Building a Tokenizer for Indonesian David Moeljadi and Hannah Choi Division of Linguistics and Multilingual Studies, Nanyang Technological University, Singapore The 21st International Symposium on Malay/Indonesian Linguistics (ISMIL 21), Langkawi Research Center 4 May 2017 Moeljadi & Choi (LMS, NTU) Tokenizer for Indonesian 4 May 2017 1 / 13