Developing an Android Application for Analyzing Indonesian Syntax: A Rule and Probability-Based POS Tagging Approach
DOI:
https://doi.org/10.31849/reila.v6i2.14975Keywords:
Android Applications, Digital Language Learning, Grammatical Corpus, Indonesian Syntax, Lexical Categories, POS TaggingAbstract
The investigation of the grammatical level of syntax, particularly Indonesian, focuses exclusively on sentence formation and does not include the corpus. This renders Indonesian grammatical corpus data less relevant in corpus-based grammatical investigations. The study offers a detailed overview of the utilization of Android applications based on POS Tagging data. The method in this study was qualitative focusing on the development of an application that utilizes Rule and Probability-based POS Tagging data from Leipzig Indonesian Mix_2013 to determine the categories, functions, and roles of Indonesian syntax with lexical categories including V (copula, existence, and equative) as the predicate potential on function. The application was designed to be compatible with the Android system by Integrating POS tagging into the System Development Life Cycle (SDLC), enabling wider accessibility to a larger user base. The result of this research introduces a program designed as a tool to search syntactic categories in Indonesian. The program uses a sequential search technique, which is a linear search method, to make it easier for users to find specific syntactic functions. By applying syntactic categories and functions using POS Tagging data from the Leipzig Indonesian Mix_2013 corpus, the study achieved significant insights into the roles of Indonesian syntax. POS Tagging based on the generated rules and probabilities achieves an accuracy rate of 92.53% for category tags and functions.
References
Auni, L., & Manan, A. (2023). A contrastive analysis of morphological and syntactic aspects of English and Indonesian adjectives. Studies in English Language and Education, 10(1), 403–423. https://doi.org/10.24815/siele.v10i1.27401
Baker, M. C. (2004). Lexical categories. Cambridge University Press.
Bradley, T.-D., Terilla, J., & Vlassopoulos, Y. (2022). An enriched category theory of language: From syntax to semantics. La Matematica, 1, 551–580. https://doi.org/10.1007/s44007-022-00021-2
Bustomi, T., Kridalaksana, H., & Fachruddin, F. (2018). Aplikasi media pembelajaran Bahasa Indonesia berbasis multimedia. Sebatik, 10(1), 1-7. https://doi.org/10.46984/sebatik.v10i1.58
Carroll, J. (2013). The Routledge handbook of corpus linguistics. Routledge. https://doi.org/10.4324/9780367076399
Changpueng, P., & Patpong, P. (2021). Genre analysis of minutes of meetings conducted in English by Thai engineers. Indonesian Journal of Applied Linguistics, 11(1), 134-145. https://doi.org/10.17509/ijal.v11i1.34584
Cheng, W., Warren, M., & Xun-feng, X. (2003). The language learner as language researcher: Putting corpus linguistics on the timetable. System, 31(2), 173–186. https://doi.org/10.1016/S0346-251X(03)00019-8
Chomsky, N. (2020). Remarks on nominalization. In Oxford University Press (pp. 25–28). https://doi.org/10.1093/oso/9780198865544.003.0002
Chomsky, N. (2021). Linguistics then and now: Some personal reflections. Annual Review of Linguistics, 7(1), 1–11. https://doi.org/10.1146/annurev-linguistics-081720-111352
Dewi, R., & Ubaidi, A. (2020). Pos tagging Bahasa Madura dengan menggunakan algoritma Brill tagger. Jurnal Teknologi Informasi Dan Ilmu Komputer, 7(6), 1121-1128. https://doi.org/10.25126/jtiik.2020722449
Dong, C., & Liu, X. (2013). Development of Android application for language studies. IERI Procedia, 4, 8–16. https://doi.org/10.1016/j.ieri.2013.11.003
Falk, Y. (2011). Lexical-functional grammar. Oxford University Press.
Flemming, U., Erhan, H., & Özkaya, I. (2004). Object-oriented application development in CAD: A graduate course. Automation in Construction, 13(2), 147–158. https://doi.org/10.1016/j.autcon.2003.09.009
Haryono, H., Lelono, B., & Kholifah, A. N. (2018). Typography, morphology, and syntax characteristics of texting. Lingua Cultura, 12(2), 179-185. https://doi.org/10.21512/lc.v12i2.3976
Herpindo, H., Wijayanti, A., & Shalima, I. (2022). Kategori, fungsi, dan peran sintaksis bahasa Indonesia dengan PoS tagging berbasis rule dan probability. Kembara: Jurnal Keilmuwan Bahasa, Sastra Dan Pengajarannya, 8(1), 51–65. https://doi.org/10.22219/kembara.v8i1.18602
Ho, J. (2023). #FleeingWuhan: Legitimation and delegitimation strategies in hostile online discourse. Applied Linguistics, 44(3), 391–419. https://doi.org/10.1093/applin/amac061
Howes, C., & Gibson, H. (2021). Dynamic syntax. Journal of Logic, Language and Information, 30(2), 263–276. https://doi.org/10.1007/s10849-021-09334-x
Ilmiyah, M., & Qoiriah, A. (2021). Sistem deteksi kesalahan tanda baca dan huruf kapital pada karya tulis ilmiah berbahasa Indonesia menggunakan algoritma Boyer-Moore. Journal of Informatics and Computer Science (JINACS), 2(03), 185-193. https://doi.org/10.26740/jinacs.v2n03.p185-193
Indiana, B. D., & Ramadhani, I. (2019). Aplikasi pembelajaran Bahasa Jawa berbasis Android. CAHAYAtech, 8(1), 40-57. https://doi.org/10.47047/ct.v8i1.18
Jia, H., & Liang, J. (2020). Lexical category bias across interpreting types: Implications for synergy between cognitive constraints and language representations. Lingua, 239, 102809. https://doi.org/10.1016/j.lingua.2020.102809
Kaya, Ö. F. (2022). Using corpora for language teaching and assessment in L2 writing: A narrative review. Focus on ELT Journal, 4(3), 46–62. https://doi.org/10.14744/felt.2022.4.3.4
Khairul, K., Haryati, S., & Yusman, Y. (2018). Aplikasi kamus Bahasa Jawa Indonesia dengan algoritma Raita berbasis Android. Jurnal Teknologi Informasi Dan Pendidikan, 11(1). https://doi.org/10.24036/tip.v11i1.102
Kridalaksana, H. (1988). Beberapa prinsip perpaduan leksem dalam Bahasa Indonesia. Kanisius.
Moeliono, A. M., dkk. (2017). Tata bahasa baku Bahasa Indonesia (Edisi keempat). Badan Pengembangan dan Pembinaan Bahasa, Kementerian Pendidikan dan Kebudayaan.
Olivia, N., & Shaklein, V. (2019). Preparation specifics of students-philologists: Modern technologies of literary and linguistic text analysis. International Journal of Language Education, 3(2), 91–98. https://doi.org/10.26858/ijole.v3i2.9749
Ortin, F., & Cueva, J. M. (2004). Dynamic adaptation of application aspects. Journal of Systems and Software, 71(3), 229–243. https://doi.org/10.1016/S0164-1212(02)00157-7
Papenhausen, E., & Mueller, K. (2018). Coding ants: Optimization of GPU code using ant colony optimization. Computer Languages, Systems & Structures, 54, 119–138. https://doi.org/10.1016/j.cl.2018.05.003
Pinciroli, F., Barros Justo, J. L., & Forradellas, R. (2022). Systematic mapping study: On the coverage of aspect-oriented methodologies for the early phases of the software development life cycle. Journal of King Saud University - Computer and Information Sciences, 34(6), 2883–2896. https://doi.org/10.1016/J.JKSUCI.2020.10.029
Pramudita, H. R., Utami, E., & Amborowati, A. (2016). Pengaruh part of speech tagging berbasis aturan dan distribusi probabilitas maximum entropy untuk Bahasa Jawa Krama. Jurnal Buana Informatika, 7(4), 235-244. https://doi.org/10.24002/jbi.v7i4.764
Prasetyo, E. A. (2019). Aplikasi pembelajaran BIPA (Bahasa Indonesia bagi penutur asing) tingkat dasar berbasis Android. J-INTECH, 6(02), 229-234. https://doi.org/10.32664/j-intech.v6i02.256
Purwono, P. Y., & Asteria, P. V. (2021). Pembelajaran BIPA dengan aplikasi AWAN ASA berbasis pengenalan lintas budaya. Fon: Jurnal Pendidikan Bahasa Dan Sastra Indonesia, 17(1), 97-107. https://doi.org/10.25134/fjpbsi.v17i1.3892
Rahmawati, I. Y., Aziz, F., & Wahyono, T. (2020). Rancang bangun aplikasi pengenalan materi pengajaran Bahasa Indonesia bagi penutur asing (BIPA) berbasis Android di Universitas Muhammadiyah Ponorogo. In Konferensi Internasional Pengajaran Bahasa Indonesia bagi Penutur Asing (KIPBIPA) XI.
Ramliyana, R., Pratiwi, N. K., & Megiati, Y. E. (2022). Analysis of Indonesian language error in writing reports of students’ learning results of the Amanah Fitrah Rabbani Foundation using the Sipebi application. Hortatori: Jurnal Pendidikan Bahasa Dan Sastra Indonesia, 6(1), 6–16. https://doi.org/10.30998/jh.v6i1.998
Ratnawati, N., Wahyuningtyas, N., Ruja, I. N., Habibi, M. M., Anggraini, R., & The, H. Y. (2021). Developing multimedia-based learning media for basic skill of teaching material in order to equip professional teachers. International Journal of Emerging Technologies in Learning (IJET), 16(07), 77. https://doi.org/10.3991/ijet.v16i07.21203
Raymondra, K. A. P., & Bukhori, H. A. (2021). Interferensi sintaksis Bahasa Indonesia terhadap Bahasa Jerman pada Schriftlicher Ausdruck dalam matakuliah B1-Prüfüngsvorbereitung. JoLLA: Journal of Language, Literature, and Arts, 1(1). https://doi.org/10.17977/um064v1i12021p25-36
Rianto, A. (2020). A study of language learning strategy use among Indonesian EFL university students. Register Journal, 13(2), 231–256. https://doi.org/10.18326/rgt.v13i2.231-256
Rizki , T., & Meditanala, T. (2018). Perancangan media pembelajaran membaca dan menulis Bahasa Indonesia bagi penutur asing melalui aplikasi berbasis Android. In Seminar Internasional Riksa Bahasa.
Römer, U. (2011). Corpus research applications in second language teaching. Annual Review of Applied Linguistics, 31, 205–225. https://doi.org/10.1017/S0267190511000055
Safara, A., Zaim, M., & Refnaldi. (2019). Mobile-based learning in digital era: Android application as a media to teach grammar. In Proceedings of the 1st International Conference on Education Social Sciences and Humanities (ICESSHum 2019). https://doi.org/10.2991/icesshum-19.2019.157
Shamsujjoha, Md., Grundy, J., Li, L., Khalajzadeh, H., & Lu, Q. (2021). Developing mobile applications via model driven development: A systematic literature review. Information and Software Technology, 140, 106693. https://doi.org/10.1016/j.infsof.2021.106693
Sneddon, J. N., Adelaar, A., Djenar, D. N., & Ewing, M. C. (2010). Indonesian reference grammar. Allen & Unwin.
Sugiarti, R., Budiyono, C., & Sunu. (2021). Fungsi, kategori dan peran sintaksis pada cerita pendek dalam koran Jawa Pos bulan Juli 2016. Buana Bastra, 5(1). https://doi.org/10.36456/bastra.vol5.no1.a3582
Sulistyo, W. D., Khakim, M. N. L., Jauhari, N., & Anggraeni, R. D. (2021). Fun learning history: Explore the history of water sites based on Android. International Journal of Emerging Technologies in Learning (IJET), 16(07), 105. https://doi.org/10.3991/ijet.v16i07.21215
Syahroni, A., & Wahab, H. (2019). Aplikasi penentuan kategori dan fungsi sintaksis kalimat Bahasa Indonesia. InfoTekJar: Jurnal Nasional Informatika Dan Teknologi Jaringan, 4(1). https://doi.org/10.30743/infotekjar.v4i1.1537
Syamsiah, S. (2017). Pengembangan aplikasi multimedia pembelajaran interaktif untuk mata pelajaran Bahasa Indonesia. SAP (Susunan Artikel Pendidikan), 2(1). https://doi.org/10.30998/sap.v2i1.1723
Tallerman, M. (2014). Understanding syntax (4th ed.). Routledge. https://doi.org/10.4324/9781315758084
Tekinerdogan, B., Ali, N., Grundy, J., Mistrik, I., & Soley, R. (2016). Quality concerns in large-scale and complex software-intensive systems. In Software Quality Assurance: In Large Scale and Complex Software-Intensive Systems (pp. 1–17). https://doi.org/10.1016/B978-0-12-802301-3.00001-6
Tyers, F., & Howell, N. (2021). A survey of part-of-speech tagging approaches applied to K’iche’. In Proceedings of the 1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021. https://doi.org/10.18653/v1/2021.americasnlp-1.6
Van Valin, R. D., Jr. (2001). Syntax: Structure, meaning and function. Cambridge University Press.
Verhaar, J. W., & Alip, B. (1996). Asas-asas linguistik umum. Gadjah Mada University Press.
Widhiyanti, K., & Harjoko, A. (2013). POS tagging Bahasa Indonesia dengan HMM dan rule based. Jurnal Informatika, 8(2). https://doi.org/10.21460/inf.2012.82.125
Yunus, Y. H. (2020). Aplikasi pembelajaran Bahasa Indonesia berbasis Android pada sekolah menengah pertama. Jurnal Teknologi Informasi Indonesia (JTII), 5(2). https://doi.org/10.30869/jtii.v5i2.646


