Developing an Android Application for Analyzing Indonesian Syntax: A Rule and Probability-Based POS Tagging Approach

Authors

  • Herpindo Herpindo Universitas Tidar, Magelang, Indonesia
  • Ristiyani Ristiyani Universitas Muria Kudus, Kudus, Indonesia
  • Miftahula Rizqin Nikmatullah Universitas Tidar, Magelang, Indonesia
  • Ratih Ngestrini Utrecht University, Utrecht, Netherlands

DOI:

https://doi.org/10.31849/reila.v6i2.14975

Keywords:

Android Applications, Digital Language Learning, Grammatical Corpus, Indonesian Syntax, Lexical Categories, POS Tagging

Abstract

The investigation of the grammatical level of syntax, particularly Indonesian, focuses exclusively on sentence formation and does not include the corpus. This renders Indonesian grammatical corpus data less relevant in corpus-based grammatical investigations. The study offers a detailed overview of the utilization of Android applications based on POS Tagging data. The method in this study was qualitative focusing on the development of an application that utilizes Rule and Probability-based POS Tagging data from Leipzig Indonesian Mix_2013 to determine the categories, functions, and roles of Indonesian syntax with lexical categories including V (copula, existence, and equative) as the predicate potential on function. The application was designed to be compatible with the Android system by Integrating POS tagging into the System Development Life Cycle (SDLC), enabling wider accessibility to a larger user base. The result of this research introduces a program designed as a tool to search syntactic categories in Indonesian. The program uses a sequential search technique, which is a linear search method, to make it easier for users to find specific syntactic functions. By applying syntactic categories and functions using POS Tagging data from the Leipzig Indonesian Mix_2013 corpus, the study achieved significant insights into the roles of Indonesian syntax.  POS Tagging based on the generated rules and probabilities achieves an accuracy rate of 92.53% for category tags and functions.

References

Auni, L., & Manan, A. (2023). A contrastive analysis of morphological and syntactic aspects of English and Indonesian adjectives. Studies in English Language and Education, 10(1), 403–423. https://doi.org/10.24815/siele.v10i1.27401

Baker, M. C. (2004). Lexical categories. Cambridge University Press.

Bradley, T.-D., Terilla, J., & Vlassopoulos, Y. (2022). An enriched category theory of language: From syntax to semantics. La Matematica, 1, 551–580. https://doi.org/10.1007/s44007-022-00021-2

Bustomi, T., Kridalaksana, H., & Fachruddin, F. (2018). Aplikasi media pembelajaran Bahasa Indonesia berbasis multimedia. Sebatik, 10(1), 1-7. https://doi.org/10.46984/sebatik.v10i1.58

Carroll, J. (2013). The Routledge handbook of corpus linguistics. Routledge. https://doi.org/10.4324/9780367076399

Changpueng, P., & Patpong, P. (2021). Genre analysis of minutes of meetings conducted in English by Thai engineers. Indonesian Journal of Applied Linguistics, 11(1), 134-145. https://doi.org/10.17509/ijal.v11i1.34584

Cheng, W., Warren, M., & Xun-feng, X. (2003). The language learner as language researcher: Putting corpus linguistics on the timetable. System, 31(2), 173–186. https://doi.org/10.1016/S0346-251X(03)00019-8

Chomsky, N. (2020). Remarks on nominalization. In Oxford University Press (pp. 25–28). https://doi.org/10.1093/oso/9780198865544.003.0002

Chomsky, N. (2021). Linguistics then and now: Some personal reflections. Annual Review of Linguistics, 7(1), 1–11. https://doi.org/10.1146/annurev-linguistics-081720-111352

Dewi, R., & Ubaidi, A. (2020). Pos tagging Bahasa Madura dengan menggunakan algoritma Brill tagger. Jurnal Teknologi Informasi Dan Ilmu Komputer, 7(6), 1121-1128. https://doi.org/10.25126/jtiik.2020722449

Dong, C., & Liu, X. (2013). Development of Android application for language studies. IERI Procedia, 4, 8–16. https://doi.org/10.1016/j.ieri.2013.11.003

Falk, Y. (2011). Lexical-functional grammar. Oxford University Press.

Flemming, U., Erhan, H., & Özkaya, I. (2004). Object-oriented application development in CAD: A graduate course. Automation in Construction, 13(2), 147–158. https://doi.org/10.1016/j.autcon.2003.09.009

Haryono, H., Lelono, B., & Kholifah, A. N. (2018). Typography, morphology, and syntax characteristics of texting. Lingua Cultura, 12(2), 179-185. https://doi.org/10.21512/lc.v12i2.3976

Herpindo, H., Wijayanti, A., & Shalima, I. (2022). Kategori, fungsi, dan peran sintaksis bahasa Indonesia dengan PoS tagging berbasis rule dan probability. Kembara: Jurnal Keilmuwan Bahasa, Sastra Dan Pengajarannya, 8(1), 51–65. https://doi.org/10.22219/kembara.v8i1.18602

Ho, J. (2023). #FleeingWuhan: Legitimation and delegitimation strategies in hostile online discourse. Applied Linguistics, 44(3), 391–419. https://doi.org/10.1093/applin/amac061

Howes, C., & Gibson, H. (2021). Dynamic syntax. Journal of Logic, Language and Information, 30(2), 263–276. https://doi.org/10.1007/s10849-021-09334-x

Ilmiyah, M., & Qoiriah, A. (2021). Sistem deteksi kesalahan tanda baca dan huruf kapital pada karya tulis ilmiah berbahasa Indonesia menggunakan algoritma Boyer-Moore. Journal of Informatics and Computer Science (JINACS), 2(03), 185-193. https://doi.org/10.26740/jinacs.v2n03.p185-193

Indiana, B. D., & Ramadhani, I. (2019). Aplikasi pembelajaran Bahasa Jawa berbasis Android. CAHAYAtech, 8(1), 40-57. https://doi.org/10.47047/ct.v8i1.18

Jia, H., & Liang, J. (2020). Lexical category bias across interpreting types: Implications for synergy between cognitive constraints and language representations. Lingua, 239, 102809. https://doi.org/10.1016/j.lingua.2020.102809

Kaya, Ö. F. (2022). Using corpora for language teaching and assessment in L2 writing: A narrative review. Focus on ELT Journal, 4(3), 46–62. https://doi.org/10.14744/felt.2022.4.3.4

Khairul, K., Haryati, S., & Yusman, Y. (2018). Aplikasi kamus Bahasa Jawa Indonesia dengan algoritma Raita berbasis Android. Jurnal Teknologi Informasi Dan Pendidikan, 11(1). https://doi.org/10.24036/tip.v11i1.102

Kridalaksana, H. (1988). Beberapa prinsip perpaduan leksem dalam Bahasa Indonesia. Kanisius.

Moeliono, A. M., dkk. (2017). Tata bahasa baku Bahasa Indonesia (Edisi keempat). Badan Pengembangan dan Pembinaan Bahasa, Kementerian Pendidikan dan Kebudayaan.

Olivia, N., & Shaklein, V. (2019). Preparation specifics of students-philologists: Modern technologies of literary and linguistic text analysis. International Journal of Language Education, 3(2), 91–98. https://doi.org/10.26858/ijole.v3i2.9749

Ortin, F., & Cueva, J. M. (2004). Dynamic adaptation of application aspects. Journal of Systems and Software, 71(3), 229–243. https://doi.org/10.1016/S0164-1212(02)00157-7

Papenhausen, E., & Mueller, K. (2018). Coding ants: Optimization of GPU code using ant colony optimization. Computer Languages, Systems & Structures, 54, 119–138. https://doi.org/10.1016/j.cl.2018.05.003

Pinciroli, F., Barros Justo, J. L., & Forradellas, R. (2022). Systematic mapping study: On the coverage of aspect-oriented methodologies for the early phases of the software development life cycle. Journal of King Saud University - Computer and Information Sciences, 34(6), 2883–2896. https://doi.org/10.1016/J.JKSUCI.2020.10.029

Pramudita, H. R., Utami, E., & Amborowati, A. (2016). Pengaruh part of speech tagging berbasis aturan dan distribusi probabilitas maximum entropy untuk Bahasa Jawa Krama. Jurnal Buana Informatika, 7(4), 235-244. https://doi.org/10.24002/jbi.v7i4.764

Prasetyo, E. A. (2019). Aplikasi pembelajaran BIPA (Bahasa Indonesia bagi penutur asing) tingkat dasar berbasis Android. J-INTECH, 6(02), 229-234. https://doi.org/10.32664/j-intech.v6i02.256

Purwono, P. Y., & Asteria, P. V. (2021). Pembelajaran BIPA dengan aplikasi AWAN ASA berbasis pengenalan lintas budaya. Fon: Jurnal Pendidikan Bahasa Dan Sastra Indonesia, 17(1), 97-107. https://doi.org/10.25134/fjpbsi.v17i1.3892

Rahmawati, I. Y., Aziz, F., & Wahyono, T. (2020). Rancang bangun aplikasi pengenalan materi pengajaran Bahasa Indonesia bagi penutur asing (BIPA) berbasis Android di Universitas Muhammadiyah Ponorogo. In Konferensi Internasional Pengajaran Bahasa Indonesia bagi Penutur Asing (KIPBIPA) XI.

Ramliyana, R., Pratiwi, N. K., & Megiati, Y. E. (2022). Analysis of Indonesian language error in writing reports of students’ learning results of the Amanah Fitrah Rabbani Foundation using the Sipebi application. Hortatori: Jurnal Pendidikan Bahasa Dan Sastra Indonesia, 6(1), 6–16. https://doi.org/10.30998/jh.v6i1.998

Ratnawati, N., Wahyuningtyas, N., Ruja, I. N., Habibi, M. M., Anggraini, R., & The, H. Y. (2021). Developing multimedia-based learning media for basic skill of teaching material in order to equip professional teachers. International Journal of Emerging Technologies in Learning (IJET), 16(07), 77. https://doi.org/10.3991/ijet.v16i07.21203

Raymondra, K. A. P., & Bukhori, H. A. (2021). Interferensi sintaksis Bahasa Indonesia terhadap Bahasa Jerman pada Schriftlicher Ausdruck dalam matakuliah B1-Prüfüngsvorbereitung. JoLLA: Journal of Language, Literature, and Arts, 1(1). https://doi.org/10.17977/um064v1i12021p25-36

Rianto, A. (2020). A study of language learning strategy use among Indonesian EFL university students. Register Journal, 13(2), 231–256. https://doi.org/10.18326/rgt.v13i2.231-256

Rizki , T., & Meditanala, T. (2018). Perancangan media pembelajaran membaca dan menulis Bahasa Indonesia bagi penutur asing melalui aplikasi berbasis Android. In Seminar Internasional Riksa Bahasa.

Römer, U. (2011). Corpus research applications in second language teaching. Annual Review of Applied Linguistics, 31, 205–225. https://doi.org/10.1017/S0267190511000055

Safara, A., Zaim, M., & Refnaldi. (2019). Mobile-based learning in digital era: Android application as a media to teach grammar. In Proceedings of the 1st International Conference on Education Social Sciences and Humanities (ICESSHum 2019). https://doi.org/10.2991/icesshum-19.2019.157

Shamsujjoha, Md., Grundy, J., Li, L., Khalajzadeh, H., & Lu, Q. (2021). Developing mobile applications via model driven development: A systematic literature review. Information and Software Technology, 140, 106693. https://doi.org/10.1016/j.infsof.2021.106693

Sneddon, J. N., Adelaar, A., Djenar, D. N., & Ewing, M. C. (2010). Indonesian reference grammar. Allen & Unwin.

Sugiarti, R., Budiyono, C., & Sunu. (2021). Fungsi, kategori dan peran sintaksis pada cerita pendek dalam koran Jawa Pos bulan Juli 2016. Buana Bastra, 5(1). https://doi.org/10.36456/bastra.vol5.no1.a3582

Sulistyo, W. D., Khakim, M. N. L., Jauhari, N., & Anggraeni, R. D. (2021). Fun learning history: Explore the history of water sites based on Android. International Journal of Emerging Technologies in Learning (IJET), 16(07), 105. https://doi.org/10.3991/ijet.v16i07.21215

Syahroni, A., & Wahab, H. (2019). Aplikasi penentuan kategori dan fungsi sintaksis kalimat Bahasa Indonesia. InfoTekJar: Jurnal Nasional Informatika Dan Teknologi Jaringan, 4(1). https://doi.org/10.30743/infotekjar.v4i1.1537

Syamsiah, S. (2017). Pengembangan aplikasi multimedia pembelajaran interaktif untuk mata pelajaran Bahasa Indonesia. SAP (Susunan Artikel Pendidikan), 2(1). https://doi.org/10.30998/sap.v2i1.1723

Tallerman, M. (2014). Understanding syntax (4th ed.). Routledge. https://doi.org/10.4324/9781315758084

Tekinerdogan, B., Ali, N., Grundy, J., Mistrik, I., & Soley, R. (2016). Quality concerns in large-scale and complex software-intensive systems. In Software Quality Assurance: In Large Scale and Complex Software-Intensive Systems (pp. 1–17). https://doi.org/10.1016/B978-0-12-802301-3.00001-6

Tyers, F., & Howell, N. (2021). A survey of part-of-speech tagging approaches applied to K’iche’. In Proceedings of the 1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021. https://doi.org/10.18653/v1/2021.americasnlp-1.6

Van Valin, R. D., Jr. (2001). Syntax: Structure, meaning and function. Cambridge University Press.

Verhaar, J. W., & Alip, B. (1996). Asas-asas linguistik umum. Gadjah Mada University Press.

Widhiyanti, K., & Harjoko, A. (2013). POS tagging Bahasa Indonesia dengan HMM dan rule based. Jurnal Informatika, 8(2). https://doi.org/10.21460/inf.2012.82.125

Yunus, Y. H. (2020). Aplikasi pembelajaran Bahasa Indonesia berbasis Android pada sekolah menengah pertama. Jurnal Teknologi Informasi Indonesia (JTII), 5(2). https://doi.org/10.30869/jtii.v5i2.646

Downloads

Published

2024-07-16

How to Cite

Developing an Android Application for Analyzing Indonesian Syntax: A Rule and Probability-Based POS Tagging Approach. (2024). REiLA : Journal of Research and Innovation in Language, 6(2), 125-142. https://doi.org/10.31849/reila.v6i2.14975