Databases on the Indonesian Prefixes PE- and PEN-

Karlina Denistia

Abstract


This paper provides the theoretical grounding in constituting databases related to PE- and PEN-, two Indonesian nominalizing prefixes, which have various meanings (e.g., patient, agent, or instrument). The first database contains the words with PE- and PEN- whereas the second database provides the cosine similarity between two words of interest. Using a written Indonesian corpus as the primary source (Leipzig Corpora Collection), the databases contain the following information: PE- or PEN- prefixes, allomorph of PEN-, base word, semantics role, morphological variation, cosine similarity, as well as the word frequency. Furthermore, this paper elaborates the theoretical consideration on how each information was cultivated. In building the databases, Indonesian morphological parser and Word to Vector were used to analyze the Indonesian morphological status and to put the words in the corpus into a vector. In addition, manual verification for the data against the Indonesian comprehensive dictionary was also conducted. In the end, the databases are available for free so that the data could be used as materials for a corpus-based analysis on Indonesian morphology. This research shed light to a careful and thorough classification of the open-access databases of PE- and PEN- from their allomorphs, base word, semantics role, and morphological variation. The information provided in this article is hoped to be contributive in Indonesian morphology specifically, and other linguistics fields (e.g., corpus linguistics and quantitative linguistics) in general.  


Keywords


corpus data; morphology; prefixes; cosine similarity

Full Text:

PDF

References


Alwi, H. (2012). Kamus Besar Bahasa Indonesia (fourth). Jakarta: Gramedia Pustaka Utama.

Arka, I. W., Dalrymple, M., Mistica, M., & Mofu, S. (2009). A linguistic and computational morphosyntactic analysis for the applicative -i in Indonesian. In M. Butt & T. H. King (Eds.), International Lexical Functional Grammar Conference (LFG) (pp. 85–105). CSLI Publications.

Benjamin, G. (2009). Affixes, Austronesian and iconicity in Malay. Bijdragen Tot de Taal-, Land- En Volkenkunde, 165(2–3), pp. 291–323.

Booij, G. (2010). Construction morphology. Language and Linguistics Compass, 4(7), pp. 543–555.

Booij, G. E. (1986). Form and Meaning in Morphology: The Case of Dutch Agent Nouns. Linguistics, 24, pp. 503–517.

Booij, G. E. (1996). Inherent versus contextual inflection and the split morphology hypothesis. In G. E. Booij & J. van Marle (Eds.), Yearbook of Morphology 1995 (pp. 1–16). Netherland: Kluwer Academic Publishers.

Booij, G., & Lieber, R. (2004). On the paradigmatic nature of affixal semantics in English and Dutch. Linguistics, 42, pp. 327–357.

Chaer, A. (2008). Morfologi Bahasa Indonesia (Pendekatan Proses). Jakarta: PT Rineka Cipta.

Dalrymple, M., & Mofu, S. (2012). Plural Semantics, Reduplication, and Numeral Modification in Indonesian. Journal of Semantics, 29(2), pp. 229–260. https://doi.org/10.1093/jos/ffr015

Dardjowidjojo, S. (1983). Some Aspects of Indonesian Linguistics. Jakarta: Djambatan.

Denistia, K. (2018). Revisiting the Indonesian Prefixes PEN-, PE2-, and PER-. Linguistik Indonesia, 36(2), pp. 145–159.

Denistia, K., & Baayen, R. H. (2019). The Indonesian prefixes PE- and PEN-: A study in productivity and allomorphy. Morphology, pp. 1–23. https://doi.org/10.1007/s11525-019-09340-7

Denistia, K. & Baayen, R. H. (2022a). Affix substitution in Indonesian: A computational modelling approach. Linguistics. https://doi.org/10.1515/ling-2020-0191

Denistia, K., and Baayen, R. H. (2022b). The morphology of Indonesian: Data and quantitative modeling. In Shei, C., and Li, S. (Eds.) The Routledge Handbook of Asian Linguistics, 605-634. London: Routledge.

Denistia, K., Shafaei-Bajestan, E., & Baayen, H. (2022). Exploring semantic differences between the Indonesian prefixes PE- and PEN- using a vector space model. Corpus Linguistics and Linguistic Theory, 18(3), pp. 573–598. https://doi.org/10.1515/cllt-2020-0023

Ermanto. (2016). Morfologi Afiksasi Bahasa Indonesia Masa Kini: Tinjauan dari Morfologi Derivasi dan Infleksi. Jakarta: Kencana.

Goldhahn, D., Eckart, T., & Quasthoff, U. (2012). Building large monolingual dictionaries at the Leipzig Corpora Collection: From 100 to 200 languages. Proceedings of the Eighth International Conference on Language Resources and Evaluation, pp. 1799–1802.

Heyer, G., & Quasthoff, U. (2004). Calculating Communities by Link Analysis of URLs. Proceedings of IICS-04.

Kridalaksana, H. (2007). Kelas Kata dalam Bahasa Indonesia (second). Jakarta: Gramedia Pustaka Utama.

Kridalaksana, H. (2008). Kamus Linguistik (4th ed.). Jakarta: Gramedia Pustaka Utama.

Kroeger, P. R. (2007). Architectures, Rules, and Preferences: Variations on Themes of Joan Bresnan (A. Zaenen, J. Simpson, T. H. King, G. Jane, J. Maling, & C. Manning, Eds.; pp. 229–251). CSLI Publications.

Larasati, S. D., Kuboň, V., & Zeman, D. (2011). Indonesian morphology tool MorphInd: Towards an Indonesian corpus. In M. C & P. M (Eds.), Systems and Frameworks for Computational Morphology (Vol. 100, pp. 119–129). Springer.

Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems, pp. 3111–3119.

Pollmächer, J. (2011). Separierung mit FindLinks gecrawlter Texte nach Sprachen [Master’s Thesis]. University of Leipzig.

Putrayasa, I. B. (2008). Kajian Morfologi: Bentuk Derivasional dan Infleksional. Bandung: PT Refika Aditama.

Quasthoff, U., Richter, M., & Biemann, C. (2006). Corpus Portal for Search in Monolingual Corpora. pp. 1799–1802.

R Team, S. (2015). RStudio: Integrated Development for R. RStudio. RStudio, Inc. http://www.rstudio.com/

Rafferty, E. (2002). Reduplication of Nouns and Adjectives in Indonesian. Papers from the Tenth Annual Meeting of the Southeast Asian Linguistics Society, pp. 317–332.

Ramlan, M. (1985). Morfologi: Suatu Tinjuan Deskriptif. Yogyakarta: CV Karyono.

Sneddon, J. N., Adelaar, A., Djenar, D. N., & Ewing, M. C. (2010). Indonesian: A Comprehensive Grammar (second). London: Routledge.

Subroto, E. (2012). Pemerian Morfologi Bahasa Indonesia: Berdasarkan Perspektif Derivasi dan Infleksi Proses Afiksasi. Surakarta: Yuma Pressino.

Sugerman. (2016). Morfologi Bahasa Indonesia: Kajian ke Arah Linguistik Deskriptif. Yogyakarta: Penerbit Ombak.

Sukarno. (2017). The Behaviours of the General Nasal /N/ in Indonesian Active Prefixed Verbs. International Journal of Language and Linguistics, 4(2), pp. 48–52.

Sutanto, I. (2002). Verba berkata dasar sama dengan gabungan afiks meN-i atau meN-kan. Makara, Sosial-Humaniora, 6(2), pp. 82–87.

Tjia, J. (2015). Grammatical relations and grammatical categories in Malay: The Indonesian prefix meN- revisited. Wacana, 16(1), pp. 105–132.

Tomasowa, F. H. (2007). The reflective experiential aspect of meaning of the affix -i in Indonesian. Linguistik Indonesia, 25(2), pp. 83–96.




DOI: https://doi.org/10.24071/joll.v23i1.4967

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Journal of Language and Literature (JOLL) is published by  Prodi Sastra Inggris, Fakultas Sastra, Universitas Sanata Dharma.

JOLL is indexed in:

       


This journal is is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License 

View My Stats