UTILIZATION OF CORPUS LINGUISTICS IN THE QATAR DEBATE BOOK: A STUDY OF HIGH FREQUENCY WORDS AND THE STRUCTURE OF ARABIC DEBATES
Abstract
Vocabulary has an important role in learning a foreign language, but students have
difficulty determining what vocabulary is used in the world of debate. This research is
motivated by the needs of students learning Arabic debate. The purpose of this research is
to produce and describe a vocabulary list with the frequency of its use, High Frequency
Words-HFW, in the Qatar debate book. The method used in this study is a quantitative
method to obtain research data and a qualitative method to analyze and conclude research
results. The results of the study stated that there were 8,724 types of words (different
words), and 110,624 tokens (number of occurrences of words). The results of this research
are needed for Arabic students, teachers, writers, and debaters, especially in Indonesia.
HFW is very useful in learning foreign languages, it is proven that it can produce special
lists of words and terms that can be categorized according to the structure of delivery of
Arabic debates so that students can understand vocabulary that is often used in debates.
Keyword: Linguistic Corpus; High Frequency Words; Qatar Debate Book
Full Text:
PDFReferences
Agung, N. (2020). Peningkatan Kemampuan Debat Bahasa Arab Mahasiswa Melalui Metode
Suggestopedia. 2(1), 19–29.
Ahsanuddin, M., Ma’sum, A., & Ridwan, N. A. (2020). INVESTIGATING ARABIC CORPUS
(KorSA) OF INDONESIAN UNDERGRADUATE THESIS ABSTRACTS. Humanities
& Social Sciences Reviews, 8(3), 920–927. https://doi.org/10.18510/hssr.2020.8396
Anthony, L. (2004). AntConc: A Learner and Classroom Friendly, Multi-Platform Corpus
Analysis Toolkit. Proceedings of IWLeL 2004: An Interactive Workshop on Language e-
Learning, 7(2), 7–13.
Arum, E. R., & Winarti, W. (2020). Penggunaan Linguistik Korpus dalam Mempersiapkan
Bahan Ajar English For Specific Purpose Di Bidang Radiologi. Jurnal Teras Kesehatan,
(2), 58–69. https://doi.org/10.38215/jutek.v2i2.39
Calafato, R. (2019). The non-native speaker teacher as proficient multilingual: A critical review
of research from 2009–2018. Lingua, 227, 102700.
https://doi.org/10.1016/j.lingua.2019.06.001
Crawford, W., & Csomay, E. (2015). Doing Corpus Linguistics. Routledge.
https://doi.org/10.4324/9781315775647
Crosthwaite, P., & Baisa, V. (2023). Generative AI and the end of corpus-assisted data-driven
learning? Not so fast! Applied Corpus Linguistics, 3(3), 100066.
https://doi.org/10.1016/j.acorp.2023.100066
Dang, T. N. Y. (2021). High-frequency words in academic spoken English: Corpora and
learners. ELT Journal, 74(2), 146–155. https://doi.org/10.1093/ELT/CCZ057
Dang, T. N. Y., Webb, S., & Coxhead, A. (2022). Evaluating lists of high-frequency words:
Teachers’ and learners’ perspectives. Language Teaching Research, 26(4), 617–641.
https://doi.org/10.1177/1362168820911189
Dbabis, S. Ben, Ghorbel, H., & Belguith, L. H. (2018). Sequential dialogue act recognition for
Arabic argumentative debates. Procesamiento Del Lenguaje Natural, 60(March 2019),
–60. https://doi.org/10.26342/2018-60-6
Evans, M. (2022). Early Modern Digital Review Sketch Engine. Renaissance and Reformation,
(4), 217–223. https://doi.org/10.33137/rr.v44i4.38650
Firoozi, T., Razavipour, K., & Ahmadi, A. (2019). The language assessment literacy needs of
Iranian EFL teachers with a focus on reformed assessment policies. Language Testing in
Asia, 9(1). https://doi.org/10.1186/s40468-019-0078-7
Hizbullah, N., Suryaningsih, I., Mardiah, Z., Al, U., & Indonesia, A. (2019). MANUSKRIP
ARAB DI NUSANTARA Arabi : Journal of Arabic Studies. Arabi: Journal of Arabic
Studies, 4(1), 65–74.
Johns, J. L., & Wilke, K. H. (2018). High-Frequency Word: Some Ways to Teach and Help
Proceeding of International Conference in Arabic Festival, May 15th 2025
Forging Traces of Arabic Civilization in World Art, Literature and Culture
E-ISSN: 2798-9461, P-ISSN: 2798-9041
PA
GE
*
ME
RG
EF
OR
M
AT
Students Practice and learn Them. TJLE: Texas Journal of Literacy Education, 6(1), 1–
Kementerian, A. R. I., Direktorat, M., & Direktorat, J. P. I. (2019). Keputusan Menteri Agama
Tentang Kurikulum PAI dan Bahasa Arab. Kementerian Agama Republik Indonesia.
Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J., Rychlý, P., &
Suchomel, V. (2014). The Sketch Engine: ten years on. Lexicography, 1(1), 7–36.
https://doi.org/10.1007/s40607-014-0009-9
Kostka, M. (2022). Pipeline Effectiveness in the Sketch Engine. Proceedings of the Sixteenth
Workshop on Recent Advances in Slavonic Natural Languages Processing, 123–130.
Lightbown, P. M., & Spada, N. (2017). How Languages are Learned (Fourth). University Press.
Liliarti, N., & Kuswanto, H. (2018). Improving the competence of diagrammatic and
argumentative representation in physics through android-based mobile learning
application. International Journal of Instruction, 11(3), 106–122.
https://doi.org/10.12973/iji.2018.1138a
Nafinuddin, S. (2020). Pengantar Semantik (Pengertian, Hakikat, dan Jenis). osf.io.
O’Keeffe, A., McCarthy, M., & Carter, R. (2007). From Corpus to Classroom: Language Use
and Language Teaching. University Press, Cambridge.
Pang, S., & Wang, K. (2023). Sketching the changing patterns in kaleidoscopes: New
developments in corpus-based studies of translation features. Research in Corpus
Linguistics, 11(2), 79–102. https://doi.org/10.32714/ricl.11.02.05
Qin, M. X. (2023). A corpus‐based approach to Chinese English lexis. World Englishes, 42(2),
–326. https://doi.org/10.1111/weng.12531
Rajeg, G. P. W. (2022). Kajian konkordansi korpus terhadap perilaku konstruksional makna
literal dan metaforis pasangan verba sinonim pandang dan tatap. Rinarxiv.Lipi.Go.Id,
November 2021.
Ramadan, S., & Mulyati, Y. (2020). Makna Kata dalam Bahasa Indonesia (Salah Kaprah dan
Upaya Perbaikannya). Ranah: Jurnal Kajian Bahasa, 9(1), 90.
https://doi.org/10.26499/rnh.v9i1.1036
San Martín, A., & Trekker, C. (2021). Adapting word sketches for specialized knowledge
extraction. Proceedings of the 14th International Conference of the Asian Association for
Lexicography (ASIALEX), 64–87.
Siagian, E. N. (2020). High Frequency Words in Indonesian as Foreign Language at Beginers
Level Esra Nelvi Siagian Badan Pengembangan dan Pembinaan Bahasa Kosakata
memiliki peranan penting dalam pembelajaran bahasa kedua atau bahasa asing . Agar
bahasa asing yang dipelajari dap. 9(2), 188–201.
https://doi.org/https://doi.org/10.26499/rnh.v9i2.2320
Suchomel, V. (2021). Genre Annotation of Web Corpora: Scheme and Issues. Proceedings of
Proceeding of International Conference in Arabic Festival, May 15th 2025
Forging Traces of Arabic Civilization in World Art, Literature and Culture
E-ISSN: 2798-9461, P-ISSN: 2798-9041
the Future Technologies Conference (FTC), 1, 738–754. https://doi.org/10.1007/978-3-
-63128-4_55
Syarofi, A., & Nugraha, A. P. (2022). ANALISIS FREKUENSI PENGGUNAAN ISTILAH
KEAGAMAAN DALAM BUKU AJAR BAHASA INDONESIA SMA KELAS X-XII:
KAJIAN BERBASIS KORPUS. Jurnal Pendidikan Bahasa Dan Sastra Indonesia
Metalingua, 7(2), 123–130. https://doi.org/10.21107/metalingua.v7i2.16853
Thoriq, M. A., Ahsanuddin, M., & Suryadarma, Y. (2022). Taṣmīm Mudawwanah Al-
Munāẓarah Al-Arabiyyah ’ala Asās Al-Sam’iyyah Al-Baṣariyyah. 8(1), 109–127.
van Heuven, W. J., Payne, J. S., & Jones, M. W. (2023). SUBTLEX-CY: A new word frequency
database for Welsh. Quarterly Journal of Experimental Psychology.
https://doi.org/10.1177/17470218231190315
Wahyuningtyas, D., & Kesuma, T. M. J. (2021). Pemanfaatan Linguistik Korpus dalam
Menentukan Kata Berfrekuensi Tinggi pada Buku Sahabatku Indonesia BIPA 1. Jurnal
Bahasa ..., 3(1), 60–69.
Wehrmeyer, E. (Ed.). (2023). Advances in Sign Language Corpus Linguistics (Vol. 8). John
Benjamins Publishing Company. https://doi.org/10.1075/scl.108
Yeldham, M. (2022). Second language English listeners’ relative processing of coherence-
based and frequency-based formulas: A corpus-based study. Applied Linguistics Review,
(2), 287–317. https://doi.org/10.1515/applirev-2018-0093
Zakaria, A., Renandya, W. A., & Aryadoust, V. (2023). A Corpus Study of Language
Simplification and Grammar in Graded Readers. LEARN Journal: Language Education
and Acquisition Research Network, 16(2), 130–153.
Refbacks
- There are currently no refbacks.