Unearthing Collocations of Keywords in News Coverage on the Ancient Town of Si Thep, Thailand’s Seventh UNESCO World Heritage Site: A Corpus-based Study
Main Article Content
Abstract
The ancient town of Si Thep in Phetchabun, Thailand, was declared a UNESCO World Heritage Site in 2023, boosting tourism and raising the demand for effective English communication in the area. This study’s objectives are to analyze keywords and their collocational patterns in a specialized corpus of news coverage on Si Thep (hereafter: the ST corpus), comprising 30,311 words, gathered from 76 English language news articles, published between April 2015 and February 2024. The results reveal three grammatical categories of keywords: nouns (N = 81), adjectives (N = 12), and verbs (N = 7). The keywords in the ST corpus are integrated into 172 types of lexical collocations, grammatically classified as Adjective + Noun (43.60%), Noun + Noun (37.79%), Noun + Verb (7.56%), Noun + of Noun (6.40%), Verb + Noun (4.07%), and Verb + Adverb (0.58%). Notably, the keywords “local”, “cultural”, “ancient”, “UNESCO”, and “Dvaravati” are associated with the highest number of collocates in the ST corpus. Various pairs of keywords and collocates also represent nine topics of the ancient town of Si Thep, as reported in the news, e.g., Si Thep Overwhelmed by Overtourism, Si Thep Threatened by Oil Drilling. This study may offer alternatives for teaching collocations of specialized vocabulary and developing instructional materials, reflective of authentic language use, as well as methodological guidelines for English teachers in other locations designated as UNESCO World Heritage Sites.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
Anthony, L. (2023). AntConc (Version 4.2.4) [Computer Software]. Waseda University. https://www.laurenceanthony.net/software
Baker, P. (2006). Using corpora in discourse analysis. Continuum.
Baker, P., Gabrielatos, C., & McEnery, T. (2013). Discourse analysis and media attitudes: The representation of Islam in the British press. Cambridge University Press. https://doi.org/10.1017/CBO9780511920103
Bennett, G. R. (2010). Using corpora in the language learning classroom: Corpus linguistics for teachers. University of Michigan Press ELT. https://doi.org/10.3998/mpub.371534
Benson, M., Benson, E., & Ilson, R. (2010). The BBI combinatory dictionary of English: Your guide to collocations and grammar (3rd ed.). John Benjamins. https://doi.org/10.1075/z.bbi
Bin Kadri, H. K. (2023). The sentiment analysis on news reporting: The sentiment of adjectives in news articles. E-Jurnal Bahasa Dan Linguistik (e-JBL), 5(2), 68–74. https://doi.org/10.53840/ejbl.v5i2.154
Bartsch, S., & Evert, S. (2014). Towards a Firthian notion of collocation. In A. Abel & L. Lemnitzer (Eds.), Network strategies, access structures, and automatic extraction of lexicographical information (2nd Work Report of the Academic Network Internet Lexicography, OPAL–Online publizierte Arbeiten zur Linguistik, pp. 48–61). Leibniz Institute for the German Language.
Boonraksa, T., & Naisena, S. (2022). A study on English collocation errors of Thai EFL students. English Language Teaching, 15(1), 164–177. https://doi.org/10.5539/elt.v15n1p164
Boonyarattanasoontorn, P., Sirawich, S., & Chomploen, C. (2020). The effect of a collocation teaching innovation on students’ use of collocations. Journal of Studies in the English Language, 15(2), 98–129. https://so04.tci-thaijo.org/index.php/jsel/article/view/240158
Browne, C. (2014). A new general service list: The better mousetrap we’ve been looking for. Vocabulary Learning and Instruction, 3(2), 1–10. http://dx.doi.org/10.7820/vli.v03.2.2187-2759
Chung, T. M., & Nation, P. (2003). Technical vocabulary in specialized texts. Reading in a Foreign Language, 2, 103–116.
Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213–238. https://doi.org/10.2307/3587951
Evert, S. (2005). The statistics of word co-occurrences: Word pairs and collocations [Unpublished doctoral dissertation]. Institute for Natural Language Processing, University of Stuttgart.
Evert, S. (2009). Corpora and collocations. In A. Lüdeling & M. Kytö (Eds.), Corpus linguistics: An international handbook. (pp. 1212–1248). Mouton de Gruyter. https://doi.org/10.1515/9783110213881.2.1212
Fujita, R. (2019). English for tourism and hospitality. In H. Terauchi, J. Noguchi, & A. Tajino (Eds.), Towards a new paradigm for English Language Teaching: English for Specific Purposes in Asia and beyond (1st ed., pp. 172–180). Routledge. https://doi.org/10.4324/9780429423963
Historical park grapples with crowds. (2023, September 21). Bangkok Post. https://www.bangkokpost.com/thailand/general/2652870/historical-park-grapples-with-crowds
Jaén, M. M. (2009). A corpus-driven design of a test for assessing the ESL collocational competence of university students. International Journal of English Studies, 7(2), 127–148. https://revistas.um.es/ijes/article/view/49031
Khamphairoh, T., & Tangpijaikul, M. (2012). Collocations of keywords found in insurance research articles: A corpus-based analysis. Manutsayasat Wichakan, 19(2), 166–188. https://so04.tci-thaijo.org/index.php/abc/article/view/54124
Koester, A. (2010). Building small specialised corpora. In A. O’Keeffe & M. McCarthy (Eds.), The Routledge handbook of corpus linguistics (pp. 66–79). Routledge.
Lei, L., & Liu, D. (2018). The academic English collocation list: A corpus-driven study. International Journal of Corpus Linguistics, 23(2), 216–243. https://doi.org/10.1075/ijcl.16135.lei
Lewis, M. (1997). Implementing the lexical approach: Putting theory into practice. Language Teaching Publications.
Lewis, M. (2000). Teaching collocation: Further developments in the lexical approach. Language Teaching Publications.
Lyse, G. I., & Andersen, G. (2012). Collocations and statistical analysis of N-grams. Exploring newspaper language: Using the web to create and investigate a large corpus of modern Norwegian. Studies in Corpus Linguistics, 49, 79–109. https://doi.org/10.1075/scl.49
McEnery, T., Xiao, R., & Tono, Y. (2006). Corpus-based language studies: An advanced resource book. Routledge.
Namtapi, I. (2022). Needs analysis of English for specific purposes for tourism personnel in Ayutthaya. LEARN Journal: Language Education and Acquisition Research Network, 15(1), 409–439. https://so04.tci-thaijo.org/index.php/LEARN/article/view/256730
Nation, I. S. P. (2013). Learning vocabulary in another language (2nd ed.). Cambridge University Press.
Nesselhauf, N. (2005). Collocations in a learner corpus. John Benjamins. https://doi.org/10.1075/scl.14
O’Dell, F., & McCarthy, M. (2008). English collocations in advanced use. Cambridge University Press.
Phu Phra Bat named World Heritage site. (2024, July 27). Bangkok Post. https://www.bangkokpost.com/thailand/general/2836758/phu-phra-bat-named-world-heritage-site
Potts, A., & Baker, P. (2012). Does semantic tagging identify cultural change in British and American English? International Journal of Corpus Linguistics, 17(3), 295–324. https://doi.org/10.1075/ijcl.17.3.01pot
Pojanapunya, P., & Lieungnapar, A. (2017). The quality of choices determines the quantity of keywords. In Proceedings of the Doing Research in Applied Linguistics 3 and 19th English in South-East Asia Conference
(DRAL 3/19th ESEA) (pp. 115–124). School of Liberal Arts, King Mongkut’s University of Technology Thonburi. http://sola.kmutt.ac.th/dral2017/proceedings/3-Papers/115-124_The%20quality%20of%20choices%20determines%20the%20quantity%20of%20Key%20words_Punjaporn%20Pojanapunya%
and%20Angvarrah%20Lieungnapar.pdf
Scott, M. (1997). PC analysis of key words - and key key words. System, 25(2), 233–245. https://doi.org/10.1016/S0346-251X(97)00011-0
Scott, M. (2015). WordSmith tools manual (Version 6.0). Lexical Analysis Software Ltd. https://lexically.net/downloads/version6/wordsmith6.pdf
Smith, N. (2009, February 14). Collocations: “Delicious” and “tasty”: How do they behave with other words? Corpus Linguistics: A Practical Web-based Course. https://www.lancaster.ac.uk/fss/courses/ling/corpus/blue/l07_4.htm
Shamsudin, S., Sadoughvanini, S., & Zaid, Y. H. (2013). Iranian EFL learners’ collocational errors in speaking skill. Procedia–Social and Behavioral Sciences, 70, 1295–1302. https://doi.org/10.1016/j.sbspro.2013.01.190
Sukman, K., & Tangkiengsirisin, S. (2024). Lexical collocations of keywords used in business news: A corpus-based study. European Journal of English Language Teaching, 9(1), 23–37. http://dx.doi.org/10.46827/ejel.v9i1.5202
Sukman, K., Triwatwaranon, W., Munkongdee, T., & Chumnumnawin, N. (2022). A corpus-based study of lexical collocations of keywords found in online business news articles. European Journal of English Language Teaching, 7(3), 59–72. http://dx.doi.org/10.46827/ejel.v7i3.4275
Suraprajit, P. (2022). English words and collocations found on logistics magazines: A corpus-based study. Advances in Language and Literary Studies, 13(1), 41–48. https://doi.org/10.7575/aiac.alls.v.13n.1.p.41
Swales, J. M. (2008). Foreword. In D. Belcher & A. Hirvela (Eds.), The oral-literate connection: Perspectives on L2 speaking, writing, and other media interactions (pp. v–viii). University of Michigan Press ELT. https://doi.org/10.3998/mpub.231182
Szudarski, P. (2023). Collocations, corpora and language learning. Cambridge University Press. https://doi.org/10.1017/9781108992602
Thaksanan, P., & Kamsa-ard, T. (2025). Lexical collocation analysis in a corpus of research articles in physical education and sport science. The New English Teacher, 19(1), 51–70. https://assumptionjournal.au.edu/index.php/newEnglishTeacher/article/view/8149/4272
The Office of National Resources and Environmental Policy and Planning, & The Fine Arts Department of Thailand. (2022). The ancient town of Si Thep: World Heritage nomination submitted by the Government of the Kingdom of Thailand AD 2022 (Nomination text). https://whc.unesco.org/document/192676
Trinant, K., & Kijpoonphol, W. (2021). Lexical collocations in a sample corpus of tourism research articles (SCTRA). NKRAFA Journal of Humanities and Social Sciences, 9, 94–108. https://so04.tci-thaijo.org/index.php/KANNICHA/article/view/249524
Trinant, K., & Yodkamlue, B. (2019). Lexical collocations in a Sample Corpus of Nursing Research Articles (SCNRA). Suranaree Journal of Social Science, 13(1), 45–72. https://doi.org/10.55766/OBXC3400
UNESCO World Heritage Convention. (2023). Decision: 45 COM 8B.41 The ancient town of Si Thep and its associated Dvaravati monuments (Thailand). https://whc.unesco.org/archive/2023/whc23-45com-19-en.pdf
Vaughan, E., & Clancy, B. (2013). Small corpora and pragmatics. In J. Roméro-Trillo (Ed.), The yearbook of corpus linguistics and pragmatics 2013: New domains and epistemologies (Vol. 1, pp. 53–73). Springer. https://doi.org/10.1007/978-94-007-6250-3_4
Wray, A. (2002). Formulaic language and the lexicon. Cambridge University Press.