Unearthing Collocations of Keywords in News Coverage on the Ancient Town of Si Thep, Thailand’s Seventh UNESCO World Heritage Site: A Corpus-based Study

Main Article Content

Nateethorn Narkprom

Abstract

The ancient town of Si Thep in Phetchabun, Thailand, was declared a UNESCO World Heritage Site in 2023, boosting tourism and raising the demand for effective English communication in the area. This study’s objectives are to analyze keywords and their collocational patterns in a specialized corpus of news coverage on Si Thep (hereafter: the ST corpus), comprising 30,311 words, gathered from 76 English language news articles, published between April 2015 and February 2024. The results reveal three grammatical categories of keywords: nouns (N = 81), adjectives (N = 12), and verbs (N = 7). The keywords in the ST corpus are integrated into 172 types of lexical collocations, grammatically classified as Adjective + Noun (43.60%), Noun + Noun (37.79%), Noun + Verb (7.56%), Noun + of Noun (6.40%), Verb + Noun (4.07%), and Verb + Adverb (0.58%). Notably, the keywords “local”, “cultural”, “ancient”, “UNESCO”, and “Dvaravati” are associated with the highest number of collocates in the ST corpus. Various pairs of keywords and collocates also represent nine topics of the ancient town of Si Thep, as reported in the news, e.g., Si Thep Overwhelmed by Overtourism, Si Thep Threatened by Oil Drilling. This study may offer alternatives for teaching collocations of specialized vocabulary and developing instructional materials, reflective of authentic language use, as well as methodological guidelines for English teachers in other locations designated as UNESCO World Heritage Sites.

Article Details

How to Cite
Narkprom, N. (2026). Unearthing Collocations of Keywords in News Coverage on the Ancient Town of Si Thep, Thailand’s Seventh UNESCO World Heritage Site: A Corpus-based Study. rEFLections, 33(1), 299–320. https://doi.org/10.61508/refl.v33i1.288739
Section
Research articles

References

Anthony, L. (2023). AntConc (Version 4.2.4) [Computer Software]. Waseda University. https://www.laurenceanthony.net/software

Baker, P. (2006). Using corpora in discourse analysis. Continuum.

Baker, P., Gabrielatos, C., & McEnery, T. (2013). Discourse analysis and media attitudes: The representation of Islam in the British press. Cambridge University Press. https://doi.org/10.1017/CBO9780511920103

Bennett, G. R. (2010). Using corpora in the language learning classroom: Corpus linguistics for teachers. University of Michigan Press ELT. https://doi.org/10.3998/mpub.371534

Benson, M., Benson, E., & Ilson, R. (2010). The BBI combinatory dictionary of English: Your guide to collocations and grammar (3rd ed.). John Benjamins. https://doi.org/10.1075/z.bbi

Bin Kadri, H. K. (2023). The sentiment analysis on news reporting: The sentiment of adjectives in news articles. E-Jurnal Bahasa Dan Linguistik (e-JBL), 5(2), 68–74. https://doi.org/10.53840/ejbl.v5i2.154

Bartsch, S., & Evert, S. (2014). Towards a Firthian notion of collocation. In A. Abel & L. Lemnitzer (Eds.), Network strategies, access structures, and automatic extraction of lexicographical information (2nd Work Report of the Academic Network Internet Lexicography, OPAL–Online publizierte Arbeiten zur Linguistik, pp. 48–61). Leibniz Institute for the German Language.

Boonraksa, T., & Naisena, S. (2022). A study on English collocation errors of Thai EFL students. English Language Teaching, 15(1), 164–177. https://doi.org/10.5539/elt.v15n1p164

Boonyarattanasoontorn, P., Sirawich, S., & Chomploen, C. (2020). The effect of a collocation teaching innovation on students’ use of collocations. Journal of Studies in the English Language, 15(2), 98–129. https://so04.tci-thaijo.org/index.php/jsel/article/view/240158

Browne, C. (2014). A new general service list: The better mousetrap we’ve been looking for. Vocabulary Learning and Instruction, 3(2), 1–10. http://dx.doi.org/10.7820/vli.v03.2.2187-2759

Chung, T. M., & Nation, P. (2003). Technical vocabulary in specialized texts. Reading in a Foreign Language, 2, 103–116.

Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213–238. https://doi.org/10.2307/3587951

Evert, S. (2005). The statistics of word co-occurrences: Word pairs and collocations [Unpublished doctoral dissertation]. Institute for Natural Language Processing, University of Stuttgart.

Evert, S. (2009). Corpora and collocations. In A. Lüdeling & M. Kytö (Eds.), Corpus linguistics: An international handbook. (pp. 1212–1248). Mouton de Gruyter. https://doi.org/10.1515/9783110213881.2.1212

Fujita, R. (2019). English for tourism and hospitality. In H. Terauchi, J. Noguchi, & A. Tajino (Eds.), Towards a new paradigm for English Language Teaching: English for Specific Purposes in Asia and beyond (1st ed., pp. 172–180). Routledge. https://doi.org/10.4324/9780429423963

Historical park grapples with crowds. (2023, September 21). Bangkok Post. https://www.bangkokpost.com/thailand/general/2652870/historical-park-grapples-with-crowds

Jaén, M. M. (2009). A corpus-driven design of a test for assessing the ESL collocational competence of university students. International Journal of English Studies, 7(2), 127–148. https://revistas.um.es/ijes/article/view/49031

Khamphairoh, T., & Tangpijaikul, M. (2012). Collocations of keywords found in insurance research articles: A corpus-based analysis. Manutsayasat Wichakan, 19(2), 166–188. https://so04.tci-thaijo.org/index.php/abc/article/view/54124

Koester, A. (2010). Building small specialised corpora. In A. O’Keeffe & M. McCarthy (Eds.), The Routledge handbook of corpus linguistics (pp. 66–79). Routledge.

Lei, L., & Liu, D. (2018). The academic English collocation list: A corpus-driven study. International Journal of Corpus Linguistics, 23(2), 216–243. https://doi.org/10.1075/ijcl.16135.lei

Lewis, M. (1997). Implementing the lexical approach: Putting theory into practice. Language Teaching Publications.

Lewis, M. (2000). Teaching collocation: Further developments in the lexical approach. Language Teaching Publications.

Lyse, G. I., & Andersen, G. (2012). Collocations and statistical analysis of N-grams. Exploring newspaper language: Using the web to create and investigate a large corpus of modern Norwegian. Studies in Corpus Linguistics, 49, 79–109. https://doi.org/10.1075/scl.49

McEnery, T., Xiao, R., & Tono, Y. (2006). Corpus-based language studies: An advanced resource book. Routledge.

Namtapi, I. (2022). Needs analysis of English for specific purposes for tourism personnel in Ayutthaya. LEARN Journal: Language Education and Acquisition Research Network, 15(1), 409–439. https://so04.tci-thaijo.org/index.php/LEARN/article/view/256730

Nation, I. S. P. (2013). Learning vocabulary in another language (2nd ed.). Cambridge University Press.

Nesselhauf, N. (2005). Collocations in a learner corpus. John Benjamins. https://doi.org/10.1075/scl.14

O’Dell, F., & McCarthy, M. (2008). English collocations in advanced use. Cambridge University Press.

Phu Phra Bat named World Heritage site. (2024, July 27). Bangkok Post. https://www.bangkokpost.com/thailand/general/2836758/phu-phra-bat-named-world-heritage-site

Potts, A., & Baker, P. (2012). Does semantic tagging identify cultural change in British and American English? International Journal of Corpus Linguistics, 17(3), 295–324. https://doi.org/10.1075/ijcl.17.3.01pot

Pojanapunya, P., & Lieungnapar, A. (2017). The quality of choices determines the quantity of keywords. In Proceedings of the Doing Research in Applied Linguistics 3 and 19th English in South-East Asia Conference

(DRAL 3/19th ESEA) (pp. 115–124). School of Liberal Arts, King Mongkut’s University of Technology Thonburi. http://sola.kmutt.ac.th/dral2017/proceedings/3-Papers/115-124_The%20quality%20of%20choices%20determines%20the%20quantity%20of%20Key%20words_Punjaporn%20Pojanapunya%

and%20Angvarrah%20Lieungnapar.pdf

Scott, M. (1997). PC analysis of key words - and key key words. System, 25(2), 233–245. https://doi.org/10.1016/S0346-251X(97)00011-0

Scott, M. (2015). WordSmith tools manual (Version 6.0). Lexical Analysis Software Ltd. https://lexically.net/downloads/version6/wordsmith6.pdf

Smith, N. (2009, February 14). Collocations: “Delicious” and “tasty”: How do they behave with other words? Corpus Linguistics: A Practical Web-based Course. https://www.lancaster.ac.uk/fss/courses/ling/corpus/blue/l07_4.htm

Shamsudin, S., Sadoughvanini, S., & Zaid, Y. H. (2013). Iranian EFL learners’ collocational errors in speaking skill. Procedia–Social and Behavioral Sciences, 70, 1295–1302. https://doi.org/10.1016/j.sbspro.2013.01.190

Sukman, K., & Tangkiengsirisin, S. (2024). Lexical collocations of keywords used in business news: A corpus-based study. European Journal of English Language Teaching, 9(1), 23–37. http://dx.doi.org/10.46827/ejel.v9i1.5202

Sukman, K., Triwatwaranon, W., Munkongdee, T., & Chumnumnawin, N. (2022). A corpus-based study of lexical collocations of keywords found in online business news articles. European Journal of English Language Teaching, 7(3), 59–72. http://dx.doi.org/10.46827/ejel.v7i3.4275

Suraprajit, P. (2022). English words and collocations found on logistics magazines: A corpus-based study. Advances in Language and Literary Studies, 13(1), 41–48. https://doi.org/10.7575/aiac.alls.v.13n.1.p.41

Swales, J. M. (2008). Foreword. In D. Belcher & A. Hirvela (Eds.), The oral-literate connection: Perspectives on L2 speaking, writing, and other media interactions (pp. v–viii). University of Michigan Press ELT. https://doi.org/10.3998/mpub.231182

Szudarski, P. (2023). Collocations, corpora and language learning. Cambridge University Press. https://doi.org/10.1017/9781108992602

Thaksanan, P., & Kamsa-ard, T. (2025). Lexical collocation analysis in a corpus of research articles in physical education and sport science. The New English Teacher, 19(1), 51–70. https://assumptionjournal.au.edu/index.php/newEnglishTeacher/article/view/8149/4272

The Office of National Resources and Environmental Policy and Planning, & The Fine Arts Department of Thailand. (2022). The ancient town of Si Thep: World Heritage nomination submitted by the Government of the Kingdom of Thailand AD 2022 (Nomination text). https://whc.unesco.org/document/192676

Trinant, K., & Kijpoonphol, W. (2021). Lexical collocations in a sample corpus of tourism research articles (SCTRA). NKRAFA Journal of Humanities and Social Sciences, 9, 94–108. https://so04.tci-thaijo.org/index.php/KANNICHA/article/view/249524

Trinant, K., & Yodkamlue, B. (2019). Lexical collocations in a Sample Corpus of Nursing Research Articles (SCNRA). Suranaree Journal of Social Science, 13(1), 45–72. https://doi.org/10.55766/OBXC3400

UNESCO World Heritage Convention. (2023). Decision: 45 COM 8B.41 The ancient town of Si Thep and its associated Dvaravati monuments (Thailand). https://whc.unesco.org/archive/2023/whc23-45com-19-en.pdf

Vaughan, E., & Clancy, B. (2013). Small corpora and pragmatics. In J. Roméro-Trillo (Ed.), The yearbook of corpus linguistics and pragmatics 2013: New domains and epistemologies (Vol. 1, pp. 53–73). Springer. https://doi.org/10.1007/978-94-007-6250-3_4

Wray, A. (2002). Formulaic language and the lexicon. Cambridge University Press.