Textual Analysis of Conceptual Associations in CEFR B2 Level Texts: A Network-based Semantic Representation Approach

Main Article Content

Piyathat Siripol
Seongha Seongha
Suthathip Thirakunkovit
Aphiwit Liang-Itsara

Abstract

Background and Objectives: Lexical cohesion is vital for text comprehension, especially for learners progressing through CEFR levels. While research has focused on logical relations like synonymy and part-whole relationships, conceptual associations remain underexplored. These associations are crucial for cognitive processing and discourse comprehension but appear to be underrepresented in CEFR-B-leveled texts, which may potentially hinder learners’ preparation for C1-level demands. This study examines the patterns and prevalence of conceptual associations in B2 texts, their comparison with logical relations, and the impact of topic complexity on their distribution.


Methodology: The study was conducted in two phases, with the initial phase involving the collection of verified B2 level texts. In the second phase, automated analysis via the UCREL Semantic Analysis System (SAS) was used to categorize words into broad conceptual groups, while a manual approach based on Town’s (2021) taxonomy was used to verify their actual association. A semantic network analysis based on Yang and González-Bailón’s (2017) framework was adapted to examine concept clustering. The semantic network was automatically generated and quantified by the numbers of nodes and clusters present in B2 level texts.


Main Results: B2 texts showed an uneven use of lexical cohesion, relying more on simpler, explicit logical relationships. In all five texts examined, the use of logical relations (such as parent-child and part-whole relationships) outnumbered the use of other conceptual associations. In the five texts combined, logical relations occurred 92 times, whereas conceptual associations occurred 39 times. While this aids initial clarity, it creates a gap for learners moving to higher proficiency levels, where they need to connect ideas less explicitly via modeled B2 texts. Despite the similar totals of cohesive relationships (logical and conceptual associations) from the approximately 20 relationships in each text, B2 texts vary significantly in their use of conceptual association.  


Discussions: The dominance of logical relations in B2 texts may limit learners' development of abstract reasoning and inferencing skills, which are critical at higher proficiency levels. While logical relations provide structural clarity, they lack the deeper conceptual connections needed for C1 comprehension. Topic variations also affect conceptual richness, emphasizing the need for intentional text selection. A balanced integration of conceptual associations with logical relations could enhance engagement and better align with CEFR descriptors and expectations.


Conclusions: These findings contribute to a deeper understanding of lexical cohesion in B2 texts and emphasize the importance of designing instructional materials that bridge the gap between explicit logical structures and abstract conceptual reasoning. To improve text cohesion and support learners’ transition to higher proficiency, B2 materials should incorporate more conceptual associations, particularly entity-based relationships and abstract linkages. By strengthening these connections, reading materials can better align with CEFR descriptors and prepare learners for complex and abstract textual comprehension at the C1 level. Future research should explore effective strategies for integrating conceptual associations into B2 materials and examine their impact on learner comprehension and retention.

Article Details

Section
Research Articles

References

Akmilia, P. M., Faridi, A., & Sakhiyya, Z. (2022). The use of cohesive devices in research paper conferences to achieve text coherence. English Education Journal, 12(1), 67-75. https://doi.org/10.15294/eej.v12i1.53228

Bolognesi, M., Pilgram, R., & van den Heerik, R. (2017). Reliability in content analysis: The case of semantic feature norms classification. Behavior Research Methods, 49, 1984-2001. https://doi.org/10.3758/s13428-016-0838-6

Chanyoo, N. (2018). Cohesive devices and academic writing quality of Thai undergraduate students. Journal of Language Teaching and Research, 9(5), 994-1001. https://doi.org/10.17507/jltr.0905.13

Crossley, S. A. (2020). Linguistic features in writing quality and development: An overview. Journal of Writing Research, 11(3), 415-443. https://doi.org/10.17239/jowr-2020.11.03.01

Crossley, S. A., Allen, D. B., & McNamara, D. S. (2011). Text readability and intuitive simplification: A comparison of readability formulas. Reading in a Foreign Language, 23(1), 84-101. https://doi.org/10.64152/10125/66657

Europe, C. o. (2001). Common European framework of reference for languages: Learning, teaching, assessment. Cambridge University Press.

Fata, I. A., Komariah, E., & Alya, A. R. (2022). Assessment of readability level of reading materials in Indonesia EFL textbooks. Lingua Cultura, 16(1), 97-104. https://doi.org/10.21512/lc.v16i1.8277

Graesser, A. C., McNamara, D. S., Louwerse, M. M., & Cai, Z. (2004). Coh-Metrix: Analysis of text on cohesion and language. Behavior Research Methods, Instruments, & Computers, 36(2), 193-202. https://doi.org/10.3758/BF03195564

Griffiths, T. L., Steyvers, M., & Tenenbaum, J. B. (2007). Topics in semantic representation. Psychological Review, 114(2), 211. https://doi.org/10.1037/0033-295X.114.2.211

Halliday, M. A. K., & Hasan, R. (2014). Cohesion in english. Routledge. https://doi.org/10.4324/9781315836010

Harsch, C. (2018). How suitable is the CEFR for setting university entrance standards? Language Assessment Quarterly, 15(1), 102-108. https://doi.org/10.1080/15434303.2017.1420793

Hiebert, E. H. (2002). Standards, assessment, and text difficulty. In A. E. Farstrup & S. J. Samuels (Eds.), What research has to say about reading instruction (3rd ed., pp. 267–296). International Reading Association. https://doi.org/10.1598/0872071774.15

Hoey, M. (2012). Lexical priming: A new theory of words and language. Routledge. https://doi.org/10.4324/9780203327630

Ismail, F. K. M., & Zubairi, A. M. B. (2022). Item Analysis of a Reading Test in a Sri Lankan Context using Classical Test Theory. International Journal of Learning, Teaching and Educational Research, 21(3), 36-50. https://doi.org/10.26803/ijlter.21.3.3

McNamara, D. S., Kintsch, E., Songer, N. B., & Kintsch, W. (1996). Are good texts always better? Text coherence, background knowledge, and levels of understanding in learning from text. Cognition and Instruction, 14(1), 43. https://doi.org/10.1207/s1532690xci1401_1

Natova, I. (2021). Estimating CEFR reading comprehension text complexity. The Language Learning Journal, 49(6), 699-710. https://doi.org/10.1080/09571736.2019.1665088

North, B. (2005). The CEFR levels and descriptor scales. In Taylor, L., & Weir, C. J. (Eds.), Multilingualism and assessment: Achieving transparency, assuring quality, sustaining diversity, Proceedings of the ALTE Berlin Conference (pp. 21-66). Cambridge University Press.

Ortega, L. (2012). Interlanguage complexity. Linguistic complexity: Second language acquisition, indigenization, contact, 13, 127. https://doi.org/10.1515/9783110229226.127

Sager, J. C. (1990). A practical course in terminology processing. https://doi.org/10.1075/z.44

Sihombing, I., Sidabutar, U., & Tampubolon, S. (2024). An analysis of lexical cohesion on the students thesis at the english teaching department of HKBP Nommensen University. ALACRITY: Journal of Education, 4(3), 370-381. https://doi.org/10.52121/alacrity.v4i3.470

Siripol, P., & Towns, S. G. (2021). A network-based method for identifying and categorizing semantic relationships. In Proceedings of DRAL 4: Doing Research in Applied Linguistics (pp. 142-157). King Mongkut’s University of Technology Thonburi.

Siripol, P., Rhee, S., Thirakunkovit, S., & Liang-Itsara, A. (2025). Evaluating the consistency of automated CEFR analyzers: a study of English language text classification. International Journal of Evaluation and Research in Education, 14, 3283-3294. https://doi.org/10.11591/ijere.v14i4.33528

Sung, Y. T., Lin, W. C., Dyson, S. B., Chang, K. E., & Chen, Y. C. (2015). Leveling L2 texts through readability: Combining multilevel linguistic features with the CEFR. The Modern Language Journal, 99(2), 371-391. https://doi.org/10.1111/modl.12213

Towns, S. G. (2021). A general-purpose semantic relationship taxonomy for the classification of logical relations and conceptual associations. In Proceedings of DRAL 4: Doing Research in Applied Linguistics (pp. 131-141). King Mongkut’s University of Technology Thonburi.

Towns, S. G., & Watson Todd, R. (2017). Methods for identifying relations and associations in text. In Proceedings of DRAL 3: Doing Research in Applied Linguistics (pp. 146-155). King Mongkut’s University of Technology Thonburi.

Towns, S. G., & Watson Todd, R. (2019). Beyond proficiency: Linguistic features of exceptional writing. English Text Construction, 12(2), 265-289. https://doi.org/10.1075/etc.00029.tow

Wang, G., & Liu, Q. (2014). On the theoretical framework of the study of discourse cohesion and coherence. Studies in Literature and Language, 8(2), 32. http://dx.doi.org/10.3968/4512

Wu, L.-l., & Barsalou, L. W. (2009). Perceptual simulation in conceptual combination: Evidence from property generation. Acta Psychologica, 132(2), 173-189. https://doi.org/10.1016/j.actpsy.2009.02.002

Yan, C., de Lange, F. P., & Richter, D. (2023). Conceptual associations generate sensory predictions. Journal of Neuroscience, 43(20), 3733-3742. https://doi.org/10.1523/JNEUROSCI.1874-22.2023

Yang, S., & González-Bailón, S. (2017). Semantic networks and applications in public opinion research. The Oxford handbook of political networks, 327-353. https://doi.org/10.1093/oxfordhb/9780190228217.013.14