A Case Study of AI Agent–Assisted Correction of Intermediate-Level Chinese Writing
คำสำคัญ:
International Chinese Education, Chinese L2 Writing, Automated Scoring, AI Agentบทคัดย่อ
This study aimed to investigate the effectiveness and applicability of an Artificial Intelligence (AI) Agent in assessing intermediate-level Chinese writing produced by learners of Chinese as a second language (L2). The research addressed two primary objectives: first, to examine the degree of consistency between AI Agent scoring and manual scoring conducted by professional teachers, and second, to identify advantages and limitations of using an AI Agent for essay correction in international Chinese language education. The research population consisted of international students in Chinese language programs at Central China Normal University (CCNU). Purposive sampling was employed to collect 31 final examination essays from students taking the Intermediate Chinese Writing II course, comprising 16 essays from the first semester and 15 from the second. All participants passed the HSK Level 4 and represented diverse cultural and linguistic backgrounds. The primary instrument was an AI Agent named “Essay Grading Assistant,” developed on the ByteDance Coze platform using the DeepSeek-V3.2 large language model. This system integrated Optical Character Recognition (OCR), Chinese word segmentation, and document analysis plugins to process handwritten examination scripts. Additionally, two university instructors with over five years of experience independently evaluated the essays using the same analytical scoring rubric.
Data analysis involved descriptive statistics to examine score distributions, Spearman’s rank correlation coefficient to determine the relationship between AI-generated scores and human ratings, and the Intraclass Correlation Coefficient (ICC) using a two-way random effects model with absolute agreement to evaluate inter-rater consistency. The findings revealed a very strong positive correlation between the AI Agent scores and the teachers’ scores, with correlation coefficients of 0.917 and 0.902, respectively (p < .001). The ICC value reached 0.964, indicating excellent agreement. The AI Agent significantly improved grading efficiency, requiring approximately 15 seconds per essay, while enhancing scoring objectivity and consistency in evaluating surface-level linguistic features. However, several limitations were identified, particularly in deep semantic interpretation, pragmatic understanding, and the recognition of unconventional learner expressions. The study concluded that AI Agents possess strong potential as supportive tools for international Chinese writing assessment; nevertheless, a human–AI collaborative scoring approach remains the most appropriate model for ensuring both efficiency and pedagogical quality in second-language writing evaluation.
เอกสารอ้างอิง
Cai, J. (2024). Zhuliu Meiti AI Agent-xia Yidai Zhineng Quanita Shengchan Chuanshou Tixi De Maodian [Mainstream Media AI Agent: The Anchor of the Next-Generation Intelligent All-Media Production and Communication System]. Media Science and Technology of China, 11, 13-17. https://doi.org/10.19483/j.cnki.11-4653/n.2024.11.002
Huang, Z., Xie, J, & Xun, E. (2014). HSK Zidong Zuowen Pingfen De Tezheng Xuanqu Yanjiu [Study of Feature Selection in HSK Automated Essay Scoring]. Computer Engineering and Applications, 50(06), 118-122, 126. https://bit.ly/4vkqQoX
Liu, L. (2024). Rengong Zhineng Shidai Jiaoshi Xinxi Suyang Tisheng Lujing Tansuo [Exploring the Path to Improving Teachers' Information Literacy in the Era of Artificial Intelligence]. China New Telecommunications, 26(22), 150-152. https://bit.ly/4uKlPoN
Ma, R. (2026). Da Yuyan Moxing Funeng Zhongwen Eryu Zuowen Zhineng Pinggai De Yingyong Yu Pingjie [Application and Efficacy Assessment for Large Language Models Enabled Intelligent Evaluation & Correction of L2 Chinese Writing]. Language Teaching and Linguistic Studies, 47(01), 13-23. https://bit.ly/3SoO26W
Ma, R., & Xu, J. (2024). Shuzhi Shidai Guoji Zhongwen Xiezuo Zhihui Jiaoxue Ziyuan Chuangxin Yanfa [Innovative Development of International Chinese Writing Smart Teaching Resources in the Digital and Intelligent Era]. Journal of International Chinese Teaching, 11(01), 13-23. https://bit.ly/44sg9EV
Ren, C. (2004). HSK Zuowen Pingfen Keguanhua Tantao [Exploratory Research on Objective Scoring of HSK Composition]. Chinese Language Learning, 24(06), 58-67. https://bit.ly/4a7YGVS
Wang, X. (2022). Guoji Zhongwen Zuowen Zidong Pigai Ji Fankui Yanjiu [Research on Automatic Scoring and Feedback of Essays in International Chinese Language Education] [Unpublished master's thesis]. Beijing University of Posts and Telecommunications. https://bit.ly/4gy3ibF
Wu, J., Zhou, W., & Lu, D. (2019). Hanyu Muyu Zhe Hanyu Eryu Xiezuo Zhiliang Pinggu Yanjiu-Yi Yuyan Tezheng He Neirong Zhiliang Wei Celiang Weidu [Assessing Chinese L2 Writing Quality on Basis of Language Features and Content Quality]. Chinese Teaching in the World, 33(01), 130-144. https://link.oversea.cnki.net/doi/10.13724/j.cnki.ctiw.2019.01.013
Xiang, J., & Huang, W. (2025). Jiyu Dayuyan Moxing Zhinengti De Zuowen Zidong Pingfen Yanjiu [A Study on Automated Essay Scoring Using LLMs-based Agents]. Corpus Linguistics, 12(01), 34-49. https://bit.ly/3QyIuGk
Xu, C., Chen, D., Wu, Q. & Xie, Z. (2015). Hanyu Zuowei Di Er Yuyan Zuowen Zidong Pingfen Yanjiu Chutan [On Automated Essay Scoring for Learners of Chinese as a Second Language]. International Chinese Teaching and Research, 2(01), 83-89. https://bit.ly/4eTP43E
Zhang, H., Li, D., An, J. & Liu, Y. (2020). Mianxiang Hanyu Xuexi Zhe De Zuowen Zidong Pingfen Xitong Sheji Yu Shixian [Design and Implementation of an Automatic Essay Scoring System for Chinese Learners]. Electronic Technology & Software Engineering, 21, 127-130. https://bit.ly/3ScDbNw
ดาวน์โหลด
เผยแพร่แล้ว
รูปแบบการอ้างอิง
ฉบับ
ประเภทบทความ
สัญญาอนุญาต
ลิขสิทธิ์ (c) 2026 สิกขา วารสารศึกษาศาสตร์

อนุญาตภายใต้เงื่อนไข Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.