可读性
计算机科学
分级(工程)
自然语言处理
应用语言学
语言学
人工智能
程序设计语言
工程类
哲学
土木工程
出处
期刊:IEEE Access
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:12: 88608-88619
被引量:3
标识
DOI:10.1109/access.2024.3418844
摘要
Selecting appropriate reading materials for L2 (second language/foreign language) learners is crucial for improving their proficiency in the target language. However, the limitation of effective Chinese text readability classifiers poses a significant hurdle for students and educators in accurately gauging the precise difficulty level of texts in international Chinese education. This research conducted the readability grading of Chinese as a Second Language (CSL) texts by developing a BERT-Based CSL Readability Classifier (BCRC), which utilizes the BERT architecture specifically trained on CSL texts and incorporates multidimensional linguistic features including lexical richness, syntactic complexity and syntax patterns. The model was evaluated using a dataset of CSL texts, and the results indicate that the BCRC model performs effectively in predicting the readability levels of CSL texts. It achieves high mean accuracy of 92.9% across different readability levels, which outperforms baseline classifiers in terms of classification performance, highlighting the enhancement capabilities of multidimensional linguistic features in CSL readability classification models. This study contributes to the field of CSL education by providing a robust readability classifier as a valuable tool for educators, curriculum designers, and developers of CSL learning materials to ensure appropriate text selection based on learners’ proficiency levels.
科研通智能强力驱动
Strongly Powered by AbleSci AI