拉什模型
心理学
等级间信度
认证
评定量表
面(心理学)
比例(比率)
一致性(知识库)
数学教育
发展心理学
社会心理学
计算机科学
物理
人格
量子力学
人工智能
政治学
法学
五大性格特征
作者
Xun Yan,Ping-Lin Chuang
出处
期刊:Language Testing
[SAGE Publishing]
日期:2022-03-01
卷期号:40 (1): 153-179
被引量:3
标识
DOI:10.1177/02655322221074913
摘要
This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters ( n = 30) across four rounds in the certification program. Using many-facet Rasch modeling, rater performance was examined in terms of rater agreement, rater consistency, and rater severity. These measurement estimates of rating quality were subjected to multivariate analysis to examine whether and how rater performance changes across rounds. Rater comments on the essays were qualitatively analyzed to obtain a deeper understanding of how raters learn to use the scale over time. The quantitative results showed a non-linear, three-staged developmental pattern of rater performance for all three groups of raters. Findings of this study suggest that rater development resembles a learning curve similar to how one acquires a language and other skills. We argue that understanding the developmental pattern of rater behavior is crucial not only to understanding the effectiveness of rater training, but also to the investigation of rater cognition and development. We will also discuss the practical implications of this study in relation to the effort and expectations needed for rater training for writing assessments.
科研通智能强力驱动
Strongly Powered by AbleSci AI