可靠性(半导体)
一致性(知识库)
数学
统计
陈
卡帕
人工智能
心理学
计算机科学
物理
生物
热力学
功率(物理)
几何学
古生物学
作者
Sandip Sinharay,Matthew Johnson
出处
期刊:Methodology of educational measurement and assessment
日期:2019-01-01
卷期号:: 359-377
被引量:12
标识
DOI:10.1007/978-3-030-05584-4_17
摘要
Gierl, Cui, and Zhou (J Educ Meas 46:293–313, 2009), Cui, Gierl, and Chang (J Educ Meas 49:19–38, 2012), Templin and Bradshaw (J Classif 30:251–275, 2013), Wang, Song, Chen, Meng, and Ding (J Educ Meas 52:457–476, 2015), Johnson and Sinharay (J Educ Meas, 55: 635–664, 2018), and Johnson and Sinharay (J Educ Behav Stat, in press) suggested reliability-like measures for the estimates obtained from a diagnostic classification model. These measures mostly express the agreement between the estimated skill and the true skill, or between estimated skills from parallel assessments. This paper provides a review of these measures and demonstrates some of them for a real data example.
科研通智能强力驱动
Strongly Powered by AbleSci AI