差异项目功能
项目反应理论
两种选择强迫选择
统计
计量经济学
样本量测定
心理学
样品(材料)
瓦尔德试验
度量(数据仓库)
统计能力
项目分析
心理测量学
统计假设检验
计算机科学
数学
数据挖掘
化学
色谱法
作者
Philseok Lee,Seang‐Hwane Joo,Stephen Stark
标识
DOI:10.1177/1094428120959822
摘要
Although modern item response theory (IRT) methods of test construction and scoring have overcome ipsativity problems historically associated with multidimensional forced choice (MFC) formats, there has been little research on MFC differential item functioning (DIF) detection, where item refers to a block, or group, of statements presented for an examinee’s consideration. This research investigated DIF detection with three-alternative MFC items based on the Thurstonian IRT (TIRT) model, using omnibus Wald tests on loadings and thresholds. We examined constrained and free baseline model comparisons strategies with different types and magnitudes of DIF, latent trait correlations, sample sizes, and levels of impact in an extensive Monte Carlo study. Results indicated the free baseline strategy was highly effective in detecting DIF, with power approaching 1.0 in the large sample size and large magnitude of DIF conditions, and similar effectiveness in the impact and no-impact conditions. This research also included an empirical example to demonstrate the viability of the best performing method with real examinees and showed how a DIF and a DTF effect size measure can be used to assess the practical significance of MFC DIF findings.
科研通智能强力驱动
Strongly Powered by AbleSci AI