因果推理
观察研究
推论
倾向得分匹配
随机森林
多级模型
计算机科学
因果模型
随机效应模型
统计推断
统计
机器学习
比例(比率)
结果(博弈论)
计量经济学
数据挖掘
人工智能
数学
荟萃分析
地理
医学
内科学
数理经济学
地图学
作者
Youmi Suk,Hyunseung Kang,Jee‐Seon Kim
标识
DOI:10.1080/00273171.2020.1808437
摘要
There is a growing interest in using machine learning (ML) methods for causal inference due to their (nearly) automatic and flexible ability to model key quantities such as the propensity score or the outcome model. Unfortunately, most ML methods for causal inference have been studied under single-level settings where all individuals are independent of each other and there is little work in using these methods with clustered or nested data, a common setting in education studies. This paper investigates using one particular ML method based on random forests known as Causal Forests to estimate treatment effects in multilevel observational data. We conduct simulation studies under different types of multilevel data, including two-level, three-level, and cross-classified data. Our simulation study shows that when the ML method is supplemented with estimated propensity scores from multilevel models that account for clustered/hierarchical structure, the modified ML method outperforms preexisting methods in a wide variety of settings. We conclude by estimating the effect of private math lessons in the Trends in International Mathematics and Science Study data, a large-scale educational assessment where students are nested within schools.
科研通智能强力驱动
Strongly Powered by AbleSci AI