What, Why, and How: An Empiricist’s Guide to Double/Debiased Machine Learning

计算机科学 机器学习 人工智能 实证研究 钥匙(锁) 面子(社会学概念) 控制(管理) 回归分析 回归 统计模型 随机森林 数据建模 在线机器学习 算法学习理论 统计学习 线性回归 变量 基于实例的学习 方案(数学) 计算学习理论
作者
Bowen Shi,Xiaojie Mao,Mochen Yang,Bo Li
出处
期刊:Information Systems Research [Institute for Operations Research and the Management Sciences]
被引量:1
标识
DOI:10.1287/isre.2024.0888
摘要

We provide an introduction to double/debiased machine learning (DML), a framework that enables effect estimation when dealing with complex, high-dimensional data. In many empirical analyses, especially in fields such as information systems, researchers face difficult choices about which control variables to include and how to model their relationships with the outcome. These modeling decisions can significantly change results, leading to uncertainty about which findings are reliable. DML offers a practical solution by combining modern machine learning with rigorous statistical inference. The idea is to let flexible ML models (such as random forests or gradient boosting) capture complex relationships among control variables while still delivering reliable estimates for the key effect of interest. DML can be applied to many familiar research designs, including standard regression with controls, instrumental variables, difference in differences, and models that incorporate ML-generated features. Empirical studies and simulations show that DML is typically more robust to misspecification than traditional regression and more reliable than earlier semiparametric methods. However, DML is not automatic—it still requires sound research design and high-quality machine learning estimation. Used thoughtfully, DML provides a powerful, flexible, and statistically grounded approach for empirical research in modern data environments.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI6.1应助lengchitu采纳,获得10
1秒前
afujiadeluo完成签到,获得积分10
1秒前
2秒前
15286706188完成签到,获得积分10
3秒前
无花果应助1112222采纳,获得10
4秒前
4秒前
5秒前
5秒前
5秒前
哈哈哈发布了新的文献求助10
6秒前
小二郎应助flyabc采纳,获得10
6秒前
7秒前
芒果哥发布了新的文献求助10
8秒前
哈哈发布了新的文献求助10
10秒前
农大馒头发布了新的文献求助10
10秒前
Tony发布了新的文献求助10
11秒前
yookia发布了新的文献求助30
11秒前
lengchitu发布了新的文献求助10
11秒前
11秒前
11秒前
哈哈哈完成签到,获得积分10
12秒前
13秒前
李健应助XHQ采纳,获得10
13秒前
14秒前
15秒前
15秒前
1112222发布了新的文献求助10
16秒前
科研学术完成签到,获得积分10
16秒前
科研通AI6.2应助企鹅舞采纳,获得10
19秒前
MingTtty9发布了新的文献求助10
19秒前
Rider完成签到,获得积分10
19秒前
汉堡包应助芋你呀采纳,获得10
19秒前
希望天下0贩的0应助YJR采纳,获得10
20秒前
20秒前
3232发布了新的文献求助10
21秒前
不敢自称科研人完成签到,获得积分10
21秒前
1112222完成签到,获得积分10
21秒前
Rainni完成签到,获得积分10
21秒前
万能图书馆应助农大馒头采纳,获得10
21秒前
自觉的涵易完成签到 ,获得积分10
23秒前
高分求助中
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2000
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6483017
求助须知:如何正确求助?哪些是违规求助? 8282982
关于积分的说明 17666989
捐赠科研通 5568072
什么是DOI,文献DOI怎么找? 2912296
邀请新用户注册赠送积分活动 1889526
关于科研通互助平台的介绍 1744940