聚类分析
计量经济学
估计员
差异(会计)
标准误差
推论
统计
变量(数学)
星团(航天器)
采样(信号处理)
分数(化学)
人口
组分(热力学)
观察研究
样品(材料)
总体方差
整群抽样
计算机科学
数学
经济
人工智能
热力学
计算机视觉
程序设计语言
化学
色谱法
社会学
有机化学
滤波器(信号处理)
物理
人口学
会计
数学分析
作者
Alberto Abadie,Susan Athey,Guido W. Imbens,Jeffrey M. Wooldridge
摘要
Abstract Clustered standard errors, with clusters defined by factors such as geography, are widespread in empirical research in economics and many other disciplines. Formally, clustered standard errors adjust for the correlations induced by sampling the outcome variable from a data-generating process with unobserved cluster-level components. However, the standard econometric framework for clustering leaves important questions unanswered: (i) Why do we adjust standard errors for clustering in some ways but not others, for example, by state but not by gender, and in observational studies but not in completely randomized experiments? (ii) Is the clustered variance estimator valid if we observe a large fraction of the clusters in the population? (iii) In what settings does the choice of whether and how to cluster make a difference? We address these and other questions using a novel framework for clustered inference on average treatment effects. In addition to the common sampling component, the new framework incorporates a design component that accounts for the variability induced on the estimator by the treatment assignment mechanism. We show that, when the number of clusters in the sample is a nonnegligible fraction of the number of clusters in the population, conventional clustered standard errors can be severely inflated, and propose new variance estimators that correct for this bias.
科研通智能强力驱动
Strongly Powered by AbleSci AI