分位数回归
推论
分位数
计算机科学
排列(音乐)
统计
回归
重采样
计量经济学
线性回归
回归分析
数学
人工智能
物理
声学
作者
Grace R. Lyden,David M. Vock,Emily S. Barrett,Sheela Sathyanarayana,Shanna H. Swan,Ruby H.N. Nguyen
标识
DOI:10.1177/09622802211013578
摘要
There is a growing demand for methods to determine the effects that chemical mixtures have on human health. One statistical challenge is identifying true “bad actors” from a mixture of highly correlated predictors, a setting in which standard approaches such as linear regression become highly variable. Weighted Quantile Sum regression has been proposed to address this problem, through a two-step process where mixture component weights are estimated using bootstrap aggregation in a training dataset and inference on the overall mixture effect occurs in a held-out test set. Weighted Quantile Sum regression is popular in applied papers, but the reliance on data splitting is suboptimal, and analysts who use the same data for both steps risk inflating the Type I error rate. We therefore propose a modification of Weighted Quantile Sum regression that uses a permutation test for inference, which allows for weight estimation using the entire dataset and preserves Type I error. To minimize computational burden, we propose replacing the bootstrap with L1 or L2 penalization and describe how to choose the appropriate penalty given expert knowledge about a mixture of interest. We apply our method to a national pregnancy cohort study of prenatal phthalate exposure and child health outcomes.
科研通智能强力驱动
Strongly Powered by AbleSci AI