缺少数据
过度拟合
可解释性
插补(统计学)
计算机科学
机器学习
心理信息
Lasso(编程语言)
特征选择
正规化(语言学)
人工智能
回归
数据挖掘
统计
数学
梅德林
人工神经网络
万维网
法学
政治学
作者
Heather Gunn,Panteha Hayati Rezvan,M. Isabel Fernández,W. Scott Comulada
出处
期刊:Psychological Methods
[American Psychological Association]
日期:2022-02-03
卷期号:28 (2): 452-471
被引量:26
摘要
Psychological researchers often use standard linear regression to identify relevant predictors of an outcome of interest, but challenges emerge with incomplete data and growing numbers of candidate predictors. Regularization methods like the LASSO can reduce the risk of overfitting, increase model interpretability, and improve prediction in future samples; however, handling missing data when using regularization-based variable selection methods is complicated. Using listwise deletion or an ad hoc imputation strategy to deal with missing data when using regularization methods can lead to loss of precision, substantial bias, and a reduction in predictive ability. In this tutorial, we describe three approaches for fitting a LASSO when using multiple imputation to handle missing data and illustrate how to implement these approaches in practice with an applied example. We discuss implications of each approach and describe additional research that would help solidify recommendations for best practices. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
科研通智能强力驱动
Strongly Powered by AbleSci AI