自举(财务)
统计
交叉验证
标准误差
均方预测误差
计算机科学
人口
最小二乘函数近似
普通最小二乘法
数学
计量经济学
社会学
估计员
人口学
作者
Stephen Bates,Trevor Hastie,Robert Tibshirani
标识
DOI:10.1080/01621459.2023.2197686
摘要
Cross-validation is a widely-used technique to estimate prediction error, but its behavior is complex and not fully understood. Ideally, one would like to think that cross-validation estimates the prediction error for the model at hand, fit to the training data. We prove that this is not the case for the linear model fit by ordinary least squares; rather it estimates the average prediction error of models fit on other unseen training sets drawn from the same population. We further show that this phenomenon occurs for most popular estimates of prediction error, including data splitting, bootstrapping, and Mallow's
科研通智能强力驱动
Strongly Powered by AbleSci AI