缺少数据
医学
健康档案
电子健康档案
预测建模
数据挖掘
数据收集
数据科学
医疗保健
计算机科学
统计
机器学习
数学
经济
经济增长
作者
Shanshan Lin,Rolf H. H. Groenwold,Hemalkumar B. Mehta,Ji Soo Kim,Jodi B Segal
标识
DOI:10.7326/annals-24-01516
摘要
Electronic health record (EHR) data are increasingly used to develop prediction models that guide clinical decision making at the point of care. These include algorithms that use high-frequency data, like in sepsis prediction, as well as simpler equations, such as the Pooled Cohort Equations for cardiovascular outcome prediction. Although EHR data used in prediction models are often highly granular and more current than other data, there is systematic and nonsystematic missingness in EHR data as there is with most data. Despite growing use for clinical decisions, algorithms implemented in EHRs are mostly unregulated and are often opaque to the user. Guidelines about the development, validation, implementation, and reporting on clinical prediction models are sparse in their recommendations regarding missing data. This article characterizes missingness in EHR data, summarizes methods for attending to missing data when developing prediction models, makes recommendations about validation and implementation of models in practice when data are missing, and identifies research needs in this field.
科研通智能强力驱动
Strongly Powered by AbleSci AI