Clinical outcome prediction using observational supervision with electronic health records and audit logs

审计观察研究临床审核逻辑回归医学电子健康档案健康档案机器学习预测建模计算机科学人工智能支持向量机临床决策支持系统随机森林急诊医学数据挖掘医疗保健内科学决策支持系统管理经济经济增长

作者

Nandita Bhaskhar,Wui Ip,Jonathan Chen,Daniel L. Rubin

出处

期刊：Journal of Biomedical Informatics [Elsevier]
日期：2023-11-01 卷期号：147: 104522-104522

链接

nih.govdoi.org

标识

DOI：10.1016/j.jbi.2023.104522

摘要

Audit logs in electronic health record (EHR) systems capture interactions of providers with clinical data. We determine if machine learning (ML) models trained using audit logs in conjunction with clinical data ("observational supervision") outperform ML models trained using clinical data alone in clinical outcome prediction tasks, and whether they are more robust to temporal distribution shifts in the data.Using clinical and audit log data from Stanford Healthcare, we trained and evaluated various ML models including logistic regression, support vector machine (SVM) classifiers, neural networks, random forests, and gradient boosted machines (GBMs) on clinical EHR data, with and without audit logs for two clinical outcome prediction tasks: major adverse kidney events within 120 days of ICU admission (MAKE-120) in acute kidney injury (AKI) patients and 30-day readmission in acute stroke patients. We further tested the best performing models using patient data acquired during different time-intervals to evaluate the impact of temporal distribution shifts on model performance.Performance generally improved for all models when trained with clinical EHR data and audit log data compared with those trained with only clinical EHR data, with GBMs tending to have the overall best performance. GBMs trained with clinical EHR data and audit logs outperformed GBMs trained without audit logs in both clinical outcome prediction tasks: AUROC 0.88 (95% CI: 0.85-0.91) vs. 0.79 (95% CI: 0.77-0.81), respectively, for MAKE-120 prediction in AKI patients, and AUROC 0.74 (95% CI: 0.71-0.77) vs. 0.63 (95% CI: 0.62-0.64), respectively, for 30-day readmission prediction in acute stroke patients. The performance of GBM models trained using audit log and clinical data degraded less in later time-intervals than models trained using only clinical data.Observational supervision with audit logs improved the performance of ML models trained to predict important clinical outcomes in patients with AKI and acute stroke, and improved robustness to temporal distribution shifts.

求助该文献

Clinical outcome prediction using observational supervision with electronic health records and audit logs

今日热心研友