Enhancing Suicide Attempt Risk Prediction Models with Temporal Clinical Note Features

计算机科学医学诊断人工智能机器学习随机森林集合（抽象数据类型）数据集数据挖掘标识符预测建模自然语言处理医学病理程序设计语言

作者

Kevin R. Krause,Sharon E. Davis,Zhijun Yin,Katherine Musacchio Schafer,S. Trent Rosenbloom,Colin G. Walsh

出处

期刊：Applied Clinical Informatics [Thieme Medical Publishers (Germany)]
日期：2024-09-09 卷期号：15 (05): 1107-1120 被引量：1

链接

nih.govdoi.org

标识

DOI：10.1055/a-2411-5796

摘要

Abstract Objectives The objective of this study was to investigate the impact of enhancing a structured-data-based suicide attempt risk prediction model with temporal Concept Unique Identifiers (CUIs) derived from clinical notes. We aimed to examine how different temporal schemes, model types, and prediction ranges influenced the model's predictive performance. This research sought to improve our understanding of how the integration of temporal information and clinical variable transformation could enhance model predictions. Methods We identified modeling targets using diagnostic codes for suicide attempts within 30, 90, or 365 days following a temporally grouped visit cluster. Structured data included medications, diagnoses, procedures, and demographics, whereas unstructured data consisted of terms extracted with regular expressions from clinical notes. We compared models trained only on structured data (controls) to hybrid models trained on both structured and unstructured data. We used two temporalization schemes for clinical notes: fixed 90-day windows and flexible epochs. We trained and assessed random forests and hybrid long short-term memory (LSTM) neural networks using area under the precision recall curve (AUPRC) and area under the receiver operating characteristic, with additional evaluation of sensitivity and positive predictive value at 95% specificity. Results The training set included 2,364,183 visit clusters with 2,009 30-day suicide attempts, and the testing set contained 471,936 visit clusters with 480 suicide attempts. Models trained with temporal CUIs outperformed those trained with only structured data. The window-temporalized LSTM model achieved the highest AUPRC (0.056 ± 0.013) for the 30-day prediction range. Hybrid models generally showed better performance compared with controls across most metrics. Conclusion This study demonstrated that incorporating electronic health record-derived clinical note features enhanced suicide attempt risk prediction models, particularly with window-temporalized LSTM models. Our results underscored the critical value of unstructured data in suicidality prediction, aligning with previous findings. Future research should focus on integrating more sophisticated methods to continue improving prediction accuracy, which will enhance the effectiveness of future intervention.

求助该文献

最长约 10秒，即可获得该文献文件

Enhancing Suicide Attempt Risk Prediction Models with Temporal Clinical Note Features

今日热心研友