A dynamic machine learning model for prediction of NAFLD in a health checkup population: A longitudinal study

布里氏评分 接收机工作特性 机器学习 医学 人工智能 特征选择 逐步回归 逻辑回归 人口 统计 计算机科学 数据挖掘 数学 环境卫生
作者
Yuhan Deng,Yuan Ma,Jingzhu Fu,Xiaona Wang,Canqing Yu,Jun Lv,Sailimai Man,Bo Wang,Liming Li
出处
期刊:Heliyon [Elsevier BV]
卷期号:9 (8): e18758-e18758 被引量:7
标识
DOI:10.1016/j.heliyon.2023.e18758
摘要

Non-alcoholic fatty liver disease (NAFLD) is one of the most common liver diseases worldwide. Currently, most NAFLD prediction models are diagnostic models based on cross-sectional data, which failed to provide early identification or clarify causal relationships. We aimed to use time-series deep learning models with longitudinal health checkup records to predict the onset of NAFLD in the future, and update the model stepwise by incorporating new checkup records to achieve dynamic prediction.10,493 participants with over 6 health checkup records from Beijing MJ Health Screening Center were included to conduct a retrospective cohort study, in which the constantly updated initial 5 checkup data were incorporated stepwise to predict the risk of NAFLD at and after their sixth health checkups. A total of 33 variables were considered, consisting of demographic characteristics, medical history, lifestyle, physical examinations, and laboratory tests. L1-penalized logistic regression (LR) was used for feature selection. The long short-term memory (LSTM) algorithm was introduced for model development, and five-fold cross-validation was conducted to tune and choose optimal hyperparameters. Both internal validation and external validation were conducted, using the 20% randomly divided holdout test dataset and previously unseen data from Shanghai MJ Health Screening Center, respectively, to evaluate model performance. The evaluation metrics included area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, Brier score, and decision curve. Bootstrap sampling was implemented to generate 95% confidence intervals of all the metrics. Finally, the Shapley additive explanations (SHAP) algorithm was applied in the holdout test dataset for model interpretability to obtain time-specific and sample-specific contributions of each feature.Among the 10,493 participants, 1662 (15.84%) were diagnosed with NAFLD at and after their sixth health checkups. The predictive performance of the deep learning model in the internal validation dataset improved over the incorporation of the checkups, with AUROC increasing from 0.729 (95% CI: 0.698,0.760) at baseline to 0.818 (95% CI: 0.798,0.844) when consecutive 5 checkups were included. The external validation dataset, containing 1728 participants, was used to verify the results, in which AUROC increased from 0.700 (95% CI: 0.657,0.740) with only the first checkups to 0.792 (95% CI: 0.758,0.825) with all five. The results of feature significance showed that body fat percentage, alanine transaminase (ALT), and uric acid owned the greatest impact on the outcome, time-specific, individual-specific and dynamic feature contributions were also produced for model interpretability.A dynamic prediction model was successfully established in our study, and the prediction capability kept improving with the renewal of the latest checkup records. In addition, we identified key features associated with the onset of NAFLD, making it possible to optimize the prevention and control strategies of the disease in the general population.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
可爱的函函应助孙星采纳,获得10
刚刚
Jasper应助Z_Miaom采纳,获得10
刚刚
简单若风发布了新的文献求助10
1秒前
alex_wang发布了新的文献求助10
2秒前
2秒前
朱大帅发布了新的文献求助10
2秒前
拼搏冬瓜完成签到,获得积分10
3秒前
科研通AI6.2应助佳烨采纳,获得10
3秒前
隐形曼青应助啊啊采纳,获得10
3秒前
领导范儿应助苹果金毛采纳,获得10
4秒前
4秒前
5秒前
6秒前
香蕉觅云应助科研通管家采纳,获得10
6秒前
FashionBoy应助科研通管家采纳,获得10
6秒前
科目三应助科研通管家采纳,获得10
6秒前
白小橘完成签到 ,获得积分10
6秒前
Lucas应助科研通管家采纳,获得10
6秒前
斯文败类应助科研通管家采纳,获得10
6秒前
脑洞疼应助科研通管家采纳,获得20
6秒前
6秒前
星辰大海应助科研通管家采纳,获得10
6秒前
6秒前
英俊的铭应助科研通管家采纳,获得10
6秒前
6秒前
领导范儿应助zhou采纳,获得10
7秒前
靳元逵发布了新的文献求助10
7秒前
7秒前
AilienWu发布了新的文献求助10
7秒前
9秒前
龙傲天发布了新的文献求助10
9秒前
Jasper应助洁净雨采纳,获得10
9秒前
sxmt123456789发布了新的文献求助10
9秒前
JamesPei应助伶俐的无血采纳,获得10
10秒前
Z_Miaom发布了新的文献求助10
11秒前
11秒前
12秒前
可爱的函函应助wh采纳,获得10
13秒前
哈哈哈完成签到,获得积分10
14秒前
酷波er应助淡然夏瑶采纳,获得10
14秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Cold War Transcended: Australia's China Policy, 1949-1990 998
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
Testimonial Injustice and Trust 510
久松真一著作集〈第5巻〉禅と芸術 500
Comprehensive Natural Products III 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6625839
求助须知:如何正确求助?哪些是违规求助? 8387968
关于积分的说明 17944134
捐赠科研通 5801255
什么是DOI,文献DOI怎么找? 2962790
邀请新用户注册赠送积分活动 1937956
关于科研通互助平台的介绍 1846202