Combinatorial Use of Machine Learning and Logistic Regression for Predicting Carotid Plaque Risk Among 5.4 Million Adults With Fatty Liver Disease Receiving Health Check-Ups: Population-Based Cross-Sectional Study

医学 逻辑回归 接收机工作特性 人口 脂肪肝 随机森林 内科学 统计 疾病 机器学习 环境卫生 计算机科学 数学
作者
Yuhan Deng,Yuan Ma,Jingzhu Fu,X X Wang,Canqing Yu,Jun Lv,Sailimai Man,Bo Wang,Liming Li
出处
期刊:JMIR public health and surveillance [JMIR Publications]
卷期号:9: e47095-e47095 被引量:10
标识
DOI:10.2196/47095
摘要

Background Carotid plaque can progress into stroke, myocardial infarction, etc, which are major global causes of death. Evidence shows a significant increase in carotid plaque incidence among patients with fatty liver disease. However, unlike the high detection rate of fatty liver disease, screening for carotid plaque in the asymptomatic population is not yet prevalent due to cost-effectiveness reasons, resulting in a large number of patients with undetected carotid plaques, especially among those with fatty liver disease. Objective This study aimed to combine the advantages of machine learning (ML) and logistic regression to develop a straightforward prediction model among the population with fatty liver disease to identify individuals at risk of carotid plaque. Methods Our study included 5,420,640 participants with fatty liver from Meinian Health Care Center. We used random forest, elastic net (EN), and extreme gradient boosting ML algorithms to select important features from potential predictors. Features acknowledged by all 3 models were enrolled in logistic regression analysis to develop a carotid plaque prediction model. Model performance was evaluated based on the area under the receiver operating characteristic curve, calibration curve, Brier score, and decision curve analysis both in a randomly split internal validation data set, and an external validation data set comprising 32,682 participants from MJ Health Check-up Center. Risk cutoff points for carotid plaque were determined based on the Youden index, predicted probability distribution, and prevalence rate of the internal validation data set to classify participants into high-, intermediate-, and low-risk groups. This risk classification was further validated in the external validation data set. Results Among the participants, 26.23% (1,421,970/5,420,640) were diagnosed with carotid plaque in the development data set, and 21.64% (7074/32,682) were diagnosed in the external validation data set. A total of 6 features, including age, systolic blood pressure, low-density lipoprotein cholesterol (LDL-C), total cholesterol, fasting blood glucose, and hepatic steatosis index (HSI) were collectively selected by all 3 ML models out of 27 predictors. After eliminating the issue of collinearity between features, the logistic regression model established with the 5 independent predictors reached an area under the curve of 0.831 in the internal validation data set and 0.801 in the external validation data set, and showed good calibration capability graphically. Its predictive performance was comprehensively competitive compared with the single use of either logistic regression or ML algorithms. Optimal predicted probability cutoff points of 25% and 65% were determined for classifying individuals into low-, intermediate-, and high-risk categories for carotid plaque. Conclusions The combination of ML and logistic regression yielded a practical carotid plaque prediction model, and was of great public health implications in the early identification and risk assessment of carotid plaque among individuals with fatty liver.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
pmsl完成签到,获得积分10
3秒前
顾枫完成签到,获得积分10
3秒前
3秒前
复杂沛白完成签到,获得积分20
6秒前
复杂沛白发布了新的文献求助10
9秒前
wwtt完成签到 ,获得积分10
10秒前
低级趣味完成签到,获得积分10
12秒前
傅寒天完成签到,获得积分10
17秒前
wangji_2017完成签到,获得积分10
19秒前
华仔完成签到 ,获得积分10
22秒前
27秒前
innyjiang完成签到,获得积分10
29秒前
dahuihui完成签到,获得积分10
33秒前
34秒前
汉堡包应助louischem采纳,获得10
35秒前
顺心映之发布了新的文献求助10
39秒前
无奈晓筠完成签到,获得积分10
43秒前
坦率的丹云完成签到,获得积分10
44秒前
夜霄咕咕鸽完成签到 ,获得积分10
44秒前
666完成签到 ,获得积分10
45秒前
Drtaoao完成签到 ,获得积分10
45秒前
小珂完成签到 ,获得积分10
46秒前
用户完成签到,获得积分10
47秒前
AllRightReserved应助zhuo_mech采纳,获得10
48秒前
Pluto完成签到,获得积分10
48秒前
无极微光应助科研通管家采纳,获得20
52秒前
星辰大海应助科研通管家采纳,获得10
52秒前
52秒前
52秒前
molihuakai应助科研通管家采纳,获得10
52秒前
52秒前
大模型应助科研通管家采纳,获得10
52秒前
小蘑菇应助科研通管家采纳,获得20
52秒前
52秒前
SciGPT应助科研通管家采纳,获得10
52秒前
我是老大应助科研通管家采纳,获得10
53秒前
末位牛马完成签到,获得积分10
53秒前
慕青应助科研通管家采纳,获得10
53秒前
浮游应助科研通管家采纳,获得10
53秒前
flymove完成签到,获得积分10
54秒前
高分求助中
Signals, Systems, and Signal Processing 610
Annie Ernaux: De la perte au corps glorieux 600
Petrology and Plate Tectonics,2025 500
Direct and Iterative Linear System Solvers 400
Cardiopulmonary Bypass and Mechanical Support: Principles and Practice, Fifth Edition 400
Circular Polar Constellations Providing Continuous Single or Multiple Coverage Above a Specified Latitude 400
Burger's Medicinal Chemistry and Drug Discovery 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6760333
求助须知:如何正确求助?哪些是违规求助? 8487164
关于积分的说明 18090033
捐赠科研通 6045076
什么是DOI,文献DOI怎么找? 3010366
邀请新用户注册赠送积分活动 1987188
关于科研通互助平台的介绍 1960926