Characterisation of cardiovascular disease (CVD) incidence and machine learning risk prediction in middle-aged and elderly populations: data from the China health and retirement longitudinal study (CHARLS)

医学 生物统计学 纵向研究 公共卫生 流行病学 中国 入射(几何) 疾病 纵向数据 老年学 环境卫生 心血管健康 人口学 内科学 病理 物理 光学 社会学 法学 政治学
作者
Qing Huang,Zi-Hao Jiang,Bo Shi,Juan Meng,Shu Li,Fuyong Hu,Jing Mi
出处
期刊:BMC Public Health [BioMed Central]
卷期号:25 (1) 被引量:7
标识
DOI:10.1186/s12889-025-21609-7
摘要

Due to the ageing population and evolving lifestyles occurring in China, middle-aged and elderly populations have become high-risk groups for cardiovascular disease (CVD). The aim of this study was to analyse the incidence characteristics of CVD in these populations and develop a prediction model by using data from the China Health and Retirement Longitudinal Study (CHARLS). We used follow-up data from the CHARLS to analyse CVD incidence in the Chinese middle-aged and elderly population over a time span of 9 years. Five machine learning (ML) algorithms were employed for risk prediction. Data preprocessing included missing value imputation via random forest. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (Lasso CV) method with cross-validation prior to model training. The application of the synthetic minority over-sampling technique (SMOTE) to address class imbalance. Model performance was evaluated via analyses including the area under the ROC curve (AUC), precision, recall, F1 score, and SHAP plots for interpretability. In accordance with the exclusion criteria, 12,580, 12,061, 11,545, and 11,619 participants were enrolled in four follow-up rounds. The cumulative incidence (CI) of CVD at 2, 4, 7, and 9 years was 2.846%, 8.971%, 17.869% and 20.518%,, respectively. Significant differences in CVD incidence were observed across gender, age, ethnicity, and region, with higher rates observed in females and in the northeast region. Ultimately, 8,080 participants and 24 features were analysed for CVD risk prediction. Five ML models were built based on these features. Although the LGB model achieves an AUC of 0.818, indicating strong overall performance, its F1 score and recall rate are relatively low, at 0.509 and 43.1%, respectively. Shapley additive explanations (SHAP) analyses revealed the importance of key features, such as night sleep duration, TG levels, and waist circumference, in predicting outcomes, and highlighted the nonlinear relationships between these features and CVD risk. Gender, age, ethnicity, and region are significant factors influencing CVD incidence. Although the LGB model demonstrates good overall performance, its low F1 score and recall rate reveal limitations in identifying high-risk cardiovascular disease patients.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
molihuakai应助123456采纳,获得10
刚刚
跳跃的大碗完成签到,获得积分10
1秒前
WSP发布了新的文献求助10
1秒前
1秒前
小刘同学发布了新的文献求助30
2秒前
陈老派发布了新的文献求助10
3秒前
3秒前
大头发布了新的文献求助10
3秒前
微调完成签到,获得积分10
3秒前
风筝与亭完成签到 ,获得积分10
3秒前
繁荣的又亦完成签到,获得积分10
4秒前
4秒前
Xiaoming发布了新的文献求助10
4秒前
4秒前
4秒前
4秒前
orixero应助微笑的剑鬼采纳,获得10
5秒前
fffbl发布了新的文献求助10
5秒前
kanglan发布了新的文献求助10
5秒前
6秒前
碧蓝贞发布了新的文献求助10
6秒前
LiPengpeng完成签到,获得积分10
7秒前
科研通AI2S应助潇洒的访冬采纳,获得10
7秒前
7秒前
宝宝熊的熊宝宝完成签到,获得积分10
7秒前
小鸣发布了新的文献求助20
7秒前
8秒前
百事都可乐完成签到 ,获得积分10
9秒前
谦让靖儿发布了新的文献求助10
9秒前
123456完成签到,获得积分10
9秒前
xxxidora发布了新的文献求助10
9秒前
pri发布了新的文献求助10
9秒前
11秒前
星辰大海应助张zhang采纳,获得10
11秒前
囡囡儿完成签到,获得积分10
12秒前
大模型应助张艺兴的咩咩采纳,获得10
12秒前
12秒前
康康发布了新的文献求助10
12秒前
13秒前
科研通AI6.4应助畅快乐双采纳,获得10
13秒前
高分求助中
Annie Ernaux: De la perte au corps glorieux 600
类器官构建与应用:从基础到前沿 500
Petrology and Plate Tectonics,2025 500
Optical Coating Design with the Essential Macleod 400
A revision of Limenitis helmanni and its related species (Nymphalidae) from Central and South China 400
Moore's Clinically Oriented Anatomy 10th Edition 400
Direct and Iterative Linear System Solvers 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6789883
求助须知:如何正确求助?哪些是违规求助? 8511195
关于积分的说明 18125621
捐赠科研通 6099326
什么是DOI,文献DOI怎么找? 3021833
邀请新用户注册赠送积分活动 1998584
关于科研通互助平台的介绍 1987049