A machine learning-based data mining in medical examination data: a biological features-based biological age prediction model

过度拟合 机器学习 自编码 人工智能 理论(学习稳定性) 计算机科学 疾病 人口 深度学习 医学 人工神经网络 病理 环境卫生
作者
Qing Yang,Sunan Gao,Junfen Lin,Ke Lyu,Zexu Wu,Yuhao Chen,Yinwei Qiu,Yanrong Zhao,Wei Wang,Tianxiang Lin,Huiyun Pan,Ming Chen
出处
期刊:BMC Bioinformatics [BioMed Central]
卷期号:23 (1): 411-411 被引量:17
标识
DOI:10.1186/s12859-022-04966-7
摘要

Abstract Background Biological age (BA) has been recognized as a more accurate indicator of aging than chronological age (CA). However, the current limitations include: insufficient attention to the incompleteness of medical data for constructing BA; Lack of machine learning-based BA (ML-BA) on the Chinese population; Neglect of the influence of model overfitting degree on the stability of the association results. Methods and results Based on the medical examination data of the Chinese population (45–90 years), we first evaluated the most suitable missing interpolation method, then constructed 14 ML-BAs based on biomarkers, and finally explored the associations between ML-BAs and health statuses (healthy risk indicators and disease). We found that round-robin linear regression interpolation performed best, while AutoEncoder showed the highest interpolation stability. We further illustrated the potential overfitting problem in ML-BAs, which affected the stability of ML-Bas’ associations with health statuses. We then proposed a composite ML-BA based on the Stacking method with a simple meta-model (STK-BA), which overcame the overfitting problem, and associated more strongly with CA (r = 0.66, P < 0.001), healthy risk indicators, disease counts, and six types of disease. Conclusion We provided an improved aging measurement method for middle-aged and elderly groups in China, which can more stably capture aging characteristics other than CA, supporting the emerging application potential of machine learning in aging research.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
厄尔尼诺完成签到,获得积分10
刚刚
中级中级完成签到,获得积分10
刚刚
离个大谱完成签到,获得积分10
刚刚
萱1988完成签到,获得积分10
刚刚
小二郎应助kiki采纳,获得10
刚刚
刚刚
做实验太菜完成签到,获得积分10
1秒前
栀染完成签到,获得积分10
1秒前
112233完成签到,获得积分10
1秒前
1秒前
隐形曼青应助文艺冷梅采纳,获得10
1秒前
小桔啊完成签到 ,获得积分10
2秒前
SweetNanchu完成签到,获得积分10
3秒前
jojo发布了新的文献求助10
3秒前
陈小金完成签到,获得积分10
3秒前
4秒前
虚心中蓝完成签到,获得积分10
4秒前
周辰完成签到,获得积分10
4秒前
SciGPT应助胖墩儿驾到采纳,获得10
4秒前
林药师完成签到 ,获得积分10
4秒前
独闯江湖完成签到 ,获得积分10
6秒前
6秒前
妖孽宇完成签到,获得积分10
6秒前
lpttfc完成签到,获得积分10
6秒前
OsamaKareem应助oyjq采纳,获得30
6秒前
6秒前
森莫莓完成签到,获得积分10
7秒前
珠珠完成签到 ,获得积分10
8秒前
8秒前
香妃完成签到,获得积分10
9秒前
9秒前
黄文洁完成签到,获得积分10
10秒前
妖孽宇发布了新的文献求助10
10秒前
蓝星花完成签到 ,获得积分10
11秒前
11秒前
11秒前
青桔柠檬完成签到 ,获得积分10
11秒前
12秒前
老六完成签到,获得积分10
13秒前
xbj笑哈哈完成签到 ,获得积分10
13秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Bounds for Statistical Estimation in Semiparametric Models 500
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6474118
求助须知:如何正确求助?哪些是违规求助? 8276997
关于积分的说明 17647720
捐赠科研通 5554680
什么是DOI,文献DOI怎么找? 2909886
邀请新用户注册赠送积分活动 1886660
关于科研通互助平台的介绍 1739204