Development and validation of an artificial intelligence prediction model and a survival risk stratification for lung metastasis in colorectal cancer from highly imbalanced data: A multicenter retrospective study

特征选择 随机森林 单变量 接收机工作特性 决策树 结直肠癌 人工智能 医学 逻辑回归 支持向量机 机器学习 多元统计 肿瘤科 预测建模 内科学 计算机科学 癌症
作者
Weiyuan Zhang,Xu Guan,Shuai Jiao,Guiyu Wang,Xishan Wang
出处
期刊:Ejso [Elsevier]
卷期号:49 (12): 107107-107107
标识
DOI:10.1016/j.ejso.2023.107107
摘要

Background To assist clinicians with diagnosis and optimal treatment decision-making, we attempted to develop and validate an artificial intelligence prediction model for lung metastasis (LM) in colorectal cancer (CRC) patients. Methods The clinicopathological characteristics of 46037 CRC patients from the Surveillance, Epidemiology, and End Results (SEER) database and 2779 CRC patients from a multi-center external validation set were collected retrospectively. After feature selection by univariate and multivariate analyses, six machine learning (ML) models, including logistic regression, K-nearest neighbor, support vector machine, decision tree, random forest, and balanced random forest (BRF), were developed and validated for the LM prediction. In addition, stratified LM patients by risk score were utilized for survival analysis. Results Extremely low rates of LM with 2.59% and 4.50% were present in the development and validation set. As the imbalanced learning strategy, the BRF model with an Area under the receiver operating characteristic curve (AUC) of 0.874 and an average precision (AP) of 0.184 performed best compares with other models and clinical predictor. Patients with LM in the high-risk group had significantly poorer survival (P<0.001) and failed to benefit from resection (P = 0.125). Conclusions In summary, we have utilized the BRF algorithm to develop an effective, non-invasive, and practical model for predicting LM in CRC patients based on highly imbalanced datasets. In addition, we have implemented a novel approach to stratify the survival risk of CRC patients with LM based the output of the model.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
大橙子应助鼓励男孩采纳,获得10
刚刚
一条咸鱼完成签到,获得积分10
1秒前
鱿鱼炒黄瓜发布了新的文献求助200
2秒前
六六完成签到,获得积分10
2秒前
辛勤的qq完成签到 ,获得积分10
3秒前
4秒前
笨笨鲜花完成签到,获得积分10
7秒前
MY999发布了新的文献求助10
9秒前
寻道图强应助稳重的代容采纳,获得30
11秒前
苯环应助文件撤销了驳回
13秒前
13秒前
15秒前
628发布了新的文献求助10
18秒前
斯文败类应助唠叨的傲薇采纳,获得10
19秒前
赘婿应助owlhealth采纳,获得10
21秒前
21秒前
rocky15给埮埮的求助进行了留言
22秒前
娃哈哈发布了新的文献求助10
22秒前
冷酷非笑发布了新的文献求助10
24秒前
25秒前
粗犷的灵松完成签到 ,获得积分10
27秒前
移动马桶完成签到 ,获得积分10
28秒前
fanghongjian完成签到,获得积分10
28秒前
29秒前
搜集达人应助友好听蓉采纳,获得10
29秒前
渔者发布了新的文献求助10
30秒前
孤独的大灰狼完成签到 ,获得积分10
32秒前
33秒前
小马甲应助欣喜的人龙采纳,获得10
33秒前
MY999完成签到,获得积分10
34秒前
淇淇发布了新的文献求助10
37秒前
霸气咖啡豆完成签到 ,获得积分10
39秒前
甲一发布了新的文献求助10
40秒前
42秒前
11完成签到,获得积分10
44秒前
科研通AI2S应助628采纳,获得10
46秒前
li完成签到,获得积分10
48秒前
糖糖发布了新的文献求助10
48秒前
YueJiang完成签到,获得积分10
50秒前
50秒前
高分求助中
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 1000
Corrosion and Oxygen Control 600
Python Programming for Linguistics and Digital Humanities: Applications for Text-Focused Fields 500
Heterocyclic Stilbene and Bibenzyl Derivatives in Liverworts: Distribution, Structures, Total Synthesis and Biological Activity 500
重庆市新能源汽车产业大数据招商指南(两链两图两池两库两平台两清单两报告) 400
Division and square root. Digit-recurrence algorithms and implementations 400
行動データの計算論モデリング 強化学習モデルを例として 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2547918
求助须知:如何正确求助?哪些是违规求助? 2176366
关于积分的说明 5604231
捐赠科研通 1897190
什么是DOI,文献DOI怎么找? 946750
版权声明 565412
科研通“疑难数据库(出版商)”最低求助积分说明 503899