Machine learning-based prediction of distant metastasis risk in invasive ductal carcinoma of the breast

乳腺癌 转移 随机森林 机器学习 支持向量机 逻辑回归 医学 人工智能 肿瘤科 导管癌 Python(编程语言) 内科学 计算机科学 癌症 操作系统
作者
Jingao Dong,R.Y. Lei,Feiyang Ma,Lu Yu,Lanlan Wang,Sheng Xu,Yunhua Hu,Jialin Sun,Wenwen Zhang,Haixia Wang,Li Zhang
出处
期刊:PLOS ONE [Public Library of Science]
卷期号:20 (2): e0310410-e0310410
标识
DOI:10.1371/journal.pone.0310410
摘要

More than 90% of deaths due to breast cancer (BC) are due to metastasis-related complications, with invasive ductal carcinoma (IDC) of the breast being the most common pathologic type of breast cancer and highly susceptible to metastasis to distant organs. BC patients who develop cancer metastases are more likely to have a poor prognosis and poor quality of life, so it is extremely important to recognize and diagnose whether distant metastases have occurred in IDC as early as possible. In this study, we develop a non-invasive breast cancer classification system for detecting cancer metastasis. We used Anaconda-Jupyter notebooks to develop various Python programming modules for text mining, data processing, and machine learning (ML) methods. A risk prediction model was constructed based on four algorithms: Random Forest, XGBoost, Logistic Regression, and SVM. Additionally, we developed a hybrid model based on a voting mechanism using these four algorithms as the base models. The models were compared and evaluated by the following metrics: accuracy, precision, recall, F1-score, and area under the ROC curve (AUC) values. The experimental results show that the hybrid model based on the voting mechanism exhibits the best prediction performance (accuracy: 0.867, precision: 0.929, recall: 0.805, F1-score: 0.856, AUC: 0.94). This stable risk prediction model provides a valuable reference support for doctors in assessing and diagnosing the risk of IDC hematogenous metastasis. It also improves the work efficiency of doctors and strives to provide patients with increased chances of survival.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
jin发布了新的文献求助10
1秒前
斯文败类应助Yayoioo采纳,获得10
1秒前
科研通AI6.3应助北海采纳,获得10
2秒前
2秒前
大馍完成签到,获得积分10
2秒前
hs完成签到,获得积分0
2秒前
why发布了新的文献求助10
3秒前
文章多多完成签到,获得积分10
3秒前
4秒前
5秒前
小马甲应助Jemma31采纳,获得50
5秒前
7秒前
spring完成签到,获得积分10
7秒前
8秒前
8秒前
搜集达人应助SUNYAOSUNYAO采纳,获得10
8秒前
9秒前
zzzz发布了新的文献求助60
9秒前
10秒前
小二郎应助沉默寻凝采纳,获得10
11秒前
will完成签到,获得积分10
12秒前
阿柴发布了新的文献求助10
12秒前
12秒前
虚心世立发布了新的文献求助10
12秒前
完美世界应助Luna采纳,获得10
13秒前
lushier完成签到,获得积分10
13秒前
13秒前
罗英完成签到,获得积分10
13秒前
13秒前
13秒前
14秒前
sghsh发布了新的文献求助10
14秒前
歪歪完成签到,获得积分10
14秒前
falan应助欣慰元蝶采纳,获得50
14秒前
称心的书双完成签到,获得积分10
14秒前
科研小蔡发布了新的文献求助10
14秒前
已白头完成签到,获得积分10
14秒前
蓦然完成签到,获得积分10
15秒前
科研通AI2S应助hc采纳,获得10
15秒前
小蘑菇应助巴卡巴卡采纳,获得10
15秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6477684
求助须知:如何正确求助?哪些是违规求助? 8279440
关于积分的说明 17657587
捐赠科研通 5559812
什么是DOI,文献DOI怎么找? 2910902
邀请新用户注册赠送积分活动 1887873
关于科研通互助平台的介绍 1741389