Interpretable machine learning models for bladder cancer overall survival prediction development and external validation via SEER database and Chinese cohort analysis

可解释性 机器学习 人工智能 判别式 逻辑回归 布里氏评分 医学 随机森林 特征选择 Lasso(编程语言) 比例危险模型 预测建模 梯度升压 决策树 人口 回归 队列 生存分析 计算机科学 降维 膀胱癌 数据挖掘 特征(语言学) 弹性网正则化 加速失效时间模型 回归分析 交叉验证 Boosting(机器学习) 统计 分类器(UML)
作者
Saimaitikari Abudoubari,Abudouresuli Tuersun,Sailidan Mutailipu,Wenbin Chen,Q Li,Mayidili Nijiati,Xiaoguang Zou
出处
期刊:Discover Oncology [Springer Nature]
标识
DOI:10.1007/s12672-026-05099-6
摘要

OBJECTIVE: We developed interpretable machine learning(ML) models to predict overall survival in bladder cancer patients. This approach aims to improve the interpretability and transparency of our modeling results. METHODS: We collected clinical and pathological information on bladder cancer patients from the SEER database, allocating it to training and validation sets in a 7:3 ratio. At the same time, we obtained an external validation cohort from Kashgar First People's Hospital in Xinjiang, China. We performed LASSO regression and Cox regression analyses to identify relevant risk factors and then combined these to develop CoxPH and six ML models: Random Survival Forest(RSF), Gradient Boosting with Component Linear(GLMboost), decision tree(dt), boosted tree(bt), DeepSurv, and neural multi-task logistic regression(NMTLR). We evaluated the predictive performance of these ML models using the consistency index (C-index), the area under the cumulative/dynamic curve (AUC) and the integrated Brier score and Kolmogorov-Smirnov(KS). For interpretability assessment, we employed three complementary methods: (1)time-dependent variable importance to quantify feature contribution across follow-up periods; (2)partial correlation survival plots to visualize individual variable effects; and (3)aggregated survival SHapley additive interpretation(SurvSHAP) plots with mean absolute deviation metrics to validate feature impact stability at both individual and population levels. RESULTS: The final ML model consists of 14 factors: the patient's age, AJCCStage, chemotherapy, Mstage, marital, Tstage, bone metastasis(BoneMets), stage, radiation, histology, liverMets, Nstage, sex. Our predictive models demonstrates significant discriminative ability, with the boosting tree model performing the best. The AUC for 1-year, 3-year, and 5-year overall survival (OS) was above 0.770 for the training set, validation set, and external validation set, respectively, with the overall Brier score consistently below 0.180. The interpretability analysis of the boosting trees model further indicated that AJCCStage, age, chemotherapy, stage, Mstage, marital were the most influential predictors via quantifiable SurvSHAP values and time-dependent importance weights, with their effects visually validated through partial correlation survival curves. CONCLUSIONS: The boosting trees model prognostic model has the best performance and can be used to predict OS in bladder cancer patients, helping physicians to accurately assess patients' overall survival rates, and providing valuable and important references for patient diagnosis, treatment, and prognosis evaluation.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
鱼干发布了新的文献求助10
刚刚
在水一方应助泡泡采纳,获得10
2秒前
3秒前
3秒前
aalihao完成签到,获得积分10
3秒前
考拉完成签到,获得积分10
3秒前
Small丹完成签到,获得积分10
4秒前
sss发布了新的文献求助10
5秒前
7秒前
于涵艺完成签到,获得积分10
8秒前
zjw1997发布了新的文献求助10
8秒前
刘二狗发布了新的文献求助10
8秒前
爱悠悠完成签到 ,获得积分10
9秒前
luojimao完成签到,获得积分10
10秒前
10秒前
马桶盖盖子完成签到,获得积分10
11秒前
11秒前
11秒前
11秒前
Jadedew完成签到,获得积分10
11秒前
MaxZimmer发布了新的文献求助10
11秒前
过时的小土豆完成签到,获得积分20
12秒前
12秒前
英俊的铭应助eason采纳,获得10
12秒前
科研通AI6.2应助七五采纳,获得10
12秒前
fangang完成签到,获得积分10
13秒前
小尚发布了新的文献求助30
13秒前
14秒前
江流石不转完成签到 ,获得积分10
14秒前
殷勤的可冥完成签到,获得积分10
15秒前
15秒前
wik完成签到,获得积分10
15秒前
sss完成签到,获得积分10
17秒前
17秒前
徐1完成签到 ,获得积分10
17秒前
17秒前
18秒前
18秒前
张笑笑完成签到,获得积分10
18秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Petrology and Plate Tectonics,2025 400
Burger's Medicinal Chemistry and Drug Discovery 400
New directions for experimental lessons in science teaching: Myth, Mystery, Necessity? by Emily K. da Silva Cunha Souto (Author), Flávia Lins Silva (Author) 333
Scientific experimentation in the classroom: Comparison between genetic-Socratic-exemplary teaching and workshop teaching by Ingrid Hofer (Author) 333
Programming for Chemical Engineers Using C, C++, and MATLAB 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6719368
求助须知:如何正确求助?哪些是违规求助? 8456338
关于积分的说明 18053601
捐赠科研通 5970363
什么是DOI,文献DOI怎么找? 2995645
邀请新用户注册赠送积分活动 1971703
关于科研通互助平台的介绍 1924783