Machine Learning Did Not Outperform Conventional Competing Risk Modeling to Predict Revision Arthroplasty

医学 机器学习 关节置换术 人工智能 外科 计算机科学
作者
Jacobien H. F. Oosterhoff,Anne de Hond,Rinne M Peters,Liza N. van Steenbergen,Juliette C. Sorel,Wierd P Zijlstra,Rudolf W. Poolman,David Ring,Paul C. Jutte,Gino M. M. J. Kerkhoffs,Hein Putter,Ewout W. Steyerberg,Job N. Doornberg
出处
期刊:Clinical Orthopaedics and Related Research [Lippincott Williams & Wilkins]
卷期号:482 (8): 1472-1482 被引量:3
标识
DOI:10.1097/corr.0000000000003018
摘要

Background Estimating the risk of revision after arthroplasty could inform patient and surgeon decision-making. However, there is a lack of well-performing prediction models assisting in this task, which may be due to current conventional modeling approaches such as traditional survivorship estimators (such as Kaplan-Meier) or competing risk estimators. Recent advances in machine learning survival analysis might improve decision support tools in this setting. Therefore, this study aimed to assess the performance of machine learning compared with that of conventional modeling to predict revision after arthroplasty. Question/purpose Does machine learning perform better than traditional regression models for estimating the risk of revision for patients undergoing hip or knee arthroplasty? Methods Eleven datasets from published studies from the Dutch Arthroplasty Register reporting on factors associated with revision or survival after partial or total knee and hip arthroplasty between 2018 and 2022 were included in our study. The 11 datasets were observational registry studies, with a sample size ranging from 3038 to 218,214 procedures. We developed a set of time-to-event models for each dataset, leading to 11 comparisons. A set of predictors (factors associated with revision surgery) was identified based on the variables that were selected in the included studies. We assessed the predictive performance of two state-of-the-art statistical time-to-event models for 1-, 2-, and 3-year follow-up: a Fine and Gray model (which models the cumulative incidence of revision) and a cause-specific Cox model (which models the hazard of revision). These were compared with a machine-learning approach (a random survival forest model, which is a decision tree–based machine-learning algorithm for time-to-event analysis). Performance was assessed according to discriminative ability (time-dependent area under the receiver operating curve), calibration (slope and intercept), and overall prediction error (scaled Brier score). Discrimination, known as the area under the receiver operating characteristic curve, measures the model’s ability to distinguish patients who achieved the outcomes from those who did not and ranges from 0.5 to 1.0, with 1.0 indicating the highest discrimination score and 0.50 the lowest. Calibration plots the predicted versus the observed probabilities; a perfect plot has an intercept of 0 and a slope of 1. The Brier score calculates a composite of discrimination and calibration, with 0 indicating perfect prediction and 1 the poorest. A scaled version of the Brier score, 1 – (model Brier score/null model Brier score), can be interpreted as the amount of overall prediction error. Results Using machine learning survivorship analysis, we found no differences between the competing risks estimator and traditional regression models for patients undergoing arthroplasty in terms of discriminative ability (patients who received a revision compared with those who did not). We found no consistent differences between the validated performance (time-dependent area under the receiver operating characteristic curve) of different modeling approaches because these values ranged between -0.04 and 0.03 across the 11 datasets (the time-dependent area under the receiver operating characteristic curve of the models across 11 datasets ranged between 0.52 to 0.68). In addition, the calibration metrics and scaled Brier scores produced comparable estimates, showing no advantage of machine learning over traditional regression models. Conclusion Machine learning did not outperform traditional regression models. Clinical Relevance Neither machine learning modeling nor traditional regression methods were sufficiently accurate in order to offer prognostic information when predicting revision arthroplasty. The benefit of these modeling approaches may be limited in this context.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
gengsumin完成签到,获得积分10
1秒前
1秒前
梅梅完成签到 ,获得积分10
2秒前
2秒前
Doraemon-2077完成签到,获得积分10
4秒前
木木余发布了新的文献求助10
5秒前
爆米花应助xiaxia采纳,获得10
6秒前
风轻完成签到 ,获得积分10
7秒前
华仔应助毅诚菌采纳,获得10
7秒前
happy发布了新的文献求助10
7秒前
8秒前
9秒前
9秒前
清脆秋玲完成签到,获得积分10
10秒前
11秒前
舒心砖头发布了新的文献求助20
11秒前
blUe完成签到,获得积分10
12秒前
13秒前
14秒前
香橙发布了新的文献求助10
14秒前
无私的亦巧完成签到,获得积分10
14秒前
所所应助Tetrahydron采纳,获得10
14秒前
木村修发布了新的文献求助10
16秒前
sjdhasj完成签到,获得积分20
16秒前
斯文败类应助徐1采纳,获得10
16秒前
奋斗人雄完成签到,获得积分0
17秒前
buqi完成签到,获得积分10
17秒前
陈佳完成签到 ,获得积分10
17秒前
刘牛子发布了新的文献求助10
17秒前
Xc完成签到,获得积分10
18秒前
感动函完成签到 ,获得积分10
18秒前
一一完成签到,获得积分10
18秒前
DO完成签到,获得积分10
18秒前
谨慎水蓝发布了新的文献求助10
19秒前
我我我完成签到,获得积分10
19秒前
19秒前
摩天轮完成签到 ,获得积分10
19秒前
19秒前
21秒前
21秒前
高分求助中
Psychopathic Traits and Quality of Prison Life 1000
Chemistry and Physics of Carbon Volume 18 800
The formation of Australian attitudes towards China, 1918-1941 660
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6451729
求助须知:如何正确求助?哪些是违规求助? 8263452
关于积分的说明 17608388
捐赠科研通 5516377
什么是DOI,文献DOI怎么找? 2903719
邀请新用户注册赠送积分活动 1880647
关于科研通互助平台的介绍 1722664