Machine Learning Did Not Outperform Conventional Competing Risk Modeling to Predict Revision Arthroplasty

医学 机器学习 关节置换术 人工智能 外科 计算机科学
作者
Jacobien H. F. Oosterhoff,Anne de Hond,Rinne M Peters,Liza N. van Steenbergen,Juliette C. Sorel,Wierd P Zijlstra,Rudolf W. Poolman,David Ring,Paul C. Jutte,Gino M. M. J. Kerkhoffs,Hein Putter,Ewout W. Steyerberg,Job N. Doornberg
出处
期刊:Clinical Orthopaedics and Related Research [Lippincott Williams & Wilkins]
卷期号:482 (8): 1472-1482 被引量:3
标识
DOI:10.1097/corr.0000000000003018
摘要

Background Estimating the risk of revision after arthroplasty could inform patient and surgeon decision-making. However, there is a lack of well-performing prediction models assisting in this task, which may be due to current conventional modeling approaches such as traditional survivorship estimators (such as Kaplan-Meier) or competing risk estimators. Recent advances in machine learning survival analysis might improve decision support tools in this setting. Therefore, this study aimed to assess the performance of machine learning compared with that of conventional modeling to predict revision after arthroplasty. Question/purpose Does machine learning perform better than traditional regression models for estimating the risk of revision for patients undergoing hip or knee arthroplasty? Methods Eleven datasets from published studies from the Dutch Arthroplasty Register reporting on factors associated with revision or survival after partial or total knee and hip arthroplasty between 2018 and 2022 were included in our study. The 11 datasets were observational registry studies, with a sample size ranging from 3038 to 218,214 procedures. We developed a set of time-to-event models for each dataset, leading to 11 comparisons. A set of predictors (factors associated with revision surgery) was identified based on the variables that were selected in the included studies. We assessed the predictive performance of two state-of-the-art statistical time-to-event models for 1-, 2-, and 3-year follow-up: a Fine and Gray model (which models the cumulative incidence of revision) and a cause-specific Cox model (which models the hazard of revision). These were compared with a machine-learning approach (a random survival forest model, which is a decision tree–based machine-learning algorithm for time-to-event analysis). Performance was assessed according to discriminative ability (time-dependent area under the receiver operating curve), calibration (slope and intercept), and overall prediction error (scaled Brier score). Discrimination, known as the area under the receiver operating characteristic curve, measures the model’s ability to distinguish patients who achieved the outcomes from those who did not and ranges from 0.5 to 1.0, with 1.0 indicating the highest discrimination score and 0.50 the lowest. Calibration plots the predicted versus the observed probabilities; a perfect plot has an intercept of 0 and a slope of 1. The Brier score calculates a composite of discrimination and calibration, with 0 indicating perfect prediction and 1 the poorest. A scaled version of the Brier score, 1 – (model Brier score/null model Brier score), can be interpreted as the amount of overall prediction error. Results Using machine learning survivorship analysis, we found no differences between the competing risks estimator and traditional regression models for patients undergoing arthroplasty in terms of discriminative ability (patients who received a revision compared with those who did not). We found no consistent differences between the validated performance (time-dependent area under the receiver operating characteristic curve) of different modeling approaches because these values ranged between -0.04 and 0.03 across the 11 datasets (the time-dependent area under the receiver operating characteristic curve of the models across 11 datasets ranged between 0.52 to 0.68). In addition, the calibration metrics and scaled Brier scores produced comparable estimates, showing no advantage of machine learning over traditional regression models. Conclusion Machine learning did not outperform traditional regression models. Clinical Relevance Neither machine learning modeling nor traditional regression methods were sufficiently accurate in order to offer prognostic information when predicting revision arthroplasty. The benefit of these modeling approaches may be limited in this context.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
结实的芷烟完成签到,获得积分10
2秒前
李查查完成签到 ,获得积分10
3秒前
烟花应助打开天窗说亮话采纳,获得10
3秒前
酷波er应助成就的初瑶采纳,获得10
4秒前
ESLove完成签到,获得积分10
5秒前
Candy完成签到,获得积分10
5秒前
AriseChen完成签到,获得积分10
6秒前
徐反宁完成签到,获得积分10
6秒前
小马甲应助blueslow采纳,获得10
7秒前
8秒前
天一完成签到,获得积分10
9秒前
orixero应助荔枝采纳,获得10
10秒前
11秒前
13秒前
ddd666完成签到 ,获得积分10
13秒前
开朗的骁发布了新的文献求助10
13秒前
13秒前
zhzssaijj发布了新的文献求助10
15秒前
15秒前
乐观海燕完成签到,获得积分10
16秒前
16秒前
忧郁的冬亦完成签到,获得积分10
17秒前
17秒前
18秒前
马开峰发布了新的文献求助10
20秒前
blueslow发布了新的文献求助10
22秒前
Vincent完成签到 ,获得积分10
22秒前
林祥胜完成签到 ,获得积分10
24秒前
zhzssaijj完成签到,获得积分10
26秒前
YY完成签到,获得积分10
26秒前
Annieqqiu发布了新的文献求助10
28秒前
lihanzhang1047应助您晓采纳,获得10
32秒前
33秒前
CHENYINGYING完成签到 ,获得积分10
36秒前
无限的千山完成签到,获得积分10
38秒前
活泼的机器猫完成签到,获得积分10
39秒前
40秒前
刘雪完成签到 ,获得积分10
40秒前
zuducyow完成签到,获得积分10
42秒前
42秒前
高分求助中
Psychopathic Traits and Quality of Prison Life 1000
Chemistry and Physics of Carbon Volume 18 800
The formation of Australian attitudes towards China, 1918-1941 660
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6451847
求助须知:如何正确求助?哪些是违规求助? 8263589
关于积分的说明 17608830
捐赠科研通 5516441
什么是DOI,文献DOI怎么找? 2903751
邀请新用户注册赠送积分活动 1880785
关于科研通互助平台的介绍 1722664