Novel machine learning models outperform risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis

随机森林 肝细胞癌 医学 队列 逻辑回归 决策树 接收机工作特性 内科学 机器学习 回顾性队列研究 人工智能 阿达布思 病毒性肝炎 肝病学 肿瘤科 支持向量机 计算机科学
作者
Grace Lai-Hung Wong,Vicki Wing-Ki Hui,Qingxiong Tan,Jingwen Xu,Hye Sun Lee,Terry Cheuk-Fung Yip,Baoyao Yang,Yee-Kit Tse,Chong Yin,Fei Lyu,Jimmy S. M. Lai,Grace Lui,Henry Lik-Yuen Chan,Pongchi Yuen,Vincent Wai-Sun Wong
出处
期刊:JHEP reports [Elsevier]
卷期号:4 (3): 100441-100441 被引量:4
标识
DOI:10.1016/j.jhepr.2022.100441
摘要

Accurate hepatocellular carcinoma (HCC) risk prediction facilitates appropriate surveillance strategy and reduces cancer mortality. We aimed to derive and validate novel machine learning models to predict HCC in a territory-wide cohort of patients with chronic viral hepatitis (CVH) using data from the Hospital Authority Data Collaboration Lab (HADCL).This was a territory-wide, retrospective, observational, cohort study of patients with CVH in Hong Kong in 2000-2018 identified from HADCL based on viral markers, diagnosis codes, and antiviral treatment for chronic hepatitis B and/or C. The cohort was randomly split into training and validation cohorts in a 7:3 ratio. Five popular machine learning methods, namely, logistic regression, ridge regression, AdaBoost, decision tree, and random forest, were performed and compared to find the best prediction model.A total of 124,006 patients with CVH with complete data were included to build the models. In the training cohort (n = 86,804; 6,821 HCC), ridge regression (area under the receiver operating characteristic curve [AUROC] 0.842), decision tree (0.952), and random forest (0.992) performed the best. In the validation cohort (n = 37,202; 2,875 HCC), ridge regression (AUROC 0.844) and random forest (0.837) maintained their accuracy, which was significantly higher than those of HCC risk scores: CU-HCC (0.672), GAG-HCC (0.745), REACH-B (0.671), PAGE-B (0.748), and REAL-B (0.712) scores. The low cut-off (0.07) of HCC ridge score (HCC-RS) achieved 90.0% sensitivity and 98.6% negative predictive value (NPV) in the validation cohort. The high cut-off (0.15) of HCC-RS achieved high specificity (90.0%) and NPV (95.6%); 31.1% of patients remained indeterminate.HCC-RS from the ridge regression machine learning model accurately predicted HCC in patients with CVH. These machine learning models may be developed as built-in functional keys or calculators in electronic health systems to reduce cancer mortality.Novel machine learning models generated accurate risk scores for hepatocellular carcinoma (HCC) in patients with chronic viral hepatitis. HCC ridge score was consistently more accurate than existing HCC risk scores. These models may be incorporated into electronic medical health systems to develop appropriate cancer surveillance strategies and reduce cancer death.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
4秒前
朱诗源完成签到 ,获得积分10
4秒前
molingyue完成签到 ,获得积分10
9秒前
科研通AI2S应助默默小海豚采纳,获得10
10秒前
科研汪完成签到,获得积分10
13秒前
牛人完成签到,获得积分10
13秒前
莓莓MM完成签到 ,获得积分10
13秒前
小爽完成签到,获得积分10
15秒前
洋洋爱吃枣完成签到 ,获得积分10
24秒前
NULI完成签到 ,获得积分10
24秒前
耙耙柑完成签到 ,获得积分10
29秒前
忐忑的雪糕完成签到 ,获得积分10
31秒前
行云流水完成签到,获得积分10
31秒前
kingmp2完成签到 ,获得积分10
32秒前
luffy189完成签到 ,获得积分10
44秒前
jeanna完成签到 ,获得积分10
49秒前
追梦完成签到 ,获得积分10
51秒前
唐云炳完成签到 ,获得积分10
53秒前
victory_liu完成签到,获得积分10
1分钟前
叶夜南完成签到 ,获得积分10
1分钟前
jessie完成签到 ,获得积分10
1分钟前
高海龙完成签到 ,获得积分10
1分钟前
迢迢万里完成签到 ,获得积分10
1分钟前
楚襄谷完成签到 ,获得积分10
1分钟前
2123121321321完成签到,获得积分10
1分钟前
隐形曼青应助崔迎松采纳,获得30
1分钟前
佳期如梦完成签到 ,获得积分10
1分钟前
1分钟前
彩色的芷容完成签到,获得积分10
1分钟前
崔迎松发布了新的文献求助30
1分钟前
蓝意完成签到,获得积分10
1分钟前
fancy完成签到 ,获得积分10
1分钟前
TK完成签到 ,获得积分10
2分钟前
benben应助科研通管家采纳,获得10
2分钟前
扶光完成签到 ,获得积分10
2分钟前
2分钟前
黄百川发布了新的文献求助10
2分钟前
ycd完成签到,获得积分10
2分钟前
hakuna_matata完成签到 ,获得积分10
2分钟前
大气伯云完成签到 ,获得积分10
2分钟前
高分求助中
Thermodynamic data for steelmaking 3000
Teaching Social and Emotional Learning in Physical Education 900
藍からはじまる蛍光性トリプタンスリン研究 400
Cardiology: Board and Certification Review 400
[Lambert-Eaton syndrome without calcium channel autoantibodies] 340
NEW VALUES OF SOLUBILITY PARAMETERS FROM VAPOR PRESSURE DATA 300
Transformerboard III 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2361899
求助须知:如何正确求助?哪些是违规求助? 2069806
关于积分的说明 5170008
捐赠科研通 1797975
什么是DOI,文献DOI怎么找? 897950
版权声明 557715
科研通“疑难数据库(出版商)”最低求助积分说明 479304