Application of Large Language Models in Automated Interpretation of Urodynamic Parameters

作者
Zhen Wang,Zhongle Xu,Yongyong Shi,Junhua Xi,Yanbin Zhang
出处
期刊:Neurourology and Urodynamics [Wiley]
标识
DOI:10.1002/nau.70160
摘要

ABSTRACT Background Urodynamic studies (UDS) are essential diagnostic tools in urology, but their interpretation requires significant expertise and is subject to interobserver variability. Large language models (LLMs) have shown promise in various medical diagnostic applications, yet their utility in automated interpretation of urodynamic parameters remains unexplored. Objective To evaluate the diagnostic performance of large language models in the automated interpretation of urodynamic parameters compared to urologists with different experience levels. Methods We analyzed 320 urodynamic studies from patients with various lower urinary tract conditions. Two large language models (Deepseek‐R1 and GPT‐4) were employed to interpret the urodynamic data. Their diagnostic accuracy was compared with that of junior and senior urologists. Performance was evaluated using receiver operating characteristic (ROC) curves, area under the curve (AUC), diagnostic accuracy, and the QUEST framework (Quality of information, Understanding and reasoning, Expression style, Safety, and Trustworthiness). This study was designed and reported following the TRIPOD + AI statement for reporting prediction models using machine learning methods. Results Deepseek‐R1 demonstrated the highest diagnostic accuracy (92.50%) among the automated systems, followed by GPT‐4 (85.94%), comparable to junior urologists (83.75%) but lower than senior urologists (95.94%). The reference standard was established by consensus of three board‐certified urodynamics experts with median 15 years of experience (range 12–22 years). ROC analysis revealed strong performance across different urological conditions, with AUC values ranging from 0.89 to 0.92 for Deepseek‐R1, 0.84–0.88 for GPT‐4, 0.81–0.84 for junior urologists, and 0.94–0.95 for senior urologists. The QUEST framework evaluation showed that Deepseek‐R1 outperformed other systems in information quality, reasoning, expression style, safety, and trustworthiness. Both LLMs demonstrated high clinical utility, with Deepseek‐R1 scoring higher in decision support (4.38/5), time efficiency (2.10/5), and educational value (4.20/5) compared to GPT‐4. Conclusions Large language models, particularly Deepseek‐R1, demonstrate promising capabilities in the automated interpretation of urodynamic parameters, with performance exceeding that of junior urologists and approaching senior urologists. These findings suggest potential applications in clinical decision support, training, and quality assurance in urodynamic practice, which could enhance diagnostic consistency and accessibility of expert‐level interpretation. Clinical Trial Registration This study is a retrospective analysis of deidentified patient data and did not involve any direct patient contact or intervention. Therefore, ethics approval was waived in accordance with institutional and national guidelines.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
王w发布了新的文献求助100
刚刚
1秒前
1秒前
wangwangwang完成签到,获得积分20
1秒前
赘婿应助李文敏采纳,获得10
2秒前
卡卡龍特完成签到,获得积分10
3秒前
4秒前
无极微光应助xiw采纳,获得20
5秒前
kuandong完成签到,获得积分10
8秒前
8秒前
在水一方应助zz采纳,获得10
9秒前
9秒前
10秒前
10秒前
BioGO完成签到,获得积分10
10秒前
好好完成签到,获得积分20
11秒前
雪满头应助悟空最可爱采纳,获得10
12秒前
郝宇发布了新的文献求助10
13秒前
13秒前
斯文文龙完成签到,获得积分10
13秒前
14秒前
15秒前
211JZH发布了新的文献求助10
15秒前
16秒前
芥末发布了新的文献求助10
16秒前
Echo发布了新的文献求助10
16秒前
17秒前
qiuxiu发布了新的文献求助20
17秒前
223老师发布了新的文献求助10
17秒前
哈哈哈完成签到,获得积分10
17秒前
左丘冥完成签到,获得积分10
18秒前
小王同学完成签到 ,获得积分10
19秒前
19秒前
20秒前
doby发布了新的文献求助10
20秒前
kiki完成签到,获得积分10
20秒前
zz发布了新的文献求助10
20秒前
陈陈陈1发布了新的文献求助30
21秒前
22秒前
Gideon完成签到,获得积分10
22秒前
高分求助中
论现代体育科学研究的方法学特征 1000
Invited Discussant 63O and 64O 1000
Ideology and Meaning-Making under the Putin Regime 750
Prompt Engineering for Clinicians: Harnessing AI in Everyday Medical Practice 600
Safety Pharmacology 500
《KNN基无铅压电陶瓷电学性能优化与物理机理研究》 500
A Handbook of User Experience Research & Design in Libraries 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6918729
求助须知:如何正确求助?哪些是违规求助? 8609236
关于积分的说明 18265287
捐赠科研通 6333056
什么是DOI,文献DOI怎么找? 3069304
关于科研通互助平台的介绍 2098655
邀请新用户注册赠送积分活动 2046521