Performance of ChatGPT Compared to Clinical Practice Guidelines in Making Informed Decisions for Lumbosacral Radicular Pain: A Cross-sectional Study

医学 腰骶关节 物理疗法 横断面研究 神经根痛 临床实习 腰椎 外科 病理
作者
Silvia Gianola,Silvia Bargeri,Greta Castellini,Chad Cook,Alvisa Palese,Paolo Pillastrini,Silvia Salvalaggio,Andrea Turolla,Giacomo Rossettini
出处
期刊:Journal of Orthopaedic & Sports Physical Therapy [American Physical Therapy Association]
卷期号:54 (3): 222-228 被引量:15
标识
DOI:10.2519/jospt.2024.12151
摘要

OBJECTIVE: To compare the accuracy of an artificial intelligence chatbot to clinical practice guidelines (CPGs) recommendations for providing answers to complex clinical questions on lumbosacral radicular pain. DESIGN: Cross-sectional study. METHODS: We extracted recommendations from recent CPGs for diagnosing and treating lumbosacral radicular pain. Relative clinical questions were developed and queried to OpenAI’s ChatGPT (GPT-3.5). We compared ChatGPT answers to CPGs recommendations by assessing the (1) internal consistency of ChatGPT answers by measuring the percentage of text wording similarity when a clinical question was posed 3 times, (2) reliability between 2 independent reviewers in grading ChatGPT answers, and (3) accuracy of ChatGPT answers compared to CPGs recommendations. Reliability was estimated using Fleiss’ kappa (κ) coefficients, and accuracy by interobserver agreement as the frequency of the agreements among all judgments. RESULTS: We tested 9 clinical questions. The internal consistency of text ChatGPT answers was unacceptable across all 3 trials in all clinical questions (mean percentage of 49%, standard deviation of 15). Intrareliability (reviewer 1: κ = 0.90, standard error [SE] = 0.09; reviewer 2: κ = 0.90, SE = 0.10) and interreliability (κ = 0.85, SE = 0.15) between the 2 reviewers was “almost perfect.” Accuracy between ChatGPT answers and CPGs recommendations was slight, demonstrating agreement in 33% of recommendations. CONCLUSION: ChatGPT performed poorly in internal consistency and accuracy of the indications generated compared to clinical practice guideline recommendations for lumbosacral radicular pain. J Orthop Sports Phys Ther 2024;54(3):222-228. Epub 29 January 2024. doi:10.2519/jospt.2024.12151
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
ding应助help采纳,获得10
1秒前
哭泣青烟完成签到 ,获得积分10
3秒前
在水一方发布了新的文献求助10
3秒前
6秒前
7秒前
9秒前
zhuhua发布了新的文献求助10
11秒前
阿蕉发布了新的文献求助10
13秒前
help发布了新的文献求助10
13秒前
15秒前
上官若男应助聪慧雪糕采纳,获得10
17秒前
dennisysz发布了新的文献求助10
19秒前
xxx完成签到,获得积分10
21秒前
上官若男应助chen采纳,获得10
24秒前
乘风破浪完成签到 ,获得积分10
25秒前
26秒前
打打应助玄妙采纳,获得30
27秒前
joleisalau发布了新的文献求助10
28秒前
聪慧雪糕发布了新的文献求助10
29秒前
科研通AI5应助科研通管家采纳,获得10
31秒前
bc应助科研通管家采纳,获得80
31秒前
燕子应助科研通管家采纳,获得10
31秒前
青羽凌雪应助科研通管家采纳,获得10
31秒前
研友_VZG7GZ应助科研通管家采纳,获得10
31秒前
科研通AI5应助科研通管家采纳,获得10
31秒前
SciGPT应助科研通管家采纳,获得10
31秒前
燕子应助科研通管家采纳,获得10
31秒前
科研通AI5应助科研通管家采纳,获得10
31秒前
昏睡的蟠桃应助科研通管家采纳,获得200
31秒前
搜集达人应助科研通管家采纳,获得10
31秒前
燕子应助科研通管家采纳,获得10
31秒前
小李老博应助科研通管家采纳,获得10
31秒前
Owen应助科研通管家采纳,获得10
32秒前
夕诙应助科研通管家采纳,获得30
32秒前
科研通AI5应助科研通管家采纳,获得10
32秒前
32秒前
南源完成签到,获得积分10
32秒前
冷静的尔冬完成签到 ,获得积分10
36秒前
悦耳的树叶完成签到 ,获得积分10
38秒前
chen发布了新的文献求助10
42秒前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
ISCN 2024 – An International System for Human Cytogenomic Nomenclature (2024) 3000
Continuum Thermodynamics and Material Modelling 2000
Encyclopedia of Geology (2nd Edition) 2000
105th Edition CRC Handbook of Chemistry and Physics 1600
Maneuvering of a Damaged Navy Combatant 650
the MD Anderson Surgical Oncology Manual, Seventh Edition 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3777429
求助须知:如何正确求助?哪些是违规求助? 3322775
关于积分的说明 10211653
捐赠科研通 3038155
什么是DOI,文献DOI怎么找? 1667159
邀请新用户注册赠送积分活动 797971
科研通“疑难数据库(出版商)”最低求助积分说明 758103