亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Assessing Ability for ChatGPT to Answer Total Knee Arthroplasty-Related Questions

关节置换术 清晰 骨科手术 相关性(法律) 一致性(知识库) 物理疗法 医学 外科 人工智能 计算机科学 生物化学 化学 政治学 法学
作者
Matthew Magruder,Ariel N. Rodriguez,Che Hang Jason Wong,Orry Erez,Nicolás S. Piuzzi,Giles R. Scuderi,James Slover,Jason H. Oh,Ran Schwarzkopf,Antonia F. Chen,Richard Iorio,Stuart B. Goodman,Michael A. Mont
出处
期刊:Journal of Arthroplasty [Elsevier]
标识
DOI:10.1016/j.arth.2024.02.023
摘要

Introduction Artificial intelligence (AI) in the field of orthopaedics has been a topic of increasing interest and opportunity in recent years. Its applications are widespread both for physicians and patients, including use in clinical decision-making, in the operating room, and in research. In this study, we aimed to assess the quality of ChatGPT answers when asked questions related to total knee arthroplasty (TKA). Methods ChatGPT prompts were created by turning 15 of the American Academy of Orthopaedic Surgeons (AAOS) Clinical Practice Guidelines into questions. An online survey was created, which included screenshots of each prompt and answers to the 15 questions. Surgeons were asked to grade ChatGPT answers from 1 to 5 based on their characteristics: 1) Relevance; 2) Accuracy; 3) Clarity; 4) Completeness; 5) Evidence-based; and 6) Consistency. There were eleven Adult Joint Reconstruction fellowship-trained surgeons who completed the survey. Questions were subclassified based on the subject of the prompt: 1) risk factors, 2) implant/Intraoperative, and 3) pain/functional outcomes. The average and standard deviation for all answers, as well as for each subgroup, were calculated. Inter-rater reliability (IRR) was also calculated. Results All answer characteristics were graded as being above average (i.e., a score > 3). Relevance demonstrated the highest scores (4.43±0.77) by surgeons surveyed, and consistency demonstrated the lowest scores (3.54±1.10). ChatGPT prompts in the Risk Factors group demonstrated the best responses, while those in the Pain/Functional Outcome group demonstrated the lowest. The overall IRR was found to be 0.33 (poor reliability), with the highest IRR for relevance (0.43) and the lowest for evidence-based (0.28). Conclusion ChatGPT can answer questions regarding well-established clinical guidelines in TKA with above-average accuracy but demonstrates variable reliability. This investigation is the first step in understanding large language model (LLM) AIs like ChatGPT and how well they perform in the field of arthroplasty.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
螃螃完成签到 ,获得积分10
16秒前
隐形曼青应助三千采纳,获得10
33秒前
36秒前
三千发布了新的文献求助10
41秒前
42秒前
小二郎应助科研通管家采纳,获得10
1分钟前
852应助科研通管家采纳,获得10
1分钟前
这个手刹不太灵完成签到 ,获得积分10
1分钟前
乐乐应助三千采纳,获得10
1分钟前
1分钟前
三千发布了新的文献求助10
1分钟前
jyy完成签到,获得积分10
1分钟前
Lily岑完成签到,获得积分10
2分钟前
tutu完成签到,获得积分10
2分钟前
深情安青应助三千采纳,获得10
2分钟前
ding应助元一一采纳,获得20
2分钟前
清净163完成签到,获得积分10
3分钟前
3分钟前
小屋藏夏完成签到,获得积分10
3分钟前
元一一完成签到,获得积分20
3分钟前
元一一发布了新的文献求助20
3分钟前
YAOYAO发布了新的文献求助10
3分钟前
汉堡包应助YAOYAO采纳,获得10
3分钟前
清净126完成签到 ,获得积分10
3分钟前
研友_VZG7GZ应助迷糊的橙子采纳,获得10
3分钟前
4分钟前
4分钟前
吹皱一湖春水完成签到 ,获得积分10
4分钟前
酷波er应助研友_闾丘枫采纳,获得10
5分钟前
君君完成签到 ,获得积分10
5分钟前
5分钟前
5分钟前
酷波er应助科研通管家采纳,获得10
5分钟前
zqq完成签到,获得积分10
6分钟前
hhhwww完成签到 ,获得积分10
6分钟前
远山笑你完成签到 ,获得积分10
6分钟前
无限烧鹅完成签到,获得积分10
6分钟前
7分钟前
7分钟前
三千发布了新的文献求助10
7分钟前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Edestus (Chondrichthyes, Elasmobranchii) from the Upper Carboniferous of Xinjiang, China 500
Chinese-English Translation Lexicon Version 3.0 500
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
マンネンタケ科植物由来メロテルペノイド類の網羅的全合成/Collective Synthesis of Meroterpenoids Derived from Ganoderma Family 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2380946
求助须知:如何正确求助?哪些是违规求助? 2088241
关于积分的说明 5244336
捐赠科研通 1815256
什么是DOI,文献DOI怎么找? 905728
版权声明 558821
科研通“疑难数据库(出版商)”最低求助积分说明 483664