亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Harnessing artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in generating clinician-level bariatric surgery recommendations

可读性 医学 利克特量表 外科 心理学 发展心理学 哲学 语言学
作者
Yung Lee,Thomas H. Shin,Léa Tessier,Arshia Javidan,James J. Jung,Dennis Hong,Andrew T. Strong,Tyler McKechnie,Sarah Malone,David Jin,Matthew Kroh,Jerry T. Dang
出处
期刊:Surgery for Obesity and Related Diseases [Elsevier BV]
卷期号:20 (7): 603-608 被引量:22
标识
DOI:10.1016/j.soard.2024.03.011
摘要

BackgroundThe formulation of clinical recommendations pertaining to bariatric surgery is essential in guiding healthcare professionals. However, the extensive and continuously evolving body of literature in bariatric surgery presents considerable challenge for staying abreast of latest developments and efficient information acquisition. Artificial intelligence (AI) has the potential to streamline access to the salient points of clinical recommendations in bariatric surgery.ObjectiveThe study aims to appraise the quality and readability of AI-chat-generated answers to frequently asked clinical inquiries in the field of bariatric and metabolic surgery.SettingRemote.MethodsQuestion prompts inputted into AI large language models (LLMs) were created based on pre-existing clinical practice guidelines regarding bariatric and metabolic surgery. The prompts were queried into three LLMs: OpenAI ChatGPT-4, Microsoft Bing, and Google Bard. The responses from each LLM were entered into a spreadsheet for randomized and blinded duplicate review. Accredited bariatric surgeons in North America independently assessed appropriateness of each recommendation using a 5-point Likert scale. Scores of 4 and 5 were deemed appropriate, while scores of 1 to 3 indicated a lack of appropriateness. A Flesch Reading Ease (FRE) score was calculated to assess the readability of responses generated by each LLMs.ResultsThere was a significant difference between the three LLMs in their 5-point Likert scores, with mean values of 4.46 (SD 0.82), 3.89 (0.80), and 3.11 (0.72) for ChatGPT-4, Bard, and Bing (P<0.001). There was a significant difference between the three LLMs in the proportion of appropriate answers, with ChatGPT-4 at 85.7%, Bard at 74.3%, and Bing at 25.7% (P<0.001). The mean FRE scores for ChatGPT-4, Bard, and Bing, were 21.68 (SD 2.78), 42.89 (4.03), and 14.64 (5.09), respectively, with higher scores representing easier readability.ConclusionLLM-based AI chat models can effectively generate appropriate responses to clinical questions related to bariatric surgery, though the performance of different models can vary greatly. Therefore, caution should be taken when interpreting clinical information provided by LLMs, and clinician oversight is necessary to ensure accuracy. Future investigation is warranted to explore how LLMs might enhance healthcare provision and clinical decision-making in bariatric surgery.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
4秒前
21秒前
25秒前
37秒前
鸟兽兽应助小李老博采纳,获得10
39秒前
免我蹉跎苦完成签到,获得积分10
46秒前
taoxz521完成签到 ,获得积分10
1分钟前
小小鱼完成签到 ,获得积分10
1分钟前
科研通AI2S应助科研通管家采纳,获得10
1分钟前
科研通AI2S应助科研通管家采纳,获得10
1分钟前
zzzz发布了新的文献求助10
1分钟前
zzzz完成签到,获得积分10
1分钟前
2分钟前
小时发布了新的文献求助10
2分钟前
2分钟前
Demi_Ming发布了新的文献求助10
2分钟前
嘻嘻哈哈应助小时采纳,获得10
2分钟前
科研通AI6.1应助小时采纳,获得10
2分钟前
Demi_Ming完成签到,获得积分10
2分钟前
3分钟前
3分钟前
3分钟前
dateline完成签到 ,获得积分10
3分钟前
3分钟前
自律完成签到,获得积分10
3分钟前
3分钟前
3分钟前
lovelife完成签到,获得积分10
3分钟前
xmsyq完成签到 ,获得积分10
4分钟前
zbr完成签到 ,获得积分10
5分钟前
6分钟前
6分钟前
6分钟前
6分钟前
6分钟前
zxq完成签到 ,获得积分10
6分钟前
清风明月完成签到 ,获得积分10
6分钟前
7分钟前
科研通AI6.4应助紫色天蓝采纳,获得10
7分钟前
7分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 610
Decentring Leadership 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6278133
求助须知:如何正确求助?哪些是违规求助? 8097625
关于积分的说明 16927658
捐赠科研通 5346845
什么是DOI,文献DOI怎么找? 2842494
邀请新用户注册赠送积分活动 1819797
关于科研通互助平台的介绍 1676979