Real‐world deployment and evaluation of PEri ‐operative AI CHatbot ( PEACH ): a large language model chatbot for peri‐operative medicine

作者
Yuhe Ke,Liyuan Jin,Kabilan Elangovan,Bee Suan Ong,Chin Yang Oh,Jacqueline Sim,Kenny Loh,Chai Rick Soh,Jonathan Cheng,Aaron Kwang Yang Lee,Daniel Shu Wei Ting,Nan Liu,Hairil Rizal Abdullah
出处
期刊:Anaesthesia [Wiley]
被引量:2
标识
DOI:10.1111/anae.16755
摘要

Summary Introduction Large Language Models are emerging as powerful tools in healthcare, particularly for complex, domain‐specific tasks. This study describes the development and evaluation of PEri‐operative AI CHatbot (PEACH). It was developed by embedding 35 institutional peri‐operative protocols into a secure large language model environment, with iterative prompt engineering and internal testing to ensure clinical relevance and accuracy. Methods The system was tested with a silent deployment using real‐world data. Accuracy, safety and usability were assessed. Accuracy was evaluated by comparing the responses from PEACH against institutional guidelines and expert consensus. Deviations and hallucinations were categorised based on potential harm, and user feedback was evaluated using the Davis' Technology Acceptance Model. Updates to PEACH were made after the initial silent deployment to make minor amendments to one of the protocols. Results In total, 240 real‐world clinical iterations were evaluated. First‐generation accuracy was 97.5% (78/80), with an overall accuracy of 96.7% (232/240) across three iterations. In the updated PEACH, accuracy improved to 97.9% (235/240), with a statistically significant difference from the null hypothesis of 95% accuracy (p = 0.018). Hallucinations and deviations were minimal (1/240 and 2/240, respectively). There was high usability, with clinicians noting that PEACH expedited decisions in 95% of cases. The κ statistic for inter‐rater reliability for PEACH was 0.772 and 0.893 between three iterations, compared with 0.610 and 0.784 for experienced peri‐operative physicians. Discussion PEACH is an accurate, adaptable tool that enhances consistency and efficiency in peri‐operative decision‐making. Future research should explore scalability across specialties and its impact on clinical outcomes.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
齐琪关注了科研通微信公众号
刚刚
zxm发布了新的文献求助10
1秒前
共享精神应助感谢佬采纳,获得10
1秒前
MYe发布了新的文献求助10
1秒前
雨泽发布了新的文献求助10
1秒前
Dyying发布了新的文献求助100
2秒前
2秒前
简单花花发布了新的文献求助10
2秒前
慕青应助1diandiant采纳,获得10
3秒前
英姑应助勤恳钢笔采纳,获得10
3秒前
James应助马铃薯采纳,获得20
3秒前
LZ完成签到,获得积分10
3秒前
Orange应助典雅西牛采纳,获得10
5秒前
小石头完成签到,获得积分10
5秒前
Frozen Flame发布了新的文献求助20
6秒前
七七发布了新的文献求助10
6秒前
orixero应助Kai采纳,获得10
6秒前
ss发布了新的文献求助10
7秒前
7秒前
蜘蛛人完成签到 ,获得积分10
7秒前
罗dd发布了新的文献求助10
7秒前
花城发布了新的文献求助10
8秒前
Halo完成签到,获得积分10
8秒前
gmace发布了新的文献求助10
9秒前
2哇哇哇发布了新的文献求助10
9秒前
lulu完成签到,获得积分10
9秒前
9秒前
9秒前
阔达的访风应助WILD采纳,获得10
9秒前
烟花应助111采纳,获得10
10秒前
nrghhjm完成签到 ,获得积分10
10秒前
10秒前
10秒前
11秒前
mm发布了新的文献求助10
11秒前
甜橙关注了科研通微信公众号
11秒前
最牛的菠萝隐士完成签到,获得积分10
11秒前
爆米花应助wang采纳,获得10
11秒前
CodeCraft应助湘湘采纳,获得10
11秒前
高分求助中
美国药典 2000
Fermented Coffee Market 2000
合成生物食品制造技术导则,团体标准,编号:T/CITS 396-2025 1000
The Leucovorin Guide for Parents: Understanding Autism’s Folate 1000
Pipeline and riser loss of containment 2001 - 2020 (PARLOC 2020) 1000
Critical Thinking: Tools for Taking Charge of Your Learning and Your Life 4th Edition 500
Comparing natural with chemical additive production 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 内科学 生物化学 物理 计算机科学 纳米技术 遗传学 基因 复合材料 化学工程 物理化学 病理 催化作用 免疫学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 5239828
求助须知:如何正确求助?哪些是违规求助? 4407067
关于积分的说明 13717174
捐赠科研通 4275655
什么是DOI,文献DOI怎么找? 2346104
邀请新用户注册赠送积分活动 1343227
关于科研通互助平台的介绍 1301291