Performance of Large Language Models in Patient Complaint Resolution: Web-Based Cross-Sectional Survey

投诉 医疗保健 患者满意度 医学 横断面研究 家庭医学 护理部 政治学 病理 法学
作者
Lorraine Pei Xian Yong,Joshua Yi Min Tung,Zi Yao Lee,Win Sen Kuan,Mui Teng Chua
出处
期刊:Journal of Medical Internet Research [JMIR Publications]
卷期号:26: e56413-e56413 被引量:5
标识
DOI:10.2196/56413
摘要

Background Patient complaints are a perennial challenge faced by health care institutions globally, requiring extensive time and effort from health care workers. Despite these efforts, patient dissatisfaction remains high. Recent studies on the use of large language models (LLMs) such as the GPT models developed by OpenAI in the health care sector have shown great promise, with the ability to provide more detailed and empathetic responses as compared to physicians. LLMs could potentially be used in responding to patient complaints to improve patient satisfaction and complaint response time. Objective This study aims to evaluate the performance of LLMs in addressing patient complaints received by a tertiary health care institution, with the goal of enhancing patient satisfaction. Methods Anonymized patient complaint emails and associated responses from the patient relations department were obtained. ChatGPT-4.0 (OpenAI, Inc) was provided with the same complaint email and tasked to generate a response. The complaints and the respective responses were uploaded onto a web-based questionnaire. Respondents were asked to rate both responses on a 10-point Likert scale for 4 items: appropriateness, completeness, empathy, and satisfaction. Participants were also asked to choose a preferred response at the end of each scenario. Results There was a total of 188 respondents, of which 115 (61.2%) were health care workers. A majority of the respondents, including both health care and non–health care workers, preferred replies from ChatGPT (n=164, 87.2% to n=183, 97.3%). GPT-4.0 responses were rated higher in all 4 assessed items with all median scores of 8 (IQR 7-9) compared to human responses (appropriateness 5, IQR 3-7; empathy 4, IQR 3-6; quality 5, IQR 3-6; satisfaction 5, IQR 3-6; P<.001) and had higher average word counts as compared to human responses (238 vs 76 words). Regression analyses showed that a higher word count was a statistically significant predictor of higher score in all 4 items, with every 1-word increment resulting in an increase in scores of between 0.015 and 0.019 (all P<.001). However, on subgroup analysis by authorship, this only held true for responses written by patient relations department staff and not those generated by ChatGPT which received consistently high scores irrespective of response length. Conclusions This study provides significant evidence supporting the effectiveness of LLMs in resolution of patient complaints. ChatGPT demonstrated superiority in terms of response appropriateness, empathy, quality, and overall satisfaction when compared against actual human responses to patient complaints. Future research can be done to measure the degree of improvement that artificial intelligence generated responses can bring in terms of time savings, cost-effectiveness, patient satisfaction, and stress reduction for the health care system.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
行歌完成签到,获得积分10
2秒前
Ratee完成签到,获得积分10
2秒前
3秒前
超帅的谷蓝完成签到,获得积分10
3秒前
tiptip应助失眠苑睐采纳,获得10
4秒前
ZephytBat发布了新的文献求助10
5秒前
wsedqw完成签到,获得积分10
5秒前
6秒前
6秒前
yy发布了新的文献求助10
6秒前
YaoHe发布了新的文献求助10
6秒前
领导范儿应助M先生采纳,获得10
6秒前
6秒前
Nina发布了新的文献求助10
7秒前
金金发布了新的文献求助10
7秒前
7秒前
7秒前
7秒前
殷硕完成签到,获得积分10
8秒前
9秒前
9秒前
情怀应助党meng采纳,获得10
9秒前
Akim应助健忘的城采纳,获得10
9秒前
脑洞战士完成签到,获得积分10
11秒前
11秒前
跳跃发布了新的文献求助10
12秒前
molihuakai应助YaoHe采纳,获得10
13秒前
无花果应助温茶月伴夜采纳,获得10
13秒前
14秒前
M先生完成签到,获得积分20
14秒前
17秒前
Elsa发布了新的文献求助10
17秒前
水若琳发布了新的文献求助10
17秒前
17秒前
花生完成签到 ,获得积分10
18秒前
18秒前
18秒前
19秒前
19秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Bounds for Statistical Estimation in Semiparametric Models 500
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6469184
求助须知:如何正确求助?哪些是违规求助? 8274289
关于积分的说明 17643501
捐赠科研通 5545138
什么是DOI,文献DOI怎么找? 2908571
邀请新用户注册赠送积分活动 1885457
关于科研通互助平台的介绍 1734704