Performance of Large Language Models in Patient Complaint Resolution: Web-Based Cross-Sectional Survey

投诉 医疗保健 患者满意度 医学 横断面研究 家庭医学 护理部 政治学 病理 法学
作者
Lorraine Pei Xian Yong,Joshua Yi Min Tung,Zi Yao Lee,Win Sen Kuan,Mui Teng Chua
出处
期刊:Journal of Medical Internet Research [JMIR Publications]
卷期号:26: e56413-e56413 被引量:5
标识
DOI:10.2196/56413
摘要

Background Patient complaints are a perennial challenge faced by health care institutions globally, requiring extensive time and effort from health care workers. Despite these efforts, patient dissatisfaction remains high. Recent studies on the use of large language models (LLMs) such as the GPT models developed by OpenAI in the health care sector have shown great promise, with the ability to provide more detailed and empathetic responses as compared to physicians. LLMs could potentially be used in responding to patient complaints to improve patient satisfaction and complaint response time. Objective This study aims to evaluate the performance of LLMs in addressing patient complaints received by a tertiary health care institution, with the goal of enhancing patient satisfaction. Methods Anonymized patient complaint emails and associated responses from the patient relations department were obtained. ChatGPT-4.0 (OpenAI, Inc) was provided with the same complaint email and tasked to generate a response. The complaints and the respective responses were uploaded onto a web-based questionnaire. Respondents were asked to rate both responses on a 10-point Likert scale for 4 items: appropriateness, completeness, empathy, and satisfaction. Participants were also asked to choose a preferred response at the end of each scenario. Results There was a total of 188 respondents, of which 115 (61.2%) were health care workers. A majority of the respondents, including both health care and non–health care workers, preferred replies from ChatGPT (n=164, 87.2% to n=183, 97.3%). GPT-4.0 responses were rated higher in all 4 assessed items with all median scores of 8 (IQR 7-9) compared to human responses (appropriateness 5, IQR 3-7; empathy 4, IQR 3-6; quality 5, IQR 3-6; satisfaction 5, IQR 3-6; P<.001) and had higher average word counts as compared to human responses (238 vs 76 words). Regression analyses showed that a higher word count was a statistically significant predictor of higher score in all 4 items, with every 1-word increment resulting in an increase in scores of between 0.015 and 0.019 (all P<.001). However, on subgroup analysis by authorship, this only held true for responses written by patient relations department staff and not those generated by ChatGPT which received consistently high scores irrespective of response length. Conclusions This study provides significant evidence supporting the effectiveness of LLMs in resolution of patient complaints. ChatGPT demonstrated superiority in terms of response appropriateness, empathy, quality, and overall satisfaction when compared against actual human responses to patient complaints. Future research can be done to measure the degree of improvement that artificial intelligence generated responses can bring in terms of time savings, cost-effectiveness, patient satisfaction, and stress reduction for the health care system.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
JamesPei应助文献来来来采纳,获得10
2秒前
Ai_niyou发布了新的文献求助10
2秒前
3秒前
木屋完成签到 ,获得积分10
4秒前
4秒前
5秒前
sfsdg发布了新的文献求助10
6秒前
万能图书馆应助ling采纳,获得10
6秒前
Hus11221完成签到,获得积分10
7秒前
shiqiang mu应助ljjxd采纳,获得10
7秒前
小哥完成签到,获得积分10
7秒前
8秒前
Orange应助小灰灰采纳,获得10
8秒前
在水一方应助沉默白猫采纳,获得10
8秒前
大胆香彤完成签到,获得积分10
9秒前
aldehyde应助甜甜的契采纳,获得10
9秒前
共享精神应助明亮的元柏采纳,获得10
10秒前
妮儿发布了新的文献求助10
10秒前
斯文败类应助蒋丞采纳,获得10
10秒前
10秒前
风清扬发布了新的文献求助10
10秒前
11秒前
ste11ar发布了新的文献求助10
11秒前
wenwei发布了新的文献求助10
11秒前
今后应助朴素青荷采纳,获得10
12秒前
彭于晏应助天天采纳,获得10
12秒前
Jasper应助天天采纳,获得10
12秒前
13秒前
星辰大海应助Yu_采纳,获得10
13秒前
www发布了新的文献求助10
14秒前
fb12000发布了新的文献求助10
15秒前
15秒前
16秒前
16秒前
WMYY完成签到,获得积分20
16秒前
17秒前
17秒前
shui发布了新的文献求助10
18秒前
高分求助中
ФОРМИРОВАНИЕ АО "МЕЖДУНАРОДНАЯ КНИГА" КАК ВАЖНЕЙШЕЙ СИСТЕМЫ ОТЕЧЕСТВЕННОГО КНИГОРАСПРОСТРАНЕНИЯ 3000
Les Mantodea de Guyane: Insecta, Polyneoptera [The Mantids of French Guiana] 2500
Future Approaches to Electrochemical Sensing of Neurotransmitters 1000
Electron microscopy study of magnesium hydride (MgH2) for Hydrogen Storage 1000
Finite Groups: An Introduction 800
壮语核心名词的语言地图及解释 600
生物降解型栓塞微球市场(按产品类型、应用和最终用户)- 2030 年全球预测 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3906398
求助须知:如何正确求助?哪些是违规求助? 3452162
关于积分的说明 10867853
捐赠科研通 3177611
什么是DOI,文献DOI怎么找? 1755523
邀请新用户注册赠送积分活动 848812
科研通“疑难数据库(出版商)”最低求助积分说明 791323