癌症
医学物理学
医学
患者满意度
放射科
心理学
护理部
内科学
作者
Amit Gupta,Swarndeep Singh,Hema Malhotra,Himanshu Pruthi,Aparna Sharma,Amit Kumar,Mukesh Yadav,Devasenathipathy Kandasamy,Atul Batra,Krithika Rangarajan
摘要
To explore the perceived utility and effect of simplified radiology reports on oncology patients' knowledge and feasibility of large language models (LLMs) to generate such reports. This study was approved by the Institute Ethics Committee. In phase I, five state-of-the-art LLMs (Generative Pre-Trained Transformer-4o [GPT-4o], Google Gemini, Claude Opus, Llama-3.1-8B, and Phi-3.5-mini) were tested to simplify 50 oncology computed tomography (CT) report impressions using five distinct prompts with each LLM. The outputs were evaluated quantitatively using readability indices. Five LLM-prompt combinations with best average readability scores were also assessed qualitatively, and the best LLM-prompt combination was selected. In phase II, 100 consecutive oncology patients were randomly assigned into two groups: original report (received original report impression) and simplified report (received LLM-generated simplified versions of their CT report impressions under the supervision of a radiologist). A questionnaire assessed the impact of these reports on patients' knowledge and perceived utility. In phase I, Claude Opus-Prompt 3 (explain to a 15-year-old) performed slightly better than other LLMs, although scores for GPT-4o, Gemini, Claude Opus, and Llama-3.1 were not significantly different (P > .0033 on Wilcoxon signed-rank test with Bonferroni correction). In phase II, simplified report group patients demonstrated significantly better knowledge of primary site and extent of their disease as well as showed significantly higher confidence and understanding of the report (P < .05 for all). Only three (of 50) simplified reports required corrections by the radiologist. Simplified radiology reports significantly enhanced patients' understanding and confidence in comprehending their medical condition. LLMs performed very well at this simplification task; therefore, they can be potentially used for this purpose, although there remains a need for human oversight.
科研通智能强力驱动
Strongly Powered by AbleSci AI