Feasibility of Using the Privacy-preserving Large Language Model Vicuna for Labeling Radiology Reports

医学 集合(抽象数据类型) 回顾性队列研究 接收机工作特性 射线照相术 机器学习 放射科 计算机科学 病理 内科学 程序设计语言
作者
Pritam Mukherjee,Benjamin Hou,Ricardo Bigolin Lanfredi,Ronald M. Summers
出处
期刊:Radiology [Radiological Society of North America]
卷期号:309 (1) 被引量:29
标识
DOI:10.1148/radiol.231147
摘要

Background Large language models (LLMs) such as ChatGPT, though proficient in many text-based tasks, are not suitable for use with radiology reports due to patient privacy constraints. Purpose To test the feasibility of using an alternative LLM (Vicuna-13B) that can be run locally for labeling radiography reports. Materials and Methods Chest radiography reports from the MIMIC-CXR and National Institutes of Health (NIH) data sets were included in this retrospective study. Reports were examined for 13 findings. Outputs reporting the presence or absence of the 13 findings were generated by Vicuna by using a single-step or multistep prompting strategy (prompts 1 and 2, respectively). Agreements between Vicuna outputs and CheXpert and CheXbert labelers were assessed using Fleiss κ. Agreement between Vicuna outputs from three runs under a hyperparameter setting that introduced some randomness (temperature, 0.7) was also assessed. The performance of Vicuna and the labelers was assessed in a subset of 100 NIH reports annotated by a radiologist with use of area under the receiver operating characteristic curve (AUC). Results A total of 3269 reports from the MIMIC-CXR data set (median patient age, 68 years [IQR, 59–79 years]; 161 male patients) and 25 596 reports from the NIH data set (median patient age, 47 years [IQR, 32–58 years]; 1557 male patients) were included. Vicuna outputs with prompt 2 showed, on average, moderate to substantial agreement with the labelers on the MIMIC-CXR (κ median, 0.57 [IQR, 0.45–0.66] with CheXpert and 0.64 [IQR, 0.45–0.68] with CheXbert) and NIH (κ median, 0.52 [IQR, 0.41–0.65] with CheXpert and 0.55 [IQR, 0.41–0.74] with CheXbert) data sets, respectively. Vicuna with prompt 2 performed at par (median AUC, 0.84 [IQR, 0.74–0.93]) with both labelers on nine of 11 findings. Conclusion In this proof-of-concept study, outputs of the LLM Vicuna reporting the presence or absence of 13 findings on chest radiography reports showed moderate to substantial agreement with existing labelers. © RSNA, 2023 Supplemental material is available for this article. See also the editorial by Cai in this issue.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
漂亮幻莲完成签到,获得积分10
1秒前
Zhy完成签到,获得积分10
1秒前
4秒前
坚定的依瑶完成签到 ,获得积分10
4秒前
6秒前
8秒前
LeungYM发布了新的文献求助10
8秒前
yangyay发布了新的文献求助30
9秒前
沉静的电脑完成签到,获得积分10
10秒前
李博士完成签到,获得积分10
10秒前
希望天下0贩的0应助rrrrr采纳,获得10
11秒前
11秒前
李健的小迷弟应助十三采纳,获得10
12秒前
哈皮完成签到,获得积分10
13秒前
13秒前
SciGPT应助秦晶晶采纳,获得10
14秒前
janejane发布了新的文献求助10
15秒前
疾风少年完成签到,获得积分10
15秒前
16秒前
AT完成签到,获得积分10
17秒前
星辰大海应助111采纳,获得10
17秒前
SciGPT应助AlexLee采纳,获得10
18秒前
最终发布了新的文献求助10
18秒前
andy发布了新的文献求助10
21秒前
细腻的雅山完成签到,获得积分10
23秒前
24秒前
最终完成签到,获得积分10
24秒前
曾经1993完成签到 ,获得积分10
24秒前
imomoe完成签到,获得积分10
25秒前
27秒前
香蕉觅云应助andy采纳,获得10
27秒前
jjy完成签到,获得积分10
27秒前
烷基八氮完成签到,获得积分10
27秒前
烟花应助zfh采纳,获得10
30秒前
31秒前
31秒前
田様应助Li采纳,获得10
33秒前
rrrrr发布了新的文献求助10
34秒前
司空沛槐完成签到,获得积分10
37秒前
张潘辉完成签到 ,获得积分10
38秒前
高分求助中
Mass producing individuality 600
Разработка метода ускоренного контроля качества электрохромных устройств 500
A Combined Chronic Toxicity and Carcinogenicity Study of ε-Polylysine in the Rat 400
Advances in Underwater Acoustics, Structural Acoustics, and Computational Methodologies 300
Treatise on Process Metallurgy Volume 3: Industrial Processes (2nd edition) 250
Cycles analytiques complexes I: théorèmes de préparation des cycles 200
The Framed World: Tourism, Tourists and Photography (New Directions in Tourism Analysis) 1st Edition 200
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3825602
求助须知:如何正确求助?哪些是违规求助? 3367781
关于积分的说明 10447735
捐赠科研通 3087186
什么是DOI,文献DOI怎么找? 1698485
邀请新用户注册赠送积分活动 816805
科研通“疑难数据库(出版商)”最低求助积分说明 769973