Improving large language model applications in biomedicine with retrieval-augmented generation: a systematic review, meta-analysis, and clinical development guidelines

生物医学 计算机科学 荟萃分析 人工智能 自然语言处理 情报检索 数据科学 医学 生物信息学 病理 生物
作者
Siru Liu,Allison B. McCoy,Adam Wright
出处
期刊:Journal of the American Medical Informatics Association [Oxford University Press]
卷期号:32 (4): 605-615 被引量:99
标识
DOI:10.1093/jamia/ocaf008
摘要

OBJECTIVE: The objectives of this study are to synthesize findings from recent research of retrieval-augmented generation (RAG) and large language models (LLMs) in biomedicine and provide clinical development guidelines to improve effectiveness. MATERIALS AND METHODS: We conducted a systematic literature review and a meta-analysis. The report was created in adherence to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses 2020 analysis. Searches were performed in 3 databases (PubMed, Embase, PsycINFO) using terms related to "retrieval augmented generation" and "large language model," for articles published in 2023 and 2024. We selected studies that compared baseline LLM performance with RAG performance. We developed a random-effect meta-analysis model, using odds ratio as the effect size. RESULTS: Among 335 studies, 20 were included in this literature review. The pooled effect size was 1.35, with a 95% confidence interval of 1.19-1.53, indicating a statistically significant effect (P = .001). We reported clinical tasks, baseline LLMs, retrieval sources and strategies, as well as evaluation methods. DISCUSSION: Building on our literature review, we developed Guidelines for Unified Implementation and Development of Enhanced LLM Applications with RAG in Clinical Settings to inform clinical applications using RAG. CONCLUSION: Overall, RAG implementation showed a 1.35 odds ratio increase in performance compared to baseline LLMs. Future research should focus on (1) system-level enhancement: the combination of RAG and agent, (2) knowledge-level enhancement: deep integration of knowledge into LLM, and (3) integration-level enhancement: integrating RAG systems within electronic health records.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
2秒前
自然友菱完成签到,获得积分10
3秒前
P_Zh_CN发布了新的文献求助10
3秒前
3秒前
4秒前
陈M雯发布了新的文献求助10
4秒前
5秒前
温暖的绮完成签到,获得积分10
5秒前
洋洋洋耶发布了新的文献求助10
5秒前
李沐阳发布了新的文献求助10
6秒前
8秒前
8秒前
device完成签到 ,获得积分10
8秒前
Dky完成签到,获得积分10
8秒前
Ava应助吕懿采纳,获得10
9秒前
科研通AI6.1应助陈昇采纳,获得30
9秒前
科研卿完成签到,获得积分20
10秒前
10秒前
11秒前
温暖的绮发布了新的文献求助10
11秒前
蓝天发布了新的文献求助20
11秒前
12秒前
12秒前
所所应助莱斯够瓦瑞丝采纳,获得10
13秒前
氨气完成签到 ,获得积分10
14秒前
滴滴滴完成签到,获得积分10
15秒前
核桃发布了新的文献求助10
16秒前
深情安青应助你好采纳,获得10
16秒前
科研通AI6.2应助insist采纳,获得10
16秒前
VDC发布了新的文献求助10
16秒前
lwt1235完成签到,获得积分10
17秒前
17秒前
明天见发布了新的文献求助10
17秒前
lml关注了科研通微信公众号
17秒前
18秒前
Gu完成签到,获得积分10
18秒前
20秒前
yun完成签到,获得积分10
20秒前
洋洋洋耶完成签到,获得积分10
20秒前
高分求助中
GL 2 A method for assessing the in-place cleanability of food processing equipment, Fourth Edition, December 2023 3000
Annie Ernaux: De la perte au corps glorieux 600
Writing Systems 500
Invited Discussant 63O and 64O 400
Thermodynamics of Natural Systems 400
A revision of Limenitis helmanni and its related species (Nymphalidae) from Central and South China 400
Direct and Iterative Linear System Solvers 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6820565
求助须知:如何正确求助?哪些是违规求助? 8534170
关于积分的说明 18165739
捐赠科研通 6154108
什么是DOI,文献DOI怎么找? 3033202
关于科研通互助平台的介绍 2012391
邀请新用户注册赠送积分活动 2010147