Radiology report generation from a singular perspective using transformers with Knowledge Distillation

透视图(图形) 计算机科学 变压器 蒸馏 人工智能 医学物理学 医学 色谱法 电气工程 化学 工程类 电压
作者
Asad Khan,Mashood Mohammad Mohsan,Muhammad Usman Akram,Taimur Hassan,Sajid Gul Khawaja,Adil Qayyum
出处
期刊:Biomedical Signal Processing and Control [Elsevier BV]
卷期号:111: 108340-108340 被引量:1
标识
DOI:10.1016/j.bspc.2025.108340
摘要

Nearly two billion chest X-rays (CXRs) are performed annually, making them the most used imaging technique in radiology for the diagnosis of pulmonary disorders. The accompanying report with the findings from a chest X-ray forms a crucial part of the examination. By providing an accurate report, healthcare professionals can be enabled to make better decisions about the care being provided. To this end, we propose an end-to-end radiology report generation framework built on transformers trained on text reports in conjunction with visual characteristics of the chest X-ray to generate a reliable report that astutely describes the findings from a single CXR taken either from the Anterior-Posterior or Posterior-Anterior position. A foundation model is utilised to perform Knowledge Distillation (KD) in conjunction with the Encoder which is fine-tuned during the training phase. In addition, using a large corpus of radiology reports to pre-train the foundation model in an unsupervised manner is shown to improve the performance on smaller datasets. This training methodology results in comparable performance to architectures that employ a lot more parameters. The proposed framework is evaluated on multiple datasets including the Indiana University dataset, MIMIC dataset, MIMIC-PRO dataset, and BRAX dataset. The incorporation of KD results in an increase of BLEU-1 score for Indiana dataset by 4% and BERTScore by 7.5%. Similarly, pre-training on larger datasets in combination with KD, further increases BLEU-1 score for Indiana dataset by 7.2% and BERTScore by 3%. For MIMIC dataset, comparable performance is achieved for the Findings and the Impression sections of the report while the proposed framework outperforms other techniques when both of these sections are combined. For MIMIC-PRO dataset, an s e m b score of 0.4069 while a RadGraph F1 score of 0.1165 is achieved outperforming other techniques in the literature. Finally, the proposed framework is also evaluated on locally gathered dataset and BRAX subset without any re-training or fine-tuning resulting in BLEU-1 score of 0.3827 and a BERTScore of 0.4392 for the former and BLEU-1 score 0.1671 of and a BERTScore of 0.2186 for latter showing generalisation ability.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
CodeCraft应助猪崽崽采纳,获得10
1秒前
1秒前
2秒前
大脑停工完成签到,获得积分10
2秒前
2秒前
烟花应助阿萨德采纳,获得10
2秒前
leec完成签到,获得积分10
3秒前
科研通AI6.3应助HAI采纳,获得10
3秒前
一支桃桃完成签到,获得积分10
3秒前
3秒前
猪猪hero发布了新的文献求助10
4秒前
zhang完成签到,获得积分10
4秒前
腼腆的安露完成签到,获得积分10
5秒前
咔咔发布了新的文献求助10
5秒前
优美若雁完成签到,获得积分10
5秒前
6秒前
yonglong完成签到,获得积分10
6秒前
小鞋完成签到,获得积分10
6秒前
bobo发布了新的文献求助10
7秒前
清爽听筠发布了新的文献求助10
7秒前
h禾禾完成签到,获得积分10
7秒前
djp666关注了科研通微信公众号
7秒前
9秒前
9秒前
xu完成签到,获得积分10
9秒前
Twyla发布了新的文献求助10
10秒前
11秒前
11秒前
spinor完成签到,获得积分10
11秒前
隐形曼青应助11采纳,获得10
12秒前
852应助rz顺利毕业采纳,获得30
12秒前
Jason发布了新的文献求助10
12秒前
99完成签到,获得积分10
13秒前
大个应助bobo采纳,获得10
13秒前
13秒前
14秒前
wanci应助收破烂的要不采纳,获得10
14秒前
受伤尔曼完成签到,获得积分10
15秒前
Pooh完成签到 ,获得积分10
15秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
Signals, Systems, and Signal Processing 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
Research Methods for Applied Linguistics 500
Picture Books with Same-sex Parented Families Unintentional Censorship 444
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6415012
求助须知:如何正确求助?哪些是违规求助? 8233905
关于积分的说明 17484432
捐赠科研通 5467904
什么是DOI,文献DOI怎么找? 2888952
邀请新用户注册赠送积分活动 1865828
关于科研通互助平台的介绍 1703487