Automated Radiographic Report Generation Purely on Transformer: A Multicriteria Supervised Approach

计算机科学 判别式 人工智能 变压器 编码器 判决 加权 自然语言处理 模式识别(心理学) 机器学习 量子力学 医学 操作系统 物理 放射科 电压
作者
Zhanyu Wang,Hongwei Han,Lei Wang,Xiu Li,Luping Zhou
出处
期刊:IEEE Transactions on Medical Imaging [Institute of Electrical and Electronics Engineers]
卷期号:41 (10): 2803-2813 被引量:90
标识
DOI:10.1109/tmi.2022.3171661
摘要

Automated radiographic report generation is challenging in at least two aspects. First, medical images are very similar to each other and the visual differences of clinic importance are often fine-grained. Second, the disease-related words may be submerged by many similar sentences describing the common content of the images, causing the abnormal to be misinterpreted as the normal in the worst case. To tackle these challenges, this paper proposes a pure transformer-based framework to jointly enforce better visual-textual alignment, multi-label diagnostic classification, and word importance weighting, to facilitate report generation. To the best of our knowledge, this is the first pure transformer-based framework for medical report generation, which enjoys the capacity of transformer in learning long range dependencies for both image regions and sentence words. Specifically, for the first challenge, we design a novel mechanism to embed an auxiliary image-text matching objective into the transformer's encoder-decoder structure, so that better correlated image and text features could be learned to help a report to discriminate similar images. For the second challenge, we integrate an additional multi-label classification task into our framework to guide the model in making correct diagnostic predictions. Also, a term-weighting scheme is proposed to reflect the importance of words for training so that our model would not miss key discriminative information. Our work achieves promising performance over the state-of-the-arts on two benchmark datasets, including the largest dataset MIMIC-CXR.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
李健的小迷弟应助StrawCc采纳,获得10
2秒前
聪明以筠完成签到,获得积分10
2秒前
徐锋发布了新的文献求助10
2秒前
123完成签到,获得积分10
2秒前
molihuakai应助nemo_yu采纳,获得10
2秒前
瓜子仁完成签到,获得积分20
2秒前
Mireia发布了新的文献求助10
2秒前
zhaoye完成签到,获得积分10
2秒前
清嘉发布了新的文献求助10
3秒前
搜集达人应助冯老师采纳,获得10
4秒前
5秒前
酷波er应助秋秋采纳,获得10
5秒前
ding应助Jane2024采纳,获得10
5秒前
聪明以筠发布了新的文献求助10
6秒前
风城发布了新的文献求助10
7秒前
情怀应助Ammiba采纳,获得10
8秒前
allen1994关注了科研通微信公众号
9秒前
9秒前
10秒前
充电宝应助JHM采纳,获得10
10秒前
lin完成签到,获得积分10
11秒前
11秒前
11秒前
lj完成签到,获得积分20
12秒前
12秒前
CipherSage应助yuxiaohua采纳,获得10
12秒前
13秒前
传奇3应助yyy0109采纳,获得10
14秒前
14秒前
14秒前
lj发布了新的文献求助10
15秒前
zzy发布了新的文献求助10
16秒前
CipherSage应助DUDU采纳,获得10
16秒前
清风完成签到,获得积分10
17秒前
caffeine应助端庄的小海豚采纳,获得10
18秒前
18秒前
惜灵发布了新的文献求助10
18秒前
18秒前
乐乐应助Jane2024采纳,获得10
19秒前
高分求助中
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Organic Reactions Volume 118 400
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6465431
求助须知:如何正确求助?哪些是违规求助? 8272420
关于积分的说明 17638041
捐赠科研通 5539652
什么是DOI,文献DOI怎么找? 2907657
邀请新用户注册赠送积分活动 1884755
关于科研通互助平台的介绍 1732248