计算机科学
医学影像学
人工智能
医学物理学
计算机视觉
放射科
医学
作者
Xiulong Yi,You Fu,Jianzhi Yu,Ruiqing Liu,Hao Zhang,Rong Hua
标识
DOI:10.1109/tmi.2024.3507073
摘要
Radiology report generation that aims to accurately describe medical findings for given images, is pivotal in contemporary computer-aided diagnosis. Recently, despite considerable progress, current radiology report generation models still struggled to achieve consistent quality across difficult and easy samples, which dramatically impacts their clinical value. To solve this problem, we explore the difficult samples mining in radiology report generation and propose the Linear Hybrid-Reward based Reinforced Focal Learning (LHR-RFL) to effectively guide the model to allocate more attention towards some difficult samples, thereby enhancing its overall performance in both general and intricate scenarios. In implementation, we first propose the Linear Hybrid-Reward (LHR) module to better quantify the learning difficulty, which employs a linear weighting scheme that assigns varying weights to three representative Natural Language Generation (NLG) evaluation metrics. Then, we propose the Reinforced Focal Learning (RFL) to adaptively adjust the contributions of difficult samples during training, thereby augmenting their impact on model optimization. The experimental results demonstrate that our proposed LHR-RFL improves the performance of the base model across all NLG evaluation metrics, achieving an average performance improvement of 20.9% and 13.2% on IU X-ray and MIMIC-CXR datasets, respectively. Further analysis also proves that our LHR-RFL can dramatically improve the quality of reports for difficult samples. The source code will be available at https://github.com/SKD-HPC/LHR-RFL.
科研通智能强力驱动
Strongly Powered by AbleSci AI