计算机科学
分级(工程)
眼底摄影
眼底(子宫)
人工智能
摄影
糖尿病性视网膜病变
领域(数学)
计算机视觉
眼科
医学
数学
艺术
土木工程
视力
荧光血管造影
纯数学
工程类
视觉艺术
糖尿病
内分泌学
作者
Junlin Hou,Jilan Xu,Fan Xiao,Rui-Wei Zhao,Yuejie Zhang,Haidong Zou,Lina Lu,Wenwen Xue,Rui Feng
标识
DOI:10.1109/bibm55620.2022.9995459
摘要
Automatic diabetic retinopathy (DR) grading based on fundus photography has been widely explored to benefit the routine screening and early treatment. Existing researches generally focus on single-field fundus images, which have limited field of view for precise eye examinations. In clinical applications, ophthalmologists adopt two-field fundus photography as the dominating tool, where the information from each field (i.e., macula-centric and optic disc-centric) is highly correlated and complementary, and benefits comprehensive decisions. However, automatic DR grading based on two-field fundus photography remains a challenging task due to the lack of publicly available datasets and effective fusion strategies. In this work, we first construct a new benchmark dataset (DRTiD) for DR grading, consisting of 3,100 two-field fundus images. To the best of our knowledge, it is the largest public DR dataset with diverse and high-quality two-field images. Then, we propose a novel DR grading approach, namely Cross-Field Transformer (CrossFiT), to capture the correspondence between two fields as well as the long-range spatial correlations within each field. Considering the inherent two-field geometric constraints, we particularly define aligned position embeddings to preserve relative consistent position in fundus. Besides, we perform masked cross-field attention during interaction to filter the noisy relations between fields. Extensive experiments on our DRTiD dataset and a public DeepDRiD dataset demonstrate the effectiveness of our CrossFiT network. The new dataset and the source code of CrossFiT will be publicly available at https://github.com/DU-VTS/DRTiD.
科研通智能强力驱动
Strongly Powered by AbleSci AI