解码方法
人工智能
计算机科学
分级(工程)
糖尿病性视网膜病变
计算机视觉
编码
一般化
编码(社会科学)
模式识别(心理学)
机器学习
深度学习
语音识别
状态空间
约束(计算机辅助设计)
视网膜
监督学习
生物识别
作者
Jingjun Yi,Qi Bi,Hao Zheng,Haolan Zhan,Wei Ji,Huimin Huang,Yong Liang,Xian Wu,Yefeng Zheng
标识
DOI:10.1109/iccvw69036.2025.00462
摘要
Diabetic retinopathy (DR) is a leading cause of vision loss. The rapid advancement of deep learning has significantly propelled the development of automated DR grading methods. However, retinal images are typically collected from diverse clinical centers using equipments from various vendors, leading to domain shifts. As a result, the pre-trained model for DR diagnosis has to handle the unseen retinal images when doing inference. In this paper, we propose Decoupled State Space Decoding (DSSD), a selective state space-based model designed to enhance the generalization capability of DR grading methods to unseen domains. Leveraging the selective scan mechanism to encode local patches relevant to lesions and the recurrent modeling to capture long-range dependencies, DSSD aims to mitigate cross-domain style variations through both channel-wise and sample-wise decoupling. Additionally, it introduces an implicit constraint between shallower and deeper state embeddings to ensure stable recurrent modeling for grade-level predictions. Experiments conducted under both leave-one-domain-out and extreme-single-domain-generalization settings demonstrate its state-of-the-art performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI