计算机科学
频域
卷积神经网络
人工智能
卷积(计算机科学)
分割
模式识别(心理学)
班级(哲学)
领域(数学分析)
深度学习
上下文图像分类
图像分割
人工神经网络
图像(数学)
计算机视觉
数学
数学分析
作者
Yonghao Huang,Chuan Zhou,Leiting Chen,Junjing Chen,Shanlin Lan
标识
DOI:10.1109/bibm52615.2021.9669443
摘要
Medical image segmentation and classification tasks have become increasing accurate by employing deep neural networks. However, existing convolution neural networks models (CNNs) are challenging to achieve quite satisfactory results as medical objects and backgrounds are usually indistinguishable in spatial-domain images. In comparison, it is easier to analyze complex objects in frequency-domain images as different object information is retained in different frequency components. However, training CNNs in the frequency domain requires complex modification for network architecture. Thus, this paper proposes a method of learning in the frequency domain to train CNNs called Frequency domain attention (FDAM) Workflow, which only requires little parameters rise and modification in CNNs. FDAM utilizes the relationship of intra-class frequency to retain valuable frequency information and suppress trivial ones. Furthermore, to reduce computation, a Gate module is designed for deleting redundant frequency channels by exploiting the relationship of inter-class frequency. The proposed methods can be applied in various CNNs, such as U-Net, ResNet and DenseNet, while accepting frequency-domain data as input. Experiment results show a significant performance improvement compared to original CNNs for retinal vessel segmentation, glaucoma classification and pneumonia classification. Specifically, Gate module can improve accuracy while using less input data size.
科研通智能强力驱动
Strongly Powered by AbleSci AI