人工智能
计算机科学
计算机视觉
图像配准
光学相干层析成像
眼底(子宫)
卷积(计算机科学)
相似性度量
失真(音乐)
模式识别(心理学)
图像(数学)
光学
人工神经网络
物理
医学
眼科
放大器
带宽(计算)
计算机网络
作者
Yuntong Tian,Yan Hu,Yuhui Ma,Huaying Hao,Lei Mou,Jianlong Yang,Yitian Zhao,Jiang Liu
标识
DOI:10.1109/embc44109.2020.9175613
摘要
Registration of multimodal retinal images is of great importance in facilitating the diagnosis and treatment of many eye diseases, such as the registration between color fundus images and optical coherence tomography (OCT) images. However, it is difficult to obtain ground truth, and most existing algorithms are for rigid registration without considering the optical distortion. In this paper, we present an unsupervised learning method for deformable registration between the two images. To solve the registration problem, the structure achieves a multi-level receptive field and takes contour and local detail into account. To measure the edge difference caused by different distortions in the optics center and edge, an edge similarity (ES) loss term is proposed, so loss function is composed by local cross-correlation, edge similarity and diffusion regularizer on the spatial gradients of the deformation matrix. Thus, we propose a multi-scale input layer, U-net with dilated convolution structure, squeeze excitation (SE) block and spatial transformer layers. Quantitative experiments prove the proposed framework is best compared with several conventional and deep learningbased methods, and our ES loss and structure combined with Unet and multi-scale layers achieve competitive results for normal and abnormal images.
科研通智能强力驱动
Strongly Powered by AbleSci AI