卷积(计算机科学)
计算机科学
红外线的
人工智能
融合
计算机视觉
图像融合
变压器
对偶(语法数字)
图像(数学)
光学
物理
电气工程
电压
人工神经网络
工程类
艺术
哲学
语言学
文学类
作者
Honggang Zhang,Haitao Yang,Fengjie Zheng,Haoyu Wang,Guo Ningbo,Yifan Xu
摘要
Fusion of texture information and semantic information from visible and infrared images is important for many fields. However, existing algorithms extract features with a limited range of receptive fields which causes the loss of contextual information. This paper introduces DCTDFusion, a two-branch network integrating dilated convolution and Transformer to address this issue. We propose Residual Dilated Convolutions Block (RDCB) to extract local information of different receptive field ranges. To get the global information at multiple scales, we introduce the Transformer Dilated Convolutions Block. In addition, the Sobel feature extractor is used to retain the gradient information of both branches. Numerous experimental evidences show that the fused images of DCDTFusion contain rich information of significant targets and background in source images. Additionally, our method improves about 20%, 4%, 1% and 2% in MI, VIF, QAB/F and SSIM metrics comparing with the second best.
科研通智能强力驱动
Strongly Powered by AbleSci AI