人工智能
计算机科学
变压器
级联
模式识别(心理学)
卷积神经网络
残余物
分割
目标检测
计算机视觉
特征学习
深度学习
算法
工程类
电压
化学工程
电气工程
作者
J. W. Li,Lei Liu,Hong Song,Yuqi Huang,Junjun Jiang,Jian Yang
标识
DOI:10.1109/tim.2023.3325520
摘要
Deep learning has become a popular technique for infrared and visible image fusion due to its powerful feature representation abilities. Existing methods often employ the CNN, Transformer, or mixed CNN-Transformer to treat different modalities indiscriminately. However, we observe that these homogeneous fusion networks (i.e., same network for cross-modal) struggle to handle the specific characteristics of each modality, leading to fused images being contaminated with interference or lacking modality-specific information. To alleviate this issue, we propose a heterogeneous Dual-branch multi-cascade fusion network based on CNN and Transformer, named DCTNet, aims independently to maintain the focus of respective modalities, such as radiation intensity of infrared images and texture details of visible images. In DCTNet, the CNN branch is specifically designed to exploit local features in visible images by utillizing stacked residual dense CNNs, while the Transformer branch is tailored to model the long-range dependencies, capturing the overall temperature distribution and thermal patterns of a scene in infrared images by cascading residual Transformers. In addition, we introduce an Adaptive Fusion Interaction Module (AFIM) that leverages attention mechanisms to adaptively high-light informative regions in the fused feature maps. The module assigns weights to these regions based on their contributions and dynamically merges features from the dual-branch at multiple levels. Experimental results on three public datasets demonstrate that the proposed method outperforms state-of-the-art methods in terms of quantitative and qualitative evaluations. Moreover, we showcase the promising performance of DCTNet in downstream object detection and semantic segmentation applications on a widely-accepted benchmark.
科研通智能强力驱动
Strongly Powered by AbleSci AI