仿射变换
人工智能
计算机科学
模式识别(心理学)
特征(语言学)
背景(考古学)
转化(遗传学)
卷积(计算机科学)
相关性
突出
计算机视觉
特征提取
RGB颜色模型
模态(人机交互)
人工神经网络
数学
生物
基因
哲学
古生物学
生物化学
化学
纯数学
语言学
几何学
作者
Zhengzheng Tu,Zhun Li,Chenglong Li,Jin Tang
标识
DOI:10.1109/tip.2022.3176540
摘要
RGBT Salient Object Detection (SOD) focuses on common salient regions of a pair of visible and thermal infrared images. Existing methods perform on the well-aligned RGBT image pairs, but the captured image pairs are always unaligned and aligning them requires much labor cost. To handle this problem, we propose a novel deep correlation network (DCNet), which explores the correlations across RGB and thermal modalities, for weakly alignment-free RGBT SOD. In particular, DCNet includes a modality alignment module based on the spatial affine transformation, the feature-wise affine transformation and the dynamic convolution to model the strong correlation of two modalities. Moreover, we propose a novel bi-directional decoder model, which combines the coarse-to-fine and fine-to-coarse processes for better feature enhancement. In particular, we design a modality correlation ConvLSTM by adding the first two components of modality alignment module and a global context reinforcement module into ConvLSTM, which is used to decode hierarchical features in both top-down and button-up manners. Extensive experiments on three public benchmark datasets show the remarkable performance of our method against state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI