计算机科学
卷积神经网络
人工智能
变压器
语义特征
深度学习
语义计算
特征(语言学)
模式识别(心理学)
计算机视觉
语义网
工程类
语言学
哲学
电压
电气工程
作者
Xin Yang,Hongtao Huo,Chang Li,Xiaowen Liu,Wenxi Wang,Cheng Wang
标识
DOI:10.1016/j.patcog.2023.110223
摘要
Deep learning based fusion mechanisms have achieved sophisticated performance in the field of image fusion. However, most existing approaches focus on learning global and local features but seldom consider to modeling semantic information, which might result in inadequate source information preservation. In this work, we propose a semantic perceptive infrared and visible image fusion Transformer (SePT). The proposed SePT extracts local feature through convolutional neural network (CNN) based module and learns long-range dependency through Transformer based modules, and meanwhile designs two semantic modeling modules based on Transformer architecture to manage high-level semantic information. One semantic modeling module maps the shallow features of source images into deep semantic, the other learns the deep semantic information in different receptive fields. The final fused results are recovered from the combination of local feature, long-range dependency and semantic feature. Extensive comparison experiments demonstrate the superiority of SePT compare to other advanced fusion approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI