SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer

计算机科学 融合 人工智能 变压器 计算机视觉 电气工程 工程类 哲学 电压 语言学
作者
Jiayi Ma,Linfeng Tang,Fan Fan,Jun Huang,Xiaoguang Mei,Yong Ma
出处
期刊:IEEE/CAA Journal of Automatica Sinica [Institute of Electrical and Electronics Engineers]
卷期号:9 (7): 1200-1217 被引量:235
标识
DOI:10.1109/jas.2022.105686
摘要

This study proposes a novel general image fusion framework based on cross-domain long-range learning and Swin Transformer, termed as SwinFusion. On the one hand, an attention-guided cross-domain module is devised to achieve sufficient integration of complementary information and global interaction. More specifically, the proposed method involves an intra-domain fusion unit based on self-attention and an inter-domain fusion unit based on cross-attention, which mine and integrate long dependencies within the same domain and across domains. Through long-range dependency modeling, the network is able to fully implement domain-specific information extraction and cross-domain complementary information integration as well as maintaining the appropriate apparent intensity from a global perspective. In particular, we introduce the shifted windows mechanism into the self-attention and cross-attention, which allows our model to receive images with arbitrary sizes. On the other hand, the multi-scene image fusion problems are generalized to a unified framework with structure maintenance, detail preservation, and proper intensity control. Moreover, an elaborate loss function, consisting of SSIM loss, texture loss, and intensity loss, drives the network to preserve abundant texture details and structural information, as well as presenting optimal apparent intensity. Extensive experiments on both multi-modal image fusion and digital photography image fusion demonstrate the superiority of our SwinFusion compared to the state-of-the-art unified image fusion algorithms and task-specific alternatives. Implementation code and pre-trained weights can be accessed at https://github.com/Linfeng-Tang/SwinFusion.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
在水一方应助Charon采纳,获得10
5秒前
6秒前
丰富的乐儿完成签到 ,获得积分10
7秒前
8秒前
乐乐应助公西富采纳,获得30
13秒前
14秒前
kakawang完成签到 ,获得积分10
16秒前
涨涨涨完成签到 ,获得积分10
17秒前
薇薇完成签到,获得积分10
17秒前
李爱国应助Viviiviii采纳,获得30
17秒前
SciGPT应助科研通管家采纳,获得10
18秒前
上官若男应助科研通管家采纳,获得10
18秒前
天天快乐应助科研通管家采纳,获得10
18秒前
18秒前
英俊的铭应助科研通管家采纳,获得10
18秒前
大模型应助科研通管家采纳,获得10
18秒前
18秒前
薇薇发布了新的文献求助10
21秒前
llh完成签到 ,获得积分10
22秒前
wenxiansci完成签到,获得积分0
22秒前
嘻嘻耶耶给嘻嘻耶耶的求助进行了留言
22秒前
wxzk发布了新的文献求助10
26秒前
研友_LpvElZ完成签到,获得积分10
26秒前
桐桐应助含蓄的梦山采纳,获得10
27秒前
harrylee应助Zbmd采纳,获得10
27秒前
纯真的诗兰完成签到,获得积分10
28秒前
cctv18应助长情的小虾米采纳,获得10
28秒前
Lucas应助云襄采纳,获得10
29秒前
研友_LOK59L发布了新的文献求助10
29秒前
gaogao完成签到,获得积分10
29秒前
丹曦完成签到,获得积分10
31秒前
33秒前
胡通才是ke研通完成签到,获得积分10
33秒前
丘比特应助lalala采纳,获得10
34秒前
丹曦发布了新的文献求助10
36秒前
37秒前
小灯完成签到,获得积分10
38秒前
38秒前
40秒前
高分求助中
请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
A radiographic standard of reference for the growing knee 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2477251
求助须知:如何正确求助?哪些是违规求助? 2141085
关于积分的说明 5457541
捐赠科研通 1864315
什么是DOI,文献DOI怎么找? 926807
版权声明 562872
科研通“疑难数据库(出版商)”最低求助积分说明 495905