已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

FusionINV: A Diffusion-Based Approach for Multimodal Image Fusion

图像融合 计算机科学 人工智能 计算机视觉 图像处理 融合 图像(数学) 模式识别(心理学) 语言学 哲学
作者
Pengwei Liang,Junjun Jiang,Qing Ma,Chenyang Wang,Xianming Liu,Jiayi Ma
出处
期刊:IEEE transactions on image processing [Institute of Electrical and Electronics Engineers]
卷期号:34: 5355-5368 被引量:4
标识
DOI:10.1109/tip.2025.3593775
摘要

Infrared images exhibit a significantly different appearance compared to visible counterparts. Existing infrared and visible image fusion (IVF) methods fuse features from both infrared and visible images, producing a new "image" appearance not inherently captured by any existing device. From an appearance perspective, infrared, visible, and fused images belong to different data domains. This difference makes it challenging to apply fused images because their domain-specific appearance may be difficult for downstream systems, e.g., pre-trained segmentation models. Therefore, accurately assessing the quality of the fused image is challenging. To address those problem, we propose a novel IVF method, FusionINV, which produces fused images with an appearance similar to visible images. FusionINV employs the pre-trained Stable Diffusion (SD) model to invert infrared images into the noise feature space. To inject visible-style appearance information into the infrared features, we leverage the inverted features from visible images to guide this inversion process. In this way, we can embed all the information of infrared and visible images in the noise feature space, and then use the prior of the pre-trained SD model to generate visually friendly images that align more closely with the RGB distribution. Specially, to generate the fused image, we design a tailored fusion rule within the denoising process that iteratively fuses visible-style infrared and visible features. In this way, the fused image falls into the visible domain and can be directly applied to existing downstream machine systems. Thanks to advancements in image inversion, FusionINV can directly produce fused images in a training-free manner. Extensive experiments demonstrate that FusionINV achieves outstanding performance in both human visual evaluation and machine perception tasks. The code is available at https://github.com/erfect2020/FusionINV.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
顾矜应助风华笔墨采纳,获得10
刚刚
1秒前
马马完成签到 ,获得积分10
2秒前
安静的春天完成签到,获得积分10
3秒前
升级小水桶完成签到 ,获得积分10
3秒前
胖头鱼发布了新的文献求助10
4秒前
汉堡包应助落水鎏情采纳,获得10
4秒前
4秒前
5秒前
JamesPei应助Hu采纳,获得10
5秒前
传统的丹雪完成签到 ,获得积分10
5秒前
6秒前
马浩航发布了新的文献求助20
6秒前
Robin发布了新的文献求助10
7秒前
7秒前
哇咔咔完成签到 ,获得积分10
7秒前
8秒前
123完成签到,获得积分20
9秒前
9秒前
lyw发布了新的文献求助10
9秒前
10秒前
SciGPT应助拼搏的万言采纳,获得20
11秒前
小二郎应助洛清河采纳,获得10
11秒前
早晨发布了新的文献求助10
12秒前
风华笔墨发布了新的文献求助10
12秒前
烂漫以山发布了新的文献求助10
13秒前
充电宝应助晓倩采纳,获得10
15秒前
15秒前
15秒前
欢呼的映秋完成签到,获得积分20
16秒前
李健的小迷弟应助Joif采纳,获得10
17秒前
19秒前
落水鎏情发布了新的文献求助10
20秒前
20秒前
21秒前
zhangli完成签到,获得积分10
21秒前
开心丸子发布了新的文献求助10
23秒前
甘乐发布了新的文献求助10
23秒前
lyw完成签到,获得积分10
24秒前
25秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Real Analysis: Theory of Measure and Integration (3rd Edition) Epub版 1200
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
卤化钙钛矿人工突触的研究 1000
Engineering for calcareous sediments : proceedings of the International Conference on Calcareous Sediments, Perth 15-18 March 1988 / edited by R.J. Jewell, D.C. Andrews 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6261156
求助须知:如何正确求助?哪些是违规求助? 8083252
关于积分的说明 16889883
捐赠科研通 5332532
什么是DOI,文献DOI怎么找? 2838488
邀请新用户注册赠送积分活动 1815941
关于科研通互助平台的介绍 1669586