哈尔
哈尔小波转换
人工智能
图像融合
小波变换
计算机视觉
对偶(语法数字)
融合
红外线的
小波
计算机科学
离散小波变换
图像(数学)
光学
物理
艺术
语言学
哲学
文学类
作者
Yuequn Wang,Jie Liu,Jianli Wang,Leqiang Yang,Bo Dong,Zhengwei Li
标识
DOI:10.1016/j.patcog.2025.111594
摘要
Infrared-visible image fusion remains challenging due to the inherent conflict between preserving multi-modal complementary features and minimizing reconstruction loss. Existing methods often suffer from inadequate feature representation and information degradation during fusion. To address this, we propose HaarFuse, a wavelet-enhanced auto-encoder network that hierarchically integrates multi-scale features for robust fusion. The network first employs wavelet transform to extend the receptive field of convolutional layers, extracting shared shallow features that encode both low-frequency structural contours and high-frequency texture primitives. Subsequently, the shallow features are decomposed into high-frequency and low-frequency components through Haar wavelet transform, and techniques such as INN, Gabor layer, and Transformer are adopted to further optimize and process these features. Finally, the fused image is reconstructed via the inverse wavelet transform. Experiments on TNO, MSRS, and M3FD benchmarks validate HaarFuse's superiority: it achieves the highest thermal saliency (SD=45.78, +5.5%↑ on MSRS; EN=6.98, +4.0%↑ on M3FD), optimal edge fidelity (Qabf=0.62, +1.6%↑ on M3FD), and 34.2 × faster inference than SwinFusion with 0.468MB parameters. Further validation in machine vision and medical imaging confirms its robustness for real-time applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI