计算机科学
人工智能
计算机视觉
水准点(测量)
过程(计算)
代表(政治)
领域(数学分析)
图像配准
特征(语言学)
像素
职位(财务)
红外线的
方向(向量空间)
特征提取
模式识别(心理学)
翻译(生物学)
忠诚
残余物
校准
基本事实
图像(数学)
图像融合
编码(集合论)
目标检测
迭代重建
光学接近校正
比例(比率)
图像传感器
光流
光谱带
作者
Zhiying Jiang,Zengxi Zhang,Jinyuan Liu
标识
DOI:10.1109/tip.2025.3607585
摘要
Infrared and visible image alignment is essential and critical to the fusion and multi-modal perception applications. It addresses discrepancies in position and scale caused by spectral properties and environmental variations, ensuring precise pixel correspondence and spatial consistency. Existing manual calibration requires regular maintenance and exhibits poor portability, challenging the adaptability of multi-modal application in dynamic environments. In this paper, we propose a harmonized representation based infrared and visible image alignment, achieving both high accuracy and scene adaptability. Specifically, with regard to the disparity between multi-modal images, we develop an invertible translation process to establish a harmonized representation domain that effectively encapsulates the feature intensity and distribution of both infrared and visible modalities. Building on this, we design a hierarchical framework to correct deformations inferred from the harmonized domain in a coarse-to-fine manner. Our framework leverages advanced perception capabilities alongside residual estimation to enable accurate regression of sparse offsets, while an alternate correlation search mechanism ensures precise correspondence matching. Furthermore, we propose the first ground truth available misaligned infrared and visible image benchmark for evaluation. Extensive experiments validate the effectiveness of the proposed method against the state-of-the-arts, advancing the subsequent applications further. Code and dataset are available at https://github.com/Jzy2017/HR4IR.
科研通智能强力驱动
Strongly Powered by AbleSci AI