高光谱成像
计算机科学
人工智能
计算机视觉
跟踪(教育)
传感器融合
对象(语法)
融合
遥感
视频跟踪
雷达跟踪器
地质学
雷达
电信
哲学
心理学
语言学
教育学
作者
Zhuanfeng Li,Fengchao Xiong,Jun Zhou,Jianfeng Lu,Zhuang Zhao,Yuntao Qian
标识
DOI:10.1109/tgrs.2024.3366536
摘要
Hyperspectral videos (HSVs) have more potential in object tracking than color videos thanks to their material identification ability. Nevertheless, previous works have not fully explored the benefits of the material information, resulting in limited representation ability and tracking accuracy. To address this issue, this paper introduces a material-guided multi-view fusion network for improved tracking. Specifically, we combine false-color information, hyperspectral information, and material information obtained by hyperspectral unmixing to provide a rich multi-view representation of the object. Cross-material attention is employed to capture the interaction among materials, enabling the network to focus on the most relevant materials for the target. Furthermore, leveraging the discriminative ability of material view, a novel material-guided multi-view fusion module is proposed to capture both intra-view and cross-view long-range spatial dependencies for effective feature aggregation. Thanks to the enhanced representation ability of each view and the integration of the complementary advantages of all views, our network is more capable of suppressing the tracking drift in various challenging scenes and achieving accurate object localization. Extensive experiments show that our tracker achieves state-of-the-art tracking performance. The source code will be available at https://github.com/hscv/MMF-Net.
科研通智能强力驱动
Strongly Powered by AbleSci AI