模态(人机交互)
计算机科学
计算机视觉
人工智能
作者
Yishuo Chen,Boran Wang,Wenbin Zhu,Yuan Jing
标识
DOI:10.1109/yac63405.2024.10598725
摘要
Utilizing cross-modality information between infrared and visible light presents a novel approach to enhancing detection performance. However, existing methods for leveraging visible-infrared information are relatively simplistic, failing to thoroughly explore the relationship between infrared and visible light features. Therefore, this paper introduces RI-YOLO, a new framework for infrared-visible images object detection. Firstly, to retain the inherent advantages of each modality image, we propose G&M-DI, a technique for modality-specific reconstruction and cross-modality information fusion. Secondly, by introducing AWI, we employ an adaptive weight allocation strategy to mitigate cross-modality interference, effectively leveraging the relationship between different modalities. Experimental results on datasets M3FD and LLVIP demonstrate that RI-YOLO surpasses existing state-of-the-art (SOTA) models.
科研通智能强力驱动
Strongly Powered by AbleSci AI