计算机科学
目标检测
人工智能
情态动词
特征(语言学)
利用
互补性(分子生物学)
传感器融合
模式
频道(广播)
相互信息
补偿(心理学)
计算机视觉
特征提取
对象(语法)
融合机制
融合
模式识别(心理学)
特征学习
信息融合
数据挖掘
模态(人机交互)
编码(内存)
视觉对象识别的认知神经科学
信息处理
能见度
编码(集合论)
机器学习
作者
Tao Zhang,Ruitao Lu,Xiaogang Yang,Dingwen Zhang,Yansheng Li,Xueli Xie,Yunsong Li
标识
DOI:10.1109/tgrs.2025.3647293
摘要
Visible-Infrared (RGB-IR) object detection plays a crucial role in UAV-based vision tasks. However, existing methods still suffer from learning bias caused by imbalanced information distribution and inaccurate fusion due to modal conflicts. Inspired by the human multisensory information processing mechanism, we propose a novel “CompensationFusion” progressive detection framework, CCSFuse, to fully exploit the complementary relationship between modalities while eliminating conflict interference. Specifically, we design a cross-modal feature compensation module, which establishes inter-modal information interaction to achieve mutual complementarity and enhancement during feature extraction, effectively mitigating the issue of imbalanced modal information distribution. Additionally, we introduce an adaptive feature-selection fusion module to address modal conflicts. We employ a cross-modal channel attention to calibrate channel features of different modalities and utilizes a selective fusion strategy to dynamically assess modal importance, thereby achieving adaptive modal fusion. Finally, we validate the effectiveness of CCSFuse on the DroneVehicle and LLVIP datasets. The results confirm that CCSFuse significantly improves the efficiency of feature optimization and integration. In UAV-based object detection scenarios, CCSFuse outperforms state-of-the-art methods in both qualitative and quantitative comparisons, particularly for small objects and low-quality modalities. The code is available at https://github.com/ZhangT-xxl/CCSFuse.
科研通智能强力驱动
Strongly Powered by AbleSci AI