人工智能
计算机科学
稳健性(进化)
模式识别(心理学)
目标检测
特征提取
融合
RGB颜色模型
计算机视觉
图像融合
图像纹理
传感器融合
互补性(分子生物学)
特征(语言学)
光学(聚焦)
人工神经网络
信息融合
代表(政治)
融合机制
图像处理
特征检测(计算机视觉)
上下文图像分类
特征学习
视觉对象识别的认知神经科学
图像分割
信息丢失
作者
C. Li,Suiping Zhou,Z. Liu,Wenjie Zhang,Ting Wu
标识
DOI:10.1109/taes.2025.3646996
摘要
To improve the accuracy and robustness of anti-UAV detection, this study introduces TransAUAV, a Transformer-based RGB- infrared image fusion detection network. This approach strengthens object feature representation by leveraging multi-modal data fusion and self-attention mechanisms. Specifically, 1) we propose a multi-modal attention fusion module, which enhances the complementarity of RGB and infrared image features through a dual-path attention mechanism; 2) we propose a cross-layer multi-scale Transformer module, which improves detection performance by extracting multi-scale features and facilitating cross-modal information interaction; 3) we propose a texture information focus module, which enhances the representation of local texture details. Additionally, this paper designs a hybrid loss function to improve feature discrimination capability and training efficiency. Experimental results on anti-UAV and Drone-detection datasets show that TransAUAV outperforms state-of-the-art methods on all evaluation metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI