Boosting(机器学习)
计算机科学
遥感
块(置换群论)
红外线的
人工智能
光学
地质学
物理
几何学
数学
作者
Yonghao Gu,Ying Guo,Wei Xie,WU Zhe,Shibo Dong,Gaogang Xie,Weifeng Xu
标识
DOI:10.1109/tgrs.2025.3566889
摘要
This paper tackles the challenges of infrared small target detection, aiming to improve detection accuracy and robustness in complex, low-contrast infrared environments. We propose several novel enhancements to YOLO-based models, commonly employed in real-time target detection tasks. First, we introduce a Multi-Scale Dilated Separable Fusion (MDSF) block, a flexible plug-in that can replace traditional convolution layers and be inserted at various stages of the network. This module enhances the network’s sensitivity to small targets by leveraging large convolution kernels in conjunction with multi-scale decomposition. Next, we design a Deep Feature Fusion (DFF) module and a Multi-Scale Dilated Separable Fusion Head (MDSF-Head) based on the MDSF block, and integrate them into YOLO models (v5-v11), resulting in significant performance gains, with mAP@50 values improving by 5.4% to 9.6%. Furthermore, we propose the coarse-to-fine Spatial and Channel Reconstruction Convolution (C2f_SCConv) module, which effectively fuses shallow spatial features with deep semantic features, boosting detection performance, particularly for occluded and small targets. Additionally, we incorporate the Spatial-to-Depth Convolution (SPD) module and replace the traditional Complete IoU (CIoU) with Efficient-IoU (EIoU) to further optimize the model. Experimental results on the Forward Looking Infrared ADAS (FLIR) dataset demonstrate that our approach outperforms the baseline YOLOv8n, with improvements of 10.9% in mAP@50 and 10.3% in mAP@50-95. On the High-altitude Infrared Thermal dataset for Unmanned Aerial Vehicle-based object detection (HIT-UAV) dataset, we observe similar improvements, with mAP@50 increasing by 8.1% and mAP@50-95 by 9.7%. These results validate the effectiveness of our proposed method, substantially enhancing detection accuracy, robustness, and adaptability in challenging infrared environments.
科研通智能强力驱动
Strongly Powered by AbleSci AI