红外线的
比例(比率)
特征(语言学)
融合
计算机科学
人工智能
模式识别(心理学)
算法
计算机视觉
物理
光学
语言学
量子力学
哲学
作者
mei Da,Y. Tao,Lin Jiang,Jue Hu,Zhijian Zhang
标识
DOI:10.1088/1361-6501/adbe96
摘要
Abstract In infrared target detection, achieving reliable detection results at high speed is essential. To address the problems of low accuracy, large number of parameters, and complexity of the target detection model for infrared images in complex backgrounds, it is difficult to achieve a better balance between accuracy and speed, we propose a YOLO-GCSPNet-EDAM Attention (YOLO-GEA) infrared target detection algorithm. Firstly, to significantly reduce model parameters while maintaining detection accuracy, we designed a lightweight partial multi-scale feature aggregation module(CSP-PMFA). Secondly, we proposed an Efficient Dual Attention Mechanism (EDAM), which adaptively learns the importance of each channel and spatial position, thereby better capturing key information in the image. Additionally, we developed the GCSPNet backbone network based on GhostNet and the CSP-PMFA module to optimize the feature aggregation process and achieve model compression. Finally, the WIoUv3 loss function is employed to further improve the precision of bounding box regression. Experiments demonstrate that on the IFdata and FLIR datasets, the mAP@0.5 reached 90.1% and 86.0%, respectively, representing improvements of 3.4% and 1.5% compared to the original YOLOv8, while reducing the number of parameters by 13.0%. Additionally, on the NEU-DET dataset, the mAP@0.5 achieved 79.8%, validating the model's generalization performance across different datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI