Fine-tuned YOLOv5 for real-time vehicle detection in UAV imagery: Architectural improvements and performance boost

计算机科学最小边界框卷积神经网络目标检测人工智能推论失败深度学习跳跃式监视特征（语言学）实时计算机器学习计算机视觉模式识别（心理学）图像（数学）语言学哲学并行计算

作者

Mohammad Hossein Hamzenejadi,Hadis Mohseni

出处

期刊：Expert Systems With Applications [Elsevier BV]
日期：2023-06-16 卷期号：231: 120845-120845 被引量：37

标识

DOI：10.1016/j.eswa.2023.120845

摘要

Nowadays, Unmanned Aerial Vehicles (UAVs) have become useful for various civil applications, such as traffic monitoring and smart parkings, where real-time vehicle detection and classification is one of the key tasks. There are many challenges in detecting vehicles including small size objects and the variety in the UAV’s altitude and angle. As classic object detection solutions have limitations in confronting these challenges, recent methods are developed based on convolutional neural networks and their ability in effective feature learning. Due to the computational complexity in these networks and the need for accurate and real-time object detection, balancing the accuracy and inference speed is obligatory for efficiency. This paper aims to propose an accurate, efficient and real-time vehicle detection network based on the successful YOLOv5 object detection model. This is done by improving the structure of the model, adding attention mechanism and using an adaptive bounding box regression loss function. Also, considering the need for real-time inference speed, the depth and width of the model was balanced and ghost convolution was incorporated into the Neck unit to further improve the balance between accuracy and inference speed. The proposed method is evaluated on three different urban UAV imagery datasets, VisDrone, CARPK and VAID, specifically intended for civil applications. Comparing the obtained results from the proposed method with YOLOv5 baseline models, it achieved 3.52% higher mAP50 and 207.15% higher FPS than YOLOv5X on VisDrone dataset, while it is much smaller in size and GFLOPS. Totally, the proposed network outcomes show how the applied structural and conceptual modifications can upgrade the YOLO family towards being small in size, high in accuracy and fast in inference speed.

求助该文献

最长约 10秒，即可获得该文献文件

Fine-tuned YOLOv5 for real-time vehicle detection in UAV imagery: Architectural improvements and performance boost

今日热心研友