计算机科学
目标检测
特征(语言学)
比例(比率)
人工智能
趋同(经济学)
先验与后验
频道(广播)
对象(语法)
计算机视觉
像素
模式识别(心理学)
数据挖掘
算法
地理
哲学
经济
认识论
地图学
经济增长
语言学
计算机网络
作者
Lingzhi Shen,Hongfeng Tao,Yuanzhi Ni,Yue Wang,Vladimir Stojanović
标识
DOI:10.1088/1361-6501/acb075
摘要
Abstract Road object detection is an essential and imperative step for driving intelligent vehicles. Generally, road objects, such as vehicles and pedestrians, present the characteristic of multi-scale and uncertain distribution which puts a high demand on the detection algorithm. Therefore, this paper proposes a YOLOv3 (You Only Look Once v3)-based method aimed at enhancing the capability of cross-scale detection and focusing on the valuable area. The proposed method fills an urgent need for multi-scale detection, and its individual components will be useful in road object detection. The K-means-GIoU algorithm is designed to generate a priori boxes whose shapes are close to real boxes. This greatly reduces the complexity of training, paving the way for fast convergence. Then, a detection branch is added to detect small targets, and a feature map cropping module is introduced into the newly added detection branch to remove the areas with high probability of background targets and easy-to-detect targets, and the cropped areas of the feature map are filled with a value of 0. Further, a channel attention module and spatial attention module are added to strengthen the network’s attention to major regions. The experiment results on the KITTI dataset show that the proposed method maintains a fast detection speed and increases the mAP (mean average precision) value by as much as 2.86 % compared with YOLOv3-ultralytics, and especially improves the detection performance for small-scale objects.
科研通智能强力驱动
Strongly Powered by AbleSci AI