An improved lightweight small object detection framework applied to real-time autonomous driving

计算机科学 修剪 块(置换群论) 核(代数) 对象(语法) 卷积(计算机科学) 目标检测 排名(信息检索) 人工智能 深度学习 计算机视觉 模式识别(心理学) 人工神经网络 数学 生物 组合数学 农学 几何学
作者
Bharat Mahaur,Krishn Kumar Mishra,Amit Kumar
出处
期刊:Expert Systems With Applications [Elsevier]
卷期号:234: 121036-121036 被引量:8
标识
DOI:10.1016/j.eswa.2023.121036
摘要

Recent deep learning-based object detectors have shown compelling performance for the detection of large objects in autonomous driving applications. However, the detection of small objects like traffic signs and traffic lights is challenging owing to the complex nature of such objects. This article investigates how an existing object detector can be adjusted to address specific tasks and how these modifications can impact the detection of small objects. In particular, we explore and introduce architectural changes to the different components of the popular YOLOv5 model in order to improve its performance in the detection of small objects for autonomous driving. Initially, we propose group depthwise separable convolution as the improved convolution unit to replace standard convolution. We then integrate this unit to create the attention-based dilated CSP block. Lastly, this block is combined with several proposed modules, including the improved SPP, improved PANet, and improved information paths, to form our IS-YOLOv5 model. We also integrate kernel pruning on the network to accelerate the model deployment on vehicle-mounted mobile platform due to limited computing resources and real-time constraints. Specifically, we propose the versatile network pruning (VNP) technique based on Taylor criterion ranking to prune less-essential kernels in the network. We will show that our modifications barely increase the complexity but significantly improve the detection accuracy and speed. Compared to the conventional YOLOv5, the proposed IS-YOLOv5 model increases the mAP by 8.35% on the BDD100K dataset. Besides, our proposed model improves the detection speed in FPS by 3.10% compared to the YOLOv5 model. When using the VNP scheme, FPS is further increased by 52.14%, while the model size and complexity are reduced by 39.29% and 47.81%, with almost no change in mAP. Nevertheless, when compared to state-of-the-art models, IS-YOLOv5+VNP is found to be conducive to the deployment in autonomous driving systems.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
乌鸦坐飞机完成签到,获得积分10
5秒前
Sakura完成签到,获得积分10
8秒前
小哲完成签到,获得积分10
8秒前
10秒前
14秒前
14秒前
怡然的剑愁完成签到,获得积分10
14秒前
Sam完成签到,获得积分10
15秒前
森巴小妹发布了新的文献求助10
17秒前
17秒前
18秒前
余宁完成签到 ,获得积分10
18秒前
YHF2发布了新的文献求助10
20秒前
丘比特应助文刀采纳,获得10
20秒前
21秒前
Augenstern完成签到 ,获得积分10
22秒前
称心曼安完成签到,获得积分10
24秒前
YINZHE应助不会取名字采纳,获得20
26秒前
青木聪聪完成签到,获得积分10
27秒前
wrl2023完成签到,获得积分10
28秒前
29秒前
32秒前
32秒前
丘比特应助怡然的剑愁采纳,获得10
35秒前
LXL完成签到,获得积分10
35秒前
fuje发布了新的文献求助10
37秒前
37秒前
41秒前
43秒前
43秒前
打工肥仔应助科研通管家采纳,获得10
43秒前
orixero应助科研通管家采纳,获得10
44秒前
44秒前
centlay应助科研通管家采纳,获得20
44秒前
完美世界应助科研通管家采纳,获得10
44秒前
44秒前
yi只熊完成签到,获得积分20
45秒前
45秒前
45秒前
weijiechi发布了新的文献求助10
49秒前
高分求助中
Formgebungs- und Stabilisierungsparameter für das Konstruktionsverfahren der FiDU-Freien Innendruckumformung von Blech 1000
The Illustrated History of Gymnastics 800
Division and square root. Digit-recurrence algorithms and implementations 500
The role of a multidrug-resistance gene (lemdrl) in conferring vinblastine resistance in Leishmania enriettii 310
Elgar Encyclopedia of Consumer Behavior 300
機能營養學前瞻(3 Ed.) 300
Improving the ductility and toughness of Fe-Cr-B cast irons 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2511256
求助须知:如何正确求助?哪些是违规求助? 2160183
关于积分的说明 5531644
捐赠科研通 1880540
什么是DOI,文献DOI怎么找? 935846
版权声明 564240
科研通“疑难数据库(出版商)”最低求助积分说明 499664