Improving the Robustness of Pedestrian Detection in Autonomous Driving With Generative Data Augmentation

稳健性(进化) 计算机科学 行人 行人检测 分类器(UML) 人工智能 数据质量 机器学习 数据挖掘 运输工程 工程类 生物化学 化学 基因 公制(单位) 运营管理
作者
Yalun Wu,Yingxiao Xiang,Endong Tong,Yuqi Ye,Zhibo Cui,Yunzhe Tian,Lejun Zhang,Jiqiang Liu,Zhen Han,Wenjia Niu
出处
期刊:IEEE Network [Institute of Electrical and Electronics Engineers]
卷期号:38 (3): 63-69 被引量:7
标识
DOI:10.1109/mnet.2024.3366232
摘要

Pedestrian detection plays a crucial role in autonomous driving by identifying the position, size, orientation, and dynamic features of pedestrians in images or videos, assisting autonomous vehicles in making better decisions and controls. It's worth noting that the performance of pedestrian detection models largely depends on the quality and diversity of available training data. Current datasets for autonomous driving have limitations in terms of diversity, scale, and quality. In recent years, numerous studies have proposed the use of data augmentation strategies to expand the coverage of datasets, aiming to maximize the utilization of existing training data. However, these data augmentation methods often overlook the diversity of data scenarios. To overcome this challenge, in this paper, we propose a more comprehensive method for data augmentation, based on image descriptions and diffusion models. This method aims to cover a wider range of scene variations, including different weather conditions and lighting situations. We have designed a classifier to select data samples for augmentation, followed by extracting visual features based on image captions and converting them into high-level semantic information as textual descriptions for the corresponding samples. Finally, we utilize diffusion models to generate new variants. Additionally, we have designed three modification patterns to increase diversity in aspects such as weather conditions, lighting, and pedestrian poses within the data. We conducted extensive experiments on the KITTI dataset and in real-world environments, demonstrating that our proposed method significantly enhances the performance of pedestrian detection models in complex scenarios. This meticulous consideration of data augmentation will notably enhance the applicability and robustness of pedestrian detection models in actual autonomous driving scenarios.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
13nnk完成签到,获得积分10
刚刚
树123发布了新的文献求助10
刚刚
1秒前
1秒前
2秒前
2秒前
2秒前
科研通AI6.3应助李某某采纳,获得10
3秒前
思源应助灵巧大地采纳,获得10
5秒前
畔畔应助whoknowsname采纳,获得50
5秒前
畔畔应助whoknowsname采纳,获得50
5秒前
完美世界应助whoknowsname采纳,获得10
6秒前
AN发布了新的文献求助30
6秒前
肖亚鑫发布了新的文献求助10
6秒前
che发布了新的文献求助10
8秒前
虎攀伟发布了新的文献求助10
9秒前
11秒前
Doctor Tang完成签到,获得积分10
11秒前
愤怒的嚣完成签到,获得积分20
11秒前
petrichor完成签到 ,获得积分10
13秒前
zdd关闭了zdd文献求助
14秒前
未晚发布了新的文献求助10
14秒前
14秒前
妮子发布了新的文献求助10
15秒前
16秒前
魏沉完成签到,获得积分10
17秒前
上官若男应助虎攀伟采纳,获得10
17秒前
18秒前
小蘑菇应助dipsy采纳,获得10
18秒前
小哈哈哈完成签到,获得积分10
18秒前
充电宝应助znlion采纳,获得10
18秒前
一水入秋发布了新的文献求助10
20秒前
小牧鱼完成签到,获得积分10
21秒前
22秒前
平淡柚子应助姚奕如采纳,获得10
23秒前
今后应助张粤采纳,获得10
23秒前
凳子完成签到,获得积分10
25秒前
26秒前
Suoogler完成签到,获得积分10
28秒前
ss发布了新的文献求助10
28秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
Signals, Systems, and Signal Processing 610
Research Methods for Applied Linguistics: A Practical Guide 600
Research Methods for Applied Linguistics 500
Chemistry and Physics of Carbon Volume 15 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6407333
求助须知:如何正确求助?哪些是违规求助? 8226407
关于积分的说明 17447387
捐赠科研通 5460011
什么是DOI,文献DOI怎么找? 2885253
邀请新用户注册赠送积分活动 1861547
关于科研通互助平台的介绍 1701812