Robust Lane Change Decision Making for Autonomous Vehicles: An Observation Adversarial Reinforcement Learning Approach

强化学习 稳健性(进化) 马尔可夫决策过程 对抗制 计算机科学 感知 人工智能 过程(计算) 贝叶斯概率 马尔可夫过程 数学 生物化学 化学 统计 神经科学 生物 基因 操作系统
作者
Xiangkun He,Haohan Yang,Zhongxu Hu,Chen Lv
出处
期刊:IEEE transactions on intelligent vehicles [Institute of Electrical and Electronics Engineers]
卷期号:8 (1): 184-193 被引量:164
标识
DOI:10.1109/tiv.2022.3165178
摘要

Reinforcementlearning holds the promise of allowing autonomous vehicles to learn complex decision making behaviors through interacting with other traffic participants. However, many real-world driving tasks involve unpredictable perception errors or measurement noises which may mislead an autonomous vehicle into making unsafe decisions, even cause catastrophic failures. In light of these risks, to ensure safety under perception uncertainty, autonomous vehicles are required to be able to cope with the worst case observation perturbations. Therefore, this paper proposes a novel observation adversarial reinforcement learning approach for robust lane change decision making of autonomous vehicles. A constrained observation-robust Markov decision process is presented to model lane change decision making behaviors of autonomous vehicles under policy constraints and observation uncertainties. Meanwhile, a black-box attack technique based on Bayesian optimization is implemented to approximate the optimal adversarial observation perturbations efficiently. Furthermore, a constrained observation-robust actor-critic algorithm is advanced to optimize autonomous driving lane change policies while keeping the variations of the policies attacked by the optimal adversarial observation perturbations within bounds. Finally, the robust lane change decision making approach is evaluated in three stochastic mixed traffic flows based on different densities. The results demonstrate that the proposed method can not only enhance the performance of an autonomous vehicle but also improve the robustness of lane change policies against adversarial observation perturbations.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
文献小白完成签到 ,获得积分10
3秒前
eccentric完成签到,获得积分10
4秒前
调皮的问丝完成签到,获得积分10
4秒前
科研百晓生完成签到,获得积分10
5秒前
抑郁小鼠解剖家完成签到,获得积分10
5秒前
SciGPT应助wuqilong采纳,获得10
6秒前
英姑应助Canonical_SMILES采纳,获得10
7秒前
酷波er应助songkeyan123采纳,获得10
7秒前
黄慶玲完成签到,获得积分10
8秒前
田様应助Ldq采纳,获得200
9秒前
HT完成签到,获得积分10
10秒前
领导范儿应助朱妙彤采纳,获得10
11秒前
11秒前
11秒前
黄浩完成签到,获得积分10
13秒前
汉堡包应助无情的初翠采纳,获得30
13秒前
科研通AI6.1应助静静等待采纳,获得30
13秒前
风吹麦田应助咕噜叽叽采纳,获得10
15秒前
bkagyin应助绝世少女搞科研采纳,获得10
15秒前
kaele完成签到,获得积分10
16秒前
16秒前
知性的夏槐完成签到 ,获得积分10
18秒前
18秒前
20秒前
的的得的完成签到,获得积分10
21秒前
饭饭完成签到,获得积分10
22秒前
22秒前
云解完成签到,获得积分10
23秒前
present发布了新的文献求助10
23秒前
23秒前
耳東发布了新的文献求助10
23秒前
侯珺发布了新的文献求助10
25秒前
26秒前
maaicui发布了新的文献求助80
27秒前
Lucas应助zsy采纳,获得10
28秒前
28秒前
28秒前
海宁发布了新的文献求助10
29秒前
朱妙彤发布了新的文献求助10
29秒前
朝朝完成签到,获得积分10
29秒前
高分求助中
Metallurgy at high pressures and high temperatures 2000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 1000
Relationship between smartphone usage in changes of ocular biometry components and refraction among elementary school children 800
The SAGE Dictionary of Qualitative Inquiry 610
Signals, Systems, and Signal Processing 610
An Introduction to Medicinal Chemistry 第六版习题答案 600
应急管理理论与实践 530
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6335976
求助须知:如何正确求助?哪些是违规求助? 8151924
关于积分的说明 17120332
捐赠科研通 5391555
什么是DOI,文献DOI怎么找? 2857632
邀请新用户注册赠送积分活动 1835186
关于科研通互助平台的介绍 1685919