Deep Reinforcement Learning for Autonomous Driving based on Safety Experience Replay
计算机科学
强化学习
人机交互
人工智能
作者
Xiaohan Huang,Yuhu Cheng,Qiang Yu,Xuesong Wang
出处
期刊:IEEE Transactions on Cognitive and Developmental Systems [Institute of Electrical and Electronics Engineers] 日期:2024-05-30卷期号:16 (6): 2070-2084被引量:4
标识
DOI:10.1109/tcds.2024.3405896
摘要
In the field of autonomous driving, safety has always been a top priority, especially in recent years with the development and increasing application of deep reinforcement learning in autonomous driving. Ensuring the safety of algorithms has become an indispensable concern. Reinforcement learning, which involves interacting with the environment through trial and error, may result in unsafe behavior in autonomous driving without any safety constraints. Such behavior could result in the drive path deviation and even collision, causing catastrophic accidents. Therefore, this paper proposes a reinforcement learning algorithm based on a safety experience replay mechanism, which is primarily to enhance the safety of reinforcement learning in autonomous driving. Firstly, the ego vehicle conducts preliminary exploration of the environment to collect data. Based on the performance of completing tasks observed from each data trajectory, safety labels of different levels are assigned to all state-action pairs, which establishes a safety experience buffer. Further, a safety critic network is constructed, which is trained by randomly sampling from the safety experience buffer. This enables the network to quantitatively evaluate the safety of driving actions, and the goal of safe driving for ego vehicle is achieved. The experimental results indicate that the proposed method can effectively reduce driving risks and improve task success rates compared with conventional reinforcement learning algorithms.