拖拉机
理论(学习稳定性)
拖车
运动学
强化学习
计算机科学
趋同(经济学)
数学优化
集合(抽象数据类型)
控制理论(社会学)
人工智能
算法
汽车工程
数学
工程类
控制(管理)
机器学习
经济增长
程序设计语言
经济
经典力学
物理
计算机网络
作者
Jian Wang,Huawei Liang,Pan Zhao,Zhiyuan Li,Zhiling Wang
标识
DOI:10.1109/icma54519.2022.9856006
摘要
Deep reinforcement learning has an excellent performance in decision-making and is widely used in areas such as autonomous driving. To improve the ability of decision-making of tractor-trailers, this paper presented an unmanned tractor-trailer decision-making model based on Deep Deterministic Policy Gradient (DDPG) algorithm. The DDPG algorithm has the problems of slow convergence in the early stage, poor stability, and easily falling into the local minimum value. Combined with the kinematics characteristics of the tractor-trailer, an improved DDPG algorithm is proposed. By adding improved artificial potential fields to the DDPG algorithm, the local minimums can be avoided. Compression of state set and action set of agent improves training speed. The stability of the algorithm is improved by adding a penalty item for a large angle between tractor and trailer and a penalty term for deviating from the desired trajectory to improve algorithm stability. Experimental results showed that the improved model improves learning efficiency, security, and average velocity of the tractor-trailer while ensuring the effectiveness of decision-making.
科研通智能强力驱动
Strongly Powered by AbleSci AI