导线
强化学习
计算机科学
过程(计算)
匹配(统计)
马尔可夫链
机制(生物学)
马尔可夫决策过程
模拟
人工智能
能源消耗
燃料效率
能量(信号处理)
工程类
马尔可夫过程
汽车工程
机器学习
地理
操作系统
大地测量学
认识论
哲学
统计
数学
电气工程
作者
Chuntao Zhang,Wenhui Huang,Xingyu Zhou,Chen Lv,Chao Sun
出处
期刊:Energy
[Elsevier]
日期:2023-10-25
卷期号:286: 129472-129472
被引量:11
标识
DOI:10.1016/j.energy.2023.129472
摘要
Eco-driving methods incorporating lateral motion exhibit enhanced energy-saving prospects in multi-lane traffic contexts, yet the randomly distributed obstructing vehicles and sparse traffic lights pose challenges in assessing the long-term value of instantaneous actions, impeding further improvement in energy efficiency. In response to this issue, a deep reinforcement learning (DRL)-based eco-driving method is proposed and augmented with the expert demonstration mechanism. Specifically, a Markov decision process matching with the target eco-driving scenario is systematically constructed, with which, the formulated DRL algorithm, parametrized soft actor-critic (PSAC), is trained to realize the integrated optimization of speed planning and lane-changing maneuver. To promote the training performance of PSAC under sparse rewards concerning traffic lights, an expert eco-driving model and an adaptive sampling approach are incorporated to constitute the expert demonstration mechanism. Simulation results highlight the superior performance of the proposed DRL-based eco-driving method and its training mechanism. Compared with the performance of the PSAC with a pure exploration-based training mechanism, the expert demonstration mechanism promotes the training efficiency and cumulated rewards of PSAC by about 60 % and 21.89 % respectively in the training phase, while in the test phase, a further reduction of 4.23 % benchmarked on a rule-based method is achieved in fuel consumption.
科研通智能强力驱动
Strongly Powered by AbleSci AI