发布文献求助

Expert-demonstration-augmented reinforcement learning for lane-change-aware eco-driving traversing consecutive traffic lights

导线强化学习计算机科学过程（计算）匹配（统计）马尔可夫链机制（生物学）马尔可夫决策过程模拟人工智能能源消耗燃料效率能量（信号处理）工程类马尔可夫过程汽车工程机器学习地理操作系统哲学电气工程大地测量学认识论统计数学

作者

Chuntao Zhang,Wenhui Huang,Xingyu Zhou,Chen Lv,Chao Sun

出处

期刊：Energy [Elsevier]
日期：2023-10-25 卷期号：286: 129472-129472 被引量：11

标识

DOI：10.1016/j.energy.2023.129472

摘要

Eco-driving methods incorporating lateral motion exhibit enhanced energy-saving prospects in multi-lane traffic contexts, yet the randomly distributed obstructing vehicles and sparse traffic lights pose challenges in assessing the long-term value of instantaneous actions, impeding further improvement in energy efficiency. In response to this issue, a deep reinforcement learning (DRL)-based eco-driving method is proposed and augmented with the expert demonstration mechanism. Specifically, a Markov decision process matching with the target eco-driving scenario is systematically constructed, with which, the formulated DRL algorithm, parametrized soft actor-critic (PSAC), is trained to realize the integrated optimization of speed planning and lane-changing maneuver. To promote the training performance of PSAC under sparse rewards concerning traffic lights, an expert eco-driving model and an adaptive sampling approach are incorporated to constitute the expert demonstration mechanism. Simulation results highlight the superior performance of the proposed DRL-based eco-driving method and its training mechanism. Compared with the performance of the PSAC with a pure exploration-based training mechanism, the expert demonstration mechanism promotes the training efficiency and cumulated rewards of PSAC by about 60 % and 21.89 % respectively in the training phase, while in the test phase, a further reduction of 4.23 % benchmarked on a rule-based method is achieved in fuel consumption.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒每天60秒读懂世界·精选全球要闻 (2026-1-2)

更新

2025年影响因子查询已上线 (2025-6-18)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: Sxq完成签到，获得积分10

刚刚; wind完成签到，获得积分10

刚刚; mrlow完成签到，获得积分10

1秒前; 大力怀绿完成签到，获得积分10

1秒前; FashionBoy上传了应助文件

1秒前; Silvia完成签到，获得积分10

1秒前; sfgggfds完成签到，获得积分10

1秒前; tang完成签到，获得积分0

2秒前; Kai完成签到，获得积分10

2秒前; 贪玩的秋柔上传了应助文件

2秒前; natianhao发布了新的文献求助10

3秒前; na嘛关注了科研通微信公众号

3秒前; 轴承完成签到，获得积分10

3秒前; Jasper上传了应助文件

3秒前; 情怀上传了应助文件

4秒前; 丘比特的应助被chengche采纳，获得10

4秒前; 111222333完成签到，获得积分10

4秒前; 烟花的应助被苏州第一深情采纳，获得10

4秒前; 丘比特上传了应助文件

4秒前; yangzhixiao发布了新的文献求助10

4秒前; 善学以致用上传了应助文件

5秒前; 华仔上传了应助文件

5秒前; 尔尔完成签到，获得积分10

5秒前; yeSui3yi上传了应助文件

5秒前; 时光机带哥走发布了新的文献求助10

5秒前; peeer完成签到，获得积分10

5秒前; 完美世界上传了应助文件

5秒前; 思源上传了应助文件

6秒前; Helen上传了应助文件

7秒前; 贪玩的秋柔的应助被活泼的眼神采纳，获得10

7秒前; fjhsg25完成签到，获得积分20

7秒前; 西西完成签到，获得积分20

7秒前; 科研通AI2S上传了应助文件

7秒前; 田田田完成签到，获得积分10

7秒前; FashionBoy的应助被抗体药物偶联采纳，获得10

8秒前; 彭于晏上传了应助文件

8秒前; HaiYan03完成签到，获得积分10

8秒前; zhangwj226完成签到，获得积分10

8秒前; 大力的灵雁的应助被科研通管家采纳，获得30

8秒前; 科研通管家关闭了Valerie的文献求助

8秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Handbook of pharmaceutical excipients, Ninth edition 5000; Aerospace Standards Index - 2026 ASIN2026 3000; Polymorphism and polytypism in crystals 1000; Signals, Systems, and Signal Processing 610; Discrete-Time Signals and Systems 610; T/SNFSOC 0002—2025 独居石精矿碱法冶炼工艺技术标准 600

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 6043522; 求助须知：如何正确求助？哪些是违规求助？ 7806800; 关于积分的说明 16240738; 捐赠科研通 5189292; 什么是DOI，文献DOI怎么找？ 2776883; 邀请新用户注册赠送积分活动 1759902; 关于科研通互助平台的介绍 1643374

今日热心研友

蓝莓橘子酱

大力的灵雁

友好的季节

你嵙这个期刊没买

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通