发布文献求助

Traffic signal control using reinforcement learning based on the teacher-student framework

强化学习计算机科学信号（编程语言）控制（管理）功能（生物学）钢筋国家（计算机科学）交通信号灯控制信号人工智能机器学习实时计算算法工程类程序设计语言电信结构工程进化生物学传输（电信）生物

作者

Junxiu Liu,Sheng Qin,Min Su,Yuling Luo,Shunsheng Zhang,Yanhu Wang,Su Yang

出处

期刊：Expert Systems With Applications [Elsevier BV]
日期：2023-10-01 卷期号：228: 120458-120458 被引量：4

标识

DOI：10.1016/j.eswa.2023.120458

摘要

Reinforcement Learning (RL) is an effective method for adaptive traffic signals control. As one type of RL, the teacher-student framework has been found helpful in improving the model performance for different application fields (such as robot control, game, hybrid intelligence), but it is rarely applied for traffic control due to that the hyper-parameters and the number of state-action pairs experienced are difficult to determine. In this work, the teacher-student framework is used for traffic signal control, where only a single reward function is designed to guide the student agent and by using this method the number of hyper-parameters and the model complexity are reduced. Specifically, the teacher agent uses an importance function to evaluate and guide the student, where the importance function combines with environment reward to form a synthetic reward for the student agent. Experimental results under different traffic environments show that the proposed method achieves the expected performance enhancement and is better than most of the state-of-the-art RL-based traffic signal control methods.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

⚡ 2026年影响因子、分区 已更新！ (2026-6-17)

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 天穹雨的应助被端庄的紫南采纳，获得30

1秒前; 南瓜好吃完成签到，获得积分10

1秒前; 科研通AI6.4上传了应助文件

1秒前; 悦风发布了新的文献求助20

1秒前; 自然醒完成签到，获得积分20

2秒前; 倪仕丽完成签到，获得积分10

2秒前; 爆米花的应助被木马病毒采纳，获得10

3秒前; 科研通AI6.4的应助被聂雨声采纳，获得10

3秒前; WilliamYen发布了新的文献求助10

3秒前; 科研通AI6.2的应助被MGSansan采纳，获得10

3秒前; 田様的应助被零一秒采纳，获得10

4秒前; NexusExplorer上传了应助文件

4秒前; 秀xiu关闭了秀xiu的文献求助

4秒前; 巴哒完成签到，获得积分10

5秒前; lizishu上传了应助文件

5秒前; 我要发JACS发布了新的文献求助10

5秒前; Lucas上传了应助文件

5秒前; 万能图书馆上传了应助文件

7秒前; Copyright上传了应助文件

7秒前; 梅西完成签到，获得积分0

7秒前; 斯文钢笔的应助被俏皮诺言采纳，获得10

8秒前; 内啡肽完成签到，获得积分10

8秒前; Q女士的论文在哪里完成签到，获得积分10

9秒前; 慕青的应助被此刻永恒采纳，获得10

9秒前; 自然醒关注了科研通微信公众号

9秒前; 门门发布了新的文献求助10

10秒前; Lillian完成签到，获得积分10

10秒前; 彭佳丽发布了新的文献求助10

10秒前; 落寞冰巧发布了新的文献求助10

10秒前; 沉静胜完成签到，获得积分10

11秒前; 蜂蜜柚子茶iii发布了新的文献求助10

12秒前; 科研通AI6.4的应助被xin采纳，获得10

13秒前; 无极微光上传了应助文件

14秒前; 思源上传了应助文件

15秒前; highlight发布了新的文献求助10

15秒前; blueyh关注了科研通微信公众号

16秒前; 隐形曼青的应助被哩哩采纳，获得10

17秒前; 爆米花上传了应助文件

17秒前; 我是老大上传了应助文件

17秒前; Vvv完成签到，获得积分10

17秒前

高分求助中: Principles of Economics, 11th Edition 10000; University Physics with Modern Physics, 16th edition 10000; (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Arthritis and Related Conditions, An Issue of Orthopedic Clinics 1000; Development of a Bridge Weigh-In-Motion System: A technology to convert the bridge response to the passage of traffic into data on vehicle configurations, speeds, times of travel and weights 1000; ズームレンズの光学設計に関する研究 800; Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 700

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 7288397; 求助须知：如何正确求助？哪些是违规求助？ 8908118; 关于积分的说明 18853649; 捐赠科研通 6957135; 什么是DOI，文献DOI怎么找？ 3208896; 关于科研通互助平台的介绍 2378670; 邀请新用户注册赠送积分活动 2184667

今日热心研友

学术文献互助

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通