Distributed Multi-Agent Reinforcement Learning for Collaborative Path Planning and Scheduling in Blockchain-Based Cognitive Internet of Vehicles

计算机科学 强化学习 分布式计算 调度(生产过程) 延迟(音频) 交通拥挤 马尔可夫决策过程 计算机网络 计算卸载 边缘计算 云计算 马尔可夫过程 人工智能 工程类 统计 操作系统 电信 数学 运输工程 运营管理
作者
Huigang Chang,Yiming Liu,Zhengguo Sheng
出处
期刊:IEEE Transactions on Vehicular Technology [Institute of Electrical and Electronics Engineers]
卷期号:73 (5): 6301-6317 被引量:11
标识
DOI:10.1109/tvt.2023.3344934
摘要

The collaborative path planning and scheduling can overcome the limitations of single vehicle intelligence to obtain a globally optimal decision strategy in cognitive internet of vehicles (CIoVs). The collaboration of vehicles necessitates the exchange of environmental and decision information, generating massive collaborative computing tasks with strict latency requirements. Leveraging mobile edge computing (MEC) technology, computing tasks can be processed near the vehicles to reduce latency. However, traffic congestion and computational load imbalance seriously affect traffic efficiency and computational latency. In hybrid driving scenarios, it is challenging to fulfill the diverse service requirements of vehicles with different intelligence levels. Moreover, non-collaborative tend to result in traffic congestion due to vehicle aggregation effects, while centralized solutions lack flexibility and have high computational complexity. To address these concerns, a distributed multi-agent reinforcement learning (DMARL) algorithm is proposed for collaborative path planning and scheduling in a blockchain-based collaboration framework. In this framework, we model the communication, traffic situation and task processing of the system and formulate a joint optimization problem to minimize both travel time and computation latency. Last, we convert the scheduling problem for different types of vehicles into Markov decision processes (MDPs) and propose Q-learning-based DMARL algorithm to achieve proactive load balancing of both road infrastructures and MEC nodes (MECNs). Simulation results demonstrate that the proposed approach outperforms the comparison schemes in terms of load balance indexes of roads and MECNs, travel time, and computation latency.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
可乐完成签到 ,获得积分10
刚刚
1秒前
1秒前
1秒前
哈哈发布了新的文献求助10
2秒前
hh关注了科研通微信公众号
3秒前
aaa发布了新的文献求助10
3秒前
4秒前
Ava应助伶俐的血茗采纳,获得10
5秒前
尔东完成签到 ,获得积分10
5秒前
Akim应助sjy采纳,获得10
5秒前
6秒前
锂电说发布了新的文献求助10
6秒前
7秒前
铮铮发布了新的文献求助10
7秒前
深情安青应助Alien采纳,获得10
7秒前
赘婿应助动听的强炫采纳,获得10
8秒前
kyt发布了新的文献求助10
8秒前
8秒前
cynthia完成签到,获得积分10
8秒前
怪僻完成签到,获得积分10
9秒前
shlw完成签到,获得积分10
9秒前
Lllll发布了新的文献求助10
10秒前
xia xianxin发布了新的文献求助10
12秒前
充电宝应助苹果亦巧采纳,获得10
13秒前
14秒前
搜集达人应助BuMAMAHAHA采纳,获得10
15秒前
p65完成签到,获得积分10
16秒前
科研通AI6.4应助露露采纳,获得10
16秒前
领导范儿应助白衣卿相采纳,获得10
19秒前
小慈爱鸡完成签到 ,获得积分10
19秒前
wanci应助平淡的初翠采纳,获得10
19秒前
19秒前
小蘑菇应助xinxiangshicheng采纳,获得10
20秒前
20秒前
hh发布了新的文献求助10
20秒前
20秒前
21秒前
21秒前
曹子睿完成签到 ,获得积分10
21秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6424534
求助须知:如何正确求助?哪些是违规求助? 8242462
关于积分的说明 17523544
捐赠科研通 5478671
什么是DOI,文献DOI怎么找? 2893672
邀请新用户注册赠送积分活动 1870020
关于科研通互助平台的介绍 1707906