Distributed Multi-Agent Reinforcement Learning for Collaborative Path Planning and Scheduling in Blockchain-Based Cognitive Internet of Vehicles

计算机科学强化学习分布式计算调度（生产过程）延迟（音频）交通拥挤马尔可夫决策过程计算机网络计算卸载边缘计算云计算马尔可夫过程人工智能工程类电信运营管理统计数学运输工程操作系统

作者

Huigang Chang,Yiming Liu,Zhengguo Sheng

出处

期刊：IEEE Transactions on Vehicular Technology [Institute of Electrical and Electronics Engineers]
日期：2023-12-20 卷期号：73 (5): 6301-6317 被引量：4

标识

DOI：10.1109/tvt.2023.3344934

摘要

The collaborative path planning and scheduling can overcome the limitations of single vehicle intelligence to obtain a globally optimal decision strategy in cognitive internet of vehicles (CIoVs). The collaboration of vehicles necessitates the exchange of environmental and decision information, generating massive collaborative computing tasks with strict latency requirements. Leveraging mobile edge computing (MEC) technology, computing tasks can be processed near the vehicles to reduce latency. However, traffic congestion and computational load imbalance seriously affect traffic efficiency and computational latency. In hybrid driving scenarios, it is challenging to fulfill the diverse service requirements of vehicles with different intelligence levels. Moreover, non-collaborative tend to result in traffic congestion due to vehicle aggregation effects, while centralized solutions lack flexibility and have high computational complexity. To address these concerns, a distributed multi-agent reinforcement learning (DMARL) algorithm is proposed for collaborative path planning and scheduling in a blockchain-based collaboration framework. In this framework, we model the communication, traffic situation and task processing of the system and formulate a joint optimization problem to minimize both travel time and computation latency. Last, we convert the scheduling problem for different types of vehicles into Markov decision processes (MDPs) and propose Q-learning-based DMARL algorithm to achieve proactive load balancing of both road infrastructures and MEC nodes (MECNs). Simulation results demonstrate that the proposed approach outperforms the comparison schemes in terms of load balance indexes of roads and MECNs, travel time, and computation latency.

求助该文献

最长约 10秒，即可获得该文献文件

Distributed Multi-Agent Reinforcement Learning for Collaborative Path Planning and Scheduling in Blockchain-Based Cognitive Internet of Vehicles

今日热心研友