计算机科学
强化学习
死锁
可扩展性
分布式计算
死锁预防算法
规划师
学习自动机
人工智能
机器学习
自动机
数据库
作者
Zhaohui Ye,Yanjie Li,Ronghao Guo,Jianqi Gao,Fuxi Wen
标识
DOI:10.1007/978-3-031-13844-7_47
摘要
The learning-based approach has been proved to be an effective way to solve multi-agent path finding (MAPF) problems. For large warehouse systems, the distributed strategy based on learning method can effectively improve efficiency and scalability. But compared with the traditional centralized planner, the learning-based approach is more prone to deadlocks. Communication learning has also made great progress in the field of multi-agent in recent years and has been be introduced into MAPF. However, the current communication methods provide redundant information for reinforcement learning and interfere with the decision-making of agents. In this paper, we combine the reinforcement learning with communication learning. The agents select its communication objectives based on priority and mask off redundant communication links. Then we use a feature interactive network based on graph neural network to achieve the information aggregation. We also introduce an additional deadlock detection mechanism to increase the likelihood of an agent escaping a deadlock. Experiments demonstrate our method is able to plan collision-free paths in different warehouse environments.
科研通智能强力驱动
Strongly Powered by AbleSci AI