强化学习
灵活性(工程)
避碰
计算机科学
适应性
控制(管理)
任务(项目管理)
过程(计算)
人工智能
工程类
分布式计算
碰撞
生态学
统计
数学
计算机安全
系统工程
生物
操作系统
作者
Jinchao Chen,Tingyang Li,Ying Zhang,Tao You,Yantao Lu,Prayag Tiwari,Neeraj Kumar
出处
期刊:IEEE Transactions on Vehicular Technology
[Institute of Electrical and Electronics Engineers]
日期:2023-10-26
卷期号:73 (3): 4194-4206
被引量:102
标识
DOI:10.1109/tvt.2023.3327571
摘要
Due to the strong adaptability and high flexibility, unmanned aerial vehicles (UAVs) have been extensively studied and widely applied in both civil and military applications. Although UAVs can achieve significant cost reduction and performance enhancement in large-scale systems by taking full advantage of their cooperation and coordination, they result in a serious cooperative behaviour control problem. Especially in dynamic environments, the cooperative behaviour control problem which has to quickly produce a safe and effective behaviour decision for each UAV to achieve group missions, is NP-hard and difficult to settle. In this work, we design a global-and-local attention-based reinforcement learning algorithm for the cooperative behaviour control problem of UAVs. First, with the motion and coordination models, we analyze the collision avoidance, motion state update, and task execution constraints of multiple UAVs, and abstract the cooperative behaviour control problem as a multi-constraint decision-making one. Then, inspired from the human-learning process where more attention is devoted to the important parts of data, we design a multi-agent reinforcement learning algorithm with a global-and-local attention mechanism to cooperatively control the behaviours of UAVs and achieve the coordination. Simulation experiments in a multi-agent particle environment provided by OpenAI are conducted to verify the effectiveness and efficiency of the proposed approach. Compared with baselines, our approach shows significant advantages in mean reward, training time, and coordination effect.
科研通智能强力驱动
Strongly Powered by AbleSci AI