Dual-Centralized Q-Network-Based Reinforcement Learning for Cooperative Path Planning of Multiple UAVs

运动规划强化学习计算机科学灵活性（工程）能源消耗任务（项目管理）障碍物避障控制工程路径（计算）工程类数学优化智能交通系统车辆动力学光学（聚焦）人工智能遥控水下航行器实时计算碰撞弹道避碰模拟增强学习分布式计算能量（信号处理）同种类的最优化问题

作者

Jinchao Chen,Chongde Ren,Yujiao Hu,Ying Zhang,Yantao Lu,Qing Li,Tao You,Joel J. P. C. Rodrigues

出处

期刊：IEEE Transactions on Intelligent Transportation Systems [Institute of Electrical and Electronics Engineers]
日期：2025-07-18 卷期号：26 (9): 13232-13246 被引量：7

标识

DOI：10.1109/tits.2025.3587392

摘要

Due to the low cost and high maneuverability, unmanned aerial vehicles (UAVs) have been commonly used and played an important role in both the civilian and military fields. Although UAVs can significantly achieve enhanced flexibility and extensibility for large-scale intelligent systems, they result in a serious path planning problem. Especially in complex environments with a large number of irregular obstacles, UAVs have to efficiently find near-optimization flight paths and automatically move to target positions to finish the group task while avoiding collisions and satisfying various constraints. In this work, we focus on the cooperative path planning problem of homogeneous UAVs and present a multi-agent reinforcement learning-based approach to solve the problem. First, with the UAV and obstacle models, we analyse the collision avoidance, motion continuity, and energy consumption constraints in UAV flying, and formulate the cooperative path planning problem as a multi-constraint combinatorial optimization one with a high computational complexity. Then, inspired by the twin delayed deep deterministic policy gradient algorithm where clipped dual Q-networks are used to decrease the overestimation error of critic networks, we propose a multi-agent reinforcement learning-based approach with a dual-centralized Q-network mechanism to automatically produce feasible and collision-free flight path for each UAV. Finally, simulation experiments are conducted in a multi-agent particle environment to evaluate the effectiveness and efficiency of the proposed approach.

求助该文献

最长约 10秒，即可获得该文献文件

Dual-Centralized Q-Network-Based Reinforcement Learning for Cooperative Path Planning of Multiple UAVs

今日热心研友