运动规划
强化学习
势场
路径(计算)
计算机科学
领域(数学)
灾害应对
人工智能
工程类
模拟
应急管理
航空学
机器人
计算机网络
地质学
数学
法学
纯数学
政治学
地球物理学
作者
Xiuyan Ren,Na Geng,Zhang Yon,Lei Xiao,Dunwei Gong
标识
DOI:10.1109/tase.2025.3601665
摘要
To address the target inaccessibility issue of the Artificial Potential Field (APF) method and the slow convergence of deep reinforcement learning algorithms, we propose a method combining the artificial potential field and an improved twin delayed deep deterministic policy gradient algorithm (PG-ITD3) for UAV path planning in post-disaster scenarios. This approach considers Unmanned Aerial Vehicles (UAV) kinematic constraints and dynamically adjusts the repulsive gain coefficient through deep reinforcement learning to enhance planning efficiency. The introduction of a virtual obstacle strategy, combining the aftershock probability model with prioritized experience replay (PER), and a novel reward function facilitates real-time reward acquisition and accelerates convergence. Simulation results demonstrate that our proposed algorithm outperforms traditional APF method and other advanced deep reinforcement learning methods in success rate, path length, path reward, and global smoothness. This research offers an effective solution for post-disaster UAV rescue path planning, contributing significantly to enhancing rescue efficiency and safety.
科研通智能强力驱动
Strongly Powered by AbleSci AI