强化学习
避障
计算机科学
加速度
障碍物
运动规划
人工智能
功能(生物学)
控制(管理)
避碰
车辆动力学
深度学习
模拟
工程类
航空航天工程
计算机安全
物理
法学
移动机器人
生物
政治学
碰撞
经典力学
机器人
进化生物学
作者
Sina Sabzekar,Mahdi Samadzad,Asal Mehditabrizi,Ala Nekouvaght Tak
出处
期刊:Unmanned Systems
[World Scientific]
日期:2023-10-11
卷期号:12 (03): 477-498
被引量:20
标识
DOI:10.1142/s2301385024420044
摘要
Unmanned aerial vehicles (UAVs) are experiencing a rapid expansion in their applications across various domains, including goods delivery, video capturing, and traffic control. The crucial aspect for UAVs to execute successful target tracking and obstacle avoidance maneuvers lies in the accuracy of their path planning operations. This research paper aims to contribute to the existing body of knowledge by presenting a novel model that incorporates acceleration control, accounting for changing variables such as UAV velocity and altitude, while also incorporating vehicle dynamics. To enhance the realism of the model, we include drag force as a factor. In this study, we focus on exploring the potential of deep reinforcement learning (DRL), specifically the deep deterministic policy gradient (DDPG) algorithm, for modeling a 3D continuous environment with a continuous set of actions. In order to improve the UAV’s performance in executing target tracking and obstacle avoidance maneuvers, we propose an innovative reward function based on the inner product. The training results show that the UAV successfully learns to perform the aforementioned tasks. Also, simulation results demonstrate the superior performance of the proposed UAV modeling and reward function compared to existing works.
科研通智能强力驱动
Strongly Powered by AbleSci AI