强化学习
避碰
计算机科学
工作区
运动规划
自编码
人工智能
任务(项目管理)
计算机视觉
代表(政治)
防撞系统
路径(计算)
碰撞
实时计算
深度学习
模拟
机器人
工程类
计算机安全
系统工程
政治
法学
政治学
程序设计语言
作者
Huaxing Huang,Guijie Zhu,Zhun Fan,H. R. Zhai,Yuwei Cai,Ze Shi,Zhaohui Dong,Zhifeng Hao
标识
DOI:10.1109/iros47612.2022.9981803
摘要
Online path planning for multiple unmanned aerial vehicle (multi-UAV) systems is considered a challenging task. It needs to ensure collision-free path planning in real-time, especially when the multi-UAV systems can become very crowded on certain occasions. In this paper, we presented a vision-based decentralized collision-avoidance policy learning method for multi-UAV systems. The policy takes depth images and inertial measurements as sensory inputs and outputs UAV's steering commands, and it is trained together with the latent representation of depth images using a policy gradient-based reinforcement learning algorithm and autoencoder in the multi-UAV three-dimensional workspaces. Each UAV follows the same trained policy and acts independently to reach the goal without colliding or communicating with other UAVs. We validate our method in various simulated scenarios. The experimental results show that our learned policy can guarantee fully autonomous collision-free navigation for multi-UAV in three-dimensional workspaces, and its navigation performance will not be greatly affected by the increase in the number of UAVs.
科研通智能强力驱动
Strongly Powered by AbleSci AI