水下
动作(物理)
路径(计算)
运动规划
空格(标点符号)
计算机科学
控制理论(社会学)
航空航天工程
地质学
大地测量学
人工智能
物理
工程类
计算机网络
机器人
海洋学
控制(管理)
量子力学
操作系统
作者
Zhenzhong Chu,Yu Wang,Daqi Zhu
标识
DOI:10.1109/tsmc.2023.3348827
摘要
In this article, the local two-dimensional (2-D) path planning problem is studied for an unmanned underwater vehicle (UUV) under continuous action space, and an improved algorithm is proposed based on the twin-delayed deep deterministic policy gradient (TD3). The mean function is added to the policy gradient to bring the output action of the algorithm closer to the mean of the action space. Hence, it suppresses the trend of a large number of boundary actions output by the TD3 algorithm. Based on the experience replay buffer, action storage is constructed to realize the automatic adjustment of the weight coefficient. Therefore, it reduces the additional hyperparameter tuning work caused by the change in the structure of the algorithm. In the setting of environmental variables and reward functions, real-time sonar variables are added to make the algorithm model more consistent with the actual underwater navigation situation. Based on ROS, a simulation environment is built and used to verify the path planning performance of the proposed algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI