强化学习
无人机
路径(计算)
控制器(灌溉)
马尔可夫决策过程
计算机科学
过程(计算)
控制理论(社会学)
人工智能
增强学习
马尔可夫过程
模拟
控制(管理)
工程类
数学
海洋工程
程序设计语言
操作系统
统计
生物
农学
作者
Joohyun Woo,Chan-Woo Yu,Nakwan Kim
标识
DOI:10.1016/j.oceaneng.2019.04.099
摘要
In this paper, a deep reinforcement learning (DRL)-based controller for path following of an unmanned surface vehicle (USV) is proposed. The proposed controller can self-develop a vehicle’s path following capability by interacting with the nearby environment. A deep deterministic policy gradient (DDPG) algorithm, which is an actor-critic-based reinforcement learning algorithm, was adapted to capture the USV’s experience during the path-following trials. A Markov decision process model, which includes the state, action, and reward formulation, specially designed for the USV path-following problem is suggested. The control policy was trained with repeated trials of path-following simulation. The proposed method’s path-following and self-learning capabilities were validated through USV simulation and a free-running test of the full-scale USV.
科研通智能强力驱动
Strongly Powered by AbleSci AI