强化学习
障碍物
模仿
计算机科学
人工智能
逃避(道德)
趋同(经济学)
过程(计算)
集合(抽象数据类型)
理论(学习稳定性)
点(几何)
避障
机器学习
机器人
移动机器人
数学
心理学
经济
法学
免疫学
生物
程序设计语言
经济增长
免疫系统
政治学
社会心理学
几何学
操作系统
作者
Xiuqing Qu,Wenhao Gan,Dalei Song,Liqin Zhou
标识
DOI:10.1016/j.oceaneng.2023.114016
摘要
Aiming at the confrontation game problems between pursuit-evasion unmanned surface vehicles under complex multi-obstacle environment, a pursuit-evasion game strategy is proposed. Firstly, the multi-obstacle environment is set up, and the gaming situation can be judged by the perception between pursuit-evasion USVs. For the pursuers, the model training is performed based on multi-agent deep reinforcement learning, so that they can quickly plan a reasonable obstacle avoidance and pursuit route, and form an effective encirclement posture before the evader approaches the target point. Meanwhile, the credit assignment problem among the members of the pursuing group is considered. For the evader, deep reinforcement learning is combined with imitation learning to train the escape model, so that it can reach the preset point in as short a time as possible and avoid the obstacles on the way. Finally, an adversarial-evolutionary game training method under multiple random scenarios is designed and combined with curriculum learning to iteratively update the pursuit and escape models. Through the detailed comparative analysis of the model training process and simulation experiments, it is proved that the proposed two types of models have higher convergence efficiency and stability, and they can have higher intelligence to pursue, escape and avoid obstacles respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI