追求者
追逃
强化学习
计算机科学
微分博弈
适应(眼睛)
数学优化
逃避(道德)
高斯分布
高斯过程
人工智能
控制理论(社会学)
数学
控制(管理)
物理
免疫系统
量子力学
光学
免疫学
生物
作者
Nick-Marios T. Kokolakis,Kyriakos G. Vamvoudakis
标识
DOI:10.1109/tnnls.2022.3203977
摘要
This article develops a safe pursuit-evasion game for enabling finite-time capture, optimal performance as well as adaptation to an unknown cluttered environment. The pursuit-evasion game is formulated as a zero-sum differential game wherein the pursuer seeks to minimize its relative distance to the target while the evader attempts to maximize it. A critic-only reinforcement learning (RL)-based algorithm is then proposed for learning online and in finite time the pursuit-evasion policies and thus enabling finite-time capture of the evader. Safety is ensured by means of barrier functions associated with the obstacles, which are integrated into the running cost. Using Gaussian processes (GPs), a learning-based mechanism is devised for safely learning the unknown environment. Simulation results illustrate the efficacy of the proposed approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI