微分博弈
数学优化
非线性系统
最优控制
零(语言学)
理论(学习稳定性)
功能(生物学)
计算机科学
班级(哲学)
差速器(机械装置)
零和博弈
动态规划
迭代法
控制理论(社会学)
索引(排版)
人工神经网络
数学
控制(管理)
纳什均衡
人工智能
工程类
物理
万维网
哲学
航空航天工程
机器学习
生物
进化生物学
量子力学
语言学
作者
Derong Liu,Qinglai Wei
摘要
SUMMARY In this paper, multiperson zero‐sum differential games for a class of continuous‐time uncertain nonlinear systems are solved using a new iterative adaptive dynamic programming (ADP) algorithm. The idea is to use ADP technique to obtain the optimal control pair iteratively that makes the performance index function reach the optimal solution of the zero‐sum differential games without the system model. It proves that the iterative performance index functions are convergent to the optimal solution of the game. Stability properties of the system under the iterative control pairs are also presented. Neural networks are used to build the system model, approximate the performance index function, and compute the optimal control policy, respectively, for facilitating the implementation of the iterative ADP method. Finally, two simulation examples are given to demonstrate the performance of the proposed method. Copyright © 2012 John Wiley & Sons, Ltd.
科研通智能强力驱动
Strongly Powered by AbleSci AI