数学优化
动态规划
非线性系统
最优控制
趋同(经济学)
等价(形式语言)
贝尔曼方程
计算机科学
非线性规划
蒙特卡罗方法
数学
量子力学
经济
统计
物理
经济增长
离散数学
作者
Xiong Yang,Haibo He,X. Zhong
标识
DOI:10.1109/tcyb.2019.2926248
摘要
In this paper, we study the constrained optimization problem of a class of uncertain nonlinear interconnected systems. First, we prove that the solution of the constrained optimization problem can be obtained through solving an array of optimal control problems of constrained auxiliary subsystems. Then, under the framework of approximate dynamic programming, we present a simultaneous policy iteration (SPI) algorithm to solve the Hamilton-Jacobi-Bellman equations corresponding to the constrained auxiliary subsystems. By building an equivalence relationship, we demonstrate the convergence of the SPI algorithm. Meanwhile, we implement the SPI algorithm via an actor-critic structure, where actor networks are used to approximate optimal control policies and critic networks are applied to estimate optimal value functions. By using the least squares method and the Monte Carlo integration technique together, we are able to determine the weight vectors of actor and critic networks. Finally, we validate the developed control method through the simulation of a nonlinear interconnected plant.
科研通智能强力驱动
Strongly Powered by AbleSci AI