哈密顿量(控制论)
动态规划
非线性系统
贝尔曼方程
最优控制
人工神经网络
数学
缩小
哈密顿系统
控制理论(社会学)
应用数学
数学优化
计算机科学
数学分析
控制(管理)
物理
人工智能
机器学习
量子力学
作者
Yongliang Yang,Donald C. Wunsch,Yixin Yin
标识
DOI:10.1109/tnnls.2017.2654324
摘要
This paper presents a Hamiltonian-driven framework of adaptive dynamic programming (ADP) for continuous time nonlinear systems, which consists of evaluation of an admissible control, comparison between two different admissible policies with respect to the corresponding the performance function, and the performance improvement of an admissible control. It is showed that the Hamiltonian can serve as the temporal difference for continuous-time systems. In the Hamiltonian-driven ADP, the critic network is trained to output the value gradient. Then, the inner product between the critic and the system dynamics produces the value derivative. Under some conditions, the minimization of the Hamiltonian functional is equivalent to the value function approximation. An iterative algorithm starting from an arbitrary admissible control is presented for the optimal control approximation with its convergence proof. The implementation is accomplished by a neural network approximation. Two simulation studies demonstrate the effectiveness of Hamiltonian-driven ADP.
科研通智能强力驱动
Strongly Powered by AbleSci AI