跟踪(教育)
非线性系统
控制理论(社会学)
动力学(音乐)
控制(管理)
离散时间和连续时间
计算机科学
数学
物理
人工智能
统计
心理学
教育学
声学
量子力学
作者
Shijie Song,Dawei Gong,Minglei Zhu,Yuyang Zhao,Cong Huang
标识
DOI:10.1109/tnnls.2023.3323142
摘要
This article aims to solve the optimal tracking problem (OTP) for a class of discrete-time (DT) nonlinear systems with completely unknown dynamics. A novel data-driven deterministic approximate dynamic programming (ADP) algorithm is proposed to solve this kind of problem with only input-output (I/O) data. The proposed algorithm has two advantages compared to existing data-driven deterministic ADP algorithms for the OTP. First, our algorithm can guarantee optimality while achieving better performance in the aspects of time-saving and robustness to data. Second, the near-optimal control policy learned by our algorithm can be implemented without considering expected control and enable the system states to track the user-specified reference signals. Therefore, the tracking performance is guaranteed while simplifying the algorithm implementation. Furthermore, the convergence and stability of the proposed algorithm are strictly proved through theoretical analysis, in which the errors caused by neural networks (NNs) are considered. At the end of this article, the developed algorithm is compared with two representative deterministic ADP algorithms through a numerical example and applied to solve the tracking problem for a two-link robotic manipulator. The simulation results demonstrate the effectiveness and advantages of the developed algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI