控制理论(社会学)
强化学习
偏转(物理)
自适应控制
估计员
计算机科学
控制器(灌溉)
有效载荷(计算)
弹道
模糊逻辑
控制工程
人工智能
工程类
数学
控制(管理)
天文
计算机网络
网络数据包
物理
光学
统计
农学
生物
作者
Santanu Kumar Pradhan,Bidyadhar Subudhi
标识
DOI:10.1109/tase.2012.2189004
摘要
This paper exploits reinforcement learning (RL) for developing real-time adaptive control of tip trajectory and deflection of a two-link flexible manipulator handling variable payloads. This proposed adaptive controller consists of a proportional derivative (PD) tracking loop and an actor-critic-based RL loop that adapts the actor and critic weights in response to payload variations while suppressing the tip deflection and tracking the desired trajectory. The actor-critic-based RL loop uses a recursive least square (RLS)-based temporal difference (TD) learning with eligibility trace and an adaptive memory to estimate the critic weights and a gradient-based estimator for estimating actor weights. Tip trajectory tracking and suppression of tip deflection performances of the proposed RL-based adaptive controller (RLAC) are compared with that of a nonlinear regression-based direct adaptive controller (DAC) and a fuzzy learning-based adaptive controller (FLAC). Simulation and experimental results envisage that the RLAC outperforms both the DAC and FLAC.
科研通智能强力驱动
Strongly Powered by AbleSci AI