稳健性(进化)
强化学习
计算机科学
任务(项目管理)
人工智能
分解
能源消耗
机器人
理论(学习稳定性)
控制理论(社会学)
基线(sea)
算法
机械臂
动作(物理)
工程类
控制(管理)
功能(生物学)
控制工程
能量(信号处理)
机器人学
国家(计算机科学)
期限(时间)
控制系统
智能控制
分级控制系统
分层数据库模型
遗传算法
任务分析
标识
DOI:10.1049/icp.2025.4584
摘要
An adaptive operation model for intelligent robotic arms based on the integration of the Soft Actor–Critic (SAC) algorithm and hierarchical teaching is proposed. Building upon the existing reinforcement learning control framework, this model incorporates hierarchical teaching's high-level task decomposition and low-level action optimization mechanisms, establishing a comprehensive system spanning state modeling, reward function design, and policy execution. Experimental validation demonstrates that Hierarchical Demonstration Soft Actor–Critic (HD-SAC) achieves a 94.6% success rate in grasping and assembly tasks—approximately 11% higher than traditional SAC—while reducing energy consumption to 111.8 J and completion time to 16.2 s. Even under perturbed conditions, it maintains a success rate above 85% (with the highest reaching 89.3%) and lowers the collision rate to 4.3%, significantly outperforming baseline methods such as SAC and Twin Delayed Deep Deterministic Policy Gradient (TD3). The results confirm that this method exhibits superior stability and robustness in complex tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI