分类
强化学习
计算机科学
抓住
人工智能
一般化
理论(学习稳定性)
机器人
排序算法
集合(抽象数据类型)
机器学习
算法
数学
数学分析
程序设计语言
作者
Qing An,Yanhua Chen,Hui Zeng,Junhua Wang
标识
DOI:10.1142/s1793962323410076
摘要
Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.
科研通智能强力驱动
Strongly Powered by AbleSci AI