人工智能
强化学习
计算机科学
模仿
机器人
对象(语法)
过程(计算)
机器人学
心理干预
任务(项目管理)
计算机视觉
人机交互
心理学
工程类
操作系统
精神科
社会心理学
系统工程
作者
Yafei Ou,Sadra Zargarzadeh,Mahdi Tavakoli
出处
期刊:Journal of medical robotics research
[World Scientific]
日期:2023-10-14
卷期号:08 (03n04)
标识
DOI:10.1142/s2424905x23400044
摘要
Recent studies in surgical robotics have focused on automating common surgical subtasks such as grasping and manipulation using deep reinforcement learning (DRL). In this work, we consider surgical endoscopic camera control for object tracking e.g. using the endoscopic camera manipulator (ECM) from the da Vinci Research Kit (dVRK) (Intuitive Inc., Sunnyvale, CA, USA) as a typical surgical robot learning task. A DRL policy for controlling the robot joint space movements is first trained in a simulation environment and then continues the learning in the real world. To speed up training and avoid significant failures (in this case, losing view of the object), human interventions are incorporated into the training process and regular DRL is combined with generative adversarial imitation learning (GAIL) to encourage imitating human behaviors. Experiments show that an average reward of 159.8 can be achieved within 1000 steps compared to only 121.8 without human interventions, and the view of the moving object is lost only twice during the training process out of 3 trials. These results show that human interventions can improve learning speed and significantly reduce failures during the training process.
科研通智能强力驱动
Strongly Powered by AbleSci AI