人工智能
计算机科学
夹持器
变压器
推论
计算机视觉
欠驱动
卷积神经网络
模式识别(心理学)
机器人
工程类
电压
机械工程
电气工程
作者
Yunhai Han,Kelin Yu,Rahul Batra,Nathan Boyd,C. H. Mehta,Tuo Zhao,Yu She,Seth Hutchinson,Ye Zhao
出处
期刊:IEEE-ASME Transactions on Mechatronics
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:: 1-13
被引量:2
标识
DOI:10.1109/tmech.2024.3400789
摘要
Reliable robotic grasping, especially with deformable objects such as fruits, remains a challenging task due to underactuated contact interactions with a gripper, unknown object dynamics and geometries. In this study, we propose a Transformer-based robotic grasping framework for rigid grippers that leverage tactile and visual information for safe object grasping. Specifically, the Transformer models learn physical feature embeddings with sensor feedback through performing two pre-defined explorative actions (pinching and sliding) and predict a grasping outcome through a multilayer perceptron (MLP) with a given grasping strength. Using these predictions, the gripper predicts a safe grasping strength via inference. Compared with convolutional recurrent networks (CNN), the Transformer models can capture the long-term dependencies across the image sequences and process spatial-temporal features simultaneously. We first benchmark the Transformer models on a public dataset for slip detection. Following that, we show that the Transformer models outperform a CNN+LSTM model in terms of grasping accuracy and computational efficiency. We also collect our fruit grasping dataset and conduct online grasping experiments using the proposed framework for both seen and unseen fruits. Our codes and dataset are public on GitHub.
科研通智能强力驱动
Strongly Powered by AbleSci AI