Attention-based object pose estimation with feature fusion and geometry enhancement

姿势 人工智能 计算机视觉 特征(语言学) 融合 对象(语法) 计算机科学 几何学 模式识别(心理学) 数学 语言学 哲学
作者
Shuai Yang,Bin Wang,Junyuan Tao,Zhe Ruan,Hong Liu
出处
期刊:Industrial Robot-an International Journal [Emerald Publishing Limited]
卷期号:52 (4): 581-590
标识
DOI:10.1108/ir-08-2024-0366
摘要

Purpose The 6D pose estimation is a crucial branch of robot vision. However, the authors find that due to the failure to make full use of the complementarity of the appearance and geometry information of the object, the failure to deeply explore the contributions of the features from different regions to the pose estimation, and the failure to take advantage of the invariance of the geometric structure of keypoints, the performances of the most existing methods are not satisfactory. This paper aims to design a high-precision 6D pose estimation method based on above insights. Design/methodology/approach First, a multi-scale cross-attention-based feature fusion module (MCFF) is designed to aggregate the appearance and geometry information by exploring the correlations between appearance features and geometry features in the various regions. Second, the authors build a multi-query regional-attention-based feature differentiation module (MRFD) to learn the contribution of each region to each keypoint. Finally, a geometric enhancement mechanism (GEM) is designed to use structure information to predict keypoints and optimize both pose and keypoints in the inference phase. Findings Experiments on several benchmarks and real robot show that the proposed method performs better than existing methods. Ablation studies illustrate the effectiveness of each module of the authors’ method. Originality/value A high-precision 6D pose estimation method is proposed by studying the relationship between the appearance and geometry from different object parts and the geometric invariance of the keypoints, which is of great significance for various robot applications.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Jasper应助33采纳,获得10
刚刚
NexusExplorer应助奶冻采纳,获得10
刚刚
1秒前
2秒前
Li梨发布了新的文献求助10
2秒前
4秒前
5秒前
科研通AI6.2应助周媚媚采纳,获得10
5秒前
6秒前
6秒前
传奇3应助青黛采纳,获得10
6秒前
过冷风发布了新的文献求助30
6秒前
YJDlXX完成签到,获得积分10
8秒前
深情安青应助南星采纳,获得10
8秒前
Ava应助小恩采纳,获得10
9秒前
受昂夫应助啧啧啧采纳,获得10
9秒前
9秒前
10秒前
orixero应助谨慎果汁采纳,获得10
11秒前
小马甲应助caixk采纳,获得10
12秒前
13秒前
贝塔完成签到 ,获得积分10
13秒前
Georges-09发布了新的文献求助10
13秒前
14秒前
Wang发布了新的文献求助10
15秒前
15秒前
16秒前
奶冻发布了新的文献求助10
17秒前
SciGPT应助虚幻笑晴采纳,获得10
18秒前
kylin发布了新的文献求助10
18秒前
科研通AI2S应助xiiiiiin采纳,获得10
19秒前
青黛完成签到,获得积分10
19秒前
林鹿发布了新的文献求助10
19秒前
20秒前
打打应助姜鹏采纳,获得10
20秒前
Wss完成签到 ,获得积分10
22秒前
23秒前
24秒前
科研通AI6.2应助lsly采纳,获得10
24秒前
123456发布了新的文献求助10
25秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Developing Genetic Editing Tools for Lysobacter 2000
Моделирование процессов самоорганизации в кристаллообразующих системах 1000
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6528272
求助须知:如何正确求助?哪些是违规求助? 8321362
关于积分的说明 17813807
捐赠科研通 5629908
什么是DOI,文献DOI怎么找? 2930672
邀请新用户注册赠送积分活动 1907425
关于科研通互助平台的介绍 1766795