姿势
人工智能
计算机视觉
计算机科学
对象(语法)
特征(语言学)
融合
估计
模式识别(心理学)
基于对象
工程类
语言学
系统工程
哲学
作者
Xiaomei Lei,Wenhuan Lu,Jiu Yong,Jianguo Wei
出处
期刊:Electronics
[MDPI AG]
日期:2024-09-04
卷期号:13 (17): 3518-3518
被引量:2
标识
DOI:10.3390/electronics13173518
摘要
Six degrees-of-freedom (6D) object pose estimation plays an important role in pattern recognition of fields such as robotics and augmented reality. However, there are issues with low accuracy and real-time performance of 6D object pose estimation in complex scenes. To address these challenges, in this article, RFF-PoseNet (a 6D object pose estimation network based on robust feature fusion) is proposed for complex scenes. Firstly, a more lightweight Ghost module is used to replace the convolutional blocks in the feature extraction network. Then, a pyramid pooling module is added to the semantic label branch of PoseCNN to fuse the features of different pooling layers and enhance the network’s ability to capture information about objects in complex scenes and the correlations between contextual information. Finally, a pose regression and optimization module is utilized to further improve object pose estimation in complex scenes. Simulation experiments conducted on the YCB-Video and Occlusion LineMOD datasets show that the RFF-PoseNet algorithm can strengthen the correlation of features between different levels and the recognition ability of unclear targets, thereby achieving excellent accuracy and real-time performance, as well as strong robustness.
科研通智能强力驱动
Strongly Powered by AbleSci AI