计算机科学
人工智能
分割
姿势
点云
RGB颜色模型
计算机视觉
对象(语法)
图像分割
过程(计算)
像素
尺度空间分割
模式识别(心理学)
操作系统
作者
Fang Gao,Qiujun Li,Qingyi Sun
标识
DOI:10.1109/smc53992.2023.10394339
摘要
The performance of 6D pose estimation, which is important for scene understanding, can be improved by more accurate object segmentation. RGB-D data including depth maps can provide more accurate position information than RGB data for semantic segmentation. In this work, we propose a novel two-stage RGBD-based pose estimation network, which can provide more precise semantic segmentation and effective point cloud features. Firstly, we use a lightweight semantic segmentation head to process the RBG-D data to get the pixel-level clustered mask, and then use a multi-scale and attention-based backbone to extract the point cloud features for pose estimation. We analyze the performance of our network on the YCB-Video dataset and the results show that our method is comparable to current state-of-the-art methods after optimization.
科研通智能强力驱动
Strongly Powered by AbleSci AI