计算机科学
初始化
分割
判别式
人工智能
模式识别(心理学)
变压器
计算机视觉
量子力学
物理
电压
程序设计语言
作者
Jiahao Liu,Jiacheng Deng,Chuxin Wang,Jianfeng He,Tianzhu Zhang
标识
DOI:10.1109/iccv51070.2023.01697
摘要
3D instance segmentation aims to predict a set of object instances in a scene and represent them as binary foreground masks with corresponding semantic labels. However, object instances are diverse in shape and category, and point clouds are usually sparse, unordered, and irregular, which leads to a query sampling dilemma. Besides, noise background queries interfere with proper scene perception and accurate instance segmentation. To address the above issues, we propose the Query Refinement Transformer termed QueryFormer. The key to our approach is to exploit a query initialization module to optimize the initialization process for the query distribution with a high coverage and low repetition rate. Additionally, we design an affiliated transformer decoder that suppresses the interference of noise background queries and helps the foreground queries focus on instance discriminative parts to predict final segmentation results. Extensive experiments on ScanNetV2 and S3DIS datasets show that our QueryFormer can surpass state-of-the-art 3D instance segmentation methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI