点云
计算机科学
最小边界框
推论
目标检测
跳跃式监视
图像扭曲
探测器
人工智能
对象(语法)
云计算
卷积神经网络
特征提取
计算机视觉
模式识别(心理学)
数据挖掘
图像(数学)
电信
操作系统
作者
Chenhang He,Hui Zeng,Jianqiang Huang,Xian-Sheng Hua,Lei Zhang
标识
DOI:10.1109/cvpr42600.2020.01189
摘要
3D object detection from point cloud data plays an essential role in autonomous driving. Current single-stage detectors are efficient by progressively downscaling the 3D point clouds in a fully convolutional manner. However, the downscaled features inevitably lose spatial information and cannot make full use of the structure information of 3D point cloud, degrading their localization precision. In this work, we propose to improve the localization precision of single-stage detectors by explicitly leveraging the structure information of 3D point cloud. Specifically, we design an auxiliary network which converts the convolutional features in the backbone network back to point-level representations. The auxiliary network is jointly optimized, by two point-level supervisions, to guide the convolutional features in the backbone network to be aware of the object structure. The auxiliary network can be detached after training and therefore introduces no extra computation in the inference stage. Besides, considering that single-stage detectors suffer from the discordance between the predicted bounding boxes and corresponding classification confidences, we develop an efficient part-sensitive warping operation to align the confidences to the predicted bounding boxes. Our proposed detector ranks at the top of KITTI 3D/BEV detection leaderboards and runs at 25 FPS for inference.
科研通智能强力驱动
Strongly Powered by AbleSci AI