点云
计算机科学
分割
人工智能
目标检测
计算机视觉
水准点(测量)
Boosting(机器学习)
激光雷达
探测器
任务(项目管理)
RGB颜色模型
图像分割
稳健性(进化)
遥感
电信
生物化学
化学
管理
大地测量学
地质学
经济
基因
地理
作者
Shaoqing Xu,Dingfu Zhou,Jin Fang,Junbo Yin,Bin Zhou,Liangjun Zhang
标识
DOI:10.1109/itsc48978.2021.9564951
摘要
Accurate detection of obstacles in 3D is an essential task for autonomous driving and intelligent transportation. In this work, we propose a general multimodal fusion framework FusionPainting to fuse the 2D RGB image and 3D point clouds at a semantic level for boosting the 3D object detection task. Especially, the FusionPainting framework consists of three main modules: a multi-modal semantic segmentation module, an adaptive attention-based semantic fusion module, and a 3D object detector. First, semantic information is obtained for 2D image and 3D Lidar point clouds based on 2D and 3D segmentation approaches. Then the segmentation results from different sensors are adaptively fused based on the proposed attention-based semantic fusion module. Finally, the point clouds painted with the fused semantic label are sent to the 3D detector for obtaining the 3D objection results. The effectiveness of the proposed framework has been verified on the large-scale nuScenes detection benchmark by comparing with three different baselines. The experimental results show that the fusion strategy can significantly improve the detection performance compared to the methods using only point clouds, and the methods using point clouds only painted with 2D segmentation information. Furthermore, the proposed approach outperforms other state-of-the-art methods on the nuScenes testing benchmark. Code will be available at https://github.com/Shaoqing26/FusionPainting/.
科研通智能强力驱动
Strongly Powered by AbleSci AI