卷积神经网络
计算机科学
卷积(计算机科学)
点云
现场可编程门阵列
散列函数
计算科学
并行计算
计算
加速
云计算
人工智能
人工神经网络
算法
计算机硬件
操作系统
计算机安全
作者
Gong-Lang Zhou,Kaiyuan Guo,Xiang Chen,Kwok Wa Leung
标识
DOI:10.1109/fccm57271.2023.00039
摘要
3D convolutional neural networks (CNNs) are commonly used to process and analyze point clouds for object detection. However, the submanifold sparse convolutions and traditional sparse convolutions, which play important roles in 3D CNNs on point cloud, often encounter performance issues when accelerated by existing convolutional neural network accelerators due to the unstructured sparsity of sparse convolution. In this paper, we present SpCNA, an FPGA-based accelerator for sparse CNNs on 3D point clouds. To avoid computations that involve a large number of zeros in point cloud, it utilizes hash tables to map input-weight-output pairs. The computational performance of our design achieves 11.00 GFLOP/s. It can provide 1.2× and 1.98× energy efficiency improvement compared to Nvidia Xavier SoC and Nvidia 2080ti GPU, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI