收缩阵列
卷积(计算机科学)
计算机科学
卷积神经网络
数据流
数组数据结构
人工神经网络
并行计算
算法
超大规模集成
人工智能
嵌入式系统
程序设计语言
作者
Rui Xu,Sheng Ma,Yaohua Wang,Yang Guo
标识
DOI:10.1109/iccd50377.2020.00089
摘要
The systolic array is one of the most popular choices for convolutional neural network accelerators. However, when computing special convolution, such as small-scale convolution or depthwise convolution, the utilization rate of the array fluctuates or even declines sharply. To address these issues, we design a configurable multi-directional systolic array (CMSA). The array can switch data mapping or dataflow for special convolution by changing the data transmission direction and configuring the array. Meanwhile, it keeps the original systolic array architecture and computing mode. Our design makes the systolic array flexible. Based on our evaluation, CMSA can increase the units utilization rate by up to 1.6× compared to the typical systolic array when running last layers of ResNet. When running depthwise convolution in MobileNet, CMSA can increase the utilization rate by up to 14.8×.
科研通智能强力驱动
Strongly Powered by AbleSci AI