计算机科学
水准点(测量)
静态随机存取存储器
并行计算
计算机体系结构
算法
人工智能
计算机硬件
大地测量学
地理
作者
Xiaochen Peng,Ankit Kaul,Muhannad S. Bakir,Shimeng Yu
标识
DOI:10.1109/ted.2021.3111857
摘要
Emerging nonvolatile memory (eNVM)-based compute-in-memory (CIM) accelerators have been proven in silicon for machine learning at the macrolevel. To fully unleash the system-level benefits, the heterogeneous 3-D integration (H3D) using through-silicon via (TSV) is a promising approach, to: 1) address the challenges of area-hungry peripheries in CIM accelerators; 2) solve the 2-D scaling challenges of eNVM; and 3) stack enormous amount of embedded memories that are required in state-of-the-art deep neural network models. This article presents an electrical-thermal co-design of multitier CIM accelerators, based on SRAM and/or eNVM, with hybrid technology nodes for logic and memory tiers. We benchmark the CIM accelerators on 8-bit ResNet-34 for ImageNet recognition, with layer-by-layer and pipelined schemes, respectively. By sweeping TSV diameter from $30~\mu \text{m}$ to 100 nm, we investigate the tradeoffs of system performance metrics (TOPS/W, TOPS, and TOPS/mm 2 ) and H3D challenges (thermal and IR-drop in power delivery). Finally, we find the sweet spot of TSV diameter for multitier H3D system is 1– $3~\mu \text{m}$ , to guarantee balanced area-overhead, performance, and IR-drop in power delivery. The extended benchmark framework is released on GitHub ( https://github.com/neurosim ) as an open-source tool for the research community.
科研通智能强力驱动
Strongly Powered by AbleSci AI