计算机科学
瓶颈
德拉姆
并行计算
核(代数)
计算机硬件
嵌入式系统
数学
组合数学
作者
Lingxi Wu,Rasool Sharifi,Ashish Venkat,Kevin Skadron
标识
DOI:10.1109/lca.2022.3201168
摘要
Exact pattern matching is a widely used kernel in many applications. A DRAM-based processing-in-memory (PIM) architecture, Sieve, was recently proposed to alleviate the bottleneck stage of sequence matching in genomics. This paper observes that other exact-pattern-matching-intensive workloads can benefit from a similar architecture. We extend Sieve with several cost-effective modifications, such as a population count logic, chip-level parallelism support, and a hardware data transposition unit, making a general-purpose DRAM-CAM and key-value store that outperforms both CPU and various PIM solutions.
科研通智能强力驱动
Strongly Powered by AbleSci AI