计算机科学
统一内存访问
交错存储器
相似性(几何)
注册存储器
内存管理
德拉姆
共享内存
分布式存储器
节点(物理)
内存映射
平面存储模型
辅助存储器
并行计算
计算机硬件
半导体存储器
人工智能
结构工程
图像(数学)
工程类
作者
Wenjie Liu,Xubin He,Qing Liu
出处
期刊:IEEE Transactions on Parallel and Distributed Systems
[Institute of Electrical and Electronics Engineers]
日期:2023-03-01
卷期号:34 (3): 797-809
标识
DOI:10.1109/tpds.2022.3227544
摘要
With the increasing problem complexity, more irregular applications are deployed on high-performance clusters due to the parallel working paradigm, and yield irregular memory access behaviors across nodes. However, the irregularity of memory access behaviors is not comprehensively studied, which results in low utilization of the integrated hybrid memory system compositing of stacked DRAM and off-chip DRAM. To address this problem, we devise a novel method called Similarity-Managed Hybrid Memory System ( SM-HMS ) to improve the hybrid memory system performance by leveraging the memory access similarity among nodes in a cluster. Within SM-HMS , two techniques are proposed, Memory Access Similarity Measuring and Similarity-based Memory Access Behavior Sharing . To quantify the memory access similarity, memory access behaviors of each node are vectorized, and the distance between two vectors is used as the memory access similarity. The calculated memory access similarity is used to share memory access behaviors precisely across nodes. With the shared memory access behaviors, SM-HMS divides the stacked DRAM into two sections, the sliding window section and the outlier section . The shared memory access behaviors guide the replacement of the sliding window section while the outlier section is managed in the LRU manner. Our evaluation results with a set of irregular applications on various clusters consisting of up to 256 nodes have shown that SM-HMS outperforms the state-of-the-art approaches, Cameo , Chameleon , and Hyrbid2 , on job finish time reduction by up to $58.6\%$ , $56.7\%$ , and $31.3\%$ , with $46.1\%$ , $41.6\%$ , and $19.3\%$ on average, respectively. SM-HMS can also achieve up to $98.6\%$ ( $91.9\%$ on average) of the ideal hybrid memory system performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI