Improving in-memory file system reading performance by fine-grained user-space cache mechanisms

计算机科学 隐藏物 缓存污染 缓存着色 页面缓存 智能缓存 缓存失效 缓存算法 缓存不经意算法 操作系统 分布式计算 并行计算 CPU缓存
作者
Rong Gu,Chongjie Li,Haipeng Dai,Yili Luo,Xiaolong Xu,Shaohua Wan,Yihua Huang
出处
期刊:Journal of Systems Architecture [Elsevier BV]
卷期号:115: 101994-101994 被引量:2
标识
DOI:10.1016/j.sysarc.2021.101994
摘要

Nowadays, as the memory capacity of servers become larger and larger, distributed in-memory file systems, which enable applications to interact with data at fast speed, have been widely used. However, the existing distributed in-memory file systems still face the problem of low data access performance in small data reading, which seriously reduce their usefulness in many important big data scenarios. In this paper, we analyze the factors that affect the performance of reading in-memory files and propose a two-layer user space cache management mechanism: in the first layer, we cache data packet references to reduce frequent page fault interruptions (packet-level cache); in the second layer, we cache and manage small file data units to avoid redundant inter-process communications (object-level cache). We further design a fine-grained caching model based on the submodular function optimization theory, for efficiently managing the variable-length cache units with partially overlapping fragments on the client side. Experimental results on synthetic and real-world workloads show that compared with the existing cutting-edge systems, the first level cache can double the reading performance on average, and the second level cache can improve random reading performance by more than 4 times. Our caching strategies also outperform the cutting-edge cache algorithms over 20% on hit ratio. Furthermore, the proposed client-side caching framework idea has been adopted by the Alluxio open source community, which shows the practical benefits of this work. • A packet-level cache policy for reducing page faults during sequential data reading. • An in-memory cache layer to cache the variable-length hot file fragments. • Proved that the variable-length cache problem is a submodular optimization problem theoretically. • Comprehensive performance evaluation of the client-side cache strategies.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
1秒前
可爱的函函应助zzy加油采纳,获得10
2秒前
2秒前
2秒前
2秒前
科研通AI6.2应助hsx采纳,获得10
3秒前
3秒前
dablack完成签到,获得积分10
3秒前
科研通AI6.1应助li采纳,获得10
3秒前
云一完成签到,获得积分10
3秒前
已注销关注了科研通微信公众号
4秒前
4秒前
4秒前
4秒前
齐冉完成签到 ,获得积分10
4秒前
小包子完成签到,获得积分10
5秒前
SPEAKERZ发布了新的文献求助10
6秒前
6秒前
李立发布了新的文献求助10
6秒前
6秒前
Tricia发布了新的文献求助10
7秒前
卢仕成完成签到,获得积分10
7秒前
lllllll发布了新的文献求助10
7秒前
7秒前
夏天发布了新的文献求助10
7秒前
小晨晨发布了新的文献求助10
7秒前
Jr L发布了新的文献求助10
7秒前
8秒前
Dabiel1213完成签到,获得积分10
8秒前
852应助重要的书桃采纳,获得10
8秒前
ding应助HasenPanzer采纳,获得10
8秒前
深情安青应助Alma采纳,获得10
8秒前
9秒前
10秒前
科研通AI6.1应助英子采纳,获得10
10秒前
10秒前
10秒前
罗丹丹完成签到,获得积分10
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Adhesion Science: Principles & Practice 800
The Graphene Handbook (2019 Edition) 700
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6532836
求助须知:如何正确求助?哪些是违规求助? 8325843
关于积分的说明 17831269
捐赠科研通 5634110
什么是DOI,文献DOI怎么找? 2933548
邀请新用户注册赠送积分活动 1909880
关于科研通互助平台的介绍 1768819