Improving in-memory file system reading performance by fine-grained user-space cache mechanisms

计算机科学 隐藏物 缓存污染 缓存着色 页面缓存 智能缓存 缓存失效 缓存算法 缓存不经意算法 操作系统 分布式计算 并行计算 CPU缓存
作者
Rong Gu,Chongjie Li,Haipeng Dai,Yili Luo,Xiaolong Xu,Shaohua Wan,Yihua Huang
出处
期刊:Journal of Systems Architecture [Elsevier BV]
卷期号:115: 101994-101994 被引量:2
标识
DOI:10.1016/j.sysarc.2021.101994
摘要

Nowadays, as the memory capacity of servers become larger and larger, distributed in-memory file systems, which enable applications to interact with data at fast speed, have been widely used. However, the existing distributed in-memory file systems still face the problem of low data access performance in small data reading, which seriously reduce their usefulness in many important big data scenarios. In this paper, we analyze the factors that affect the performance of reading in-memory files and propose a two-layer user space cache management mechanism: in the first layer, we cache data packet references to reduce frequent page fault interruptions (packet-level cache); in the second layer, we cache and manage small file data units to avoid redundant inter-process communications (object-level cache). We further design a fine-grained caching model based on the submodular function optimization theory, for efficiently managing the variable-length cache units with partially overlapping fragments on the client side. Experimental results on synthetic and real-world workloads show that compared with the existing cutting-edge systems, the first level cache can double the reading performance on average, and the second level cache can improve random reading performance by more than 4 times. Our caching strategies also outperform the cutting-edge cache algorithms over 20% on hit ratio. Furthermore, the proposed client-side caching framework idea has been adopted by the Alluxio open source community, which shows the practical benefits of this work. • A packet-level cache policy for reducing page faults during sequential data reading. • An in-memory cache layer to cache the variable-length hot file fragments. • Proved that the variable-length cache problem is a submodular optimization problem theoretically. • Comprehensive performance evaluation of the client-side cache strategies.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
2秒前
烟花应助Lee采纳,获得10
3秒前
hsy309完成签到,获得积分10
3秒前
Daisy完成签到,获得积分10
3秒前
4秒前
忐忑的蓝血完成签到,获得积分10
4秒前
CipherSage应助拾染采纳,获得10
5秒前
差异显著发布了新的文献求助10
5秒前
勇yi完成签到,获得积分10
5秒前
chenxin完成签到,获得积分10
5秒前
珹钰钰完成签到 ,获得积分10
6秒前
6秒前
Luojiayi完成签到,获得积分10
6秒前
何文珍发布了新的文献求助30
6秒前
7秒前
大模型应助wuhuhu采纳,获得10
8秒前
科研通AI6.2应助zbidnh采纳,获得30
9秒前
慕容松完成签到,获得积分10
10秒前
11秒前
11秒前
柏果完成签到,获得积分10
11秒前
ljx完成签到,获得积分10
12秒前
丘比特应助Xie采纳,获得10
13秒前
13秒前
思源应助xx采纳,获得10
13秒前
差异显著完成签到,获得积分10
14秒前
香蕉觅云应助何文珍采纳,获得10
14秒前
14秒前
科研通AI6.2应助斯文画板采纳,获得10
15秒前
15秒前
Oasis完成签到,获得积分10
15秒前
Itazu完成签到,获得积分10
16秒前
16秒前
Lemon发布了新的文献求助10
16秒前
17秒前
Ava应助Qiaoqiao采纳,获得10
17秒前
蓝莓酱蘸橘子完成签到 ,获得积分10
18秒前
18秒前
拾染完成签到,获得积分10
19秒前
高分求助中
Clinical Epidemiology: The Essentials, 6e 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Graphene Handbook (2019 Edition) 800
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
The Immune System (Fifth Edition) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6558238
求助须知:如何正确求助?哪些是违规求助? 8341642
关于积分的说明 17872274
捐赠科研通 5677554
什么是DOI,文献DOI怎么找? 2941084
邀请新用户注册赠送积分活动 1916888
关于科研通互助平台的介绍 1788227