已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce

计算机科学 树(集合论) 比例(比率) 任务(项目管理) 数据挖掘 大数据 吞吐量 分布式计算 物理 管理 量子力学 经济 无线 数学分析 电信 数学
作者
Lun Hu,Shicheng Yang,Xin Luo,Huaqiang Yuan,Khaled Sedraoui,MengChu Zhou
出处
期刊:IEEE/CAA Journal of Automatica Sinica [Institute of Electrical and Electronics Engineers]
卷期号:9 (1): 160-172 被引量:67
标识
DOI:10.1109/jas.2021.1004198
摘要

Protein-protein interactions are of great significance for human to understand the functional mechanisms of proteins. With the rapid development of high-throughput genomic technologies, massive protein-protein interaction (PPI) data have been generated, making it very difficult to analyze them efficiently. To address this problem, this paper presents a distributed framework by reimplementing one of state-of-the-art algorithms, i.e., CoFex, using MapReduce. To do so, an in-depth analysis of its limitations is conducted from the perspectives of efficiency and memory consumption when applying it for large-scale PPI data analysis and prediction. Respective solutions are then devised to overcome these limitations. In particular, we adopt a novel tree-based data structure to reduce the heavy memory consumption caused by the huge sequence information of proteins. After that, its procedure is modified by following the MapReduce framework to take the prediction task distributively. A series of extensive experiments have been conducted to evaluate the performance of our framework in terms of both efficiency and accuracy. Experimental results well demonstrate that the proposed framework can considerably improve its computational efficiency by more than two orders of magnitude while retaining the same high accuracy.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
dax大雄完成签到 ,获得积分10
刚刚
1秒前
Layover完成签到 ,获得积分10
1秒前
合适缘分完成签到 ,获得积分10
3秒前
yn关注了科研通微信公众号
3秒前
山茶花开完成签到,获得积分10
3秒前
科研通AI6.4应助dqs采纳,获得10
4秒前
思源应助竺七采纳,获得10
5秒前
海洋完成签到 ,获得积分10
6秒前
SHJ发布了新的文献求助10
8秒前
8秒前
昌莆完成签到 ,获得积分10
11秒前
打打应助HB采纳,获得10
12秒前
12秒前
烟花应助科研通管家采纳,获得10
12秒前
Tong应助科研通管家采纳,获得10
12秒前
12秒前
东坡发布了新的文献求助10
13秒前
14秒前
随缘完成签到 ,获得积分10
18秒前
19秒前
19秒前
20秒前
yyf完成签到 ,获得积分10
22秒前
东坡完成签到,获得积分10
22秒前
23秒前
Jodie发布了新的文献求助10
24秒前
25秒前
Qi完成签到 ,获得积分10
25秒前
在水一方应助HB采纳,获得10
25秒前
竺七发布了新的文献求助10
25秒前
So发布了新的文献求助10
29秒前
30秒前
30秒前
NiceSunnyDay完成签到 ,获得积分10
32秒前
32秒前
33秒前
LX有理想完成签到 ,获得积分10
34秒前
Xenomorph给Xenomorph的求助进行了留言
34秒前
zzz发布了新的文献求助10
35秒前
高分求助中
卤化钙钛矿人工突触的研究 2000
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Software that combines deep learning,3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6495194
求助须知:如何正确求助?哪些是违规求助? 8292076
关于积分的说明 17694484
捐赠科研通 5588685
什么是DOI,文献DOI怎么找? 2916457
邀请新用户注册赠送积分活动 1893336
关于科研通互助平台的介绍 1752403