A Parallel Multiclassification Algorithm for Big Data Using an Extreme Learning Machine

大数据 人工智能 机器学习 人工神经网络 并行算法 深度学习
作者
Mingxing Duan,Kenli Li,Xiangke Liao,Keqin Li
出处
期刊:IEEE Transactions on Neural Networks [Institute of Electrical and Electronics Engineers]
卷期号:29 (6): 2337-2351 被引量:73
标识
DOI:10.1109/tnnls.2017.2654357
摘要

As data sets become larger and more complicated, an extreme learning machine (ELM) that runs in a traditional serial environment cannot realize its ability to be fast and effective. Although a parallel ELM (PELM) based on MapReduce to process large-scale data shows more efficient learning speed than identical ELM algorithms in a serial environment, some operations, such as intermediate results stored on disks and multiple copies for each task, are indispensable, and these operations create a large amount of extra overhead and degrade the learning speed and efficiency of the PELMs. In this paper, an efficient ELM based on the Spark framework (SELM), which includes three parallel subalgorithms, is proposed for big data classification. By partitioning the corresponding data sets reasonably, the hidden layer output matrix calculation algorithm, matrix $\mathbf {\hat {U}}$ decomposition algorithm, and matrix $\mathbf {V}$ decomposition algorithm perform most of the computations locally. At the same time, they retain the intermediate results in distributed memory and cache the diagonal matrix as broadcast variables instead of several copies for each task to reduce a large amount of the costs, and these actions strengthen the learning ability of the SELM. Finally, we implement our SELM algorithm to classify large data sets. Extensive experiments have been conducted to validate the effectiveness of the proposed algorithms. As shown, our SELM achieves an $8.71\times$ speedup on a cluster with ten nodes, and reaches a $13.79\times$ speedup with 15 nodes, an $18.74\times$ speedup with 20 nodes, a $23.79\times$ speedup with 25 nodes, a $28.89\times$ speedup with 30 nodes, and a $33.81\times$ speedup with 35 nodes.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
焱阳完成签到 ,获得积分10
1秒前
Biofly526完成签到,获得积分10
1秒前
臭皮完成签到,获得积分10
3秒前
Chem34完成签到,获得积分10
3秒前
Sally完成签到 ,获得积分10
6秒前
xxxxam完成签到,获得积分10
7秒前
777完成签到 ,获得积分10
7秒前
tingalan完成签到,获得积分10
7秒前
9秒前
马騳骉完成签到,获得积分10
10秒前
爱学习的医学小白完成签到 ,获得积分10
11秒前
HC3完成签到 ,获得积分10
12秒前
12秒前
123完成签到 ,获得积分10
14秒前
betty完成签到 ,获得积分10
19秒前
mingtian完成签到,获得积分10
22秒前
yueyue爱科研完成签到,获得积分10
22秒前
自然梦岚完成签到 ,获得积分10
22秒前
24秒前
风屿完成签到 ,获得积分10
25秒前
耶耶耶完成签到 ,获得积分10
27秒前
wangsiheng发布了新的文献求助10
28秒前
Chikit完成签到,获得积分10
32秒前
CAST1347完成签到,获得积分10
32秒前
夏侯卿完成签到,获得积分10
33秒前
脑洞疼应助科研通管家采纳,获得10
34秒前
忐忑的甜瓜完成签到,获得积分10
36秒前
胡图图完成签到,获得积分10
37秒前
约礼完成签到,获得积分10
37秒前
点点123完成签到,获得积分10
38秒前
wangsiheng完成签到,获得积分10
39秒前
sukiyaki完成签到,获得积分10
41秒前
42秒前
huangqian完成签到,获得积分10
45秒前
韭菜盒子发布了新的文献求助10
46秒前
静默完成签到 ,获得积分10
47秒前
abc105完成签到 ,获得积分10
48秒前
时光倒流ltt完成签到 ,获得积分10
49秒前
livra1058完成签到,获得积分10
50秒前
孙非完成签到,获得积分10
52秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Survey Development 600
Particle strengthening of metals and alloys 500
Monocentric experience of transforaminal endoscopic lumbar discectomy and foraminotomy outcomes: pushing the indications and avoiding failure. Report of 200 cases 400
Transferrin affects food intake and reproduction in the hard tick Haemaphysalis longicornis 400
Lexique et typologie des poteries: pour la normalisation de la description des poteries (Full Book) 400
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2354974
求助须知:如何正确求助?哪些是违规求助? 2061402
关于积分的说明 5142550
捐赠科研通 1791537
什么是DOI,文献DOI怎么找? 894926
版权声明 557276
科研通“疑难数据库(出版商)”最低求助积分说明 477591