发布文献求助

Parallelizing DNN Training on GPUs: Challenges and Opportunities

数据并行性计算机科学平行性（语法）并行计算过程（计算）任务并行性工作量极限（数学）人工神经网络指令级并行人工智能程序设计语言数学分析数学操作系统

作者

Weizheng Xu,Youtao Zhang,Xulong Tang

出处

期刊：Companion Proceedings of the The Web Conference 2018 日期：2021-04-19 卷期号：: 174-178 被引量：13

标识

DOI：10.1145/3442442.3452055

摘要

In recent years, Deep Neural Networks (DNNs) have emerged as a widely adopted approach in many application domains. Training DNN models is also becoming a significant fraction of the datacenter workload. Recent evidence has demonstrated that modern DNNs are becoming more complex and the size of DNN parameters (i.e., weights) is also increasing. In addition, a large amount of input data is required to train the DNN models to reach target accuracy. As a result, the training performance becomes one of the major challenges that limit DNN adoption in real-world applications. Recent works have explored different parallelism strategies (i.e., data parallelism and model parallelism) and used multi-GPUs in datacenters to accelerate the training process. However, naively adopting data parallelism and model parallelism across multiple GPUs can lead to sub-optimal executions. The major reasons are i) the large amount of data movement that prevents the system from feeding the GPUs with the required data in a timely manner (for data parallelism); and ii) low GPU utilization caused by data dependency between layers that placed on different devices (for model parallelism).

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 陌路发布了新的文献求助10

4秒前; SHENYanpeng发布了新的文献求助100

4秒前; 旺仔发布了新的文献求助10

5秒前; Marciu33关闭了Marciu33的文献求助

6秒前; Sjingjia完成签到，获得积分10

8秒前; Hello上传了应助文件

9秒前; 空里流霜不觉飞完成签到，获得积分10

9秒前; 陆家麟完成签到，获得积分20

10秒前; 铭铭铭完成签到，获得积分10

10秒前; 乐乐上传了应助文件

12秒前; 胡豆豆发布了新的文献求助20

15秒前; LYQ上传了应助文件

16秒前; sunny发布了新的文献求助10

18秒前; 万能图书馆的应助被科研小崽采纳，获得10

22秒前; 拉长的诗蕊发布了新的文献求助10

23秒前; 星辰大海的应助被小熊童话书采纳，获得10

24秒前; Benjamin上传了应助文件

24秒前; 科目三的应助被dqw采纳，获得10

25秒前; SciGPT上传了应助文件

28秒前; Marciu33关闭了Marciu33的文献求助

30秒前; 汉堡包上传了应助文件

32秒前; 英姑的应助被温暖的颜演采纳，获得10

32秒前; 丘比特的应助被单薄飞莲采纳，获得10

33秒前; 美好斓发布了新的文献求助10

35秒前; 个性的紫菜上传了应助文件

35秒前; sunsunsun完成签到，获得积分10

36秒前; 福征发布了新的文献求助10

37秒前; Cohenyun完成签到，获得积分10

38秒前; Billy上传了应助文件

38秒前; 华仔的应助被可耐的乐荷采纳，获得10

39秒前; 易寒完成签到，获得积分10

40秒前; sunny完成签到，获得积分10

40秒前; Jasper上传了应助文件

40秒前; 充电宝上传了应助文件

40秒前; bkagyin上传了应助文件

41秒前; iNk上传了应助文件

41秒前; 丘比特上传了应助文件

43秒前; 王爷教你白给发布了新的文献求助10

43秒前; 仲如之完成签到，获得积分10

43秒前; Lucas的应助被娇气的蘑菇采纳，获得10

44秒前

高分求助中: How Maoism Was Made: Reconstructing China, 1949-1965 1200; Quantum reference frames : from quantum information to spacetime 888; Pediatric Injectable Drugs 500; Instant Bonding Epoxy Technology 500; March's Advanced Organic Chemistry: Reactions, Mechanisms, and Structure 400; ASHP Injectable Drug Information 2025 Edition 400; DEALKOXYLATION OF β-CYANOPROPIONALDEYHDE DIMETHYL ACETAL 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 4390189; 求助须知：如何正确求助？哪些是违规求助？ 3881164; 关于积分的说明 12088081; 捐赠科研通 3525082; 什么是DOI，文献DOI怎么找？ 1934348; 邀请新用户注册赠送积分活动 975314; 科研通“疑难数据库（出版商）”最低求助积分说明 873150

今日热心研友

波风水门_文献来晚了吗

科研小废物

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通