Deep Reinforcement Learning Enhanced Greedy Algorithm for Online Scheduling of Batched Tasks in Cloud in Cloud HPC Systems

计算机科学 云计算 服务器 强化学习 调度(生产过程) 贪婪算法 任务(项目管理) 作业车间调度 分布式计算 算法 人工智能 操作系统 数学优化 计算机网络 数学 布线(电子设计自动化) 管理 经济
作者
Yuanhao Yang,Hong Shen
出处
期刊:IEEE Transactions on Parallel and Distributed Systems [Institute of Electrical and Electronics Engineers]
卷期号:: 1-1 被引量:8
标识
DOI:10.1109/tpds.2021.3138459
摘要

In a large cloud data center HPC system, a critical problem is how to allocate the submitted tasks to heterogenous servers for achieving the goal of maximize systems net gain defined as the value of completed tasks minus system operation cost. We consider this problem in the online setting that tasks arrive in batches and propose a novel deep reinforcement learning (DRL) enhanced greedy algorithm of two-stage scheduling interacting task sequencing and task allocation. For task sequencing we deploy a DRL module to make prediction for the best allocation sequence for each arriving batch of tasks based on knowledge (allocation strategies) learnt from prior batches. For task allocation, we propose a greedy strategy that allocates tasks to servers one by one online following the allocation sequence to maximally increase the total gain. We show that our greedy strategy has a performance guarantee of competitive ratio 1/(1+k) to the optimal offline solution, which improves the existing result for the same problem, where k is upper bounded by the maximum cost-to-gain ratio of each task. While our DRL module enhances the greedy by providing the likely-optimal allocation sequence for each batch of arriving tasks, our greedy strategy bounds DRLs prediction error within a proven performance guarantee for any allocation sequence, enabling a better solution quality than that obtainable from both DRL and greedy optimization alone. Extensive experiment evaluation results in both simulation and real application environments demonstrate the effectiveness and efficiency of our proposed algorithm. Compared with the state-of-the-art baselines, our algorithm increases the system gain by about 10% to 30%. Our algorithm provides an interesting example of joining machine-learning and greedy optimization techniques to improve ML-based solutions with a worst-case performance guarantee for solving hard optimization problems.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
多情的灵安完成签到,获得积分10
刚刚
1秒前
Fury完成签到,获得积分10
1秒前
虎皮猫大人完成签到,获得积分10
1秒前
1秒前
2秒前
在水一方应助nn采纳,获得10
2秒前
3秒前
lisitian完成签到,获得积分10
3秒前
唱跳双c完成签到,获得积分10
3秒前
bkagyin应助冷艳的半莲采纳,获得10
3秒前
niniyiya发布了新的文献求助10
4秒前
4秒前
5秒前
传奇3应助aabsd采纳,获得10
5秒前
Fury发布了新的文献求助10
5秒前
熠熠生辉发布了新的文献求助10
5秒前
5秒前
6秒前
6秒前
内向芷雪发布了新的文献求助10
6秒前
郇郇发布了新的文献求助10
6秒前
Bassss发布了新的文献求助20
7秒前
7秒前
FashionBoy应助蓝天采纳,获得10
7秒前
浮生只梦欢完成签到,获得积分10
7秒前
笨笨山芙发布了新的文献求助10
8秒前
fannyeast完成签到,获得积分10
8秒前
8秒前
8秒前
9秒前
9秒前
小蘑菇应助239287采纳,获得10
9秒前
淡丹丹关注了科研通微信公众号
9秒前
李爱国应助追寻纲采纳,获得10
9秒前
10秒前
陆玖笙完成签到 ,获得积分10
10秒前
11秒前
在水一方应助HM采纳,获得10
11秒前
阿良发布了新的文献求助10
11秒前
高分求助中
GL 2 A method for assessing the in-place cleanability of food processing equipment, Fourth Edition, December 2023 3000
Annie Ernaux: De la perte au corps glorieux 600
Writing Systems 500
Media Today Mass Communication in a Converging World 9th Edition 400
Understanding Modeling and Simulation of Polymerization Reactions 400
Invited Discussant 63O and 64O 400
A revision of Limenitis helmanni and its related species (Nymphalidae) from Central and South China 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6833660
求助须知:如何正确求助?哪些是违规求助? 8543954
关于积分的说明 18178255
捐赠科研通 6178076
什么是DOI,文献DOI怎么找? 3037725
关于科研通互助平台的介绍 2023882
邀请新用户注册赠送积分活动 2014748