Reliability-Aware Deep Reinforcement Learning Scheduling for Microservices in Maritime Distributed Multi-Cloud Environment

计算机科学强化学习分布式计算动态优先级调度作业车间调度调度（生产过程）公平份额计划马尔可夫决策过程两级调度单调速率调度云计算微服务可靠性（半导体）循环调度启发式最早截止时间优先安排任务分析马尔可夫过程固定优先级先发制人调度实时计算动态规划学习效果

作者

Hao Qiu,Chen Ma,Jingqi Wu,Ning Li

标识

DOI：10.1109/icctit68197.2025.11406424

摘要

The maritime distributed computing environment exhibits inherent characteristics such as high network latency, resource constraints, and significant heterogeneity in node reliability. In this context, individual vessels can be abstracted as homogeneous distributed cloud units, collectively forming a dynamic ship-based multi-cloud computing platform. To ensure the efficient execution of data-intensive maritime missions, this paper investigates the microservice scheduling problem within this platform, focusing on the critical challenge of co-optimizing task completion makespan and reliability assurance. Traditional heuristic scheduling algorithms struggle to achieve a dynamic balance among these multi-dimensional objectives in such a dynamic environment, often leading to suboptimal resource efficiency or reliability violations. To address this, we propose a reliability-constrained cooperative microservice scheduling framework based on deep reinforcement learning (DRL). First, a formal model of the maritime multi-cloud system is established, explicitly incorporating variable communication delays and node failure rates. The scheduling problem is then formulated as a Markov Decision Process (MDP), and a policy-based DRL agent is designed to learn an end-to-end dynamic scheduling policy that jointly considers execution efficiency and reliability. This agent is trained via a composite reward function, which guides it to minimize the overall scheduling makespan while strictly adhering to predefined task reliability thresholds. Extensive simulations under various scenarios demonstrate that the proposed framework outperforms several classical heuristic and meta-heuristic baseline algorithms across different load conditions. Particularly under high-load scenarios, it effectively reduces the maximum task completion time while guaranteeing that task reliability remains above the specified threshold.

求助该文献

最长约 10秒，即可获得该文献文件

Reliability-Aware Deep Reinforcement Learning Scheduling for Microservices in Maritime Distributed Multi-Cloud Environment

今日热心研友