计算机科学
强化学习
分布式计算
动态优先级调度
作业车间调度
调度(生产过程)
公平份额计划
马尔可夫决策过程
两级调度
单调速率调度
云计算
微服务
可靠性(半导体)
循环调度
启发式
最早截止时间优先安排
任务分析
马尔可夫过程
固定优先级先发制人调度
实时计算
动态规划
学习效果
作者
Hao Qiu,Chen Ma,Jingqi Wu,Ning Li
标识
DOI:10.1109/icctit68197.2025.11406424
摘要
The maritime distributed computing environment exhibits inherent characteristics such as high network latency, resource constraints, and significant heterogeneity in node reliability. In this context, individual vessels can be abstracted as homogeneous distributed cloud units, collectively forming a dynamic ship-based multi-cloud computing platform. To ensure the efficient execution of data-intensive maritime missions, this paper investigates the microservice scheduling problem within this platform, focusing on the critical challenge of co-optimizing task completion makespan and reliability assurance. Traditional heuristic scheduling algorithms struggle to achieve a dynamic balance among these multi-dimensional objectives in such a dynamic environment, often leading to suboptimal resource efficiency or reliability violations. To address this, we propose a reliability-constrained cooperative microservice scheduling framework based on deep reinforcement learning (DRL). First, a formal model of the maritime multi-cloud system is established, explicitly incorporating variable communication delays and node failure rates. The scheduling problem is then formulated as a Markov Decision Process (MDP), and a policy-based DRL agent is designed to learn an end-to-end dynamic scheduling policy that jointly considers execution efficiency and reliability. This agent is trained via a composite reward function, which guides it to minimize the overall scheduling makespan while strictly adhering to predefined task reliability thresholds. Extensive simulations under various scenarios demonstrate that the proposed framework outperforms several classical heuristic and meta-heuristic baseline algorithms across different load conditions. Particularly under high-load scenarios, it effectively reduces the maximum task completion time while guaranteeing that task reliability remains above the specified threshold.
科研通智能强力驱动
Strongly Powered by AbleSci AI