强化学习
计算机科学
马尔可夫决策过程
调度(生产过程)
作业车间调度
地铁列车时刻表
人工智能
流水车间调度
马尔可夫过程
机器学习
工业工程
数学优化
工程类
数学
统计
操作系统
作者
Zijun Liao,Qiwen Li,Yuanzhi Dai,Zizhen Zhang
标识
DOI:10.1109/smc53654.2022.9945585
摘要
The job-shop scheduling problem (JSSP) is a classic combinatorial optimization problem in the areas of computer science and operations research. It is closely associated with many industrial scenarios. In today’s society, the demand for efficient and stable scheduling algorithms has significantly increased. More and more researchers have recently tried new methods to solve JSSP. In this paper, we effectively formulate the scheduling process of JSSP as a Semi-Markov Decision Process. We then propose a method of using hierarchical reinforcement learning with graph neural networks to solve JSSP. We also demonstrate that larger-sized instances require the support of a bigger number of sub-policies and different scheduling phases require using different sub-policies.
科研通智能强力驱动
Strongly Powered by AbleSci AI