Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

计算机科学强化学习调度（生产过程）试验台工作车间作业车间调度人工智能分布式计算运筹学工业工程机器学习流水车间调度工程类运营管理地铁列车时刻表操作系统计算机网络

作者

Yi Zhang,Haihua Zhu,Dunbing Tang,Tong Zhou,Yong Gui

出处

期刊：Robotics and Computer-integrated Manufacturing [Elsevier BV]
日期：2022-07-06 卷期号：78: 102412-102412 被引量：159

标识

DOI：10.1016/j.rcim.2022.102412

摘要

Personalized orders bring challenges to the production paradigm, and there is an urgent need for the dynamic responsiveness and self-adjustment ability of the workshop. Traditional dispatching rules and heuristic algorithms solve the production planning and control problems by making schedules. However, the previous methods cannot work well in a changeable workshop environment when encountering a large number of stochastic disturbances of orders and resources. Recently, the potential of artificial intelligence (AI) algorithms in solving the dynamic scheduling problem has attracted researchers' attention. Therefore, this paper presents a multi-agent manufacturing system based on deep reinforcement learning (DRL), which integrates the self-organization mechanism and self-learning strategy. Firstly, the manufacturing equipment in the workshop is constructed as an equipment agent with the support of edge computing node, and an improved contract network protocol (CNP) is applied to guide the cooperation and competition among multiple agents, so as to complete personalized orders efficiently. Secondly, a multi-layer perceptron is employed to establish the decision-making module called AI scheduler inside the equipment agent. According to the perceived workshop state information, AI scheduler intelligently generates an optimal production strategy to perform task allocation. Then, based on the collected sample trajectories of scheduling process, AI scheduler is periodically trained and updated through the proximal policy optimization (PPO) algorithm to improve its decision-making performance. Finally, in the multi-agent manufacturing system testbed, dynamic events such as stochastic job insertions and unpredictable machine failures are considered in the verification experiments. The experimental results show that the proposed method is capable of obtaining the scheduling solutions that meet various performance metrics, as well as dealing with resource or task disturbances efficiently and autonomously.

求助该文献

最长约 10秒，即可获得该文献文件

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

今日热心研友