Real-time data-driven dynamic scheduling for flexible job shop with insufficient transportation resources using hybrid deep Q network

作业车间调度计算机科学工作车间动态优先级调度调度（生产过程）马尔可夫决策过程数学优化概括性强化学习运筹学流水车间调度分布式计算人工智能工业工程地铁列车时刻表马尔可夫过程工程类心理治疗师操作系统统计数学心理学

作者

Yuxin Li,Wenjin Gu,Minghai Yuan,Tang Ya-ming

出处

期刊：Robotics and Computer-integrated Manufacturing [Elsevier]
日期：2022-04-01 卷期号：74: 102283-102283 被引量：52

标识

DOI：10.1016/j.rcim.2021.102283

摘要

With the extensive application of automated guided vehicles in manufacturing system, production scheduling considering limited transportation resources becomes a difficult problem. At the same time, the real manufacturing system is prone to various disturbance events, which increase the complexity and uncertainty of shop floor. To this end, this paper addresses the dynamic flexible job shop scheduling problem with insufficient transportation resources (DFJSP-ITR) to minimize the makespan and total energy consumption. As a sequential decision-making problem, DFJSP-ITR can be modeled as a Markov decision process where the agent should determine the scheduling object and allocation of resources at each decision point. So this paper adopts deep reinforcement learning to solve DFJSP-ITR. In this paper, the multiobjective optimization model of DFJSP-ITR is established. Then, in order to make agent learn to choose the appropriate rule based on the production state at each decision point, a hybrid deep Q network (HDQN) is developed for this problem, which combines deep Q network with three extensions. Moreover, the shop floor state model is established at first, and then the decision point, generic state features, genetic-programming-based action space and reward function are designed. Based on these contents, the training method using HDQN and the strategy for facing new job insertions and machine breakdowns are proposed. Finally, comprehensive experiments are conducted, and the results show that HDQN has superiority and generality compared with current optimization-based approaches, and can effectively deal with disturbance events and unseen situations through learning.

求助该文献

最长约 10秒，即可获得该文献文件

Real-time data-driven dynamic scheduling for flexible job shop with insufficient transportation resources using hybrid deep Q network

今日热心研友