强化学习
计算机科学
启发式
自动化
比例(比率)
分布式计算
人工智能
深度学习
卡车
工业工程
工程类
物理
操作系统
航空航天工程
机械工程
量子力学
作者
Chi Zhang,Philip Odonkor,Shuai Zheng,Hamed Khorasgani,Susumu Serita,Chetan Gupta,Haiyan Wang
标识
DOI:10.1109/bigdata50022.2020.9378191
摘要
Dynamic dispatching is one of the core problems for operation optimization in traditional industries such as mining, as it is about how to smartly allocate the right resources to the right place at the right time. Conventionally, the industry relies on heuristics or even human intuitions which are often short-sighted and sub-optimal solutions. Leveraging the power of AI and Internet of Things (IoT), data-driven automation is reshaping this area. However, facing its own challenges such as large-scale and heterogenous trucks running in a highly dynamic environment, it can barely adopt methods developed in other domains (e.g., ride-sharing). In this paper, we propose a novel Deep Reinforcement Learning approach to solve the dynamic dispatching problem in mining. We first develop an event-based mining simulator with parameters calibrated in real mines. Then we propose an experience-sharing Deep Q Network with a novel abstract state/action representation to learn memories from heterogeneous agents altogether and realizes learning in a centralized way. We demonstrate that the proposed methods significantly outperform the most widely adopted approaches in the industry by 5.56% in terms of productivity. The proposed approach has great potential in a broader range of industries (e.g., manufacturing, logistics) which have a large-scale of heterogenous equipment working in a highly dynamic environment, as a general framework for dynamic resource allocation.
科研通智能强力驱动
Strongly Powered by AbleSci AI