A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses

强化学习 任务(项目管理) 启发式 计算机科学 钢筋 线路规划 人工智能 运输工程 工程类 系统工程 结构工程
作者
Kunpeng Li,Tengbo Liu,P.N. Ram Kumar,Xuefang Han
出处
期刊:Transportation Research Part E-logistics and Transportation Review [Elsevier BV]
卷期号:185: 103518-103518 被引量:45
标识
DOI:10.1016/j.tre.2024.103518
摘要

Globally, e-commerce warehouses have begun implementing robotic mobile fulfillment systems (RMFS), which can improve order-picking efficiency by using automated guided vehicles (AGVs) to realize operations from parts to pickers. AGVs depart from their initial points, move to a target rack position, and subsequently transport racks to picking stations. The AGVs return the racks to their original positions after the workers pick them up. When all tasks are completed, the AGVs return to their starting point. In this context, the main challenge is the task assignment and route planning of multiple AGVs to minimize travel times. We formulate a mixed-integer linear programming (MILP) model with valid inequalities to solve small problem instances optimally. We introduce a reinforcement learning (RL)-based hyper-heuristic (HH) framework to solve large instances to near-optimality. A typical HH framework comprises two levels: high-level heuristics (HLH) and low-level heuristics (LLH). The framework starts from an initial solution and improves iteratively through LLHs, while the HLH invokes a selection strategy and an acceptance criterion to generate a new solution. We propose a novel selection strategy based on the improved Multi-Armed Bandits algorithm called Co-SLMAB and Exponential Monte Carlo with counters (EMCQ) as the acceptance criterion. The corresponding collision avoidance rules are then formulated for different conflicts to construct a conflict-free traveling route for AGVs. Besides testing the proposed framework's effectiveness in real-life warehouse layouts, we perform extensive computational experiments and a thorough sensitivity analysis. The results show that (i) the proposed valid inequalities aid in obtaining better lower bounds and significantly speed up the solution process; (ii) the Co-SLMAB-HH framework is quite competitive compared to CPLEX, outperforming the other tested hyper-heuristics and the problem-specific heuristic regarding convergence and computation time; and (iii) a pool of LLHs consisting of a wide range of different operators is advantageous over a limited set of simple operators while solving problems using hyper-heuristics.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Orange应助duang采纳,获得10
刚刚
共享精神应助ming采纳,获得10
2秒前
3秒前
我是老大应助sunaijia采纳,获得10
3秒前
4秒前
4秒前
动听剑心发布了新的文献求助10
4秒前
DAWN完成签到,获得积分10
4秒前
5秒前
rmajly完成签到,获得积分10
6秒前
6秒前
景易完成签到,获得积分10
7秒前
华仔应助hao采纳,获得10
7秒前
8秒前
Maiqi919发布了新的文献求助10
8秒前
qqqyy发布了新的文献求助10
8秒前
killy发布了新的文献求助10
9秒前
9秒前
Jasper应助duang采纳,获得10
10秒前
12秒前
阿瑠发布了新的文献求助10
14秒前
14秒前
14秒前
Lucas应助心灵美的大叔采纳,获得10
16秒前
哆哆给哆哆的求助进行了留言
16秒前
16秒前
17秒前
kai完成签到,获得积分10
17秒前
姜菡完成签到 ,获得积分10
18秒前
18秒前
老猪佩奇发布了新的文献求助10
19秒前
研友_LpQRrn完成签到 ,获得积分10
19秒前
19秒前
19秒前
orixero应助Qing采纳,获得10
19秒前
青年才俊发布了新的文献求助10
20秒前
21秒前
Antraliel完成签到,获得积分10
21秒前
22秒前
lithion发布了新的文献求助10
22秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Organometallic Chemistry of the Transition Metals 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6440179
求助须知:如何正确求助?哪些是违规求助? 8253986
关于积分的说明 17569044
捐赠科研通 5498308
什么是DOI,文献DOI怎么找? 2899634
邀请新用户注册赠送积分活动 1876393
关于科研通互助平台的介绍 1716828