计算机科学
启发式
强化学习
水准点(测量)
车辆路径问题
启发式
编码器
布线(电子设计自动化)
数学优化
算法
人工智能
计算机网络
数学
大地测量学
操作系统
地理
作者
Abhinav Gupta,Supratim Ghosh,Anulekha Dhara
标识
DOI:10.1145/3493700.3493723
摘要
Vehicle routing problem (VRP) is a well known NP-hard combinatorial optimization problem having several variants. In this paper, we consider VRP along with additional constraints of capacity and time-windows (CVRPTW) and aim to provide a fast and approximately optimal solutions to large-scale CVRPTW problems. We present a deep Q-network with encoder-decoder based reinforcement learning approach to solve CVRPTW. The encoder is based on the attention mechanism whereas decoder is fully connected neural network. Via numerical experiments on benchmark datasets, we show the efficacy and computational speed our approach compared to baseline heuristics, a meta-heuristic algorithm, and a multi-agent reinforcement learning (RL) based framework.
科研通智能强力驱动
Strongly Powered by AbleSci AI