计算机科学
启发式
稳健性(进化)
灵活性(工程)
布线(电子设计自动化)
人工智能
车辆路径问题
变化(天文学)
机器学习
数学优化
分布式计算
数学
计算机网络
生物化学
化学
统计
物理
天体物理学
基因
操作系统
作者
Guillaume Bono,Jilles Dibangoye,Olivier Simonin,Laëtitia Matignon,Florian Pereyron
标识
DOI:10.1109/tits.2020.3009289
摘要
Routing delivery vehicles to serve customers in dynamic and uncertain environments like dense city centers is a challenging task that requires robustness and flexibility. Most existing approaches to routing problems produce solutions offline in the form of plans, which only apply to the situation they have been optimized for. Instead, we propose to learn a policy that provides decision rules to build the routes from online measurements of the environment state, including the customers configuration itself. Doing so, we can generalize from past experiences and quickly provide decision rules for new instances of the problem without re-optimizing any parameters of our policy. The difficulty with this approach comes from the complexity to represent this state. In this paper, we introduce a sequential multi-agent decision-making model to formalize the description and the temporal evolution of a Dynamic and Stochastic Vehicle Routing Problem. We propose a variation of Deep Neural Network using Attention Mechanisms to learn generalizable representation of the state and output online decision rules adapted to dynamic and stochastic information. Using artificially-generated data, we show promising results in these dynamic and stochastic environments, while staying competitive in deterministic ones compared to offline classical heuristics.
科研通智能强力驱动
Strongly Powered by AbleSci AI