强化学习
计算机科学
嵌入
集合(抽象数据类型)
多目标优化
帕累托原理
旅行商问题
背景(考古学)
启发式
数学优化
人工智能
组合优化
布线(电子设计自动化)
机器学习
算法
数学
古生物学
计算机网络
生物
程序设计语言
作者
Zhenkun Wang,Shunyu Yao,Genghui Li,Qingfu Zhang
标识
DOI:10.1109/tcyb.2023.3312476
摘要
This article proposes utilizing a single deep reinforcement learning model to solve combinatorial multiobjective optimization problems. We use the well-known multiobjective traveling salesman problem (MOTSP) as an example. Our proposed method employs an encoder-decoder framework to learn the mapping from the MOTSP instance to its Pareto-optimal set. Specifically, it leverages a novel routing encoder to extract information for both the entire multiobjective aspect and every individual objective from the MOTSP instance. The global embeddings and each objective's embeddings are adaptively aggregated via a routing network to form the subproblems' embedding that can well represent the MOTSP features. Using a modified context embedding, the subproblems' embeddings are fed into a decoder to produce a set of approximate Pareto-optimal solutions in parallel. Additionally, we develop a Top-k baseline to enable more efficient data utilization and lightweight training for our proposed method. We compare our method with heuristic-based and learning-based ones on various types of MOTSP instances, and the experimental results show that our method can solve MOTSP instances in real-time and outperform the other algorithms, especially on large-scale problem instances.
科研通智能强力驱动
Strongly Powered by AbleSci AI