列生成
强化学习
车辆路径问题
钢筋
栏(排版)
计算机科学
布线(电子设计自动化)
人工智能
数学优化
心理学
数学
计算机网络
社会心理学
帧(网络)
作者
Abdo Abouelrous,Laurens Bliek,Adriana F. Gabor,Yaoxin Wu,Yingqian Zhang
出处
期刊:Cornell University - arXiv
日期:2025-04-03
标识
DOI:10.48550/arxiv.2504.02383
摘要
In this paper, we address the problem of Column Generation (CG) using Reinforcement Learning (RL). Specifically, we use a RL model based on the attention-mechanism architecture to find the columns with most negative reduced cost in the Pricing Problem (PP). Unlike previous Machine Learning (ML) applications for CG, our model deploys an end-to-end mechanism as it independently solves the pricing problem without the help of any heuristic. We consider a variant of Vehicle Routing Problem (VRP) as a case study for our method. Through a set of experiments where our method is compared against a Dynamic Programming (DP)-based heuristic for solving the PP, we show that our method solves the linear relaxation up to a reasonable objective gap within 9% in significantly shorter running times, up to over 300 times faster for instances with 100 customers.
科研通智能强力驱动
Strongly Powered by AbleSci AI