启发式
重写
计算机科学
基线(sea)
强化学习
调度(生产过程)
数学优化
趋同(经济学)
人工智能
人工神经网络
作业车间调度
平面图(考古学)
机器学习
数学
程序设计语言
地质学
操作系统
海洋学
历史
经济
考古
地铁列车时刻表
经济增长
作者
Xinyun Chen,Yuandong Tian
摘要
For problem solving, making reactive decisions based on problem description is fast but inaccurate, while search-based planning using heuristics gives better solutions but could be exponentially slow. In this paper, we propose a new approach that improves an existing solution by iteratively picking and rewriting its local components until convergence. The rewriting policy employs a neural network trained with reinforcement learning. We evaluate our approach in two domains: job scheduling and expression simplification. Compared to common effective heuristics, baseline deep models and search algorithms, our approach efficiently gives solutions with higher quality.
科研通智能强力驱动
Strongly Powered by AbleSci AI