管道(软件)
强化学习
计算机科学
人工智能
程序设计语言
作者
Zixin Huang,Ziang Wei,Junwei Zhou,Hao Fu
标识
DOI:10.1177/01423312251326630
摘要
Automatic pipeline layout optimization in complex environment is a highly challenging task. However, the traditional pipeline layout optimization method is not enough to deal with the complex changes of the environment and the requirements of multi-pipeline layout process. Therefore, this paper transforms this pipeline layout optimization problem into an MDP and proposes a pipeline layout optimization algorithm based on deep reinforcement learning. Specifically, we introduce a design method of layers in pipeline space to reduce the computational burden caused by an increase in the number of pipelines and combine with the discrete Soft Actor-Critic algorithm to improve the exploration efficiency of pipeline space, thereby to avoid falling into the local optimum of pipeline layout prematurely. Then, we verify the proposed pipeline layout optimization algorithm through simulation experiments. Experimental results show that the automatic pipeline layout optimization is significantly promoted in the complex, and various environments and the craft requirements of multi-pipeline layout are also satisfied.
科研通智能强力驱动
Strongly Powered by AbleSci AI