A constrained multi-item EOQ inventory model for reusable items: Reinforcement learning-based differential evolution and particle swarm optimization

粒子群优化经济订货量计算机科学强化学习差异进化数学优化库存控制元启发式重新使用人工智能运筹学算法数学供应链生物生态学法学政治学

作者

Ali Fallahi,Erfan Amani Bani,Seyed Taghi Akhavan Niaki

出处

期刊：Expert Systems With Applications [Elsevier]
日期：2022-07-01 卷期号：207: 118018-118018 被引量：50

标识

DOI：10.1016/j.eswa.2022.118018

摘要

The growing environmental concerns, governmental regulations, and significant cost savings are the primary motivations for companies to consider the reuse and recovery of products in their inventory system. The previous research ignored several realistic features of reusable items inventory systems, such as the presence of multiple products and operational constraints. For the first time, this paper presents a new multiproduct economic order quantity inventory model for an inventory system of reusable products. The goal of the model is to determine the optimal replenishment quantity and reuse quantity of each item so that the system's total cost is minimized. Several operational constraints are considered to provide a more realistic framework, such as the total available budget, warehouse space, and holding cost. Due to the nonlinearity of the presented model, differential evolution (DE) and particle swarm optimization (PSO) algorithms are utilized as two solution approaches. However, these algorithms' performance is highly dependent on their control parameters. Therefore, for the first time, two new variants of these algorithms, called DEQL and PSOQL, are presented in this study, where the control parameters of algorithms are not pre-determined. A powerful reinforcement learning algorithm, Q-learning, adapts these values intelligently. In other words, as a research contribution, this research aims at introducing a new variant of hybrid the DE and PSO algorithms in which a machine learning algorithm controls the value of metaheuristic parameters. The other parameters of the proposed algorithms are tunned employing the Taguchi method. Extensive numerical examples are established in different sizes, and the outputs are discussed in terms of several criteria. Statistical analysis of the results is performed, demonstrating that the proposed reinforcement learning-based parameter adaption has significantly improved algorithms' performance.

求助该文献

最长约 10秒，即可获得该文献文件

A constrained multi-item EOQ inventory model for reusable items: Reinforcement learning-based differential evolution and particle swarm optimization

今日热心研友