计算机科学
期限(时间)
数学优化
光学(聚焦)
随机建模
运筹学
数理经济学
数学
统计
物理
量子力学
光学
作者
Jaideep J. Rao,Kiran Kumar Ravulapati,Tapas K. Das
标识
DOI:10.1080/00207720310001640755
摘要
Abstract Non-cooperative decision-making problems in a decentralized supply chain can be characterized and studied using a stochastic game model. In an earlier paper, the authors developed a methodology that uses machine learning for finding (near) optimal policies for non-zero sum stochastic games, and applied their methodology on an N-retailer and W-warehouse inventory-planning problem. The focus of this paper is on making the methodology more amenable to practical applications by making it completely simulation-based. It is also demonstrated, through numerical example problems, how this methodology can be used to find (near) equilibrium policies, and evaluate short-term rewards of stochastic games. Short-term rewards of stochastic games could be, in many instances, more critical than equilibrium rewards. To our knowledge, no methodology exists in the open literature that can capture the short-term behaviour of non-zero sum stochastic games as examined in this paper.
科研通智能强力驱动
Strongly Powered by AbleSci AI