蒙特卡罗方法
数学优化
计算机科学
生产(经济)
最小二乘函数近似
能量(信号处理)
强化学习
数学
经济
人工智能
估计员
统计
宏观经济学
作者
Bo Yang,Selvaprabu Nadarajah,Nicola Secomandi
出处
期刊:Operations Research
[Institute for Operations Research and the Management Sciences]
日期:2023-06-29
卷期号:72 (6): 2758-2775
被引量:1
标识
DOI:10.1287/opre.2018.0341
摘要
Modeling as real options the operations of energy production companies that operate in wholesale markets gives rise to a challenging Markov decision process. In “Least Squares Monte Carlo and Pathwise Optimization for Merchant Energy Production,” Yang, Nadarajah, and Secomandi study the performance of two reinforcement learning techniques that can be used to determine feasible operating policies and optimality bounds for this model, namely least squares Monte Carlo and pathwise optimization, extending the applicability of the latter method beyond optimal stopping by using principal component analysis and block coordinate descent. They find that both approaches lead to near optimal policies, but pathwise optimization outperforms least squares Monte Carlo in terms of dual bounds at the expense of more sizable computational requirements. These findings have potential relevance for managers of energy production assets that use analytics to optimize their operations and researchers interested in broadening the scope of pathwise optimization.
科研通智能强力驱动
Strongly Powered by AbleSci AI