强化学习
微电网
计算机科学
调度(生产过程)
人工智能
数学优化
控制(管理)
数学
作者
Yang Li,Jiankai Gao,Yuanzheng Li,Chen Chen,Sen Li,Mohammad Shahidehpour,Zhe Chen
标识
DOI:10.1109/tia.2024.3522486
摘要
To coordinate the interests of operator and users in a microgrid under complex and changeable operating conditions, this paper proposes a microgrid scheduling model considering the thermal flexibility of thermostatically controlled loads and demand response by leveraging physical informedinspired deep reinforcement learning (DRL) based bi-level programming.To overcome the non-convex limitations of karushkuhn-tucker (KKT)-based methods, a novel optimization solution method based on DRL theory is proposed to handle the bilevel programming through alternate iterations between levels.Specifically, by combining a DRL algorithm named asynchronous advantage actor-critic (A3C) and automated machine learningprioritized experience replay (AutoML-PER) strategy to improve the generalization performance of A3C to address the above problems, an improved A3C algorithm, called AutoML-PER-A3C, is designed to solve the upper-level problem; while the DOCPLEX optimizer is adopted to address the lower-level problem.In this solution process, AutoML is used to automatically optimize hyperparameters and PER improves learning efficiency and quality by extracting the most valuable samples.The test results demonstrate that the presented approach manages to reconcile the interests between multiple stakeholders in MG by fully exploiting various flexibility resources.Furthermore, in terms of economic viability and computational efficiency, the proposal vastly exceeds other advanced reinforcement learning methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI