默契串通
强化学习
投标
计算机科学
电力市场
纳什均衡
完整信息
共谋
电
数学优化
趋同(经济学)
人工智能
数学
数理经济学
经济
工程类
经济增长
电气工程
微观经济学
作者
Yanchang Liang,Chunlin Guo,Zhaohao Ding,Huichun Hua
标识
DOI:10.1109/tpwrs.2020.2999536
摘要
Game theoretic methods and simulations based on reinforcement learning (RL) are often used to analyze electricity market equilibrium. However, the former is limited to a simple market environment with complete information, and difficult to visually reflect the tacit collusion; while the conventional RL algorithm is limited to low-dimensional discrete state and action spaces, and the convergence is unstable. To address the aforementioned problems, this paper adopts deep deterministic policy gradient (DDPG) algorithm to model the bidding strategies of generation companies (GenCos). Simulation experiments, including different settings of GenCo, load and network, demonstrate that the proposed method is more accurate than conventional RL algorithm, and can converge to the Nash equilibrium of complete information even in the incomplete information environment. Moreover, the proposed method can intuitively reflect the different tacit collusion level by quantitatively adjusting GenCos' patience parameter, which can be an effective means to analyze market strategies.
科研通智能强力驱动
Strongly Powered by AbleSci AI