强化学习
计算机科学
调度(生产过程)
钢筋
实时计算
人工智能
工程类
运营管理
结构工程
作者
Shuoyao Wang,Suzhi Bi,Ying–Jun Angela Zhang
标识
DOI:10.1109/tii.2019.2950809
摘要
This article proposes a reinforcement-learning (RL) approach for optimizing charging scheduling and pricing strategies that maximize the system objective of a public electric vehicle (EV) charging station. The proposed algorithm is "online" in the sense that the charging and pricing decisions made at each time depend only on the observation of past events, and is "model-free" in the sense that the algorithm does not rely on any assumed stochastic models of uncertain events. To cope with the challenge arising from the time-varying continuous state and action spaces in the RL problem, we first show that it suffices to optimize the total charging rates to fulfill the charging requests before departure times. Then, we propose a feature-based linear function approximator for the state-value function to further enhance the efficiency and generalization ability of the proposed algorithm. Through numerical simulations with real-world data, we show that the proposed RL algorithm achieves on average 138.5% higher charging-station profit than representative benchmark algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI