强化学习
马尔可夫决策过程
随机性
水准点(测量)
计算机科学
电动汽车
电
数学优化
过程(计算)
缩小
人工神经网络
增强学习
马尔可夫过程
人工智能
功率(物理)
工程类
数学
物理
地理
量子力学
统计
操作系统
大地测量学
电气工程
作者
Sichen Li,Weihao Hu,Di Cao,Tomislav Dragičević,Qi Huang,Zhe Chen,Frede Blaabjerg
标识
DOI:10.35833/mpce.2020.000460
摘要
A time-variable time-of-use electricity price can be used to reduce the charging costs for electric vehicle (EV) owners. Considering the uncertainty of price fluctuation and the randomness of EV owner's commuting behavior, we propose a deep reinforcement learning based method for the minimization of individual EV charging cost. The charging problem is first formulated as a Markov decision process (MDP), which has unknown transition probability. A modified long short-term memory (LSTM) neural network is used as the representation layer to extract temporal features from the electricity price signal. The deep deterministic policy gradient (DDPG) algorithm, which has continuous action spaces, is used to solve the MDP. The proposed method can automatically adjust the charging strategy according to electricity price to reduce the charging cost of the EV owner. Several other methods to solve the charging problem are also implemented and quantitatively compared with the proposed method which can reduce the charging cost up to 70.2% compared with other benchmark methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI