期刊:2019 34th International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC)日期:2019-06-01被引量:10
标识
DOI:10.1109/itc-cscc.2019.8793294
摘要
We studied wireless power transfer (WPT) system where unmanned aerial vehicle (UAV) broadcasts power to energy receivers (ERs) on the ground to solve the fairness problem. Since the design of the optimal UAV trajectory based on the location information of all ERs is not practical for UAV and requires high complexity, we apply Q-learning among reinforcement learning techniques to design the suboptimal trajectory with lower complexity. We confirmed that the proposed reinforcement learning-based trajectory design approaches the outer bound of the achievable region of two ERs.