Online Altitude Control and Scheduling Policy for Minimizing AoI in UAV-assisted IoT Wireless Networks

计算机科学马尔可夫决策过程 Lyapunov优化调度（生产过程）强化学习计算机网络基站在线算法最优化问题上传无线信道状态信息软件部署无线网络实时计算马尔可夫过程分布式计算数学优化电信人工智能 Lyapunov重新设计李雅普诺夫指数统计数学算法混乱的程序设计语言操作系统

作者

Moataz Samir,Chadi Assi,Sanaa Sharafeddine,Ali Ghrayeb

出处

期刊：IEEE Transactions on Mobile Computing [IEEE Computer Society]
日期：2020-01-01 卷期号：: 1-1 被引量：92

标识

DOI：10.1109/tmc.2020.3042925

摘要

This article considers unmanned aerial vehicle (UAV) assisted Internet of Things (IoT) networks, where low resource IoT devices periodically sample a stochastic process and need to upload more recent information to a Base Station (BS). Among the myriad of applications, there is a need for timely delivery of data (for example, status-updates) before the data becomes outdated and loses its value. Since transmission capabilities of IoT devices are limited, it may not always be feasible to transmit over one hop transmission to the BS. To address this challenge, UAVs with virtual queues are deployed as middle layer between IoT devices and the BS to relay recent information over unreliable channels. In the absence of channel conditions, the optimal online scheduling policy is investigated as well as dynamic UAV altitude control that maintains a fresh status of information at the BS. The objective of this paper is to minimize the Expected Weighted Sum Age of Information (EWSA) for IoT devices. First, the problem is formulated as an optimization problem that is however generally hard to solve. Second, an online model free Deep Reinforcement Learning (DRL) is proposed, where the deployed UAV obtains instantaneous channel state information (CSI) in real time along with any adjustment to its deployment altitude. Third, we formulate the online problem as a Markov Decision Process (MDP) and Proximal Policy Optimization (PPO) algorithm, which is a highly stable state-of-the-art DRL algorithm, is leveraged to solve the formulated problem. Finally, extensive simulations are conducted to verify findings and comprehensive comparisons with other baseline approaches are provided to demonstrate the effectiveness of the proposed design.

求助该文献

最长约 10秒，即可获得该文献文件

Online Altitude Control and Scheduling Policy for Minimizing AoI in UAV-assisted IoT Wireless Networks

今日热心研友