计算机科学
马尔可夫决策过程
Lyapunov优化
调度(生产过程)
强化学习
计算机网络
基站
在线算法
最优化问题
上传
无线
信道状态信息
软件部署
无线网络
实时计算
马尔可夫过程
分布式计算
数学优化
电信
人工智能
Lyapunov重新设计
李雅普诺夫指数
统计
数学
算法
混乱的
程序设计语言
操作系统
作者
Moataz Samir,Chadi Assi,Sanaa Sharafeddine,Ali Ghrayeb
标识
DOI:10.1109/tmc.2020.3042925
摘要
This article considers unmanned aerial vehicle (UAV) assisted Internet of Things (IoT) networks, where low resource IoT devices periodically sample a stochastic process and need to upload more recent information to a Base Station (BS). Among the myriad of applications, there is a need for timely delivery of data (for example, status-updates) before the data becomes outdated and loses its value. Since transmission capabilities of IoT devices are limited, it may not always be feasible to transmit over one hop transmission to the BS. To address this challenge, UAVs with virtual queues are deployed as middle layer between IoT devices and the BS to relay recent information over unreliable channels. In the absence of channel conditions, the optimal online scheduling policy is investigated as well as dynamic UAV altitude control that maintains a fresh status of information at the BS. The objective of this paper is to minimize the Expected Weighted Sum Age of Information (EWSA) for IoT devices. First, the problem is formulated as an optimization problem that is however generally hard to solve. Second, an online model free Deep Reinforcement Learning (DRL) is proposed, where the deployed UAV obtains instantaneous channel state information (CSI) in real time along with any adjustment to its deployment altitude. Third, we formulate the online problem as a Markov Decision Process (MDP) and Proximal Policy Optimization (PPO) algorithm, which is a highly stable state-of-the-art DRL algorithm, is leveraged to solve the formulated problem. Finally, extensive simulations are conducted to verify findings and comprehensive comparisons with other baseline approaches are provided to demonstrate the effectiveness of the proposed design.
科研通智能强力驱动
Strongly Powered by AbleSci AI