强化学习
计算机科学
匹配(统计)
缩小
数据收集
博弈论
物联网
人工智能
完整信息
机器学习
实时计算
计算机安全
万维网
数学
统计
数理经济学
作者
Guobin Zhang,Wei Xin,Xiao Tan,Zhu Han,Guangchi Zhang
标识
DOI:10.1109/tcomm.2025.3525566
摘要
A space-air-ground integrated network consisting of a satellite, high altitude platforms (HAPs), unmanned aerial vehicles (UAVs), and terrestrial Internet of Things (IoT) devices is constructed to collect wide-area information. The IoT devices sense the environmental information, the UAVs fly to collect data, and the HAPs deliver the computation results to the satellite. In order to improve the information freshness, the age of information (AoI) of the system is minimized by the UAV trajectory design and network configuration under the cost and practical constraints. The optimization is decomposed into two stages, which are jointly conducted by the HAPs and UAVs. In the first stage, each UAV and IoT device cluster are paired, and the UAV obtains the minimum AoI along with the optimal destined position by deep reinforcement learning (DRL). Afterwards, the HAP performs the matching between the UAVs and the IoT device clusters by the Gale-Shapley algorithm. In the second stage, the HAPs complete the configuration of the coverage area and height of the HAPs and UAVs by the soft actor-critic DRL algorithm. The extensive simulation verifies the AoI deduction of the proposed scheme and depicts the regularities of network configuration and UAV trajectory design for the minimum AoI achievement.
科研通智能强力驱动
Strongly Powered by AbleSci AI