计算机科学
比例(比率)
内容(测量理论)
计算机网络
数学
量子力学
物理
数学分析
作者
Ranshu Peng,Shi Chen,Changbin Xue
标识
DOI:10.1109/lwc.2024.3443154
摘要
In this letter, we investigate the content caching problem within large-scale integrated satellite-terrestrial networks, focusing on a fusion scenario of future large-scale remote sensing constellations and communication satellite networks. Our investigation relies on deep reinforcement learning techniques aimed at minimizing the long-term average content delivery delay. To address the inherent challenge of convergence in single-agent algorithms, we propose clustering intelligent remote sensing satellites, with each cluster headed by an intelligent agent. Based on the characteristics of the model, we modify the multi-agent proximal policy optimization (MAPPO) algorithm by integrating long short-term memory (LSTM) to capture the correlation of the state information of different agents in the time domain. Simulation results show that the proposed LSTM-MAPPO outperforms the benchmarks, exhibiting faster convergence speed and lower standard deviation.
科研通智能强力驱动
Strongly Powered by AbleSci AI