强化学习
计算机科学
能源消耗
还原(数学)
无人机
编码器
接头(建筑物)
控制(管理)
多智能体系统
能量(信号处理)
人工智能
高效能源利用
机制(生物学)
基线(sea)
面子(社会学概念)
机器学习
吞吐量
混合学习
编码(内存)
帧(网络)
人工神经网络
实时计算
作者
Xinlong Gong,Siye Wang,Wenbo Xu,Luoyu Gao
标识
DOI:10.1109/ccai65422.2025.11189331
摘要
This paper proposes a hybrid non-terrestrial network architecture integrating drones and satellites, combining radio frequency (RF) and free-space optical (FSO) links. All drones are centrally controlled via control links to optimize energy consumption and system throughput. To address this communication scenario, we introduce a Transformer-based multi-agent reinforcement learning (MARL) algorithm, TransComm-MARL. The encoder extracts internal representations from joint observations, while the decoder employs an autoregressive approach for sequential decision-making. A masking mechanism ensures that each agent considers only the decisions of its predecessor, thereby reducing the complexity of joint decision-making. Experiments show that TransComm-MARL outperforms baseline models, achieving a $93.57 \%$ reduction in energy consumption and a $31.77 \%$ increase in throughput after convergence.
科研通智能强力驱动
Strongly Powered by AbleSci AI