诺玛
强化学习
电信线路
计算机科学
资源配置
资源(消歧)
分布式计算
计算机网络
人工智能
作者
Yuanyan Huang,Jingjing Su,Xuan Lu,Shoulin Huang,Hongyan Zhu,Haiyong Zeng
出处
期刊:Entropy
[Multidisciplinary Digital Publishing Institute]
日期:2025-07-29
卷期号:27 (8): 811-811
被引量:2
摘要
This paper studies deep reinforcement learning (DRL)-based joint resource allocation and three-dimensional (3D) trajectory optimization for unmanned aerial vehicle (UAV)-ground access point (GAP) cooperative non-orthogonal multiple access (NOMA) communication in Industrial Internet of Things (IIoT) systems. Cooperative and non-cooperative users adopt different signal transmission strategies to meet diverse, task-oriented, quality-of-service requirements. Specifically, the DRL framework based on the Soft Actor-Critic algorithm is proposed to jointly optimize user scheduling, power allocation, and UAV trajectory in continuous action spaces. Closed-form power allocation and maximum weight bipartite matching are integrated to enable efficient user pairing and resource management. Simulation results show that the proposed scheme significantly enhances system performance in terms of throughput, spectral efficiency, and interference management, while enabling robustness against channel uncertainties in dynamic IIoT environments. The findings indicate that combining model-free reinforcement learning with conventional optimization provides a viable solution for adaptive resource management in dynamic UAV-GAP cooperative communication scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI