计算机科学
强化学习
趋同(经济学)
符号
光谱(功能分析)
人工智能
方案(数学)
算法
理论计算机科学
机器学习
数学
算术
经济增长
量子力学
经济
数学分析
物理
作者
Hao-Hsuan Chang,Yifei Song,Thinh T. Doan,Lingjia Liu
标识
DOI:10.1109/twc.2022.3233436
摘要
Dynamic spectrum access (DSA) has been introduced as a promising technology that allows a secondary system to access the licensed spectrum of the primary system to improve spectrum utilization. In this paper, we introduce Fed-MADRL by incorporating federated learning (FL) and multi-agent deep reinforcement learning (MADRL) to design a collaborative DSA strategy. Our Fed-MADRL scheme employs FL to enable multiple users to collaboratively optimize the system goal without sharing their training data. By keeping all the training data at the user end, FL improves the communication efficiency and strengthens user data privacy. To further reduce the communication overheads, each user only shares quantized information. We provide the convergence analysis to characterize the trade-off between the communication efficiency and the system performance. In particular, we show that the introduced method converges at a rate $\mathcal {O}(1/K^{1/4})$ , where $K$ is the number of FL iterations. To the best of our knowledge, Fed-MADRL is the first work that utilizes FL in DSA networks under quantized communication. Performance evaluation results show that the introduced Fed-MADRL method outperforms the independent learning method and achieves comparable performance with the centralized MADRL method, which requires much higher communication overheads.
科研通智能强力驱动
Strongly Powered by AbleSci AI