计算机科学
瓶颈
分布式计算
异步通信
GSM演进的增强数据速率
边缘设备
架空(工程)
强化学习
云计算
计算机网络
人工智能
嵌入式系统
操作系统
作者
Qiong Wu,Xu Chen,Tao Ouyang,Zhi Zhou,Xiaoxi Zhang,Shusen Yang,Junshan Zhang
标识
DOI:10.1109/tpds.2023.3238049
摘要
Federated learning (FL) is a promising paradigm that enables collaboratively learning a shared model across massive clients while keeping the training data locally. However, for many existing FL systems, clients need to frequently exchange model parameters of large data size with the remote cloud server directly via wide-area networks (WAN), leading to significant communication overhead and long transmission time. To mitigate the communication bottleneck, we resort to the hierarchical federated learning paradigm of HiFL, which reaps the benefits of mobile edge computing and combines synchronous client-edge model aggregation and asynchronous edge-cloud model aggregation together to greatly reduce the traffic volumes of WAN transmissions. Specifically, we first analyze the convergence bound of HiFL theoretically and identify the key controllable factors for model performance improvement. We then advocate an enhanced design of HiFlash by innovatively integrating deep reinforcement learning based adaptive staleness control and heterogeneity-aware client-edge association strategy to boost the system efficiency and mitigate the staleness effect without compromising model accuracy. Extensive experiments corroborate the superior performance of HiFlash in model accuracy, communication reduction, and system efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI