斯塔克伯格竞赛
强化学习
计算机科学
服务器
马尔可夫决策过程
激励
GSM演进的增强数据速率
人工智能
任务(项目管理)
机构设计
博弈论
数学优化
资源配置
付款
马尔可夫过程
微观经济学
工程类
经济
数学
计算机网络
统计
系统工程
万维网
作者
Nan Zhao,Yiyang Pei,Ying‐Chang Liang,Dusit Niyato
出处
期刊:IEEE Transactions on Vehicular Technology
[Institute of Electrical and Electronics Engineers]
日期:2023-05-19
卷期号:72 (10): 13530-13535
被引量:11
标识
DOI:10.1109/tvt.2023.3276898
摘要
Federated edge learning (FEL) is capable of training large-scale machine learning models without exposing the raw data of edge devices (EDs). Considering that the learning performance heavily depends on the active participation of EDs, it is essential to motivate the resource-limited EDs to contribute their efforts to learning tasks. In this paper, a learning-based multi-task FEL mechanism is proposed to design the economic incentive and participation contribution strategy jointly. Specifically, the incentive-based interaction between the edge servers and EDs is formulated as a multi-leader multi-follower Stackelberg game. Then, the theoretical analysis is provided to prove the existence and uniqueness of the Stackelberg equilibrium. To obtain the equilibrium solution under the incomplete information, a Markov decision process is formulated for the two-stage Stackelberg game. Considering the high dimensionality of the continuous action space, a multi-agent double actors deep deterministic policy gradient algorithm is employed to achieve the optimal training-ratio of EDs and the payment policies of edge servers. Numerical results validate the effectiveness and efficiency of our proposed incentive mechanism.
科研通智能强力驱动
Strongly Powered by AbleSci AI