EXTRA: An Exact First-Order Algorithm for Decentralized Consensus Optimization

数学迭代函数凸函数数学优化共识订单（交换）正多边形收敛速度最优化问题趋同（经济学）拓扑（电路）算法多智能体系统计算机科学组合数学频道（广播）数学分析几何学人工智能经济经济增长计算机网络财务

作者

Wei Shi,Qing Ling,Gang Wu,Wotao Yin

出处

期刊：Siam Journal on Optimization [Society for Industrial and Applied Mathematics]
日期：2015-01-01 卷期号：25 (2): 944-966 被引量：1181

链接

arxiv.org arxiv.org arxiv.org arxiv.org arxiv.orgdoi.org

标识

DOI：10.1137/14096668x

摘要

Recently, there has been growing interest in solving consensus optimization problems in a multiagent network. In this paper, we develop a decentralized algorithm for the consensus optimization problem $\mathrm{minimize}_{x\in\mathbb{R}^p}~\bar{f}(x)=\frac{1}{n}\sum_{i=1}^n f_i(x),$ which is defined over a connected network of $n$ agents, where each function $f_i$ is held privately by agent $i$ and encodes the agent's data and objective. All the agents shall collaboratively find the minimizer while each agent can only communicate with its neighbors. Such a computation scheme avoids a data fusion center or long-distance communication and offers better load balance to the network. This paper proposes a novel decentralized exact first-order algorithm (abbreviated as EXTRA) to solve the consensus optimization problem. “Exact” means that it can converge to the exact solution. EXTRA uses a fixed, large step size, which can be determined independently of the network size or topology. The local variable of every agent $i$ converges uniformly and consensually to an exact minimizer of $\bar{f}$. In contrast, the well-known decentralized gradient descent (DGD) method must use diminishing step sizes in order to converge to an exact minimizer. EXTRA and DGD have the same choice of mixing matrices and similar per-iteration complexity. EXTRA, however, uses the gradients of the last two iterates, unlike DGD which uses just that of the last iterate. EXTRA has the best known convergence rates among the existing synchronized first-order decentralized algorithms for minimizing convex Lipschitz--differentiable functions. Specifically, if the $f_i$'s are convex and have Lipschitz continuous gradients, EXTRA has an ergodic convergence rate $O(\frac{1}{k})$ in terms of the first-order optimality residual. In addition, as long as $\bar{f}$ is (restricted) strongly convex (not all individual $f_i$'s need to be so), EXTRA converges to an optimal solution at a linear rate $O(C^{-k})$ for some constant $C>1$.

求助该文献

最长约 10秒，即可获得该文献文件

EXTRA: An Exact First-Order Algorithm for Decentralized Consensus Optimization

今日热心研友