数学
迭代函数
独特性
颂歌
常微分方程
有界函数
随机微分方程
应用数学
功能(生物学)
收敛速度
趋同(经济学)
数学分析
微分方程
钥匙(锁)
进化生物学
生物
经济增长
经济
生态学
作者
Anas Barakat,Pascal Bianchi
摘要
Adam is a popular variant of stochastic gradient descent for finding a local minimizer of a function. In the constant stepsize regime, assuming that the objective function is differentiable and nonconvex, we establish the convergence in the long run of the iterates to a stationary point under a stability condition. The key ingredient is the introduction of a continuous-time version of Adam, under the form of a nonautonomous ordinary differential equation. This continuous-time system is a relevant approximation of the Adam iterates, in the sense that the interpolated Adam process converges weakly toward the solution to the ODE. The existence and the uniqueness of the solution are established. We further show the convergence of the solution toward the critical points of the objective function and quantify its convergence rate under a Łojasiewicz assumption. Then, we introduce a novel decreasing stepsize version of Adam. Under mild assumptions, it is shown that the iterates are almost surely bounded and converge almost surely to critical points of the objective function. Finally, we analyze the fluctuations of the algorithm by means of a conditional central limit theorem.
科研通智能强力驱动
Strongly Powered by AbleSci AI