A Differential Equation for Modeling Nesterov’s Accelerated Gradient Method: Theory and Insights

登录免费注册

资源分类

论文
算法
数据集
经验分享
技术动态
行业动态

论文
学习
研究领域

算法
学习
研究领域

数据集
自动驾驶
图片

经验分享
学习
研究领域

技术动态
计算机视觉
自然语言处理

行业动态
教育
语音识别

》资源》论文》A Differential Equation for Modeling Nesterov’s Accelerated Gradient Method: Theory and Insights

A Differential Equation for Modeling Nesterov’s Accelerated Gradient Method: Theory and Insights

2020-01-19 |

60 |

37 |

A Differential Equation for Modeling Nesterov’s Accelerated Gradient Method: Theory and Insights
论文

Abstract

We derive a second-order ordinary differential equation (ODE), which is the limit of Nesterov’s accelerated gradient method. This ODE exhibits approximate equivalence to Nesterov’s scheme and thus can serve as a tool for analysis. We show that the continuous time ODE allows for a better understanding of Nesterov’s scheme. As a byproduct, we obtain a family of schemes with similar convergence rates. The ODE interpretation also suggests restarting Nesterov’s scheme leading to an algorithm, which can be rigorously proven to converge at a linear rate whenever the objective is strongly convex.

上一篇：Structure Regularization for Structured Prediction Xu Sun

下一篇：Near–Optimal Density Estimation in Near–Linear Time Using Variable–Width Histograms

用户评价