资源论文SSRGD: Simple Stochastic Recursive Gradient Descent for Escaping Saddle Points

SSRGD: Simple Stochastic Recursive Gradient Descent for Escaping Saddle Points

2020-02-19 | |  71 |   50 |   0

Abstract

We analyze stochastic gradient algorithms for optimizing nonconvex problems. In particular, our goal is to find local minima (second-order stationary points) instead of just finding first-order stationary points which may be some bad unstable saddle points. We show that a simple perturbed version of stochastic recursive gradient descent algorithm (called SSRGD) can find an 图片.png -second-order stationary point with 图片.png stochastic gradient complexity for nonconvex finite-sum problems.  As a by-product, SSRGD finds an 图片.png-first-order stationary point with 图片.png stochastic gradients. These  results are almost optimal since Fang et al. [11] provided a lower bound 图片.png for finding even just an -first-order stationary point. We emphasize that SSRGD algorithm for finding second-order stationary points is as simple as for finding first-order stationary points just by adding a uniform perturbation sometimes, while all other algorithms for finding second-order stationary points with similar gradient complexity need to combine with a negative-curvature search subroutine (e.g., Neon2 [4]). Moreover, the simple SSRGD algorithm gets a simpler analysis. Besides, we also extend our results from nonconvex finite-sum problems to nonconvex online (expectation) problems, and prove the corresponding convergence results.

上一篇:RSN: Randomized Subspace Newton

下一篇:Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...