资源论文Algorithms and matching lower bounds for approximately-convex optimization

Algorithms and matching lower bounds for approximately-convex optimization

2020-02-05 | |  51 |   44 |   0

Abstract 

In recent years, a rapidly increasing number of applications in practice requires optimizing non-convex objectives, like training neural networks, learning graphical models, maximum likelihood estimation. Though simple heuristics such as gradient descent with very few modifications tend to work well, theoretical understanding is very weak. We consider possibly the most natural class of non-convex functions where one could hope to obtain provable guarantees: functions that are “approximately convex”, i.e. functions image.png for which there exists a convex function f such that for all x, image.png for a fixed value image.pngWe then want to minimize image.png i.e. output a point image.png such that image.png It is quite natural to conjecture that for fixed image.png the problem gets harder for larger image.png however, the exact dependency of image.png and image.png is not known. In this paper, we significantly improve the known lower bound on image.png as a function of image.png and an algorithm matching this lower bound for a natural class of convex bodies. More precisely, we identify a function T :image.png such that when image.png we can give an algorithm that outputs a point image.png such that image.png within time poly image.png On the other hand, when image.png we also prove an information theoretic lower bound that any algorithm that outputs such a image.png must use super polynomial number of evaluations of image.png

上一篇:Using Social Dynamics to Make Individual Predictions: Variational Inference with a Stochastic Kinetic Model

下一篇:MetaGrad: Multiple Learning Rates in Online Learning

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...