资源论文Recovery Guarantees for One-hidden-layer Neural Networks*

Recovery Guarantees for One-hidden-layer Neural Networks*

2020-03-10 | |  119 |   107 |   0

Abstract

In this paper, we consider regression problems with one-hidden-layer neural networks (1NNs). We distill some properties of activation functions that lead to local strong convexity in the neighborhood of the ground-truth parameters for the 1NN squared-loss objective and most popular nonlinear activation functions satisfy the dis tilled properties, including rectified linear unit (ReLUs), leaky ReLUs, squared ReLUs and sigmoids. For activation functions that are also smooth, we show local linear convergence guarantees of gradient descent under a resampling rule. For homogeneous activations, we show tensor methods are able to initialize the parameters to fall into the local strong convexity region. As a result, tensor initialization followed by gradie descent is guaranteed to recover the ground truth with sample complexity d · log(1/ε) · poly(k,λ) and computational complexity n · d · poly(k,λ) for smooth homogeneous activations with high probability, where d is the dimension of the input, k (k 图片.png d) is the number of hidden nodes, λ is a conditioning property of the ground-truth parameter matrix between the input layer and the hidden layer, ε is the targeted precision and n is the number of samples. To the best of our knowledge, this is the first work that provides recovery guarantees for 1NNs with both sample complexity and computational complexity linear in the input dimension and logarithmic in the precision.

上一篇:An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation

下一篇:Learning Infinite Layer Networks Without the Kernel Trick

用户评价
全部评价

热门资源

  • Deep Cross-media ...

    Cross-media retrieval is a research hotspot in ...

  • Regularizing RNNs...

    Recently, caption generation with an encoder-de...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Visual Reinforcem...

    For an autonomous agent to fulfill a wide range...

  • Joint Pose and Ex...

    Facial expression recognition (FER) is a challe...