资源论文Limitations of Lazy Training of Two-layers Neural Networks

Limitations of Lazy Training of Two-layers Neural Networks

2020-02-25 | |  43 |   37 |   0

Abstract

We study the supervised learning problem under either of the following two models: (1) Feature vectors xi are d-dimensional Gaussians and responses are 图片.png for 图片.png an unknown quadratic function; (2) Feature vectors xi are distributed as a mixture of two d-dimensional centered Gaussians, and 图片.png's are the corresponding class labels. We use two-layers neural networks with quadratic activations, and compare three different learning regimes: the random features (RF) regime in which we only train the second-layer weights; the neural tangent (NT) regime in which we train a linearization of the neural network around its initialization; the fully trained neural network (NN) regime in which we train all the weights in the network. We prove that, even for the simple quadratic model of point (1), there is a potentially unbounded gap between the prediction risk achieved in these three training regimes, when the number of neurons is smaller than the ambient dimension. When the number of neurons is larger than the number of dimensions, the problem is significantly easier and both NT and NN learning achieve zero risk.

上一篇:Flow-based Image-to-Image Translation with Feature Disentanglement

下一篇:RUBi: Reducing Unimodal Biases for Visual Question Answering

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • Learning to learn...

    The move from hand-designed features to learned...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...