资源论文AF UNCTION SPACE VIEW OF BOUNDED NORM INFI -NITE WIDTH RE LU NETS :T HE MULTIVARIATE CASE

AF UNCTION SPACE VIEW OF BOUNDED NORM INFI -NITE WIDTH RE LU NETS :T HE MULTIVARIATE CASE

2019-12-30 | |  61 |   37 |   0

Abstract
A key element of understanding the efficacy of overparameterized neural networks is characterizing how they represent functions as the number of weights in the network approaches infinity. In this paper, we characterize the norm required to realize a function f : Rd → R as a single hidden-layer ReLU network with an unbounded number of units (infinite width), but where the Euclidean norm of the weights is bounded, including precisely characterizing which functions can be realized with finite norm. This was settled for univariate functions f : R → R in Savarese et al. (2019), where it was shown that the required norm is determined by the L1 -norm of the second derivative of the function. We extend the characterization to multi-variate functions (d ≥ 2, i.e., multiple input units), relating the required norm to the L1 -norm of the Radon transform of a (d + 1)/2-power Laplacian of the function. This characterization allows us to show that all functions in Sobolev spaces W s,1 (Rd ), s ≥ d + 1, can be represented with bounded norm, to calculate the required norm for several specific functions, and to obtain a depth separation result. These results have important implications for understanding generalization performance and the distinction between neural networks and more traditional kernel learning.

上一篇:GEOMETRIC INSIGHTS INTO THE CONVERGENCE OFN ONLINEAR TD LEARNING

下一篇:VL-BERT: PRE -TRAINING OF GENERIC VISUAL -L INGUISTIC REPRESENTATIONS

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...