资源论文FUNCTIONAL VS .PARAMETRIC EQUIVALENCEOF RE LU NETWORKS

FUNCTIONAL VS .PARAMETRIC EQUIVALENCEOF RE LU NETWORKS

2020-01-02 | |  56 |   35 |   0

Abstract

F UNCTIONAL VS . PARAMETRIC EQUIVALENCEOF R E LU NETWORKS Anonymous authors Paper under double-blind review We address the following question: How redundant is the parameterisation of ReLU networks? Specifically, we consider transformations of the weight space which leave the function implemented by the network intact. Two such transformations are known for feedforward architectures: permutation of neurons within a layer, and positive scaling of all incoming weights of a neuron coupled with inverse scaling of its outgoing weights. In this work, we show for architectures with non-increasing widths that permutation and scaling are in fact the only function-preserving weight transformations. For any eligible architecture we give an explicit construction of a neural network such that any other network that implements the same function can be obtained from the original one by the application of permutations and rescaling. The proof relies on a geometric understanding of boundaries between linear regions of ReLU networks, and we hope the developed mathematical tools are of independent interest.

上一篇:SMOOTHNESS AND STABILITY IN GAN S

下一篇:GRAPH CONVOLUTIONAL REINFORCEMENT LEARN -ING

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...