资源论文THE INTRIGUING ROLE OF MODULE CRITICALITY INTHE GENERALIZATION OF DEEP NETWORKS

THE INTRIGUING ROLE OF MODULE CRITICALITY INTHE GENERALIZATION OF DEEP NETWORKS

2020-01-02 | |  46 |   37 |   0

Abstract

We study the phenomenon that some modules of deep neural networks (DNNs) are more critical than others. Meaning that rewinding their parameter values back to initialization, while keeping other modules fixed at the trained parameters, results in a large drop in the network’s performance. Our analysis reveals interesting properties of the loss landscape which leads us to propose a complexity measure, called module criticality, based on the shape of the valleys that connects the initial and final values of the module parameters. We formulate how generalization relates to the module criticality, and show that this measure is able to explain the superior generalization performance of some architectures over others, whereas earlier measures fail to do so.

上一篇:TOWARDS AD EEP NETWORK ARCHITECTURE FORS TRUCTURED SMOOTHNESS

下一篇:DROP EDGE :T OWARDS DEEP GRAPH CONVOLU -TIONAL NETWORKS ON NODE CLASSIFICATION

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...