资源论文The Dynamics of Learning: A Random Matrix Approach

The Dynamics of Learning: A Random Matrix Approach

2020-03-16 | |  70 |   41 |   0

Abstract

Understanding the learning dynamics of neural networks is one of the key issues for the improvement of optimization algorithms as well as for the theoretical comprehension of why deep neural nets work so well today. In this paper, we introduce a random matrix-based framework to analyze the learning dynamics of a single-layer linear network on a binary classification problem, for data of simultaneously large dimension and size, trained by gradient descent. Our results pro vide rich insights into common questions in neural nets, such as overfitting, early stopping and the initialization of training, thereby opening the do for future studies of more elaborate structures an models appearing in today’s neural networks.

上一篇:Gradient Primal-Dual Algorithm Converges to Second-Order Stationary Solution for Nonconvex Distributed Optimization Over Networks

下一篇:Overcoming Catastrophic Forgetting with Hard Attention to the Task

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...