资源论文AC ONSTRUCTIVE PREDICTION OF THEG ENERALIZATION ERROR ACROSS SCALES

AC ONSTRUCTIVE PREDICTION OF THEG ENERALIZATION ERROR ACROSS SCALES

2020-01-02 | |  60 |   48 |   0

Abstract

The dependency of the generalization error of neural networks on model and dataset size is of critical importance both in practice and for understanding the theory of neural networks. Nevertheless, the functional form of this dependency remains elusive. In this work, we present a functional form which approximates well the generalization error in practice. Capitalizing on the successful concept of model scaling (e.g., width, depth), we are able to simultaneously construct such a form and specify the exact models which can attain it across model/data scales. Our construction follows insights obtained from observations conducted over a range of model/data scales, in various model types and datasets, in vision and language tasks. We show that the form both fits the observations well across scales, and provides accurate predictions from smallto large-scale models and data.

上一篇:ATARGET-AGNOSTIC ATTACK ON DEEP MODELS :E XPLOITING SECURITY VULNERABILITIES OFT RANSFER LEARNING

下一篇:KALEIDOSCOPE :A NE FFICIENT, LEARNABLE REPRE -SENTATION FOR ALL STRUCTURED LINEAR MAPS

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...