资源论文IMPLICIT BIAS OF GRADIENT DESCENT BASED AD -VERSARIAL TRAINING ON SEPARABLE DATA

IMPLICIT BIAS OF GRADIENT DESCENT BASED AD -VERSARIAL TRAINING ON SEPARABLE DATA

2020-01-02 | |  57 |   47 |   0

Abstract

Adversarial training is a principled approach for training robust neural networks. Despite of tremendous successes in practice, its theoretical properties still remain largely unexplored. In this paper, we provide new theoretical insights of gradient descent based adversarial training by studying its computational properties, specifically on its implicit bias. We take the binary classification task on linearly separable data as an illustrative example, where the loss asymptotically attains its infimum as the parameter diverges to infinity along certain directions. Specifically, we show that for any fixed iteration T , when the adversarial perturbation during training has proper bounded 图片.png-norm, the classifier learned by gradient descent based adversarial training converges in direction to the maximum 图片.png -norm margin classifier at the rate of 图片.pngsignificantly faster than the rate 图片.png(1/ log T ) e of training with clean data. In addition, when the adversarial perturbation during training has bounded 图片.png-norm with 图片.pngthe resulting classifier converges in direction to a maximum mixed-norm margin classifier, which has a natural interpretation of robustness, as being the maximum 图片.png  -norm margin classifier under worst-case 图片.png-norm perturbation to the data. Our findings provide theoretical backups for adversarial training that it indeed promotes robustness against adversarial perturbation.

上一篇:STOCHASTIC CONDITIONAL GENERATIVE NETWORKSWITH BASIS DECOMPOSITION

下一篇:DEEP DOUBLE DESCENT:W HERE BIGGER MODELS AND MORE DATA HURT

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...