资源算法Loss Surfaces

Loss Surfaces

2019-09-19 | |  51 |   0 |   0

## AN EMPIRICAL ANALYSIS OF DEEP NETWORK LOSS SURFACES WITH MXNET

A MXNet implementation of AN EMPIRICAL ANALYSIS OF DEEP NETWORK LOSS SURFACES.

Important: This code can not generate absolutely the same visualization results with the original paper yet.

Usage

  1. To train a ResNet-32 network on cifar10 with adam, run the following command:

$ python train_cifar10.py --model-prefix models/cifar10/resnet32_adam --network resnet --num-layers 32 --optimizer adam --model-period 100 --params models/cifar10/resnet32_sgd-0000.params --gpus 0
  1. To compute the train/val error curve(50 points) between two different models:

$ python linear_interpolate_model.py --net-json models/cifar10/resnet_sgd-symbol.json --params1 models/cifar10/resnet_sgd-0300.params --params2 models/cifar10/resnet_adam-0300.params --alpha-num 50 --batch-size 16 --alpha-num 50 --gpus 0
  1. To compute the train/val error mesh(alpha num = 50, beta num = 50) among three different models using the barycentric interpolate method:

$ python barycentric_interpolate_model.py --net-json models/cifar10/resnet_sgd-symbol.json --params1 models/cifar10/resnet_rmsprop-0300.params --params2 models/cifar10/resnet_adam-0300.params --params3 models/cifar10/resnet_sgd-0300.params --alpha-num 50 --beta-num 50 --gpus 0
  1. To compute the train/val error mesh(alpha num = 50, beta num = 50) among four different models using the bilinear interpolate method:

$ python bilinear_interpolate_model.py --net-json models/cifar10/resnet_sgd-symbol.json --params1 models/cifar10/resnet_adam-0000.params --params2 models/cifar10/resnet_sgd-0300.params --params3 models/cifar10/resnet_adam-0300.params --params4 models/cifar10/resnet_rmsprop-0300.params --alpha-num 50 --beta-num 50 --gpus 0
  1. To plot the train/val error curve between two models, please use visualize_interpolate_2d.ipynb.

  2. To plot the train/val error surface among three or four models, please use visualize_interpolate_3d.ipynb.


上一篇:VisualBackProp

下一篇:RNN-Transducer(RNNT loss using CPP)

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...