资源算法taming-maml

taming-maml

2020-02-25 | |  44 |   0 |   0

Taming MAML: Efficient unbiased meta-reinforcement learning

Reference Tensorflow implementation of Taming MAML: Efficient unbiased meta-reinforcement learning. We will release Pytorch version later.

Getting started

You can use Dockerfile to build an image with conda environment called tmaml included, activating this conda env:

conda activate tmaml

you can also use tmaml.yml to create a conda env called tmaml.

conda env create -f tmaml.yml

then activate this conda env

conda activate tmaml

Usage

You can use the tmaml_run_mujoco.py , vpg_run_mujoco.py and dice_vpg_run_mujoco.py scripts in order to run reinforcement learning experiments with different algorithm. MAML:

python vpg_run_mujoco.py --env HalfCheetahRandDirecEnv

MAML + DICE:

python dice_vpg_run_mujoco.py --env HalfCheetahRandDirecEnv

TMAML:

python tmaml_run_mujoco.py --env HalfCheetahRandDirecEnv

References

To cite TMAML please use

@InProceedings{pmlr-v97-liu19g,
  title = 	 {Taming {MAML}: Efficient unbiased meta-reinforcement learning},
  author = 	 {Liu, Hao and Socher, Richard and Xiong, Caiming},
  booktitle = 	 {Proceedings of the 36th International Conference on Machine Learning},
  pages = 	 {4061--4071},
  year = 	 {2019},
  editor = 	 {Chaudhuri, Kamalika and Salakhutdinov, Ruslan},
  volume = 	 {97},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Long Beach, California, USA},
  month = 	 {09--15 Jun},
  publisher = 	 {PMLR},
}

TODOs

  •  Adding TMAML

  •  Adding MAML

  •  Adding DICE

  •  Benchmarking

  •  Pytorch version

Acknowledgements

This repository is based on ProMP repo.


上一篇:memory-efficient-maml

下一篇:build2014-MAML-EnablingSaaS

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...