资源算法Dist-A3C

Dist-A3C

2020-01-10 | |  31 |   0 |   0

Dist-A3C

MIT License

TODO: Have server use mp - one thread for server, one for testing. Keep counter to know once finished. Also be able to send push notifications to kill running clients once counter done.

Distributed asynchronous advantage actor-critic (A3C) [1] with generalised advantage estimation (GAE) [2]. Run python server.py <options> to start the server and python client.py <options> for as many clients as wanted.

Requirements

To install all dependencies with Anaconda run conda env create -f environment.yml and use source activate dista3c to activate the environment.

Acknowledgements

References

[1] Asynchronous Methods for Deep Reinforcement Learning
[2] High-Dimensional Continuous Control Using Generalized Advantage Estimation


上一篇: A3C-PyTorch

下一篇:tf-a3c-gpu

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...