登录免费注册

论文
算法
数据集
经验分享
技术动态
行业动态

论文
学习
研究领域

算法
学习
研究领域

数据集
自动驾驶
图片

经验分享
学习
研究领域

技术动态
计算机视觉
自然语言处理

行业动态
教育
语音识别

》资源》算法》Dist-A3C

Dist-A3C

2020-01-10 |

|

31 |

0 |

0

0

Dist-A3C

Dist-A3C

TODO: Have server use mp - one thread for server, one for testing. Keep counter to know once finished. Also be able to send push notifications to kill running clients once counter done.

Distributed asynchronous advantage actor-critic (A3C) [1] with generalised advantage estimation (GAE) [2]. Run python server.py <options> to start the server and python client.py <options> for as many clients as wanted.

Requirements

OpenAI Gym
MessagePack
msgpack-numpy
Plotly
PyTorch
PyZMQ

To install all dependencies with Anaconda run conda env create -f environment.yml and use source activate dista3c to activate the environment.

Acknowledgements

@ikostrikov for pytorch-a3c

References

[1] Asynchronous Methods for Deep Reinforcement Learning
[2] High-Dimensional Continuous Control Using Generalized Advantage Estimation

上一篇： A3C-PyTorch

下一篇：tf-a3c-gpu

用户评价

登录
注册

全部评价

还没有评论，说两句吧！

热门资源

Keras-ResNeXt

Keras ResNeXt Implementation of ResNeXt models...
seetafaceJNI

项目介绍基于中科院seetaface2进行封装的JAVA...
spark-corenlp

This package wraps Stanford CoreNLP annotators ...
capsnet-with-caps...

CapsNet with capsule-wise convolution Project ...
inferno-boilerplate

This is a very basic boilerplate example for pe...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com

关于我们
智享云简介联系我们隐私声明
服务与支持
使用帮助联系我们
快速链接
启迪智享官网
咨询电话：010-82353090

工作日早9:00-晚6:00

© 2009-2019 tusaishared.com.cn 版权所有京ICP备19018324号