资源算法TRPO

TRPO

2019-09-18 | |  69 |   0 |   0

RL_toolbox

all the algorithm is running on pycharm IDE, or the package loss error may exist.

implemented algorithm: trpo a3c

  • a3c:for continous action space, use multi processes, but saving model has not been implemented.

  • trpo:for continous and discrete action space

run

  • a3c:run a3c/a3c_continous.py in pycharm IDE

  • trpo:run experiment/trpo_continous.py in pycharm IDE

contain some useful reinforcement learning algorithm and relative tool


上一篇:chainer-gogh

下一篇:pytorch_TDNN

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...