资源算法TRPO-GAE

TRPO-GAE

2020-01-16 | |  34 |   0 |   0

TRPO with GAE

Tensorflow implementation of TRPO(Trust Region Policy Optimization) with GAE(Generalized Advantage Estimator) on mujoco

Reference


上一篇:TRPO-TensorFlow

下一篇:netflix-categories

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...