资源算法RL-Adventure-2

RL-Adventure-2

2019-10-12 | |  89 |   0 |   0

RL-Adventure-2: Policy Gradients

PyTorch tutorial of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

The deep reinforcement learning community has made several improvements to the policy gradient algorithms. This tutorial presents latest extensions in the following order:

  1. Advantage Actor Critic (A2C)

  1. High-Dimensional Continuous Control Using Generalized Advantage Estimation

  1. Proximal Policy Optimization Algorithms

  1. Sample Efficient Actor-Critic with Experience Replay

  1. Continuous control with deep reinforcement learning

  1. Addressing Function Approximation Error in Actor-Critic Methods

  1. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

  1. Generative Adversarial Imitation Learning

  1. Hindsight Experience Replay

If you get stuck…

  • Remember you are not stuck unless you have spent more than a week on a single algorithm. It is perfectly normal if you do not have all the required knowledge of mathematics and CS.

  • Carefully go through the paper. Try to see what is the problem the authors are solving. Understand a high-level idea of the approach, then read the code (skipping the proofs), and after go over the mathematical details and proofs.

RL Algorithms

68747470733a2f2f706c616e73706163652e6f72672f32303137303833302d6265726b656c65795f646565705f726c5f626f6f7463616d702f696d672f616e6e6f74617465642e6a7067.jpgDeep Q Learning tutorial: DQN Adventure: from Zero to State of the Art 68747470733a2f2f706c616e73706163652e6f72672f32303137303833302d6265726b656c65795f646565705f726c5f626f6f7463616d702f696d672f616e6e6f74617465642e6a7067.jpg Awesome RL libs: rlkit @vitchyr, pytorch-a2c-ppo-acktr @ikostrikov, ACER @Kaixhin

Best RL courses

  • Berkeley deep RL link

  • Deep RL Bootcamp link

  • David Silver's course link

  • Practical RL link

上一篇:RL-Adventure

下一篇:adversarial-autoencoders-with-pytorch

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...