登录免费注册

论文
算法
数据集
经验分享
技术动态
行业动态

论文
学习
研究领域

算法
学习
研究领域

数据集
自动驾驶
图片

经验分享
学习
研究领域

技术动态
计算机视觉
自然语言处理

行业动态
教育
语音识别

》资源》算法》pytorch-trpo

pytorch-trpo

2019-09-10 |

|

162 |

0 |

0

0

pytorch-trpo

PyTorch implementation of TRPO

This repo contains a PyTorch implementation of a Trust Region Policy Optimization agent for an environment with a discrete action space.

Environment Setup

Install conda for Python 2.7.

2.

conda create --name trpo --file requirements/conda_requirements.txt
source activate trpo
pip install -r requirements/pip_requirements.txt

Install PyTorch from source at commit eff5b8b.

Usage

python run_trpo.py --env=GYM_ENV_ID

where GYM_ENV_ID is the environment ID of the gym environment you which to train on.

Results

trpo_pong_gif

A game of Pong as played using the policy model learned from a TRPO agent

trpo_pong_png

Plot of total reward per episode of Pong vs. episode number

Related Repos

OpenAI's Baseline implementation of parallel TRPO in TensorFlow

Ilya Kostrikov's implementation of TRPO for continuous control in PyTorch

上一篇：bandit-nmt

下一篇：wasserstein-gan

用户评价

登录
注册

全部评价

还没有评论，说两句吧！

热门资源

DuReader_QANet_BiDAF

Machine Reading Comprehension on DuReader Usin...
ETD_cataloguing_a...

ETD catalouging project using allennlp
allennlp-server

allennlp-server Serve allennlp services as sep...
ubuntu-allennlp

ubuntu-allennlp AllenAI AllenNLP image based o...
allennlp_extras

allennlp_extras Some utilities build on top of...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com

关于我们
智享云简介联系我们隐私声明
服务与支持
使用帮助联系我们
快速链接
启迪智享官网
咨询电话：010-82353090

工作日早9:00-晚6:00

© 2009-2019 tusaishared.com.cn 版权所有京ICP备19018324号