zeroshot-imitation

资源分类

zeroshot-imitation

2020-01-09 |

36 |

0 |

zeroshot-imitation

This is the implementation for the ICLR 2018 paper Zero Shot Visual Imitation. We propose an alternative paradigm wherein an agent first explores the world without any expert supervision and then distills its experience into a goal-conditioned skill policy with a novel forward consistency loss. The key insight is the intuition that, for most tasks, reaching the goal is more important than how it is reached.

@inproceedings{pathakICLR18zeroshot,
    Author = {Pathak, Deepak and
    Mahmoudieh, Parsa and Luo, Guanghao and
    Agrawal, Pulkit and Chen, Dian and
    Shentu, Yide and Shelhamer, Evan and
    Malik, Jitendra and Efros, Alexei A. and
    Darrell, Trevor},
    Title = {Zero-Shot Visual Imitation},
    Booktitle = {ICLR},
    Year = {2018}
}

1) Installation and Usage

Requirements

git clone -b master --single-branch https://github.com/pathak22/zeroshot-imitation.gitcd zeroshot-imitation/# (1) Install requirements:sudo apt-get install python-tk
virtualenv venvsource $PWD/venv/bin/activate
pip install --upgrade pip
pip install numpy
pip install -r src/requirements.txt# (2) Install Caffe: http://caffe.berkeleyvision.org/install_apt.htmlgit clone https://github.com/BVLC/caffe.git
sudo apt-get install libprotobuf-dev libleveldb-dev libsnappy-dev libopencv-dev libhdf5-serial-dev protobuf-compiler
sudo apt-get install libatlas-base-dev
sudo apt-get install libgflags-dev libgoogle-glog-dev liblmdb-dev
sudo apt-get install --no-install-recommends libboost-all-devcd caffe/  # edit Makefile.configmake all -j
make pycaffe
make test -j
make runtest -j# Note: If you are using conda, then its easy:# $ conda install -c conda-forge caffe# $ conda install -c conda-forge opencv=3.2.0

Data setup

Data can be downloaded at google drive link. This is the same data as used in Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation.

You will need the rope dataset from this download.

Then, download the AlexNet weights, bvlc_alexnet.npy from here

put rope data in data/datasets/rope9
it is important to name it rope9!
put bvlc_alexnet.npy in nets/bvlc_alexnet.npy

Training

python -i train.py# fwd_consist=True to turn foward consistency loss on,# or leave it False for to just learn the inverse modelr = RopeImitator('name', fwd_consist=True)# to train baseline, turn baseline_reg=True. note that fwd_consist should be turned on as well (historical accident)r = RopeImitator('name', fwd_consist=True, baseline_reg=True)# Restore old models, if any. default of model_name is just current model namer.restore(iteration, model_name='name of old model')# trainingr.train(num_iters)

Note that the accuracies presented is not a good measure of real world performance. The purpose of forward consistency is to learn actions consistent with state transistions, which don't necessarily have to be the ground truth actions.

2) Other resources

上一篇：context-encoder

下一篇：unsupervised-video

用户评价

全部评价

还没有评论，说两句吧！

热门资源

TensorFlow-Course

This repository aims to provide simple and read...
seetafaceJNI

项目介绍基于中科院seetaface2进行封装的JAVA...
mxnet_VanillaCNN

This is a mxnet implementation of the Vanilla C...
tensorflow-sketch...

Discrlaimer: This is not an official Google pro...
vsepp_tensorflow

Improving Visual-Semantic Embeddings with Hard ...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com