资源算法seq2seq.pytorch

seq2seq.pytorch

2019-09-09 | |  126 |   0 |   0

Seq2Seq in PyTorch

This is a complete suite for training sequence-to-sequence models in PyTorch. It consists of several models and code to both train and infer using them.

Using this code you can train: * Neural-machine-translation (NMT) models * Language models * Image to caption generation * Skip-thought sentence representations * And more...

Models

Models currently available: * Simple Seq2Seq recurrent model * Recurrent Seq2Seq with attentional decoder * Google neural machine translation (GNMT) recurrent model * Transformer - attention-only model from "Attention Is All You Need" * ByteNet - convolution based encoder+decoder

Datasets

Datasets currently available:

  • WMT16

  • OpenSubtitles 2016

  • COCO image captions

All datasets can be tokenized using 3 available segmentation methods:

  • Character based segmentation

  • Word based segmentation

  • Byte-pair-encoding (BPE) as suggested by bpe with selectable number of tokens.

After choosing a tokenization method, a vocabulary will be generated and saved for future inference.

Training methods

The models can be trained using several methods:

  • Basic Seq2Seq - given encoded sequence, generate (decode) output sequence. Training is done with teacher-forcing.

  • Multi Seq2Seq - where several tasks (such as multiple languages) are trained simultaneously by using the data sequences as both input to the encoder and output for decoder.

  • Image2Seq - used to train image to caption generators.

Usage

Example training scripts are available in scripts folder. Inference examples are available in examples folder.

上一篇:DCSCN Super Resolution

下一篇:PyTorch-progressive_growing_of

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...