资源算法MS-Tacotron2

MS-Tacotron2

2020-04-02 | |  34 |   0 |   0

MS-Tacotron2

Tacotron2 based multi-speaker text to speech

Reference

Model

Three places adding speaker embedding layers

  • text encoder GRU initial state

  • add speaker code after text encoder RNN

Data preparation

data:dictionaries for different language
dataset:where put training data and test data

Train step

python -m multiproc.py train.py -o outdir15/ -l logdir -c ./outdir5/tacotron2_statedict.pt --warm_start

Experiments

E1 (Jan-19)

location: 115 outdir14 data:

  • mix indirect and direct speech with one speaker code + 39 female speaker from cusent

  • storytelling sentences share the same speaker id

E2 (Jan-24)

location: 114 outdir15 data:

  • mix indirect and direct speech with one speaker code + 39 female speaker from cusent

  • indirect speech: spk id 69 direct speech: spk id 70

Install

  • apex segmentation error: conda install -c psi4 gcc-5

TO DO LIST

  • tweak input format : replace with pandas dataframe instead of filelists


上一篇:tacotron2-ros

下一篇:Tacotron2-demo

用户评价
全部评价

热门资源

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • shih-styletransfer

    shih-styletransfer Code from Style Transfer ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...