资源算法dc_tts-transfer-learning

dc_tts-transfer-learning

2019-12-30 | |  27 |   0 |   0

dc_tts-transfer-learning

This repo contains attempts to apply transfer learning to the dc_tts text-to-speech model decribed in the paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention. The code used is a modified version of Kyubyong's dc_tts code. The pretrained model was also provided in Kyubong's repo. It was pretrained on the LJ Speech Dataset. Scarlett Johansson's voice was trained during transfer learning


Transfer Learning is accomplished by selecting the model layers to train in hyperparameters.py


Task List:

  • add selectable list of layers for transfer learning

  • prelim model training

  • add scoring history plots

  • detailed exploration of which layers to train

  • explore data augmentation methods

  • explore post-processing

Prelim Model Training

  • ~6 hrs of training on Tesla V100 GPU

  • Layers trained:

    • SSRN(C_13, C_14, C_15, C_16)

    • Text2Mel/TextEnc(HC_11, HC_12, HC_13, HC_14, HC_15)

    • Text2Mel/AudioEnc(HC_9, HC_10, HC_11, HC_12, HC_13)

    • Text2Mel/AudioDec(HC_7, C_8, C_9, C_10, C_11)

Transfer learning data source:

Scarlett Johansson's audio book

Model Generated Examples (parodies of famous quotes from A.I. in movies):

references:


上一篇:pytorch-dc-tts

下一篇:tts_dc_test

用户评价
全部评价

热门资源

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...