资源算法pytorch-dc-tts

pytorch-dc-tts

2019-12-30 | |  54 |   0 |   0

PyTorch implementation ofEfficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attentionbased partially on the following projects:

Online Text-To-Speech Demo

The following notebooks are executable on https://colab.research.google.com :

For audio samples and pretrained models, visit the above notebook links.

Training/Synthesizing English Text-To-Speech

The English TTS uses the LJ-Speech dataset.

  1. Download the dataset: python dl_and_preprop_dataset.py --dataset=ljspeech

  2. Train the Text2Mel model: python train-text2mel.py --dataset=ljspeech

  3. Train the SSRN model: python train-ssrn.py --dataset=ljspeech

  4. Synthesize sentences: python synthesize.py --dataset=ljspeech

    • The WAV files are saved in the samples folder.

Training/Synthesizing Mongolian Text-To-Speech

The Mongolian text-to-speech uses 5 hours audio from the Mongolian Bible.

  1. Download the dataset: python dl_and_preprop_dataset.py --dataset=mbspeech

  2. Train the Text2Mel model: python train-text2mel.py --dataset=mbspeech

  3. Train the SSRN model: python train-ssrn.py --dataset=mbspeech

  4. Synthesize sentences: python synthesize.py --dataset=mbspeech

    • The WAV files are saved in the samples folder.


上一篇:dctts-pytorch

下一篇:dc_tts-transfer-learning

用户评价
全部评价

热门资源

  • TensorFlow-Course

    This repository aims to provide simple and read...

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • mxnet_VanillaCNN

    This is a mxnet implementation of the Vanilla C...

  • vsepp_tensorflow

    Improving Visual-Semantic Embeddings with Hard ...

  • DuReader_QANet_BiDAF

    Machine Reading Comprehension on DuReader Usin...