kaldi-tf

2020-04-07 |

|

43 |

0 |

0

kaldi-tf

`kaldi-tf`

A set of scripts for getting data out of Kaldi and into TensorFlow.

Pipeline

Step	Code Location
1) Generate Kaldi phoneme-level alignments (`*.ali`) via GMMs	Kaldi source
2) Generate Kaldi nnet3 neural net example files (`egs.*.ark`) from alignments	Kaldi source
3) Convert binary Nnet3Egs ark files to text ark files via `nnet3-copy-egs.cc`	Kaldi source
4) Convert text ark file to csv via `egs-to-csv.py`	this repo
5) Convert csv to tfrecords via `csv-to-tfrecords.py`	this repo
6) Read tfrecords, train, and evaluate with `train_and_eval.py`	this repo

Modifying Kaldi egs

Unrelated to TensorFlow, but if you want to open Kaldi egs, make changes, and use those modified egs in training, follow this guide:

convert egs.ark to text: $ nnet3-copy-egs ark:egs.1.ark ark,t:egs.1.ark.txt
make your changes to new ark text file
convert ark text file back to binary with new scp file: $ nnet3-copy-egs ark,t:egs.1.ark.txt ark,scp:egs.1.ark,egs.scp
make changes to scp file paths, because they change depending on where you run the nnet3-copy-egs script!

上一篇：kaldi_tutorial

下一篇：kaldi-unsupervised

用户评价

全部评价

还没有评论，说两句吧！

热门资源

TensorFlow-Course

This repository aims to provide simple and read...
seetafaceJNI

项目介绍基于中科院seetaface2进行封装的JAVA...
mxnet_VanillaCNN

This is a mxnet implementation of the Vanilla C...
DuReader_QANet_BiDAF

Machine Reading Comprehension on DuReader Usin...
Klukshu-Sockeye-...

KLUKSHU SOCKEYE PROJECTS 2016 This repositor...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com