简体繁体中英

How to train an lstm for speech recognition

原文 2016-11-25 21:12:43 5 1 tensorflow/ speech-recognition/ keras/ speech-to-text/ lstm

I'm trying to train lstm model for speech recognition but don't know what training data and target data to use. I'm using the LibriSpeech dataset and it contains both audio files and their transcripts. At this point, I know the target data will be the transcript text vectorized. As for the training data, I was thinking of using the frequencies and time from each audio file (or MFCC features). If that is the correct way to approach the problem, the training data/audio will be multiple arrays, how would I input those array into my lstm model? Will I have to vectorize them?

Thanks!

1 answers

To prepare the speech dataset for feeding into the LSTM model, you can see this post - Building Speech Dataset for LSTM binary classification and also the segment Data Preparation .

As a good example, you can see this post - http://danielhnyk.cz/predicting-sequences-vectors-keras-using-rnn-lstm/ . This post talks about how to predict sequence of vectors in Keras using RNN - LSTM .

I believe you will find this post ( https://stats.stackexchange.com/questions/192014/how-to-implement-a-lstm-based-classifier-to-classify-speech-files-using-keras ) very helpful too.

How to mask paddings in LSTM model for speech emotion recognition

How to train HMM with audio senteces dataset for speech recognition?

How to train different LSTM on the same tensorflow session?

How to Train LSTM Using Multiple Datasets?

How to train a Keras LSTM with a multidimensional input?

How to train LSTM with single label per “batch”

How to train a LSTM with a multivarible input of different lengths?

How to reshape X_train and y_train for LSTM keras

Speech Recognition - how to split a sentence into words?

Tensor flow LSTM for speech recognition slows down when training each subsequent word

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to mask paddings in LSTM model for speech emotion recognition How to train HMM with audio senteces dataset for speech recognition? How to train different LSTM on the same tensorflow session? How to Train LSTM Using Multiple Datasets? How to train a Keras LSTM with a multidimensional input? How to train LSTM with single label per “batch” How to train a LSTM with a multivarible input of different lengths? How to reshape X_train and y_train for LSTM keras Speech Recognition - how to split a sentence into words? Tensor flow LSTM for speech recognition slows down when training each subsequent word

Related Tags

How to train an lstm for speech recognition

Question

1 answers

solution1 15 ACCPTED 2016-11-26 00:18:13

solution1
15 ACCPTED 2016-11-26 00:18:13