Exploring End-to-End Techniques for Low-Resource Speech Recognition

Vladimir Bataev,Maxim Korenevsky,Ivan Medennikov,Alexander Zatvornitskiy

Exploring End-to-End Techniques for Low-Resource Speech Recognition

2018

Vladimir Bataev
Maxim Korenevsky
Ivan Medennikov
Alexander Zatvornitskiy

In this work we present simple grapheme-based system for low-resource speech recognition using Babel data for Turkish spontaneous speech (80 h). We have investigated different neural network architectures performance, including fully-convolutional, recurrent and ResNet with GRU. Different features and normalization techniques are compared as well. We also proposed CTC-loss modification using segmentation during training, which leads to improvement while decoding with small beam size.

Keywords:

Decoding methods
residual neural network
Normalization (statistics)
Segmentation
Turkish
End-to-end principle
Artificial neural network
Speech recognition
Computer science
low resource
spontaneous speech

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations