Lattice-based discriminative training for large vocabulary speech recognition

Valtcho Valtchev,Julian J. Odell,Philip C. Woodland,Steve J. Young

Lattice-based discriminative training for large vocabulary speech recognition

1996

Valtcho Valtchev
Julian J. Odell
Philip C. Woodland
Steve J. Young

This paper describes a framework for optimising the parameters of a continuous density HMM-based large vocabulary recognition system using a maximum mutual information estimation (MMIE) criterion. To limit the computational complexity arising from the need to find confusable speech segments in the large search space of alternative utterance hypotheses, word lattices generated from the training data are used. Experiments are presented on the Wall Street journal database using up to 66 hours of training data. These show that lattices combined with an improved estimation algorithm makes MMIE training practicable even for very complex recognition systems and large training sets. Furthermore, experimental results show that MMIE training can yield useful increases in recognition accuracy.

Keywords:

Computational complexity theory
Vocabulary
Discriminative model
Speech recognition
Utterance
Machine learning
Mutual information
Pattern recognition
Speech segmentation
Hidden Markov model
Artificial intelligence
Computer science
Context model
Encoding (memory)

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations