Contextual Information Improves OOV Detection in Speech

Carolina Parada,Mark Dredze,Denis Filimonov,Frederick Jelinek

Contextual Information Improves OOV Detection in Speech

2010

Carolina Parada
Mark Dredze
Denis Filimonov
Frederick Jelinek

Out-of-vocabulary (OOV) words represent an important source of error in large vocabulary continuous speech recognition (LVCSR) systems. These words cause recognition failures, which propagate through pipeline systems impacting the performance of downstream applications. The detection of OOV regions in the output of a LVCSR system is typically addressed as a binary classification task, where each region is independently classified using local information. In this paper, we show that jointly predicting OOV regions, and including contextual information from each region, leads to substantial improvement in OOV detection. Compared to the state-of-the-art, we reduce the missed OOV rate from 42.6% to 28.4% at 10% false alarm rate.

Keywords:

Machine learning
Constant false alarm rate
Speech recognition
Binary classification
Artificial intelligence
Computer science
Vocabulary
Pattern recognition
Natural language processing
contextual information

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations