Language model adaptation using Random Forests

Anoop Deoras,Frederick Jelinek,Yi Su

Language model adaptation using Random Forests

2010

Anoop Deoras
Frederick Jelinek
Yi Su

In this paper we investigate random forest based language model adaptation. Large amounts of out-of-domain data are used to grow the decision trees while very small amounts of in-domain data are used to prune them back, so that the structure of the trees are suitable for the desired domain while the probabilities in the tree nodes are reliably estimated. Extensive experiments are carried out and results are reported on a particular task of adapting Broadcast News language model to the MIT computer science lecture domain. We show 0.80% and 0.60% absolute WER improvement over language model interpolation and count merging techniques, respectively.

Keywords:

Interpolation
Random forest
Language model
Decision tree
Computer science
Merge (version control)
Machine learning
Data modeling
Hidden Markov model
Pattern recognition
Artificial intelligence
Broadcasting
Speech processing

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations