Research on Visual Speech Recognition Based on Local Binary Pattern and Stacked Sparse Autoencoder

Yuanyao Lu,Ke Gu,Shan He

Research on Visual Speech Recognition Based on Local Binary Pattern and Stacked Sparse Autoencoder

2019

Yuanyao Lu
Ke Gu
Shan He

Lip feature extraction from human mouth image plays an essential role in visual speech recognition applications. This paper presents a lip feature extraction algorithm based on Local Binary Patterns (LBP) and Stacked Sparse Autoencoders (SSAE). First, LBP texture features are extracted from lip images. Then SSAE uses greedy unsupervised learning to extract high-level features. At last, we improve the performance of overall system by fine-tuning and input the extracted features into the Softmax classifier. Compared with traditional methods, the model proposed in this paper has higher classification accuracy and more applicability.

Keywords:

Local binary patterns
Autoencoder
Pattern recognition
Computer science
Artificial intelligence
Speech recognition
Lip feature
extraction algorithm
Softmax function
Unsupervised learning
Classifier (linguistics)

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations