Perceptual Time Varying Linear Prediction model for speech applications

2009 
A new perceptual time varying model for non-stationary analysis of speech signals is presented. Some researches have already shown that the Time Varying Linear Prediction Coding (TVLPC) model that was applied to speech signals increases the recognition performance of Automatic Speech Recognition (ASR) systems. This improvement has been achieved due to the incorporation of the speech dynamics information in the model. Another work, Perceptual Linear Prediction (PLP) analysis of speech, has shown that a modified estimation of the Auto Correlation Function (ACF) of stationary speech frame yields major improvement to the recognition rate. The presented model, Perceptual Time Varying Linear Prediction (PTVLP) analysis of speech, adopts the perceptual concepts, of how to estimate the ACF, into the TVLPC model. This research shows that the proposed PTVLP model is more accurate, robust to noise and achieves better recognition rates than PLP and TVLPC over wide SNR range.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    5
    References
    4
    Citations
    NaN
    KQI
    []