Full covariance state duration modeling for HMM-based speech synthesis

Heng Lu,Yi-Jian Wu,Keiichi Tokuda,Li Rong Dai,Ren-Hua Wang

Full covariance state duration modeling for HMM-based speech synthesis

2009

Heng Lu
Yi-Jian Wu
Keiichi Tokuda
Li Rong Dai
Ren-Hua Wang

This paper proposes a state duration modeling method using full covariance matrix for HMM-based speech synthesis. In this method, a full covariance matrix instead of the conventional diagonal covariance matrix is adopted in the multi-dimensional Gaussian distribution to model the state duration of each context-dependent phoneme. At synthesis stage, the state durations are predicted using the clustered context-dependent distributions with full covariance matrices. Experimental results show that the synthesized speech using full-covariance state duration models is more natural than the conventional method when we change the speaking rate of synthesized speech.

Keywords:

Covariance matrix
Covariance intersection
Artificial intelligence
Estimation of covariance matrices
Speech synthesis
Matrix (mathematics)
Covariance function
Mathematics
Pattern recognition
Covariance
Hidden Markov model
Speech recognition

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations