Enhancement of Spectral Tilt in Synthesized Speech

Bidisha Sharma,S. R. Mahadeva Prasanna

Enhancement of Spectral Tilt in Synthesized Speech

2017

Bidisha Sharma
S. R. Mahadeva Prasanna

The research in statistical parametric speech synthesis is towards improving naturalness and intelligibility. In this work, the deviation in spectral tilt of the natural and synthesized speech is analyzed and observed a large gap between the two. Furthermore, the same is analyzed for different classes of sounds, namely low-vowels, mid-vowels, high-vowels, semi-vowels, nasals, and found to be varying with category of sound units. Based on variation, a novel method for spectral tilt enhancement is proposed, where the amount of enhancement introduced is different for different classes of sound units. The proposed method yields improvement in terms of intelligibility, naturalness, and speaker similarity of the synthesized speech.

Keywords:

Speech enhancement
Parametric statistics
Pattern recognition
Artificial intelligence
Naturalness
Speech recognition
Natural language
Intelligibility (communication)
Speech synthesis
Linear predictive coding
Computer science
Hidden Markov model
Mathematics

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations