Automatic prosodic modeling for speaker and task adaptation in text-to-speech

E. Lopez-Gonzalo,J.M. Rodriguez-Garcia,L. Hernandez-Gomez,J.M. Villar

Automatic prosodic modeling for speaker and task adaptation in text-to-speech

1997

E. Lopez-Gonzalo
J.M. Rodriguez-Garcia
L. Hernandez-Gomez
J.M. Villar

One of the most important demands for future text-to-speech (TTS) systems is their ability to improve naturalness when embedded in a particular task or application that requires a particular speaking style for a particular speaker. We present a new prosodic modeling procedure for improving naturalness by adapting a TTS system to a new speaker and a new speaking style. The proposed procedure is an extension of our automatic data-driven methodology, to model both fundamental frequency and segmental duration. Automatic linguistic and acoustic analysis are performed on both a task dependent text corpus and the recorded material from the selected speaker.

Keywords:

Speaker diarisation
Task adaptation
Text corpus
Speech processing
Speech recognition
Natural language processing
Loudspeaker
Speech synthesis
Naturalness
Feature extraction
Artificial intelligence
Computer science
electronic mail

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations