Synthetic F0 Can Effectively Convey Speaker ID in Delexicalized Speech

Eric Morley,Esther Klabbers,Jan P. H. van Santen,Alexander Kain,Seyed Hamidreza Mohammadi

Synthetic F0 Can Effectively Convey Speaker ID in Delexicalized Speech

2012

Eric Morley
Esther Klabbers
Jan P. H. van Santen
Alexander Kain
Seyed Hamidreza Mohammadi

We investigate the extent to which F0 can convey speaker ID in the absence of spectral, segmental, and durational information. We propose two methods of F0 synthesis based on the Linear Alignment Model (LAM) [2]: one parametric, the other corpusbased. Through a perceptual experiment, we show that F0 alone is able to convey information about speaker ID. We find that F0 synthesized with either LAM-based method conveys speaker ID almost as effectively as natural F0.

Keywords:

Perception
Parametric statistics
Speech recognition
Pattern recognition
Speech synthesis
Artificial intelligence
Prosody
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations