Vowel Conversion by Phonetic Segmentation

Carlos de Obaldía; Udo Zölzer

Vowel Conversion by Phonetic Segmentation

DAFx-2015 - Trondheim

In this paper a system for vowel conversion between different speakers using short-time speech segments is presented. The input speech signal is segmented into period-length speech segments whose fundamental frequency and ﬁrst two formants are used to ﬁnd the perceivable vowel-quality. These segments are used to represent a voiced phoneme, i.e. a vowel. The approach relies on pitchsynchronous analysis and uses a modiﬁed PSOLA technique for concatenation of the vowel segments. Vowel conversion between speakers is achieved by exchanging the phonetic constituents of a source speaker’s speech waveform in voiced regions of speech whilst preserving prosodic features of the source speaker, thus introducing a method for phonetic segmentation, mapping, and reconstruction of vowels.

Download

Browse by year

Vowel Conversion by Phonetic Segmentation