Construction and evaluation of a parametric one-dimensional vocal tract model

Simon Stone; Michael Marxen; Peter Birkholz

doi:10.1109/TASLP.2018.2825601

Construction and evaluation of a parametric one-dimensional vocal tract model

Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung

Beitragende

Simon Stone - , Professur für Sprachtechnologie und Kognitive Systeme, Juniorprofessur für Kognitive Systeme (Autor:in)
Michael Marxen - , Klinik und Poliklinik für Psychiatrie und Psychotherapie (Autor:in)
Peter Birkholz - , Juniorprofessur für Kognitive Systeme (Autor:in)

Abstract

Articulatory speech synthesis based on aero-acoustic simulations of the vocal tract is computationally expensive and, therefore, requires simple yet precise models. Modeling the one-dimensional vocal tract area function directly instead of a higher dimensional vocal tract model is an efficient way to minimize the computational overhead of the simulations. In this paper, we propose a new parametric vocal tract model that is controlled by six points and capable of modeling a large variety of vocal tract shapes. We geometrically and perceptually evaluated the model on a set of 22 reference area functions corresponding to German vowels and consonants. The model was able to geometrically approximate the reference area functions with a minimum root-mean-square error of 0.302 cm $2$, a maximum error of 1.142 cm $2$, and a median error of 0.891 cm $2$. After optimizations, a perceptual evaluation of the synthesis using our model in combination with a state-of-the-art aero-acoustic simulation achieved a vowel recognition rate of 90.7% and a consonant recognition rate of 73.2%.

Details

Originalsprache	Englisch
Seiten (von - bis)	1381-1392
Seitenumfang	12
Fachzeitschrift	IEEE/ACM Transactions on Audio Speech and Language Processing
Jahrgang	26
Ausgabenummer	8
Publikationsstatus	Veröffentlicht - Aug. 2018
Peer-Review-Status	Ja

Externe IDs

ORCID	/0000-0001-8870-0041/work/142251357
ORCID	/0000-0003-0167-8123/work/167214872

Forschungsportal der TU Dresden

Construction and evaluation of a parametric one-dimensional vocal tract model

Beitragende

Abstract

Details

Externe IDs

Schlagworte

Forschungsprofillinien der TU Dresden

ASJC Scopus Sachgebiete

Schlagwörter