Gesture-based Articulatory Text to Speech Synthesis

Benjamin Weitz; Ingmar Steiner; Peter Birkholz

Gesture-based Articulatory Text to Speech Synthesis

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Benjamin Weitz - , SemVox GmbH, Saarland University (Author)
Ingmar Steiner - , Saarland University, German Research Center for Artificial Intelligence (DFKI) (Author)
Peter Birkholz - , Junior Professorship in Cognitive Systems (Author)

Abstract

We present work carried out to extend the text to speech (TTS) platform MaryTTS with a back-end that serves as an interface to the articulatory synthesizer VocalTractLab (VTL). New processing modules were developed to (a) convert the linguistic and acoustic parameters predicted from orthographic text into a gestural score, and (b) synthesize it to audio using the VTL software library. We also describe an evaluation of the resulting gesture-based articulatory TTS, using articulatory and acoustic speech data.

Details

Original language	English
Title of host publication	Elektronische Sprachsignalverarbeitung 2017
Editors	Jürgen Trouvain, Ingmar Steiner, Bernd Möbius
Publisher	Dresden : TUDpress
Pages	324-331
Number of pages	8
ISBN (print)	978-3-959080-92-7
Publication status	Published - 1 Mar 2017
Peer-reviewed	Yes

Publication series

Series	Studientexte zur Sprachkommunikation
Volume	86
ISSN	0940-6832

External IDs

ORCID	/0000-0003-0167-8123/work/168716951

Keywords

Poster

Research Portal of the TU Dresden

Contributors

Abstract

Details

Publication series

External IDs

Keywords

Keywords