Real-time manipulation of the F0-contour in synthetic speech using the Fujisaki model

Simon Stone; Konrad Schulze; Peter Steiner; Peter Birkholz

Real-time manipulation of the F0-contour in synthetic speech using the Fujisaki model

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Simon Stone - , Chair of Speech Technology and Cognitive Systems, Junior Professorship in Cognitive Systems (Author)
Konrad Schulze - , TUD Dresden University of Technology (Author)
Peter Steiner - , Chair of Speech Technology and Cognitive Systems (Author)
Peter Birkholz - , Junior Professorship in Cognitive Systems (Author)

Abstract

In this paper, we propose a system that allows the user of a real-time speech synthesizer to directly manipulate the F0 contour of an utterance on-line and in real-time. The intonation is generated by the Fujisaki Model, which creates the F0 contour based on accent and phrase commands that the user needs to trigger. These input commands to the model can be generated by the user with the buttons of a wireless Mycestro 3D mouse. To evaluate the usability, a study with 16 subjects was conducted and 10 monotone sentences were manipulated in real-time using the proposed system. The results show that the majority of users were able to produce a natural sounding intonation.

Details

Original language	English
Title of host publication	Elektronische Sprachsignalverarbeitung 2017
Editors	Jürgen Trouvain, Ingmar Steiner, Bernd Möbius
Publisher	Dresden : TUDpress
Pages	278-285
Number of pages	8
ISBN (print)	978-3-959080-92-7
Publication status	Published - 1 Mar 2017
Peer-reviewed	Yes

Publication series

Series	Studientexte zur Sprachkommunikation
Volume	86
ISSN	0940-6832

External IDs

ORCID	/0000-0003-0167-8123/work/168716949
ORCID	/0000-0002-8149-2275/work/168719845

Keywords

Poster

Research Portal of the TU Dresden

Contributors

Abstract

Details

Publication series

External IDs

Keywords

Keywords