Real-time manipulation of the F0-contour in synthetic speech using the Fujisaki model
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
In this paper, we propose a system that allows the user of a real-time speech synthesizer to directly manipulate the F0 contour of an utterance on-line and in real-time. The intonation is generated by the Fujisaki Model, which creates the F0 contour based on accent and phrase commands that the user needs to trigger. These input commands to the model can be generated by the user with the buttons of a wireless Mycestro 3D mouse. To evaluate the usability, a study with 16 subjects was conducted and 10 monotone sentences were manipulated in real-time using the proposed system. The results show that the majority of users were able to produce a natural sounding intonation.
Details
| Original language | English |
|---|---|
| Title of host publication | Elektronische Sprachsignalverarbeitung 2017 |
| Editors | Jürgen Trouvain, Ingmar Steiner, Bernd Möbius |
| Publisher | Dresden : TUDpress |
| Pages | 278-285 |
| Number of pages | 8 |
| ISBN (print) | 978-3-959080-92-7 |
| Publication status | Published - 1 Mar 2017 |
| Peer-reviewed | Yes |
Publication series
| Series | Studientexte zur Sprachkommunikation |
|---|---|
| Volume | 86 |
| ISSN | 0940-6832 |
External IDs
| ORCID | /0000-0003-0167-8123/work/168716949 |
|---|---|
| ORCID | /0000-0002-8149-2275/work/168719845 |
Keywords
Keywords
- Poster