Modelling vowel acquisition using the Birkholz synthesizer

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

Abstract

Human infants have a remarkable ability to learn to speak. To examine theories of some aspects of speech production development we previously developed Elija, a computational model of infant speech acquisition. Elija is an agent that can influence its environment by generating acoustic output by controlling an articulatory synthesizer as well as receiving somatosensory feedback from the environment. We first describe the Elija model more formally within the framework of reinforcement learning. Then we implement Elija’s vocal apparatus using the more sophisticated 3-D articulatory Birkholz synthesizer instead of the Maeda model used previously. Here we focus on vowel learning and show that, despite the increase in synthesizer complexity, the Elija model agent can still learn to generate vocalic speech sounds unassisted. Subsequently, using a selection process by a caregiver, Elija can refine these utterances leading to a set of L1 vowels. We present examples of the discovered vowels and show that they compare favorably to standard vowel configurations made available with the Birkholz synthesizer.

Details

Original languageEnglish
Title of host publicationElektronische Sprachsignalverarbeitung 2019
EditorsPeter Birkholz, Simon Stone
Publisher Dresden : TUDpress
Pages304-311
Number of pages8
ISBN (print)978-3-959081-57-3
Publication statusPublished - 1 Mar 2019
Peer-reviewedYes

Publication series

SeriesStudientexte zur Sprachkommunikation
Volume93
ISSN0940-6832

External IDs

ORCID /0000-0003-0167-8123/work/168716942

Keywords

Keywords

  • Sprachproduktion