Prior multisensory learning can facilitate auditory-only voice-identity and speech recognition in noise

Corrina Therese Maguinness; Sonja Schall; Brian Mathias; Martin Schoemann; Katharina von Kriegstein

doi:10.1177/17470218241278649

Prior multisensory learning can facilitate auditory-only voice-identity and speech recognition in noise

Research output: Contribution to journal › Research article › Contributed › peer-review

Contributors

Corrina Therese Maguinness - , Chair of Cognitive and Clinical Neuroscience, Max Planck Institute for Human Cognitive and Brain Sciences (Author)
Sonja Schall - , Max Planck Institute for Human Cognitive and Brain Sciences (Author)
Brian Mathias - , Chair of Cognitive and Clinical Neuroscience, University of Aberdeen (Author)
Martin Schoemann - , Chair of Psychological Methods and Cognitive Modelling (Author)
Katharina von Kriegstein - , Chair of Cognitive and Clinical Neuroscience, Max Planck Institute for Human Cognitive and Brain Sciences (Author)

Abstract

Seeing the visual articulatory movements of a speaker, while hearing their voice, helps with understanding what is said. This multisensory enhancement is particularly evident in noisy listening conditions. Multisensory enhancement also occurs even in auditory-only conditions: auditory-only speech and voice-identity recognition are superior for speakers previously learned with their face, compared to control learning; an effect termed the “face-benefit.” Whether the face-benefit can assist in maintaining robust perception in increasingly noisy listening conditions, similar to concurrent multisensory input, is unknown. Here, in two behavioural experiments, we examined this hypothesis. In each experiment, participants learned a series of speakers’ voices together with their dynamic face or control image. Following learning, participants listened to auditory-only sentences spoken by the same speakers and recognised the content of the sentences (speech recognition, Experiment 1) or the voice-identity of the speaker (Experiment 2) in increasing levels of auditory noise. For speech recognition, we observed that 14 of 30 participants (47%) showed a face-benefit. 19 of 25 participants (76%) showed a face-benefit for voice-identity recognition. For those participants who demonstrated a face-benefit, the face-benefit increased with auditory noise levels. Taken together, the results support an audio–visual model of auditory communication and suggest that the brain can develop a flexible system in which learned facial characteristics are used to deal with varying auditory uncertainty.

Details

Original language	English
Pages (from-to)	1348-1368
Number of pages	21
Journal	Quarterly Journal of Experimental Psychology
Volume	78
Issue number	7
Early online date	20 Aug 2024
Publication status	Published - Jul 2025
Peer-reviewed	Yes

External IDs

ORCID	/0000-0002-2531-4175/work/166324445
ORCID	/0000-0001-7989-5860/work/166324980
unpaywall	10.1177/17470218241278649
Scopus	85204594463
PubMed	39164830

Keywords

audio-visual, multisensory, person recognition, speech in noise, voice identity, Speech in noise, audio–visual, multisensory learning

Research Portal of the TU Dresden

Contributors

Abstract

Details

External IDs

Keywords

Keywords