Audio-visual speech comprehension in noise with real and virtual speakers

Jens Nirme; Birgitta Sahlén; Viveka Lyberg Åhlander; Jonas Brännström; Magnus Haake

doi:10.1016/j.specom.2019.11.005

Audio-visual speech comprehension in noise with real and virtual speakers

Jens Nirme, Birgitta Sahlén, Viveka Lyberg Åhlander, Jonas Brännström, Magnus Haake

Research output: Contribution to journal › Article › Scientific › peer-review

4 Citations (Scopus)

62 Downloads (Pure)

Abstract

This paper presents a study where a 3D motion-capture animated ‘virtual speaker’ is compared to a video of a real speaker with regards to how it facilitates children's speech comprehension of narratives in background multitalker babble noise. As secondary measures, children self-assess the listening- and attentional effort demanded by the task, and associates words describing positive or negative social traits to the speaker. The results show that the virtual speaker, despite being associated with more negative social traits, facilitates speech comprehension in babble noise compared to a voice-only presentation but that the effect requires some adaptation. We also found the virtual speaker to be at least as facilitating as the video. We interpret these results to suggest that audiovisual integration supports speech comprehension independently of children's social perception of the speaker, and discuss virtual speakers’ potential in research and pedagogical applications.

Original language	English
Pages (from-to)	44–55
Journal	Speech Communication
Volume	116
DOIs	https://doi.org/10.1016/j.specom.2019.11.005
Publication status	Published - 2020
MoE publication type	A1 Journal article-refereed

Access to Document

10.1016/j.specom.2019.11.005

Nirme-Audio-visual speech comprehension in noise preprint (1).pdfAccepted author manuscript, 550 KBLicence: Publisher rights policy

https://urn.fi/URN:NBN:fi-fe202201147570

Cite this

@article{857dc3a786eb449d884a980cbadc21af,

title = "Audio-visual speech comprehension in noise with real and virtual speakers",

abstract = "This paper presents a study where a 3D motion-capture animated {\textquoteleft}virtual speaker{\textquoteright} is compared to a video of a real speaker with regards to how it facilitates children's speech comprehension of narratives in background multitalker babble noise. As secondary measures, children self-assess the listening- and attentional effort demanded by the task, and associates words describing positive or negative social traits to the speaker. The results show that the virtual speaker, despite being associated with more negative social traits, facilitates speech comprehension in babble noise compared to a voice-only presentation but that the effect requires some adaptation. We also found the virtual speaker to be at least as facilitating as the video. We interpret these results to suggest that audiovisual integration supports speech comprehension independently of children's social perception of the speaker, and discuss virtual speakers{\textquoteright} potential in research and pedagogical applications.",

author = "Jens Nirme and Birgitta Sahl{\'e}n and {Lyberg {\AA}hlander}, Viveka and Jonas Br{\"a}nnstr{\"o}m and Magnus Haake",

year = "2020",

doi = "10.1016/j.specom.2019.11.005",

language = "English",

volume = "116",

pages = "44–55",

journal = "Speech Communication",

issn = "0167-6393",

publisher = "Elsevier",

}

TY - JOUR

T1 - Audio-visual speech comprehension in noise with real and virtual speakers

AU - Nirme, Jens

AU - Sahlén, Birgitta

AU - Lyberg Åhlander, Viveka

AU - Brännström, Jonas

AU - Haake, Magnus

PY - 2020

Y1 - 2020

N2 - This paper presents a study where a 3D motion-capture animated ‘virtual speaker’ is compared to a video of a real speaker with regards to how it facilitates children's speech comprehension of narratives in background multitalker babble noise. As secondary measures, children self-assess the listening- and attentional effort demanded by the task, and associates words describing positive or negative social traits to the speaker. The results show that the virtual speaker, despite being associated with more negative social traits, facilitates speech comprehension in babble noise compared to a voice-only presentation but that the effect requires some adaptation. We also found the virtual speaker to be at least as facilitating as the video. We interpret these results to suggest that audiovisual integration supports speech comprehension independently of children's social perception of the speaker, and discuss virtual speakers’ potential in research and pedagogical applications.

AB - This paper presents a study where a 3D motion-capture animated ‘virtual speaker’ is compared to a video of a real speaker with regards to how it facilitates children's speech comprehension of narratives in background multitalker babble noise. As secondary measures, children self-assess the listening- and attentional effort demanded by the task, and associates words describing positive or negative social traits to the speaker. The results show that the virtual speaker, despite being associated with more negative social traits, facilitates speech comprehension in babble noise compared to a voice-only presentation but that the effect requires some adaptation. We also found the virtual speaker to be at least as facilitating as the video. We interpret these results to suggest that audiovisual integration supports speech comprehension independently of children's social perception of the speaker, and discuss virtual speakers’ potential in research and pedagogical applications.

U2 - 10.1016/j.specom.2019.11.005

DO - 10.1016/j.specom.2019.11.005

M3 - Article

SN - 0167-6393

VL - 116

SP - 44

EP - 55

JO - Speech Communication

JF - Speech Communication

ER -

Audio-visual speech comprehension in noise with real and virtual speakers

Abstract

Access to Document

Fingerprint

Cite this