Exploring the impact of noise, language familiarity, and experimental settings on emotion recognition

IntroductionThis work aims to understand the contextual factors affecting speech emotion recognition (SER), more specifically the current research investigates whether the identification of vocal emotional expressions of anger, fear, sadness, joy, and neutrality is affected by three factors: (a) the...

Full description

Saved in:
Bibliographic Details
Main Authors: Terry Amorese, Marialucia Cuciniello, Anna Alterio, Daniele Pepe, Odette Scharenborg, Gennaro Cordasco, Anna Esposito
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-06-01
Series:Frontiers in Psychology
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1548975/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1839655276591972352
author Terry Amorese
Marialucia Cuciniello
Anna Alterio
Daniele Pepe
Odette Scharenborg
Gennaro Cordasco
Anna Esposito
author_facet Terry Amorese
Marialucia Cuciniello
Anna Alterio
Daniele Pepe
Odette Scharenborg
Gennaro Cordasco
Anna Esposito
author_sort Terry Amorese
collection DOAJ
description IntroductionThis work aims to understand the contextual factors affecting speech emotion recognition (SER), more specifically the current research investigates whether the identification of vocal emotional expressions of anger, fear, sadness, joy, and neutrality is affected by three factors: (a) the experimental setting, exploring vocal emotion recognition in both a controlled, soundproof laboratory and a more natural listening environment; (b) the effect of stimuli’s background noise: sentences were presented with three different levels of noise to gradually increase the level of difficulty: one clear (no noise) condition and two noise conditions; (c) language familiarity, since the stimuli comprised Italian sentences, and participants were both native (Italians) and Dutch speakers, who did not know Italian.MethodDutch and Italian participants were involved in a vocal emotion recognition task carried out in two different experimental settings (realistic vs. laboratory). The stimuli were vocal utterances from the Italian EMOVO dataset, conveying emotions like anger, fear, sadness, joy, and neutrality, and were presented in three different noise conditions.ResultsConcerning the effect of the experimental setting, even in higher levels of background noise conditions, individuals possess the remarkable ability to discern emotional nuances conveyed through voice. Regarding familiarity with the language, differences in emotion recognition performance between the Italian and Dutch listeners were observed, but the error magnitude was contingent on the emotional categories. Higher noise levels reduced accuracy, but people could still discern emotions, especially prosody.ConclusionThe study highlighted that emotion recognition is influenced by variables such as listening context, background noise, and language familiarity. These results could be useful for developing robust Speech Emotion Recognition (SER) systems and improving human-computer interaction.
format Article
id doaj-art-dcb64c7f73ec4d5b8fbe84d576e07ce8
institution Matheson Library
issn 1664-1078
language English
publishDate 2025-06-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Psychology
spelling doaj-art-dcb64c7f73ec4d5b8fbe84d576e07ce82025-06-25T11:27:57ZengFrontiers Media S.A.Frontiers in Psychology1664-10782025-06-011610.3389/fpsyg.2025.15489751548975Exploring the impact of noise, language familiarity, and experimental settings on emotion recognitionTerry Amorese0Marialucia Cuciniello1Anna Alterio2Daniele Pepe3Odette Scharenborg4Gennaro Cordasco5Anna Esposito6Department of Psychology, Università degli Studi della Campania “L. Vanvitelli”, Caserta, ItalyDepartment of Psychology, Università degli Studi della Campania “L. Vanvitelli”, Caserta, ItalyDepartment of Psychology, Università degli Studi della Campania “L. Vanvitelli”, Caserta, ItalyDepartment of Psychology, Università degli Studi della Campania “L. Vanvitelli”, Caserta, ItalyMultimedia Computing Group, Delft University of Technology, Delft, NetherlandsDepartment of Psychology, Università degli Studi della Campania “L. Vanvitelli”, Caserta, ItalyDepartment of Psychology, Università degli Studi della Campania “L. Vanvitelli”, Caserta, ItalyIntroductionThis work aims to understand the contextual factors affecting speech emotion recognition (SER), more specifically the current research investigates whether the identification of vocal emotional expressions of anger, fear, sadness, joy, and neutrality is affected by three factors: (a) the experimental setting, exploring vocal emotion recognition in both a controlled, soundproof laboratory and a more natural listening environment; (b) the effect of stimuli’s background noise: sentences were presented with three different levels of noise to gradually increase the level of difficulty: one clear (no noise) condition and two noise conditions; (c) language familiarity, since the stimuli comprised Italian sentences, and participants were both native (Italians) and Dutch speakers, who did not know Italian.MethodDutch and Italian participants were involved in a vocal emotion recognition task carried out in two different experimental settings (realistic vs. laboratory). The stimuli were vocal utterances from the Italian EMOVO dataset, conveying emotions like anger, fear, sadness, joy, and neutrality, and were presented in three different noise conditions.ResultsConcerning the effect of the experimental setting, even in higher levels of background noise conditions, individuals possess the remarkable ability to discern emotional nuances conveyed through voice. Regarding familiarity with the language, differences in emotion recognition performance between the Italian and Dutch listeners were observed, but the error magnitude was contingent on the emotional categories. Higher noise levels reduced accuracy, but people could still discern emotions, especially prosody.ConclusionThe study highlighted that emotion recognition is influenced by variables such as listening context, background noise, and language familiarity. These results could be useful for developing robust Speech Emotion Recognition (SER) systems and improving human-computer interaction.https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1548975/fullspeech recognitionvocal emotion recognitionnoiselanguage proficiencylanguage understanding
spellingShingle Terry Amorese
Marialucia Cuciniello
Anna Alterio
Daniele Pepe
Odette Scharenborg
Gennaro Cordasco
Anna Esposito
Exploring the impact of noise, language familiarity, and experimental settings on emotion recognition
Frontiers in Psychology
speech recognition
vocal emotion recognition
noise
language proficiency
language understanding
title Exploring the impact of noise, language familiarity, and experimental settings on emotion recognition
title_full Exploring the impact of noise, language familiarity, and experimental settings on emotion recognition
title_fullStr Exploring the impact of noise, language familiarity, and experimental settings on emotion recognition
title_full_unstemmed Exploring the impact of noise, language familiarity, and experimental settings on emotion recognition
title_short Exploring the impact of noise, language familiarity, and experimental settings on emotion recognition
title_sort exploring the impact of noise language familiarity and experimental settings on emotion recognition
topic speech recognition
vocal emotion recognition
noise
language proficiency
language understanding
url https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1548975/full
work_keys_str_mv AT terryamorese exploringtheimpactofnoiselanguagefamiliarityandexperimentalsettingsonemotionrecognition
AT marialuciacuciniello exploringtheimpactofnoiselanguagefamiliarityandexperimentalsettingsonemotionrecognition
AT annaalterio exploringtheimpactofnoiselanguagefamiliarityandexperimentalsettingsonemotionrecognition
AT danielepepe exploringtheimpactofnoiselanguagefamiliarityandexperimentalsettingsonemotionrecognition
AT odettescharenborg exploringtheimpactofnoiselanguagefamiliarityandexperimentalsettingsonemotionrecognition
AT gennarocordasco exploringtheimpactofnoiselanguagefamiliarityandexperimentalsettingsonemotionrecognition
AT annaesposito exploringtheimpactofnoiselanguagefamiliarityandexperimentalsettingsonemotionrecognition