On efficacy of Meta-Learning for Domain Generalization in Speech Emotion Recognition

Raeshak King Gandhi, Vasilis Tsouvalas, Nirvana Meratnia (Begeleider)

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

95 Downloads (Pure)

Samenvatting

Speech Emotion Recognition (SER) refers to the recognition of human emotions from natural speech, vital for building human-centered context-aware intelligent systems. Here, domain shift, where models' trained on one domain exhibit performance degradation when exposed to an unseen domain with different statistics, is a major limiting factor in SER applicability, as models have a strong dependence on speakers and languages characteristics used during training. Meta-Learning for Domain Generalization (MLDG) has shown great success in improving models' generalization capacity and alleviate the domain shift problem in the vision domain; yet, its' efficacy on SER remains largely unexplored. In this work, we propose a "domain-shift aware" MLDG approach to learn generalizable models across multiple domains in SER. Based on our extensive evaluation, we identify a number of pitfalls that contribute to poor models' DG ability, and demonstrate that log-mel spectrograms representations lack distinct features required for MLDG in SER. We further explore the use of appropriate features to achieve DG in SER as to provide insides to future research directions for DG in SER.
Originele taal-2Engels
Titel2023 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events, PerCom Workshops 2023
UitgeverijInstitute of Electrical and Electronics Engineers
Pagina's421-426
Aantal pagina's6
ISBN van elektronische versie9781665453813
DOI's
StatusGepubliceerd - 2023
Evenement2023 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events, PerCom Workshops 2023 - Atlanta, Verenigde Staten van Amerika
Duur: 13 mrt. 202317 mrt. 2023

Congres

Congres2023 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events, PerCom Workshops 2023
Land/RegioVerenigde Staten van Amerika
StadAtlanta
Periode13/03/2317/03/23

Vingerafdruk

Duik in de onderzoeksthema's van 'On efficacy of Meta-Learning for Domain Generalization in Speech Emotion Recognition'. Samen vormen ze een unieke vingerafdruk.

Citeer dit