Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

21 Citaten (Scopus)

Samenvatting

Speech Emotion Recognition (SER) refers to the recognition of human emotions from natural speech. If done accurately, it can offer a number of benefits in building human-centered context-aware intelligent systems. Existing SER approaches are largely centralized, without considering users’ privacy. Federated Learning (FL) is a distributed machine learning paradigm dealing with decentralization of privacy-sensitive personal data. In this paper, we present a privacy-preserving and data-efficient SER approach by utilizing the concept of FL. To the best of our knowledge, this is the first federated SER approach, which utilizes self-training learning in conjunction with federated learning to exploit both labeled and unlabeled on-device data. Our experimental evaluations on the IEMOCAP dataset shows that our federated approach can learn generalizable SER models even under low availability of data labels and highly non-i.i.d. distributions. We show that our approach with as few as 10% labeled data, on average, can improve the recognition rate by 8.67% compared to the fully-supervised federated counterparts.
Originele taal-2Engels
Titel2022 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)
UitgeverijIEEE/LEOS
Pagina's359-364
Aantal pagina's6
ISBN van elektronische versie9781665416474
ISBN van geprinte versie978-1-6654-1648-1
DOI's
StatusGepubliceerd - 25 mrt. 2022
Evenement2022 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops) - Pisa, Italy
Duur: 21 mrt. 202225 mrt. 2022

Congres

Congres2022 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)
Periode21/03/2225/03/22

Vingerafdruk

Duik in de onderzoeksthema's van 'Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning'. Samen vormen ze een unieke vingerafdruk.

Citeer dit