Federated Self-Training for Data-Efficient Audio Recognition

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

7 Citations (Scopus)
5 Downloads (Pure)

Abstract

Federated learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices like smartphones, labeling is entrusted to the clients or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result, an abundance of audio data remains unlabeled and unexploited on users’ devices. Existing federated learning approaches largely focus on supervised learning without harnessing the unlabeled data. Here, we study the problem of semi-supervised learning of audio models in conjunction with federated learning. We propose FedSTAR, a self-training approach to exploit large-scale on-device unlabeled data to improve the generalization of audio recognition models. We conduct experiments on diverse public audio classification datasets and investigate the performance of our models under varying percentages of labeled data and show that with as little as 3% labeled data, FedSTAR on average can improve the recognition rate by 13.28% compared to the fully-supervised federated model.
Original languageEnglish
Title of host publicationICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublisherInstitute of Electrical and Electronics Engineers
Pages476-480
Number of pages5
ISBN (Electronic)978-1-6654-0540-9
ISBN (Print)978-1-6654-0541-6
DOIs
Publication statusPublished - 27 Apr 2022
Event2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Virtual, Online, Singapore, Singapore
Duration: 23 May 202227 May 2022
Conference number: 47
https://2022.ieeeicassp.org/

Conference

Conference2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022
Abbreviated titleICASSP 2022
Country/TerritorySingapore
CitySingapore
Period23/05/2227/05/22
Internet address

Keywords

  • Supervised learning
  • Semantics
  • Speech recognition
  • Semisupervised learning
  • Signal processing
  • Collaborative work
  • Data models
  • deep learning
  • sound recognition
  • semi-supervised learning
  • audio classification
  • federated learning

Fingerprint

Dive into the research topics of 'Federated Self-Training for Data-Efficient Audio Recognition'. Together they form a unique fingerprint.

Cite this