Abstract
Federated learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices like smartphones, labeling is entrusted to the clients or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result, an abundance of audio data remains unlabeled and unexploited on users’ devices. Existing federated learning approaches largely focus on supervised learning without harnessing the unlabeled data. Here, we study the problem of semi-supervised learning of audio models in conjunction with federated learning. We propose FedSTAR, a self-training approach to exploit large-scale on-device unlabeled data to improve the generalization of audio recognition models. We conduct experiments on diverse public audio classification datasets and investigate the performance of our models under varying percentages of labeled data and show that with as little as 3% labeled data, FedSTAR on average can improve the recognition rate by 13.28% compared to the fully-supervised federated model.
| Original language | English |
|---|---|
| Title of host publication | ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
| Publisher | Institute of Electrical and Electronics Engineers |
| Pages | 476-480 |
| Number of pages | 5 |
| ISBN (Electronic) | 978-1-6654-0540-9 |
| ISBN (Print) | 978-1-6654-0541-6 |
| DOIs | |
| Publication status | Published - 27 Apr 2022 |
| Event | 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Virtual, Online, Singapore, Singapore Duration: 23 May 2022 → 27 May 2022 Conference number: 47 https://2022.ieeeicassp.org/ |
Conference
| Conference | 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 |
|---|---|
| Abbreviated title | ICASSP 2022 |
| Country/Territory | Singapore |
| City | Singapore |
| Period | 23/05/22 → 27/05/22 |
| Internet address |
Keywords
- Supervised learning
- Semantics
- Speech recognition
- Semisupervised learning
- Signal processing
- Collaborative work
- Data models
- deep learning
- sound recognition
- semi-supervised learning
- audio classification
- federated learning