Semi-Supervised Learning with Per-Class Adaptive Confidence Scores for Acoustic Environment Classification with Imbalanced Data

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

Samenvatting

In this paper, we concentrate on the per-class accuracy of neural network-based classification in the context of identifying acoustic environments. Even a fully supervised learning framework with an equal amount of data for each class can lead to significant differences in class accuracies. This is then amplified by semi-supervised learning using naturally imbalanced data. To address this problem, we propose an adaptive method for pseudo-label selection via a straightforward optimization of the validation accuracy per class, aimed specifically at reducing the variance between different classes. The proposed method is general and can be applied for both maximum probability and entropy-based confidence criteria. Compared to fully supervised learning as well as state-of-the-art methods for pseudo-labeling, it achieves the lowest variances of per-class accuracy and the highest accuracies of the minority classes when tested on common publicly available environment sound databases.
Originele taal-2Engels
TitelICASSP 2023 -2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
UitgeverijInstitute of Electrical and Electronics Engineers
Aantal pagina's5
ISBN van elektronische versie978-1-7281-6327-7
DOI's
StatusGepubliceerd - 5 mei 2023
EvenementICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Rhodes Island, Griekenland
Duur: 4 jun. 202310 jun. 2023

Congres

CongresICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Verkorte titelICASSP 2023
Land/RegioGriekenland
StadRhodes Island
Periode4/06/2310/06/23

Vingerafdruk

Duik in de onderzoeksthema's van 'Semi-Supervised Learning with Per-Class Adaptive Confidence Scores for Acoustic Environment Classification with Imbalanced Data'. Samen vormen ze een unieke vingerafdruk.

Citeer dit