Abstract
Machine learning models are often trained on data sets subject to selection bias. In particular, selection bias can be hard to avoid in scenarios where the proportion of positives is low and labeling is expensive, such as fraud detection. However, when selection bias is related to sensitive characteristics such as gender and race, it can result in an unequal distribution of burdens across sensitive groups, where marginalized groups are misrepresented and disproportionately scrutinized. Moreover, when the predictions of existing systems affect the selection of new labels, a feedback loop can occur in which selection bias is amplified over time. In this work, we explore the effectiveness of active learning approaches to mitigate fairnessrelated harm caused by selection bias. Active learning approaches aim to select the most informative instances from unlabeled data. We hypothesize that this characteristic steers data collection towards underexplored areas of the feature space and away from overexplored areas – including areas affected
by selection bias. Our preliminary simulation results confirm the intuition that active learning can mitigate the negative consequences of selection bias, compared to both the baseline scenario and random sampling.
by selection bias. Our preliminary simulation results confirm the intuition that active learning can mitigate the negative consequences of selection bias, compared to both the baseline scenario and random sampling.
Original language | English |
---|---|
Title of host publication | Proceedings of the Workshop on Interactive Adaptive Learning |
Subtitle of host publication | co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2023) |
Editors | Mirko Bunse, Barbara Hammer, Georg Krempl, Vincent Lemaire, Alaa Tharwat, Amal Saadallah |
Publisher | CEUR-WS.org |
Pages | 74-88 |
Number of pages | 15 |
Publication status | Published - 2023 |
Event | 7th International Workshop & Tutorial on Interactive Adaptive Learning (IAL 2023): Co-Located with ECML-PKDD 2023 - Torino, Italy Duration: 22 Sept 2023 → 22 Sept 2023 |
Publication series
Name | CEUR Workshop Proceedings |
---|---|
Volume | 3470 |
ISSN (Electronic) | 1613-0073 |
Conference
Conference | 7th International Workshop & Tutorial on Interactive Adaptive Learning (IAL 2023) |
---|---|
Abbreviated title | IAL 2023 |
Country/Territory | Italy |
City | Torino |
Period | 22/09/23 → 22/09/23 |
Keywords
- active learning
- algorithmic fairness
- machine learning
- selection bias