Look and You Will Find It: Fairness-Aware Data Collection through Active Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)
34 Downloads (Pure)

Abstract

Machine learning models are often trained on data sets subject to selection bias. In particular, selection bias can be hard to avoid in scenarios where the proportion of positives is low and labeling is expensive, such as fraud detection. However, when selection bias is related to sensitive characteristics such as gender and race, it can result in an unequal distribution of burdens across sensitive groups, where marginalized groups are misrepresented and disproportionately scrutinized. Moreover, when the predictions of existing systems affect the selection of new labels, a feedback loop can occur in which selection bias is amplified over time. In this work, we explore the effectiveness of active learning approaches to mitigate fairnessrelated harm caused by selection bias. Active learning approaches aim to select the most informative instances from unlabeled data. We hypothesize that this characteristic steers data collection towards underexplored areas of the feature space and away from overexplored areas – including areas affected
by selection bias. Our preliminary simulation results confirm the intuition that active learning can mitigate the negative consequences of selection bias, compared to both the baseline scenario and random sampling.
Original languageEnglish
Title of host publicationProceedings of the Workshop on Interactive Adaptive Learning
Subtitle of host publicationco-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2023)
EditorsMirko Bunse, Barbara Hammer, Georg Krempl, Vincent Lemaire, Alaa Tharwat, Amal Saadallah
PublisherCEUR-WS.org
Pages74-88
Number of pages15
Publication statusPublished - 2023
Event7th International Workshop & Tutorial on Interactive Adaptive Learning (IAL 2023): Co-Located with ECML-PKDD 2023 - Torino, Italy
Duration: 22 Sept 202322 Sept 2023

Publication series

NameCEUR Workshop Proceedings
Volume3470
ISSN (Electronic)1613-0073

Conference

Conference7th International Workshop & Tutorial on Interactive Adaptive Learning (IAL 2023)
Abbreviated titleIAL 2023
Country/TerritoryItaly
CityTorino
Period22/09/2322/09/23

Keywords

  • active learning
  • algorithmic fairness
  • machine learning
  • selection bias

Fingerprint

Dive into the research topics of 'Look and You Will Find It: Fairness-Aware Data Collection through Active Learning'. Together they form a unique fingerprint.

Cite this