Abstract
Today, data is an essential part of many decision-making processes in businesses and social life through the use of various machine learning techniques. These methods can easily perpetuate human bias in the data and result in discrimination. Despite a growing interest in data discrimination discovery and removal, to date there is a lack of a general and robust framework to distinguish discriminatory decision-making processes from non-discriminatory ones. In this work, we present a generic framework that helps detect possible discrimination by analyzing historical data and associated decisions using a top-down unsupervised approach, which we refer to as hierarchical clustering. Our approach is highly adaptive as it gradually 'learns' users' inherent groups, and clusters their records using cohesiveness and density of points in the dataset. Moreover, we propose a progressive attribute-selection method to choose statistically relevant attributes, thus reducing the effect of noise. Finally, we adopt a recursive notion of cluster profile that is homogeneous w.r.t. decision labels. This allows for deeper insights on the data and on the decision-making underlying the final user classification. Our framework is able to identify both positive and negative bias resulting in discrimination. We also highlight patterns of discrimination revealed by the homogeneous cluster centroids, which otherwise could not be captured.
Original language | English |
---|---|
Title of host publication | Proceedings - IEEE 2nd International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2019 |
Place of Publication | Piscataway |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 187-194 |
Number of pages | 8 |
ISBN (Electronic) | 978-1-7281-1488-0 |
DOIs | |
Publication status | Published - 1 Jun 2019 |
Event | 2nd IEEE International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2019 - Cagliari, Sardinia, Italy Duration: 3 Jun 2019 → 5 Jun 2019 |
Conference
Conference | 2nd IEEE International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2019 |
---|---|
Country/Territory | Italy |
City | Cagliari, Sardinia |
Period | 3/06/19 → 5/06/19 |
Keywords
- Bias
- Clustering
- Discrimination Discovery
- KNN
- Measurements