Abstract
Scholars in the machine learning community have recently focused on analyzing the fairness of learning models, including clustering algorithms. In this work we study fair clustering in a probabilistic (soft) setting, where observations may belong to several clusters determined by probabilities. We introduce new probabilistic fairness metrics, which generalize and extend existing non-probabilistic fairness frameworks and propose an algorithm for obtaining a fair probabilistic cluster solution from a data representation known as a fairlet decomposition. Finally, we demonstrate our proposed fairness metrics and algorithm by constructing a fair Gaussian mixture model on three real-world datasets. We achieve this by identifying balanced micro-clusters which minimize the distances induced by the model, and on which traditional clustering can be performed while ensuring the fairness of the solution.
Original language | English |
---|---|
Title of host publication | Proceedings of the 27th International Conference on Artifi- cial Intelligence and Statistics |
Volume | 238 |
Publisher | Proceedings of Machine Learning Research |
Publication date | 2024 |
Pages | 1270-1278 |
Publication status | Published - 2024 |
Event | 27th International Conference on Artificial Intelligence and Statistics - Valencia, Spain Duration: 2 May 2024 → 4 May 2024 |
Conference
Conference | 27th International Conference on Artificial Intelligence and Statistics |
---|---|
Country/Territory | Spain |
City | Valencia |
Period | 02/05/2024 → 04/05/2024 |