Fair Soft Clustering

Rune D. Kjærsgaard*, Pekka Parviainen, Saket Saurabh, Madhumita Kundu, Line K.H. Clemmensen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

102 Downloads (Orbit)

Abstract

Scholars in the machine learning community have recently focused on analyzing the fairness of learning models, including clustering algorithms. In this work we study fair clustering in a probabilistic (soft) setting, where observations may belong to several clusters determined by probabilities. We introduce new probabilistic fairness metrics, which generalize and extend existing non-probabilistic fairness frameworks and propose an algorithm for obtaining a fair probabilistic cluster solution from a data representation known as a fairlet decomposition. Finally, we demonstrate our proposed fairness metrics and algorithm by constructing a fair Gaussian mixture model on three real-world datasets. We achieve this by identifying balanced micro-clusters which minimize the distances induced by the model, and on which traditional clustering can be performed while ensuring the fairness of the solution.
Original languageEnglish
Title of host publicationProceedings of the 27th International Conference on Artifi- cial Intelligence and Statistics
Volume238
PublisherProceedings of Machine Learning Research
Publication date2024
Pages1270-1278
Publication statusPublished - 2024
Event27th International Conference on Artificial Intelligence and Statistics - Valencia, Spain
Duration: 2 May 20244 May 2024

Conference

Conference27th International Conference on Artificial Intelligence and Statistics
Country/TerritorySpain
CityValencia
Period02/05/202404/05/2024

Fingerprint

Dive into the research topics of 'Fair Soft Clustering'. Together they form a unique fingerprint.

Cite this