Abstract
Scholars in the machine learning community have recently focused on analyzing the fairness of learning models, including clustering algorithms. In this work we study fair clustering in a probabilistic (soft) setting, where observations may belong to several clusters determined by probabilities. We introduce new probabilistic fairness metrics, which generalize and extend existing non-probabilistic fairness frameworks and propose an algorithm for obtaining a fair probabilistic cluster solution from a data representation known as a fairlet decomposition. Finally, we demonstrate our proposed fairness metrics and algorithm by constructing a fair Gaussian mixture model on three real-world datasets. We achieve this by identifying balanced micro-clusters which minimize the distances induced by the model, and on which traditional clustering can be performed while ensuring the fairness of the solution.
Originalsprog | Engelsk |
---|---|
Tidsskrift | Proceedings of Machine Learning Research |
Vol/bind | 238 |
Sider (fra-til) | 1270-1278 |
Antal sider | 9 |
ISSN | 2640-3498 |
Status | Udgivet - 2024 |
Udgivet eksternt | Ja |
Begivenhed | 27th International Conference on Artificial Intelligence and Statistics, AISTATS 2024 - Valencia, Spanien Varighed: 2 maj 2024 → 4 maj 2024 |
Konference
Konference | 27th International Conference on Artificial Intelligence and Statistics, AISTATS 2024 |
---|---|
Land/Område | Spanien |
By | Valencia |
Periode | 02/05/2024 → 04/05/2024 |
Bibliografisk note
Publisher Copyright:Copyright 2024 by the author(s).