Fair Soft Clustering

Rune D. Kjærsgaard*, Pekka Parviainen, Saket Saurabh, Madhumita Kundu, Line K.H. Clemmensen

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftKonferenceartikelForskningpeer review

1 Citationer (Scopus)

Abstract

Scholars in the machine learning community have recently focused on analyzing the fairness of learning models, including clustering algorithms. In this work we study fair clustering in a probabilistic (soft) setting, where observations may belong to several clusters determined by probabilities. We introduce new probabilistic fairness metrics, which generalize and extend existing non-probabilistic fairness frameworks and propose an algorithm for obtaining a fair probabilistic cluster solution from a data representation known as a fairlet decomposition. Finally, we demonstrate our proposed fairness metrics and algorithm by constructing a fair Gaussian mixture model on three real-world datasets. We achieve this by identifying balanced micro-clusters which minimize the distances induced by the model, and on which traditional clustering can be performed while ensuring the fairness of the solution.

OriginalsprogEngelsk
TidsskriftProceedings of Machine Learning Research
Vol/bind238
Sider (fra-til)1270-1278
Antal sider9
ISSN2640-3498
StatusUdgivet - 2024
Udgivet eksterntJa
Begivenhed27th International Conference on Artificial Intelligence and Statistics, AISTATS 2024 - Valencia, Spanien
Varighed: 2 maj 20244 maj 2024

Konference

Konference27th International Conference on Artificial Intelligence and Statistics, AISTATS 2024
Land/OmrådeSpanien
ByValencia
Periode02/05/202404/05/2024

Bibliografisk note

Publisher Copyright:
Copyright 2024 by the author(s).

Citationsformater