Back to Search Start Over

On coresets for fair clustering in metric and Euclidean spaces and their applications.

Authors :
Bandyapadhyay, Sayan
Fomin, Fedor V.
Simonov, Kirill
Source :
Journal of Computer & System Sciences. Jun2024, Vol. 142, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Fair clustering is a constrained clustering problem where we need to partition a set of colored points. The fraction of points of each color in every cluster should be more or less equal to the fraction of points of this color in the dataset. The problem was recently introduced by Chierichetti et al. (2017) [1]. We propose a new construction of coresets for fair clustering for Euclidean and general metrics based on random sampling. For the Euclidean space R d , we provide the first coreset whose size does not depend exponentially on the dimension d. The question of whether such constructions exist was asked by Schmidt et al. (2019) [2] and Huang et al. (2019) [5]. For general metrics, our construction provides the first coreset for fair clustering. New coresets appear to be a handy tool for designing better approximation and streaming algorithms for fair and other constrained clustering variants. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00220000
Volume :
142
Database :
Academic Search Index
Journal :
Journal of Computer & System Sciences
Publication Type :
Academic Journal
Accession number :
175793823
Full Text :
https://doi.org/10.1016/j.jcss.2024.103506