Solving the Correlation Cluster LP in Sublinear Time

Nairen Cao, Vincent Cohen-Addad, Euiwoong Lee, Shi Li, David Rasmussen Lolck, Alantha Newman, Mikkel Thorup, Lukas Vogl*, Shuyi Yan, Hanwen Zhang

*Corresponding author af dette arbejde

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

1 Citationer (Scopus)

Abstract

Correlation Clustering is a fundamental and widely-studied problem in unsupervised learning and data mining. The input is a graph and the goal is to construct a clustering minimizing the number of inter-cluster edges plus the number of missing intra-cluster edges. Cao, Cohen-Addad, Lee, Li, Newman, and Vogl [STOC 2024] introduced the cluster LP for Correlation Clustering, which they argued captures the problem much more succinctly than previous linear programming formulations. However, the cluster LP has exponential size, with a variable for every possible set of vertices in the input graph. Nevertheless, they showed how to find a feasible solution for the cluster LP in time O(npoly(1/ϵ)) with objective value at most (1+ϵ) times the value of an optimal solution for the respective Correlation Clustering instance. Furthermore, they showed how to round a solution to the cluster LP, yielding a (1.437+ϵ)-approximation algorithm for the Correlation Clustering problem. The main technical result of this paper is a new approach to find a feasible solution for the cluster LP with objective value at most (1+ϵ) of the optimum in time O(2poly(1/ϵ) n), where n is the number of vertices in the graph. We also show how to implement the rounding within the same time bounds, thus achieving a fast (1.437+ϵ)-approximation algorithm for the Correlation Clustering problem. This bridges the gap between state-of-the-art methods for approximating Correlation Clustering and the recent focus on fast algorithms.

OriginalsprogEngelsk
TitelSTOC 2025 : Proceedings of the 57th Annual ACM Symposium on Theory of Computing
RedaktørerMichal Koucky, Nikhil Bansal
Vol/bind1
ForlagAssociation for Computing Machinery
Publikationsdato2025
Udgave1
Sider1154-1165
KapitelSESSION: Session 7B
ISBN (Elektronisk)9798400715105
DOI
StatusUdgivet - 2025
Begivenhed57th Annual ACM Symposium on Theory of Computing, STOC 2025 - Prague, Tjekkiet
Varighed: 23 jun. 202527 jun. 2025

Konference

Konference57th Annual ACM Symposium on Theory of Computing, STOC 2025
Land/OmrådeTjekkiet
ByPrague
Periode23/06/202527/06/2025
NavnProceedings of the Annual ACM Symposium on Theory of Computing
ISSN0737-8017

Bibliografisk note

Publisher Copyright:
© 2025 Owner/Author.

Citationsformater