Abstract
We consider the numerical taxonomy problem of fitting a positive distance function D:(S2)→R>0 by a tree metric. We want a tree T with positive edge weights and including S among the vertices so that their distances in T match those in D. A nice application is in evolutionary biology where the tree T aims to approximate the branching process leading to the observed distances in D [Cavalli-Sforza and Edwards 1967]. We consider the total error, that is the sum of distance errors over all pairs of points. We present a deterministic polynomial time algorithm minimizing the total error within a constant factor. We can do this both for general trees, and for the special case of ultrametrics with a root having the same distance to all vertices in S.
The problems are APX-hard, so a constant factor is the best we can hope for in polynomial time. The best previous approximation factor was O((logn)(loglogn)) by Ailon and Charikar [2005] who wrote "Determining whether an O(1) approximation can be obtained is a fascinating question".
The problems are APX-hard, so a constant factor is the best we can hope for in polynomial time. The best previous approximation factor was O((logn)(loglogn)) by Ailon and Charikar [2005] who wrote "Determining whether an O(1) approximation can be obtained is a fascinating question".
Original language | English |
---|---|
Title of host publication | 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS) |
Publisher | IEEE |
Publication date | 2022 |
Pages | 1-12 |
ISBN (Electronic) | 978-1-6654-2055-6 |
DOIs | |
Publication status | Published - 2022 |
Event | 62nd Annual IEEE Symposium on Foundations of Computer Science (FOCS 2021)) - Virtual Duration: 7 Feb 2022 → 11 Feb 2022 |
Conference
Conference | 62nd Annual IEEE Symposium on Foundations of Computer Science (FOCS 2021)) |
---|---|
City | Virtual |
Period | 07/02/2022 → 11/02/2022 |