TY - JOUR
T1 - Finding the closest ultrametric
AU - Di Summa, Marco
AU - Pritchard, David
AU - Sanità, Laura
PY - 2015/1/10
Y1 - 2015/1/10
N2 - Ultrametrics model the pairwise distances between living species, where the distance is measured by hereditary time. Reconstructing the tree from the ultrametric distance data is easy, but only if our data is exact. We consider the NP-complete problem of finding the closest ultrametric to noisy data, as modeled by multiplicative or additive total distortion, with or without a monotonicity assumption on the noise. We obtain approximation ratio O(logn) for multiplicative distortion where n is the number of species, and O(1+(ρ-1)-1) for additive distortion where ρ is the minimum ratio of any two distinct input distances. As part of proving our approximation bound for additive distortion, we give the first constant-factor approximation algorithm for a previously-studied problem called Cluster Deletion.
AB - Ultrametrics model the pairwise distances between living species, where the distance is measured by hereditary time. Reconstructing the tree from the ultrametric distance data is easy, but only if our data is exact. We consider the NP-complete problem of finding the closest ultrametric to noisy data, as modeled by multiplicative or additive total distortion, with or without a monotonicity assumption on the noise. We obtain approximation ratio O(logn) for multiplicative distortion where n is the number of species, and O(1+(ρ-1)-1) for additive distortion where ρ is the minimum ratio of any two distinct input distances. As part of proving our approximation bound for additive distortion, we give the first constant-factor approximation algorithm for a previously-studied problem called Cluster Deletion.
KW - Approximation algorithm
KW - Cluster deletion
KW - Phylogenetic reconstruction
KW - Ultrametric
UR - http://www.scopus.com/inward/record.url?scp=84922246381&partnerID=8YFLogxK
U2 - 10.1016/j.dam.2014.07.023
DO - 10.1016/j.dam.2014.07.023
M3 - Article
AN - SCOPUS:84922246381
SN - 0166-218X
VL - 180
SP - 70
EP - 80
JO - Discrete Applied Mathematics
JF - Discrete Applied Mathematics
ER -