TY - JOUR
T1 - Preclustering Algorithms for Imprecise Points
AU - Abam, Mohammad Ali
AU - de Berg, Mark
AU - Farahzad, Sina
AU - Haji Mirsadeghi, Mir Omid
AU - Saghafian, Morteza
N1 - Funding Information:
Mark de Berg supported by the Netherlands’ Organisation for Scientific Research (NWO) under project networks (project no. 024.002.003).
Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2022/6
Y1 - 2022/6
N2 - We study the problem of preclustering a set B of imprecise points in Rd: we wish to cluster the regions specifying the potential locations of the points such that, no matter where the points are located within their regions, the resulting clustering approximates the optimal clustering for those locations. We consider k-center, k-median, and k-means clustering, and obtain the following results. Let B: = { b1, … , bn} be a collection of disjoint balls in Rd, where each ball bi specifies the possible locations of an input point pi. A partition C of B into subsets is called an (f(k) , α) -preclustering (with respect to the specific k-clustering variant under consideration) if (i) C consists of f(k) preclusters, and (ii) for any realization P of the points pi inside their respective balls, the cost of the clustering on P induced by C is at most α times the cost of an optimal k-clustering on P. We call f(k) the size of the preclustering and we call α its approximation ratio. We prove that, even in R1, one may need at least 3 k- 3 preclusters to obtain a bounded approximation ratio—this holds for the k-center, the k-median, and the k-means problem—and we present a (3k, 1) preclustering for the k-center problem in R1. We also present various preclusterings for balls in Rd with d⩾ 2 , including a (3 k, α) -preclustering with α≈ 13.9 for the k-center and the k-median problem, and α≈ 193.9 for the k-means problem.
AB - We study the problem of preclustering a set B of imprecise points in Rd: we wish to cluster the regions specifying the potential locations of the points such that, no matter where the points are located within their regions, the resulting clustering approximates the optimal clustering for those locations. We consider k-center, k-median, and k-means clustering, and obtain the following results. Let B: = { b1, … , bn} be a collection of disjoint balls in Rd, where each ball bi specifies the possible locations of an input point pi. A partition C of B into subsets is called an (f(k) , α) -preclustering (with respect to the specific k-clustering variant under consideration) if (i) C consists of f(k) preclusters, and (ii) for any realization P of the points pi inside their respective balls, the cost of the clustering on P induced by C is at most α times the cost of an optimal k-clustering on P. We call f(k) the size of the preclustering and we call α its approximation ratio. We prove that, even in R1, one may need at least 3 k- 3 preclusters to obtain a bounded approximation ratio—this holds for the k-center, the k-median, and the k-means problem—and we present a (3k, 1) preclustering for the k-center problem in R1. We also present various preclusterings for balls in Rd with d⩾ 2 , including a (3 k, α) -preclustering with α≈ 13.9 for the k-center and the k-median problem, and α≈ 193.9 for the k-means problem.
KW - Clustering
KW - Computational Geometry
KW - Imprecise Points
UR - http://www.scopus.com/inward/record.url?scp=85124459345&partnerID=8YFLogxK
U2 - 10.1007/s00453-022-00929-9
DO - 10.1007/s00453-022-00929-9
M3 - Article
AN - SCOPUS:85124459345
SN - 0178-4617
VL - 84
SP - 1467
EP - 1489
JO - Algorithmica
JF - Algorithmica
IS - 6
ER -