Samenvatting
The Euclidean distance is one of the most commonly used distance metrics. Several approximations have been pro-posed in the literature to reduce the complexity of this metric for high-dimensional or large datasets. In this paper, we propose BitBooster, an approximation to the Euclidean distance that can be efficiently computed using binary operations and which can also be applied to the Manhattan distance. The introduced approximation error is shown to be negligible when BitBooster is used for both convex- and density-based clustering. While obtaining clusters of almost the same quality as those obtained with the exact computation, we require only a fraction of the computation time. We demonstrate the superiority of our method to alternative approximations on 960 synthetic and 13 real-world datasets of varying sizes, dimensions and clusters.
Originele taal-2 | Engels |
---|---|
Titel | Proceedings - 2022 IEEE 46th Annual Computers, Software, and Applications Conference, COMPSAC 2022 |
Redacteuren | Hong Va Leong, Sahra Sedigh Sarvestani, Yuuichi Teranishi, Alfredo Cuzzocrea, Hiroki Kashiwazaki, Dave Towey, Ji-Jiang Yang, Hossain Shahriar |
Uitgeverij | Institute of Electrical and Electronics Engineers |
Pagina's | 201-210 |
Aantal pagina's | 10 |
ISBN van elektronische versie | 9781665488105 |
DOI's | |
Status | Gepubliceerd - 27 jun. 2022 |
Evenement | 46th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2022 - Virtual, Online, Verenigde Staten van Amerika Duur: 27 jun. 2022 → 1 jul. 2022 |
Congres
Congres | 46th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2022 |
---|---|
Land/Regio | Verenigde Staten van Amerika |
Stad | Virtual, Online |
Periode | 27/06/22 → 1/07/22 |
Bibliografische nota
Publisher Copyright:© 2022 IEEE.