Adaptive Distributed Streaming Similarity Joins

George Siachamis, Kyriakos Psarakis, Marios Fragkoulis, Odysseas Papapetrou, Arie Van Deursen, Asterios Katsifodimos

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

1 Citaat (Scopus)

Samenvatting

How can we perform similarity joins of multi-dimensional streams in a distributed fashion, achieving low latency? Can we adaptively repartition those streams in order to retain high performance under concept drifts? Current approaches to similarity joins are either restricted to single-node deployments or focus on set-similarity joins, failing to cover the ubiquitous case of metric-space similarity joins. In this paper, we propose the first adaptive distributed streaming similarity join approach that gracefully scales with variable velocity and distribution of multi-dimensional data streams. Our approach can adaptively rebalance the load of nodes in the case of concept drifts, allowing for similarity computations in the general metric space. We implement our approach on top of Apache Flink and evaluate its data partitioning and load balancing schemes on a set of synthetic datasets in terms of latency, comparisons ratio, and data duplication ratio.

Originele taal-2Engels
TitelDEBS 2023 - Proceedings of the 17th ACM International Conference on Distributed and Event-based Systems
UitgeverijAssociation for Computing Machinery, Inc
Pagina's25-36
Aantal pagina's12
ISBN van elektronische versie9798400701221
DOI's
StatusGepubliceerd - 27 jun. 2023
Evenement17th ACM International Conference on Distributed and Event-based Systems, DEBS 2023 - Neuchatel, Zwitserland
Duur: 27 jun. 202330 jun. 2023

Congres

Congres17th ACM International Conference on Distributed and Event-based Systems, DEBS 2023
Land/RegioZwitserland
StadNeuchatel
Periode27/06/2330/06/23

Bibliografische nota

Publisher Copyright:
© 2023 Owner/Author(s).

Financiering

This publication is part of the project number 19708, of the Vidi research programme partly financed by the Dutch Research Council (NWO). It is also partially funded by European Union’s Horizon Europe research and innovation programme, under grant agreement No. 101070122, and by ICAI AI for Fintech Research Lab.

FinanciersFinanciernummer
European Union’s Horizon Europe research and innovation programme101070122
Nederlandse Organisatie voor Wetenschappelijk Onderzoek

    Vingerafdruk

    Duik in de onderzoeksthema's van 'Adaptive Distributed Streaming Similarity Joins'. Samen vormen ze een unieke vingerafdruk.

    Citeer dit