A survey on graph-based methods for similarity searches in metric spaces.

Larissa Capobianco Shimomura (Corresponding author), Rafael Seidi Oyamada, Marcos R. Vieira, Daniel S. Kaster (Corresponding author)

Research output: Contribution to journalArticleAcademicpeer-review

2 Citations (Scopus)

Abstract

Technology development has accelerated the volume growth of complex data, such as images, videos, time series, and georeferenced data. Similarity search is a widely used approach to retrieve complex data, which aims at retrieving similar data according to intrinsic characteristics of the data. Therefore, to facilitate the retrieval of complex data using similarity searches, one needs to organize large collections of data in a way that similar data can be retrieved efficiently. Many access methods were proposed in the literature to speed up similarity data retrieval from large databases. Recently, graph-based methods have emerged as a very efficient alternative for similarity retrieval, with reports indicating those methods outperformed other non-graph-based methods in several scenarios. However, to the best of our knowledge, there is no previous work with experimental analysis on a comprehensive number of graph-based methods using the same search algorithm and execution environment. Our main contribution is a survey on graph-based methods used for similarity searches. We present a review on graph-based methods (types of graphs and search algorithms) as well as a detailed discussion on the applicability of search algorithms (with exact or approximate answers) in each graph type. Our main focus is on static methods in metric spaces. This survey also includes an experimental evaluation of representative graphs implemented in a common platform. We evaluate the relative performance behavior of these graphs concerning the main construction and query parameters for a variety of real-world datasets. We also show results using synthetic datasets evaluating the performance of different graph types according to different dataset features. Our experimental results reinforce the tradeoff between graph construction cost and search performance according to the construction and search parameters.
Original languageEnglish
Article number101507
Number of pages22
JournalInformation Systems
Volume95
DOIs
Publication statusPublished - Jan 2021

Bibliographical note

DBLP's bibliographic metadata records provided through http://dblp.org/search/publ/api are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.

Keywords

  • Experimental survey
  • Graph-based methods
  • Metric spaces
  • Proximity graphs
  • Similarity searches

Fingerprint Dive into the research topics of 'A survey on graph-based methods for similarity searches in metric spaces.'. Together they form a unique fingerprint.

Cite this