Multiscale Convolutional Descriptor Aggregation for Visual Place Recognition

Raffaele Imbriaco, Egor Bondarev, Peter H.N. de With

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)

Abstract

Visual place recognition using query and database images from different sources remains a challenging task in computer vision. Our method exploits global descriptors for efficient image matching and local descriptors for geometric verification. We present a novel, multi-scale aggregation method for local convolutional descriptors, using memory vector construction for efficient aggregation. The method enables to find preliminary set of image candidate matches and remove visually similar but erroneous candidates. We deploy the multi-scale aggregation for visual place recognition on 3 large-scale datasets. We obtain a Recall@10 larger than 94% for the Pittsburgh dataset, outperforming other popular convolutional descriptors used in image retrieval and place recognition. Additionally, we provide a comparison for these descriptors on a more challenging dataset containing query and database images obtained from different sources, achieving over 77% Recall@10.

Original languageEnglish
Title of host publicationProceedings IS&T International Symposium on Electronic Imaging
Subtitle of host publication Image Processing: Algorithms and Systems XVIII, 2020
Place of PublicationSpringfield
PublisherSociety for Imaging Science and Technology (IS&T)
Number of pages7
DOIs
Publication statusPublished - 26 Jan 2020
Event18th Image Processing: Algorithms and Systems Conference, IPAS 2020 - Burlingame, United States
Duration: 26 Jan 202030 Jan 2020

Publication series

NameElectronic Imaging
Volume32

Conference

Conference18th Image Processing: Algorithms and Systems Conference, IPAS 2020
Country/TerritoryUnited States
CityBurlingame
Period26/01/2030/01/20

Fingerprint

Dive into the research topics of 'Multiscale Convolutional Descriptor Aggregation for Visual Place Recognition'. Together they form a unique fingerprint.

Cite this