TIED: A Cycle Consistent Encoder-Decoder Model for Text-to-Image Retrieval

Clint Sebastian, Raffaele Imbriaco, Panagiotis Meletis, Gijs Dubbelman, Egor Bondarev, Peter H.N. De With

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

5 Citaten (Scopus)

Samenvatting

Retrieving specific vehicle tracks by Natural Language (NL)-based descriptions is a convenient way to monitor vehicle movement patterns and traffic-related events. NL-based image retrieval has several applications in smart cities, traffic control, etc. In this work, we propose TIED, a text-to-image encoder-decoder model for the simultaneous extraction of visual and textual information for vehicle track retrieval. The model consists of an encoder network that enforces the two modalities into a common latent space and a decoder network that performs an inverse mapping to the text descriptions. The method exploits visual semantic attributes of a target vehicle along with a cycle-consistency loss. The proposed method employs both intra-modal and inter-modal relationships to improve retrieval performance. Our system yields competitive performance achieving the 7th position in the Natural Language-Based Vehicle Retrieval public track of the 2021 NVIDIA AI City Challenge. We demonstrate that the proposed TIED model obtains six times higher Mean Reciprocal Rank (MRR) than the baseline, achieving an MRR of 15.48. The code and models will be made publicly available.

Originele taal-2Engels
TitelProceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021
UitgeverijInstitute of Electrical and Electronics Engineers
Pagina's4133-4141
Aantal pagina's9
ISBN van elektronische versie9781665448994
DOI's
StatusGepubliceerd - 1 sep. 2021
Evenement2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021 - Nashville, Verenigde Staten van Amerika
Duur: 19 jun. 202125 jun. 2021

Congres

Congres2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021
Verkorte titelCVPRW 2021
Land/RegioVerenigde Staten van Amerika
StadNashville
Periode19/06/2125/06/21

Bibliografische nota

Publisher Copyright:
© 2021 IEEE.

Vingerafdruk

Duik in de onderzoeksthema's van 'TIED: A Cycle Consistent Encoder-Decoder Model for Text-to-Image Retrieval'. Samen vormen ze een unieke vingerafdruk.

Citeer dit