Semi-automated Generation of Accurate Ground-Truth for 3D Object Detection

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Downloads (Pure)

Abstract

Visual algorithms for traffic surveillance systems typically locate and observe traffic movement by representing all traffic with 2D boxes. These 2D bounding boxes around vehicles are insufficient to generate accurate real-world locations. However, 3D annotation datasets are not available for training and evaluation of detection for traffic surveillance. Therefore, a new dataset for training the 3D detector is required. We propose and validate seven different annotation configurations for automated generation of 3D box annotations using only camera calibration, scene information (static vanishing points) and existing 2D annotations. The proposed novel Simple Box method does not require segmentation of vehicles and provides a more simple 3D box construction, which assumes a fixed predefined vehicle width and height. The existing KM3D CNN-based 3D detection model is adopted for traffic surveillance, which directly estimates 3D boxes around vehicles in the camera image, by training the detector on the newly generated dataset. The KM3D detector trained with the Simple Box configuration provides the best 3D object detection results, resulting in 51.9% AP3D on this data. The 3D object detector can estimate an accurate 3D box up to a distance of 125 m from the camera, with a median middle point mean error of only 0.5–1.0 m.

Original languageEnglish
Title of host publicationComputer Vision, Imaging and Computer Graphics Theory and Applications
Subtitle of host publication17th International Joint Conference, VISIGRAPP 2022, Virtual Event, February 6–8, 2022, Revised Selected Papers
EditorsA. Augusto de Sousa, Kurt Debattista, Alexis Paljic, Mounia Ziat, Christophe Hurter, Helen Purchase, Giovanni Maria Farinella, Petia Radeva, Kadi Bouatouch
Place of PublicationCham
PublisherSpringer
Pages21-50
Number of pages30
ISBN (Electronic)978-3-031-45725-8
ISBN (Print)978-3-031-45724-1
DOIs
Publication statusPublished - 17 Oct 2023
Event17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications , VISIGRAPP 2022 - Online, Virtual, Online
Duration: 6 Feb 20228 Feb 2022
Conference number: 17
https://visapp.scitevents.org/?y=2022

Publication series

NameCommunications in Computer and Information Science (CCIS)
Volume1815
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications , VISIGRAPP 2022
Abbreviated titleVISIGRAPP
CityVirtual, Online
Period6/02/228/02/22
Internet address

Keywords

  • 3D object detection
  • Semi-automated annotation
  • Traffic surveillance application

Fingerprint

Dive into the research topics of 'Semi-automated Generation of Accurate Ground-Truth for 3D Object Detection'. Together they form a unique fingerprint.

Cite this