Abstract
Visual algorithms for traffic surveillance systems typically locate and observe traffic movement by representing all traffic with 2D boxes. These 2D bounding boxes around vehicles are insufficient to generate accurate real-world locations. However, 3D annotation datasets are not available for training and evaluation of detection for traffic surveillance. Therefore, a new dataset for training the 3D detector is required. We propose and validate seven different annotation configurations for automated generation of 3D box annotations using only camera calibration, scene information (static vanishing points) and existing 2D annotations. The proposed novel Simple Box method does not require segmentation of vehicles and provides a more simple 3D box construction, which assumes a fixed predefined vehicle width and height. The existing KM3D CNN-based 3D detection model is adopted for traffic surveillance, which directly estimates 3D boxes around vehicles in the camera image, by training the detector on the newly generated dataset. The KM3D detector trained with the Simple Box configuration provides the best 3D object detection results, resulting in 51.9% AP3D on this data. The 3D object detector can estimate an accurate 3D box up to a distance of 125 m from the camera, with a median middle point mean error of only 0.5–1.0 m.
| Original language | English |
|---|---|
| Title of host publication | Computer Vision, Imaging and Computer Graphics Theory and Applications |
| Subtitle of host publication | 17th International Joint Conference, VISIGRAPP 2022, Virtual Event, February 6–8, 2022, Revised Selected Papers |
| Editors | A. Augusto de Sousa, Kurt Debattista, Alexis Paljic, Mounia Ziat, Christophe Hurter, Helen Purchase, Giovanni Maria Farinella, Petia Radeva, Kadi Bouatouch |
| Place of Publication | Cham |
| Publisher | Springer |
| Pages | 21-50 |
| Number of pages | 30 |
| ISBN (Electronic) | 978-3-031-45725-8 |
| ISBN (Print) | 978-3-031-45724-1 |
| DOIs | |
| Publication status | Published - 17 Oct 2023 |
| Event | 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications , VISIGRAPP 2022 - Online, Virtual, Online Duration: 6 Feb 2022 → 8 Feb 2022 Conference number: 17 https://visapp.scitevents.org/?y=2022 |
Publication series
| Name | Communications in Computer and Information Science (CCIS) |
|---|---|
| Volume | 1815 |
| ISSN (Print) | 1865-0929 |
| ISSN (Electronic) | 1865-0937 |
Conference
| Conference | 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications , VISIGRAPP 2022 |
|---|---|
| Abbreviated title | VISIGRAPP |
| City | Virtual, Online |
| Period | 6/02/22 → 8/02/22 |
| Internet address |
Keywords
- 3D object detection
- Semi-automated annotation
- Traffic surveillance application
Fingerprint
Dive into the research topics of 'Semi-automated Generation of Accurate Ground-Truth for 3D Object Detection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver