Single network panoptic segmentation for street scene understanding

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)
4 Downloads (Pure)

Abstract

In this work, we propose a single deep neural network for panoptic segmentation, for which the goal is to provide each individual pixel of an input image with a class label, as in semantic segmentation, as well as a unique identifier for specific objects in an image, following instance segmentation. Our network makes joint semantic and instance segmentation predictions and combines these to form an output in the panoptic format. This has two main benefits: firstly, the entire panoptic prediction is made in one pass, reducing the required computation time and resources; secondly, by learning the tasks jointly, information is shared between the two tasks, thereby improving performance. Our network is evaluated on two street scene datasets: Cityscapes and Mapillary Vistas. By leveraging information exchange and improving the merging heuristics, we increase the performance of the single network, and achieve a score of 23.9 on the Panoptic Quality (PQ) metric on Mapillary Vistas validation, with an input resolution of 640 × 900 pixels. On Cityscapes validation, our method achieves a PQ score of 45.9 with an input resolution of 512 × 1024 pixels. Moreover, our method decreases the prediction time by a factor of 2 with respect to separate networks.

Original languageEnglish
Title of host publication2019 IEEE Intelligent Vehicles Symposium, IV 2019
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Pages709-715
Number of pages7
ISBN (Electronic)978-1-7281-0560-4
DOIs
Publication statusPublished - 1 Jun 2019
Event30th IEEE Intelligent Vehicles Symposium, IV 2019 - Paris, France
Duration: 9 Jun 201912 Jun 2019

Conference

Conference30th IEEE Intelligent Vehicles Symposium, IV 2019
CountryFrance
CityParis
Period9/06/1912/06/19

Fingerprint Dive into the research topics of 'Single network panoptic segmentation for street scene understanding'. Together they form a unique fingerprint.

Cite this