Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT
  RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling

He, Y., Chiu, W.-C., Keuper, M., & Fritz, M. (2016). RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling. Retrieved from http://arxiv.org/abs/1604.02388.

Item is

Dateien

einblenden: Dateien
ausblenden: Dateien
:
arXiv:1604.02388.pdf (Preprint), 2MB
Name:
arXiv:1604.02388.pdf
Beschreibung:
File downloaded from arXiv at 2016-07-15 12:26
OA-Status:
Sichtbarkeit:
Öffentlich
MIME-Typ / Prüfsumme:
application/pdf / [MD5]
Technische Metadaten:
Copyright Datum:
-
Copyright Info:
-

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
He, Yang1, Autor           
Chiu, Wei-Chen1, Autor           
Keuper, Margret1, Autor           
Fritz, Mario1, Autor           
Affiliations:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547              

Inhalt

einblenden:
ausblenden:
Schlagwörter: Computer Science, Computer Vision and Pattern Recognition, cs.CV
 Zusammenfassung: Beyond the success in classification, neural networks have recently shown strong results on pixel-wise prediction tasks like image semantic segmentation on RGBD data. However, the commonly used deconvolutional layers for upsampling intermediate representations to the full-resolution output still show different failure modes, like imprecise segmentation boundaries and label mistakes in particular on large, weakly textured objects (e.g. fridge, whiteboard, door). We attribute these errors in part to the rigid way, current network aggregate information, that can be either too local (missing context) or too global (inaccurate boundaries). Therefore we propose a data-driven pooling layer that integrates with fully convolutional architectures and utilizes boundary detection from RGBD image segmentation approaches. We extend our approach to leverage region-level correspondences across images with an additional temporal pooling stage. We evaluate our approach on the NYU-Depth-V2 dataset comprised of indoor RGBD video sequences and compare it to various state-of-the-art baselines. Besides a general improvement over the state-of-the-art, our approach shows particularly good results in terms of accuracy of the predicted boundaries and in segmenting previously problematic classes.

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 2016-04-082016-06-092016
 Publikationsstatus: Online veröffentlicht
 Seiten: 16 p.
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: arXiv: 1604.02388
URI: http://arxiv.org/abs/1604.02388
BibTex Citekey: He_arXiv2016
 Art des Abschluß: -

Veranstaltung

einblenden:

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle

einblenden: