Analysis and Improvement of the Visual Object Detection Pipeline

Hosang, Jan

DetailsÜbersicht

Analysis and Improvement of the Visual Object Detection Pipeline

Hosang, J. (2017). Analysis and Improvement of the Visual Object Detection Pipeline. PhD Thesis, Universität des Saarlandes, Saarbrücken.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/11858/00-001M-0000-002D-8CC9-B Versions-Permalink: https://hdl.handle.net/11858/00-001M-0000-002D-8F91-F

Genre: Hochschulschrift

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
http://scidok.sulb.uni-saarland.de/doku/lic_ohne_pod.php?la=de (Verlagsvertrag) Open Access Status unbekannt

Beschreibung:
-

OA-Status:
Keine Angabe

externe Referenz:
http://scidok.sulb.uni-saarland.de/volltexte/2017/6908/ (beliebiger Volltext) Open Access Grün

Beschreibung:
-

OA-Status:
Grün

Urheber

einblenden:

ausblenden:

Urheber:
Hosang, Jan^{1, 2}, Autor
Schiele, Bernt¹, Ratgeber
Ferrari, Vittorio³, Gutachter

Affiliations:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547
2International Max Planck Research School, MPI for Informatics, Max Planck Society, Campus E1 4, 66123 Saarbrücken, DE, ou_1116551
3External Organizations, ou_persistent22

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: Visual object detection has seen substantial improvements during the last years due to the possibilities enabled by deep learning. While research on image classification provides continuous progress on how to learn image representations and classifiers jointly, object detection research focuses on identifying how to properly use deep learning technology to effectively localise objects. In this thesis, we analyse and improve different aspects of the commonly used detection pipeline. We analyse ten years of research on pedestrian detection and find that improvement of feature representations was the driving factor. Motivated by this finding, we adapt an end-to-end learned detector architecture from general object detection to pedestrian detection. Our deep network outperforms all previous neural networks for pedestrian detection by a large margin, even without using additional training data. After substantial improvements on pedestrian detection in recent years, we investigate the gap between human performance and state-of-the-art pedestrian detectors. We find that pedestrian detectors still have a long way to go before they reach human performance, and we diagnose failure modes of several top performing detectors, giving direction to future research. As a side-effect we publish new, better localised annotations for the Caltech pedestrian benchmark. We analyse detection proposals as a preprocessing step for object detectors. We establish different metrics and compare a wide range of methods according to these metrics. By examining the relationship between localisation of proposals and final object detection performance, we define and experimentally verify a metric that can be used as a proxy for detector performance. Furthermore, we address a structural weakness of virtually all object detection pipelines: non-maximum suppression. We analyse why it is necessary and what the shortcomings of the most common approach are. To address these problems, we present work to overcome these shortcomings and to replace typical non-maximum suppression with a learnable alternative. The introduced paradigm paves the way to true end-to-end learning of object detectors without any post-processing. In summary, this thesis provides analyses of recent pedestrian detectors and detection proposals, improves pedestrian detection by employing deep neural networks, and presents a viable alternative to traditional non-maximum suppression.

Details

einblenden:

ausblenden:

Sprache(n): eng - English

Datum: Angenommen: 2017-05-02Online veröffentlicht: 2017Erschienen: 2017

Publikationsstatus: Erschienen

Seiten: 205 p.

Ort, Verlag, Ausgabe: Saarbrücken : Universität des Saarlandes

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: BibTex Citekey: Hosangphd17
URN: urn:nbn:de:bsz:291-scidok-69080

Art des Abschluß: Doktorarbeit

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle