Learning Smooth Pooling Regions for Visual Recognition

Malinowski, Mateusz; Fritz, Mario

doi:10.5244/C.27.118

Lokale TagsFreigabegeschichteDetailsÜbersicht

Learning Smooth Pooling Regions for Visual Recognition

Malinowski, M., & Fritz, M. (2013). Learning Smooth Pooling Regions for Visual Recognition. In T. Burghardt, D. Damen, W. Mayol-Cuevas, & M. Mirmehdi (Eds.), Electronic Proceedings of the British Machine Vision Conference 2013 (pp. 1-11). Durham: BMVA Press. doi:10.5244/C.27.118.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/11858/00-001M-0000-0018-0C60-C Versions-Permalink: https://hdl.handle.net/11858/00-001M-0000-0024-A214-D

Genre: Konferenzbeitrag

Dateien

einblenden: Dateien

ausblenden: Dateien

:

paper0118.pdf (beliebiger Volltext), 251KB

Öffnen Speichern

Datei-Permalink:
https://hdl.handle.net/11858/00-001M-0000-0018-0CC3-C

Name:
paper0118.pdf

Beschreibung:
-

OA-Status:

Sichtbarkeit:
Öffentlich

MIME-Typ / Prüfsumme:
application/pdf / [MD5]

Technische Metadaten:

Öffnen

Copyright Datum:
2013

Copyright Info:
The copyright of this document resides with its authors. It may be distributed unchanged freely in print or electronic forms.

Lizenz:
-

Externe Referenzen

einblenden:

Urheber

einblenden:

ausblenden:

Urheber:
Malinowski, Mateusz¹, Autor
Fritz, Mario¹, Autor

Affiliations:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: From the early HMAX model to Spatial Pyramid Matching, spatial pooling has played an important role in visual recognition pipelines. By aggregating local statistics, it equips the recognition pipelines with a certain degree of robustness to translation and deformation yet preserving spatial information. Despite of its predominance in current recognition systems, we have seen little progress to fully adapt the pooling strategy to the task at hand. In this paper, we propose a flexible parameterization of the spatial pooling step and learn the pooling regions together with the classifier. We investigate a smoothness regularization term that in conjuncture with an efficient learning scheme makes learning scalable. Our framework can work with both popular pooling operators: sum-pooling and max-pooling. Finally, we show benefits of our approach for object recognition tasks based on visual words and higher level event recognition tasks based on object-bank features. In both cases, we improve over the hand-crafted spatial pooling step showing the importance of its adaptation to the task.

Details

einblenden:

ausblenden:

Sprache(n): eng - English

Datum: Online veröffentlicht: 2013

Publikationsstatus: Online veröffentlicht

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: BibTex Citekey: 757
DOI: 10.5244/C.27.118

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: 24th British Machine Vision Conference

Veranstaltungsort: Bristol, UK

Start-/Enddatum: 2013-09-09 - 2013-09-13

ausblenden:

Titel: Electronic Proceedings of the British Machine Vision Conference 2013

Kurztitel : BMVC 2013

Genre der Quelle: Konferenzband

Urheber:
Burghardt, Tilo¹, Herausgeber
Damen, Dima¹, Herausgeber
Mayol-Cuevas, Walterio¹, Herausgeber
Mirmehdi, Majid¹, Herausgeber

Affiliations:
1 External Organizations, ou_persistent22

Ort, Verlag, Ausgabe: Durham : BMVA Press

Seiten: - Band / Heft: - Artikelnummer: 118 Start- / Endseite: 1 - 11 Identifikator: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1