Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT
  Generating Visual Explanations

Hendricks, L. A., Akata, Z., Rohrbach, M., Donahue, J., Schiele, B., & Darrell, T. (2016). Generating Visual Explanations. In B. Leibe, J. Matas, N. Sebe, & M. Welling (Eds.), Computer Vision -- ECCV 2016 (pp. 3-19). Berlin: Springer. doi:10.1007/978-3-319-46493-0_1.

Item is

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
Hendricks, Lisa Anne1, Autor
Akata, Zeynep2, Autor           
Rohrbach, Marcus1, Autor           
Donahue, Jeff1, Autor
Schiele, Bernt2, Autor           
Darrell, Trevor1, Autor
Affiliations:
1External Organizations, ou_persistent22              
2Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547              

Inhalt

einblenden:
ausblenden:
Schlagwörter: Computer Science, Computer Vision and Pattern Recognition, cs.CV,Computer Science, Artificial Intelligence, cs.AI,Computer Science, Computation and Language, cs.CL
 Zusammenfassung: Clearly explaining a rationale for a classification decision to an end-user can be as important as the decision itself. Existing approaches for deep visual recognition are generally opaque and do not output any justification text; contemporary vision-language models can describe image content but fail to take into account class-discriminative image aspects which justify visual predictions. We propose a new model that focuses on the discriminating properties of the visible object, jointly predicts a class label, and explains why the predicted label is appropriate for the image. We propose a novel loss function based on sampling and reinforcement learning that learns to generate sentences that realize a global sentence property, such as class specificity. Our results on a fine-grained bird species classification dataset show that our model is able to generate explanations which are not only consistent with an image but also more discriminative than descriptions produced by existing captioning methods.

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 201620162016
 Publikationsstatus: Erschienen
 Seiten: 17 p.
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: BibTex Citekey: Hendricks2016
DOI: 10.1007/978-3-319-46493-0_1
 Art des Abschluß: -

Veranstaltung

einblenden:
ausblenden:
Titel: 14th European Conference on Computer Vision
Veranstaltungsort: Amsterdam, The Netherlands
Start-/Enddatum: 2016-10-11 - 2016-10-14

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: Computer Vision -- ECCV 2016
  Kurztitel : ECCV 2016
  Untertitel : 14th European Conference ; Amsterdam, The Netherlands, October 11–14, 2016 ; Proceedings, Part IV
Genre der Quelle: Konferenzband
 Urheber:
Leibe, Bastian1, Herausgeber           
Matas, Jiri1, Herausgeber
Sebe, Nicu1, Herausgeber
Welling, Max1, Herausgeber
Affiliations:
1 External Organizations, ou_persistent22            
Ort, Verlag, Ausgabe: Berlin : Springer
Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 3 - 19 Identifikator: ISBN: 978-3-319-46492-3

Quelle 2

einblenden:
ausblenden:
Titel: Lecture Notes in Computer Science
  Kurztitel : LNCS
Genre der Quelle: Reihe
 Urheber:
Affiliations:
Ort, Verlag, Ausgabe: -
Seiten: - Band / Heft: 9908 Artikelnummer: - Start- / Endseite: - Identifikator: -