English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Predicting the Category and Attributes of Mental Pictures Using Deep Gaze Pooling

Sattar, H., Bulling, A., & Fritz, M. (2016). Predicting the Category and Attributes of Mental Pictures Using Deep Gaze Pooling. Retrieved from http://arxiv.org/abs/1611.10162.

Item is

Files

show Files
hide Files
:
arXiv:1611.10162.pdf (Preprint), 10MB
Name:
arXiv:1611.10162.pdf
Description:
File downloaded from arXiv at 2016-12-01 11:00
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Sattar, Hosnieh1, Author           
Bulling, Andreas1, Author           
Fritz, Mario1, Author           
Affiliations:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547              

Content

show
hide
Free keywords: Quantitative Biology, Neurons and Cognition, q-bio.NC,Computer Science, Computer Vision and Pattern Recognition, cs.CV
 Abstract: Previous work focused on predicting visual search targets from human fixations but, in the real world, a specific target is often not known, e.g. when searching for a present for a friend. In this work we instead study the problem of predicting the mental picture, i.e. only an abstract idea instead of a specific target. This task is significantly more challenging given that mental pictures of the same target category can vary widely depending on personal biases, and given that characteristic target attributes can often not be verbalised explicitly. We instead propose to use gaze information as implicit information on users' mental picture and present a novel gaze pooling layer to seamlessly integrate semantic and localized fixation information into a deep image representation. We show that we can robustly predict both the mental picture's category as well as attributes on a novel dataset containing fixation data of 14 users searching for targets on a subset of the DeepFahion dataset. Our results have important implications for future search interfaces and suggest deep gaze pooling as a general-purpose approach for gaze-supported computer vision systems.

Details

show
hide
Language(s): eng - English
 Dates: 2016-11-272016
 Publication Status: Published online
 Pages: 9 p.
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: arXiv: 1611.10162
URI: http://arxiv.org/abs/1611.10162
BibTex Citekey: Sattar1611.10162
 Degree: -

Event

show

Legal Case

show

Project information

show

Source

show