Learning mid-level motion features for the recognition of body movements

Sigala, R; Serre T, Poggio, T; Giese, MA

doi:10.1167/5.8.26

DetailsSummary

Learning mid-level motion features for the recognition of body movements

Sigala, R., Serre T, Poggio, T., & Giese, M. (2005). Learning mid-level motion features for the recognition of body movements. Poster presented at Fifth Annual Meeting of the Vision Sciences Society (VSS 2005), Sarasota, FL, USA.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-D46B-7 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-D46C-5

Genre: Poster

Files

show Files

Locators

show

Creators

show

hide

Creators:
Sigala, R¹, Author
Serre T, Poggio, T, Author
Giese, MA², Author

Affiliations:
1Department Physiology of Cognitive Processes, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497798
2Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497794

Content

show

hide

Free keywords: -

Abstract: Body movements are characterized by specific sequences of complex optic flow patterns. Computational models for the perception of static shapes have demonstrated that recognition performance can be significantly improved by choosing an appropriate dictionary of mid-level shape-components (see abstract by Serre Poggio, 2005). Preliminary results suggest that such shape-tuned units are consistent with recent physiological data collected in V4 (see abstract by Cadieu et al, 2005). We test if the visual recognition of complex body movements from optic flow might also benefit from optimized motion-component units. METHOD: We employ a physiologically inspired learning algorithm for the optimization of mid-level motion detectors of a hierarchical model for the recognition of human actions (Giese Poggio, 2003). In the proposed algorithm, competing units are associated with a memory trace that reflects their recent synaptic activity. The model is presented with movies showing a human action (i.e. walking): the trace from units that are behaviorally-relevant is increased while the trace from the others is decreased. Units whose memory trace falls below a critical threshold are randomly replaced. RESULTS: When presented with movies showing human actions, the model generates a dictionary of mid-level motion-component units that lead to a significant improvement of the recognition performance. For the special case of walking, many of the units' preferred stimuli were characterized by horizontal opponent motion, consistent with a recent experimental study showing that opponent horizontal motion is a critical feature for the recognition of these stimuli (Casile Giese, 2003). CONCLUSION: Like for the categorization of static shapes, recognition performance for human actions is improved by choosing optimized mid-level motion features. In addition, the extracted features might predict receptive field properties of complex motion-selective neurons, e.g. in areas MT and MSTl.

Details

show

hide

Language(s):

Dates: Date issued: 2005-09

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: URI: http://www.journalofvision.org/5/8/26/
DOI: 10.1167/5.8.26
BibTex Citekey: 5546

Degree: -

Event

show

hide

Title: Fifth Annual Meeting of the Vision Sciences Society (VSS 2005)

Place of Event: Sarasota, FL, USA

Start-/End Date: -

Legal Case

show

Project information

show

Source

show