Hand Shape Recognition Using a ToF Camera : An Application to Sign Language

Simonovsky, Martin

Local TagsRelease HistoryDetailsSummary

Hand Shape Recognition Using a ToF Camera : An Application to Sign Language

Simonovsky, M. (2011). Hand Shape Recognition Using a ToF Camera: An Application to Sign Language. Master Thesis, Universität des Saarlandes, Saarbrücken.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0010-11B9-2 Version Permalink: https://hdl.handle.net/21.11116/0000-000E-3282-6

Genre: Thesis

Latex : Hand Shape Recognition Using a {ToF} Camera : An Application to Sign Language

Files

show Files

hide Files

:

2011_Martin_Simonovsky_Thesis.pdf (Any fulltext), 4MB

File Permalink:
-

Name:
2011_Martin_Simonovsky_Thesis.pdf

Description:
-

OA-Status:

Visibility:
Restricted (Max Planck Institute for Informatics, MSIN; )

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Simonovsky, Martin^{1, 2}, Author
Theobalt, Christian², Advisor
Müller, Meinard³, Referee

Affiliations:
1International Max Planck Research School, MPI for Informatics, Max Planck Society, ou_1116551
2Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047
3External Organizations, ou_persistent22

Content

show

hide

Free keywords: -

Abstract: This master's thesis investigates the benefit of utilizing depth information acquired by a time-of-flight (ToF) camera for hand shape recognition from unrestricted viewpoints. Specifically, we assess the hypothesis that classical 3D content descriptors might be inappropriate for ToF depth images due to the 2.5D nature and noisiness of the data and possible expensive computations in 3D space. Instead, we extend 2D descriptors to make use of the additional semantics of depth images. Our system is based on the appearance-based retrieval paradigm, using a synthetic 3D hand model to generate its database. The system is able to run at interactive frame rates. For increased robustness, no color, intensity, or time coherence information is used. A novel, domain-specific algorithm for segmenting the forearm from the upper body based on reprojecting the acquired geometry into the lateral view is introduced. Moreover, three kinds of descriptors exploiting depth data are proposed and the made design choices are experimentally supported. The whole system is then evaluated on an American sign language fingerspelling dataset. However, the retrieval performance still leaves room for improvements. Several insights and possible reasons are discussed.

Details

show

hide

Language(s): eng - English

Dates: Modified: 2012-03-22Accepted: 2011-03-30Published Online: 2011Date issued: 2011

Publication Status: Issued

Pages: III, 64 p.

Publishing info: Saarbrücken : Universität des Saarlandes

Table of Contents: -

Rev. Type: -

Identifiers: eDoc: 618894
Other: Local-ID: C125675300671F7B-F6F3ECA002CC444FC1257970006B4EF3-Simonovsky2010

Degree: Master

Event

show

Legal Case

show

Project information

show

Source

show