Monocular 3D Pose Estimation and Tracking by Detection

Andriluka, Mykhaylo; Roth, Stefan; Schiele, Bernt

doi:10.1109/CVPR.2010.5540156

DetailsSummary

Monocular 3D Pose Estimation and Tracking by Detection

Andriluka, M., Roth, S., & Schiele, B. (2010). Monocular 3D Pose Estimation and Tracking by Detection. In 2010 IEEE Conference on Computer Vision and Pattern Recognition (pp. 623-630). Piscataway, NJ: IEEE. doi:10.1109/CVPR.2010.5540156.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-15C6-7 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0024-30F6-9

Genre: Conference Paper

Latex : Monocular {3D} Pose Estimation and Tracking by Detection

Files

show Files

hide Files

:

cvpr10andriluka.pdf (Any fulltext), 8MB

File Permalink:
-

Name:
cvpr10andriluka.pdf

Description:
-

OA-Status:

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Andriluka, Mykhaylo¹, Author
Roth, Stefan¹, Author
Schiele, Bernt², Author

Affiliations:
1External Organizations, ou_persistent22
2Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547

Content

show

hide

Free keywords: -

Abstract: Automatic recovery of 3D human pose from monocular image sequences is a challenging and important research topic with numerous applications. Although current methods are able to recover 3D pose for a single person in controlled environments, they are severely challenged by real-world scenarios, such as crowded street scenes. To address this problem, we propose a three-stage process building on a number of recent advances. The first stage obtains an initial estimate of the 2D articulation and viewpoint of the person from single frames. The second stage allows early data association across frames based on tracking-by-detection. These two stages successfully accumulate the available 2D image evidence into robust estimates of 2D limb positions over short image sequences (= tracklets). The third and final stage uses those tracklet-based estimates as robust image observations to reliably recover 3D pose. We demonstrate state-of-the-art performance on the HumanEva II benchmark, and also show the applicability of our approach to articulated 3D tracking in realistic street conditions.

Details

show

hide

Language(s): eng - English

Dates: Published Online: 2010Date issued: 2010

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: eDoc: 536682
BibTex Citekey: 420
DOI: 10.1109/CVPR.2010.5540156
Other: Local-ID: C12576EE0048963A-205A6C785D49C581C125781B0044271E-andriluka10cvpr

Degree: -

Event

show

hide

Title: 2010 IEEE Conference on Computer Vision and Pattern Recognition

Place of Event: San Francisco, USA

Start-/End Date: 2010-06-15 - 2010-06-17

Legal Case

show

Project information

show

Source 1

show

hide

Title: 2010 IEEE Conference on Computer Vision and Pattern Recognition

Abbreviation : CVPR 2010

Source Genre: Proceedings

Creator(s):

Affiliations:

Publ. Info: Piscataway, NJ : IEEE

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 623 - 630 Identifier: ISBN: 978-1-4244-6984-0