Learning Depth

Sinz, F; Franz, MO; H.H.Bülthoff R.Ulrich F.A.Wichmann, H.A.Mallot

DetailsSummary

Learning Depth

Sinz, F., & Franz, M. (2004). Learning Depth. Poster presented at 7th Tübingen Perception Conference (TWK 2004), Tübingen, Germany.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-D9F7-1 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-D9F8-0

Genre: Poster

Files

show Files

Locators

show

Creators

show

hide

Creators:
Sinz, F¹, Author
Franz, MO², Author
H.H.Bülthoff R.Ulrich F.A.Wichmann, H.A.Mallot, Editor

Affiliations:
1Research Group Computational Vision and Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497805
2Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795

Content

show

hide

Free keywords: -

Abstract: The depth of a point in space can be estimated by observing its image position from two different viewpoints. The classical approach to stereo vision calculates depth from the two projection equations which together form a stereocamera model. An unavoidable preparatory work for this solution is a calibration procedure, i.e., estimating the external (position and orientation) and internal (focal length, lens distortions etc.) parameters of each camera from a set of points with known spatial position and their corresponding image positions. This is normally done by iteratively linearizing the single camera models and reestimating their parameters according to the error on the known datapoints. The advantage of the classical method is the maximal usage of prior knowledge about the underlying physical processes and the explicit estimation of meaningful model parameters such as focal length or camera position in space. However, the approach neglects the nonlinear nature of the problem such that the results critically depend on the choice of the initial values for the parameters. In this study, we approach the depth estimation problem from a different point of view by applying generic machine learning algorithms to learn the mapping from image coordinates to spatial position. These algorithms do not require any domain knowledge and are able to learn nonlinear functions by mapping the inputs into a higher-dimensional space. Compared to classical calibration, machine learning methods give a direct solution to the depth estimation problem which means that the values of the stereocamera parameters cannot be extracted from the learned mapping. On the poster, we compare the performance of classical camera calibration to that of different machine learning algorithms such as kernel ridge regression, gaussian processes and support vector regression. Our results indicate that generic learning approaches can lead to higher depth accuracies than classical calibration although no domain knowledge is used.

Details

show

hide

Language(s):

Dates: Date issued: 2004-02

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: URI: http://www.twk.tuebingen.mpg.de/twk04/index.php
BibTex Citekey: 2462

Degree: -

Event

show

hide

Title: 7th Tübingen Perception Conference (TWK 2004)

Place of Event: Tübingen, Germany

Start-/End Date: -

Legal Case

show

Project information

show

Source

show