Sparse Multiscale Gaussian Process Regression

Walder, C; Kim, KI; Schölkopf, B; Cohen,; W., W.; McCallum, A.; Roweis, S. T.

doi:10.1145/1390156.1390296

DetailsSummary

Sparse Multiscale Gaussian Process Regression

Walder, C., Kim, K., & Schölkopf, B. (2008). Sparse Multiscale Gaussian Process Regression. Proceedings of the 25th International Conference on Machine Learning (ICML 2008), 1112-1119.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-C841-F Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-C842-D

Genre: Conference Paper

Files

show Files

Locators

show

Creators

show

hide

Creators:
Walder, C^{1, 2}, Author
Kim, KI¹, Author
Schölkopf, B¹, Author
Cohen, Editor
W., W., Editor
McCallum, A., Editor
Roweis, S. T., Editor

Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795
2Department Human Perception, Cognition and Action, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497797

Content

show

hide

Free keywords: -

Abstract: Most existing sparse Gaussian process (g.p.) models seek computational advantages by basing their computations on a set of m basis functions that are the covariance function of the g.p. with one of its two inputs fixed. We generalise this for the case of Gaussian covariance function, by basing our computations on m Gaussian basis functions with arbitrary diagonal covariance matrices (or length scales). For a fixed number of basis functions and any given criteria, this additional flexibility permits approximations no worse and typically better than was previously possible. We perform gradient based optimisation of the marginal likelihood, which costs O(m2n) time where n is the number of data points, and compare the method to various other sparse g.p. methods. Although we focus on g.p. regression, the central idea is applicable to all kernel based algorithms, and we also provide some results for the support vector machine (s.v.m.) and kernel ridge regression (k.r.r.). Our approach outperforms the other methods, particularly for the case of very few basis functions, i.e. a very high sparsity ratio.

Details

show

hide

Language(s):

Dates: Date issued: 2008-07

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: URI: http://icml2008.cs.helsinki.fi/papers/icml2008proceedings.pdf
DOI: 10.1145/1390156.1390296
BibTex Citekey: 5121

Degree: -

Event

show

hide

Title: 25th International Conference on Machine Learning

Place of Event: Helsinki, Finland

Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show

hide

Title: Proceedings of the 25th International Conference on Machine Learning (ICML 2008)

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: New York, NY, USA : ACM Press

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1112 - 1119 Identifier: -