Gaussian Processes in Reinforcement Learning

Rasmussen, CE; Kuss, M; Thrun,; S.,; Saul, L. K.; Schölkopf, B.

Item

ITEM ACTIONSEXPORT

DownloadE-Mail

Please note that a newer version of this item is available:
https://pure.mpg.de/pubman/item/item_1791914_2

DetailsSummary

Gaussian Processes in Reinforcement Learning

Rasmussen, C., & Kuss, M. (2004). Gaussian Processes in Reinforcement Learning. Advances in Neural Information Processing Systems 16, 751-759.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-D8E5-F Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-D8E6-D

Genre: Conference Paper

Files

show Files

Locators

show

Creators

show

hide

Creators:
Rasmussen, CE¹, Author
Kuss, M¹, Author
Thrun, Editor
S., Editor
Saul, L. K., Editor
Schölkopf, B., Editor

Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795

Content

show

hide

Free keywords: -

Abstract: We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP model allows evaluation of the value function in closed form. The resulting policy iteration algorithm is demonstrated on a simple problem with a two dimensional state space. Further, we speculate that the intrinsic ability of GP models to characterise distributions of functions would allow the method to capture entire distributions over future values instead of merely their expectation, which has traditionally been the focus of much of reinforcement learning.

Details

show

hide

Language(s):

Dates: Date issued: 2004-06

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: ISBN: 0-262-20152-6
URI: http://nips.cc/Conferences/2003/
BibTex Citekey: 2287

Degree: -

Event

show

hide

Title: Seventeenth Annual Conference on Neural Information Processing Systems (NIPS 2003)

Place of Event: Vancouver, BC, Canada

Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show

hide

Title: Advances in Neural Information Processing Systems 16

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: Cambridge, MA, USA : MIT Press

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 751 - 759 Identifier: -