English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Optimal Reinforcement Learning for Gaussian Systems

Hennig, P. (2012). Optimal Reinforcement Learning for Gaussian Systems. In Advances in Neural Information Processing Systems 24 (pp. 325-333). Red Hook, NY, USA: Curran.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Hennig, P1, Author           
Shawe-Taylor, Editor
J., Editor
Zemel, R.S., Editor
Bartlett, P., Editor
Pereira, F., Editor
Weinberger, K.Q., Editor
Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795              

Content

show
hide
Free keywords: -
 Abstract: The exploration-exploitation trade-off is among the central challenges of reinforcement learning. The optimal Bayesian solution is intractable in general. This paper studies to what extent analytic statements about optimal learning are possible if all beliefs are Gaussian processes. A first order approximation of learning of both loss and dynamics, for nonlinear, time-varying systems in continuous time and space, subject to a relatively weak restriction on the dynamics, is described by an infinite-dimensional partial differential equation. An approximate finitedimensional projection gives an impression for how this result may be helpful.

Details

show
hide
Language(s):
 Dates: 2012-01
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: ISBN: 978-1-618-39599-3
URI: http://nips.cc/Conferences/2011/
BibTex Citekey: Hennig2011
 Degree: -

Event

show
hide
Title: Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS 2011)
Place of Event: Granada, Spain
Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show
hide
Title: Advances in Neural Information Processing Systems 24
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: Red Hook, NY, USA : Curran
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 325 - 333 Identifier: -