English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Hierarchical Relative Entropy Policy Search

Daniel, C., Neumann, G., & Peters, J. (2012). Hierarchical Relative Entropy Policy Search. In N. Lawrence (Ed.), Proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS) (pp. 273-281). Cambridge, MA, USA: Microtome Publ.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Daniel, C, Author
Neumann, G, Author
Peters, J1, Author           
Affiliations:
1Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society, ou_1497647              

Content

show
hide
Free keywords: Abt. Schölkopf
 Abstract: {Many real hierarchically structured. The use of this structure in an agent's policy may well be the key to improved scalability and higher performance. However, such hierarchical structures cannot be exploited by current policy search algorithms. We will concentrate on a basic, but highly relevant hierarchy - the `mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. In this paper, we reformulate learning a hierarchical policy as a latent variable estimation problem and subsequently extend th Relative Entropy Policy Search (REPS) to the latent variable case. We show that our Hierarchical REPS can learn versatile solutions while also showing an increased performance in terms of learning speed and quality of the found policy in comparison to the nonhierarchical approach.}

Details

show
hide
Language(s):
 Dates: 2012-04
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: BibTex Citekey: DanielNP2012
URN: http://jmlr.csail.mit.edu/proceedings/papers/v22/
 Degree: -

Event

show
hide
Title: Fifteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2012)
Place of Event: La Palma, Canary Islands, Spain
Start-/End Date: 2012-04-21 - 2012-04-23

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS)
Source Genre: Proceedings
 Creator(s):
Lawrence, N., Editor
Girolami, M., Author
Affiliations:
-
Publ. Info: Cambridge, MA, USA : Microtome Publ.
Pages: - Volume / Issue: 22 Sequence Number: - Start / End Page: 273 - 281 Identifier: -

Source 2

show
hide
Title: JMLR: Workshop and Conference Proceedings
  Abbreviation : JMLR: W&P
Source Genre: Series
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 22 Sequence Number: - Start / End Page: - Identifier: -