English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

Hachiya, H., Akiyama T, Sugiyama, M., & Peters, J. (2008). Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation. Proceedings of the Twenty-Third Conference on Artificial Intelligence (AAAI 2008), 1351-1356.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Hachiya, H1, Author           
Akiyama T, Sugiyama, M, Author
Peters, J1, 2, Author           
Fox C. P. Gomes, D., Editor
Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795              
2Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society, ou_1497647              

Content

show
hide
Free keywords: -
 Abstract: Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are usually prohibitively expensive. A common approach is to use importance sampling techniques for compensating for the bias caused by the difference between data-sampling policies and the target policy. However, existing off-policy methods do not often take the variance of value function estimators explicitly into account and therefore their performance tends to be unstable. To cope with this problem, we propose using an adaptive importance sampling technique which allows us to actively control the trade-off between bias and variance. We further provide a method for optimally determining the trade-off parameter based on a variant of cross-validation. We demonstrate the usefulness of the proposed approach through simulations.

Details

show
hide
Language(s):
 Dates: 2008-07
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: URI: http://www.aaai.org/Conferences/AAAI/aaai08.php
BibTex Citekey: 5096
 Degree: -

Event

show
hide
Title: Twenty-Third Conference on Artificial Intelligence
Place of Event: Chicago, IL, USA
Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the Twenty-Third Conference on Artificial Intelligence (AAAI 2008)
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Menlo Park, CA, USA : AAAI Press
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1351 - 1356 Identifier: -