English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  PAC-Bayesian Analysis of Contextual Bandits

Seldin, Y., Auer P, Laviolette F, Shawe-Taylor, J., & Ortner, R. (2012). PAC-Bayesian Analysis of Contextual Bandits. In Advances in Neural Information Processing Systems 24 (pp. 1683-1691). Red Hook, NY, USA: Curran.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Seldin, Y1, Author           
Auer P, Laviolette F, Shawe-Taylor, J, Author
Ortner, R, Author
Shawe-Taylor, Editor
J., Editor
Zemel, R.S., Editor
Bartlett, P., Editor
Pereira, F., Editor
Weinberger, K.Q., Editor
Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795              

Content

show
hide
Free keywords: -
 Abstract: We derive an instantaneous (per-round) data-dependent regret bound for stochastic multiarmed bandits with side information (also known as contextual bandits). The scaling of our regret bound with the number of states (contexts) N goes as \sqrtN I_{ρ_t}(S;A)}, where I_{ρ_t}(S;A) is the mutual information between states and actions (the side information) used by the algorithm at round t. If the algorithm uses all the side information, the regret bound scales as \sqrt{N \ln K}, where K is the number of actions (arms). However, if the side information I_{ρ_t}(S;A) is not fully used, the regret bound is significantly tighter. In the extreme case, when I_{ρ_t(S;A) = 0, the dependence on the number of states reduces from linear to logarithmic. Our analysis allows to provide the algorithm large amount of side information, let the algorithm to decide which side information is relevant for the task, and penalize the algorithm only for the side information that it is using de facto. We also present an algorithm for multiarmed bandits with side information with computational complexity that is a linear in the number of actions.

Details

show
hide
Language(s):
 Dates: 2012-01
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: ISBN: 978-1-618-39599-3
URI: http://nips.cc/Conferences/2011/
BibTex Citekey: SeldinALSO2011
 Degree: -

Event

show
hide
Title: Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS 2011)
Place of Event: Granada, Spain
Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show
hide
Title: Advances in Neural Information Processing Systems 24
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: Red Hook, NY, USA : Curran
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1683 - 1691 Identifier: -