Peters, J Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society; Max Planck Institute for Biological Cybernetics, Max Planck Society;
https://papers.nips.cc/paper/3501-fitted-q-iteration-by-advantage-weighted-regression.pdf (Publisher version)
Neumann, G., & Peters, J. (2009). Fitted Q-iteration by Advantage Weighted Regression. In D. Koller, D. Schuurmans, Y. Bengio, & L. Bottou (Eds.), Advances in neural information processing systems 21 (pp. 1177-1184). Red Hook, NY, USA: Curran.