Peters, J Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society; Max Planck Institute for Biological Cybernetics, Max Planck Society;
http://www.scholarpedia.org/article/Policy_gradient_methods (Verlagsversion)
Peters, J. (2010). Policy gradient methods. Scholarpedia, 5(11), 3698. doi:10.4249/scholarpedia.3698.