Wierstra, D., Förster, A., Peters, J., & Schmidhuber, J. (2007). Solving Deep Memory
POMDPs with Recurrent Policy Gradients. In J. Marques de Sá, L. Alexandre, W. Duch, & D. Mandic (Eds.),
Artificial Neural Networks – ICANN 2007: 7th International Conference, Porto, Portugal, September
9-13, 2007 (pp. 697-706). Berlin, Germany: Springer.