Gu, S., Lillicrap, T., Turner, R. E., Ghahramani, Z., Schölkopf, B., & Levine, S. (2017).
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning. In I.,
Guyon, U., von Luxburg, S., Bengio, H., Wallach, R., Fergus, S., Vishwanathan, & R., Garnett (Eds.),
Advances in Neural Information Processing Systems 30 (pp. 3849-3858). Curran Associates,
Inc. Retrieved from https://papers.nips.cc/paper/6974-interpolated-policy-gradient-merging-on-policy-and-off-policy-gradient-estimation-for-deep-reinforcement-learning.pdf.