Shetty, R., Tavakoli, H. R., & Laaksonen, J. (2018). Image and Video Captioning with Augmented Neural Architectures. IEEE MultiMedia, Early Access. doi:10.1109/MMUL.2018.112135923.