Shetty, R., Rohrbach, M., Hendricks, L. A., Fritz, M., & Schiele, B. (2017). Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training. In IEEE International Conference on Computer Vision (pp. 4155-4164). Piscataway, NJ: IEEE. doi:10.1109/ICCV.2017.398.