Theobald, Martin Databases and Information Systems, MPI for Informatics, Max Planck Society;
Theobald, M., Siddharth, J., & Paepcke, A. (2008). SpotSigs: robust and efficient near duplicate detection in large web collections. In S.-H. Myaeng, D. W. Oard, F. Sebastiani, T.-S. Chua, & M.-K. Leong (Eds.), ACM SIGIR 2008: Thirty-First Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 563-570). New York, NY: ACM.