A Distributed Algorithm for Large-scale Generalized Matching

Makari, Faraz; Awerbuch, Baruch; Gemulla, Rainer; Khandekar, Rohit; Mestre, Julian; Sozio, Mauro

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Lokale TagsFreigabegeschichteDetailsÜbersicht

Freigegeben

Bericht

A Distributed Algorithm for Large-scale Generalized Matching

MPG-Autoren

/persons/resource/persons44974

Makari, Faraz
Databases and Information Systems, MPI for Informatics, Max Planck Society;

/persons/resource/persons44484

Gemulla, Rainer
Databases and Information Systems, MPI for Informatics, Max Planck Society;

/persons/resource/persons44769

Khandekar, Rohit
Algorithms and Complexity, MPI for Informatics, Max Planck Society;

/persons/resource/persons45030

Mestre, Julian
Algorithms and Complexity, MPI for Informatics, Max Planck Society;

/persons/resource/persons45527

Sozio, Mauro
Databases and Information Systems, MPI for Informatics, Max Planck Society;

Externe Ressourcen

Es sind keine externen Ressourcen hinterlegt

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

MPI-I-2013-5-002.pdf
(beliebiger Volltext), 548KB

Ergänzendes Material (frei zugänglich)

Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar

Zitation

Makari, F., Awerbuch, B., Gemulla, R., Khandekar, R., Mestre, J., & Sozio, M.(2013). A Distributed Algorithm for Large-scale Generalized Matching (MPI-I-2013-5-002). Saarbrücken: Max-Planck-Institut für Informatik.

Zitierlink: https://hdl.handle.net/11858/00-001M-0000-0024-03B4-3

Zusammenfassung

Generalized matching problems arise in a number of applications, including computational advertising, recommender systems, and trade markets. Consider, for example, the problem of recommending multimedia items (e.g., DVDs) to users such that (1) users are recommended items that they are likely to be interested in, (2) every user gets neither too few nor too many recommendations, and (3) only items available in stock are recommended to users. State-of-the-art matching algorithms fail at coping with large real-world instances, which may involve millions of users and items. We propose the first distributed algorithm for computing near-optimal solutions to large-scale generalized matching problems like the one above. Our algorithm is designed to run on a small cluster of commodity nodes (or in a MapReduce environment), has strong approximation guarantees, and requires only a poly-logarithmic number of passes over the input. In particular, we propose a novel distributed algorithm to approximately solve mixed packing-covering linear programs, which include but are not limited to generalized matching problems. Experiments on real-world and synthetic data suggest that our algorithm scales to very large problem sizes and can be orders of magnitude faster than alternative approaches.