Partout: A Distributed Engine for Efficient RDF Processing

Galárraga, Luis; Hose, Katja; Schenkel, Ralf

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Lokale TagsFreigabegeschichteDetailsÜbersicht

Freigegeben

Zeitschriftenartikel

Partout: A Distributed Engine for Efficient RDF Processing

MPG-Autoren

/persons/resource/persons44469

Galárraga, Luis
Databases and Information Systems, MPI for Informatics, Max Planck Society;

/persons/resource/persons44645

Hose, Katja
Databases and Information Systems, MPI for Informatics, Max Planck Society;

/persons/resource/persons45380

Schenkel, Ralf
Databases and Information Systems, MPI for Informatics, Max Planck Society;

Externe Ressourcen

Es sind keine externen Ressourcen hinterlegt

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

Es sind keine frei zugänglichen Volltexte in PuRe verfügbar

Ergänzendes Material (frei zugänglich)

Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar

Zitation

Galárraga, L., Hose, K., & Schenkel, R. (2012). Partout: A Distributed Engine for Efficient RDF Processing. arXiv, abs/1212.5636, 1-12. Retrieved from http://arxiv.org/abs/1212.5636.

Zitierlink: https://hdl.handle.net/11858/00-001M-0000-0014-58D2-3

Zusammenfassung

The increasing interest in Semantic Web technologies has led not only to a rapid growth of semantic data on the Web but also to an increasing number of backend applications with already more than a trillion triples in some cases. Confronted with such huge amounts of data and the future growth, existing state-of-the-art systems for storing RDF and processing SPARQL queries are no longer sufficient. In this paper, we introduce Partout, a distributed engine for efficient RDF processing in a cluster of machines. We propose an effective approach for fragmenting RDF data sets based on a query log, allocating the fragments to nodes in a cluster, and finding the optimal configuration. Partout can efficiently handle updates and its query optimizer produces efficient query execution plans for ad-hoc SPARQL queries. Our experiments show the superiority of our approach to state-of-the-art approaches for partitioning and distributed SPARQL query processing.