English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Design Alternatives for Large-Scale Web Search: Alexander was Great, Aeneas a Pioneer, and Anakin has the Force

Bender, M., Michel, S., Triantafillou, P., & Weikum, G. (2007). Design Alternatives for Large-Scale Web Search: Alexander was Great, Aeneas a Pioneer, and Anakin has the Force. In LSDS-IR: 1st Workshop on Large-Scale Distributed (pp. 16-22).: n/a.

Item is

Files

show Files
hide Files
:
LSDSIR2007.pdf (Any fulltext), 5KB
 
File Permalink:
-
Name:
LSDSIR2007.pdf
Description:
-
OA-Status:
Visibility:
Private
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Bender, Matthias1, Author           
Michel, Sebastian1, Author           
Triantafillou, Peter1, Author           
Weikum, Gerhard1, Author           
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Content

show
hide
Free keywords: -
 Abstract: Indexing the Web and meeting the throughput, response-time, and failure-resilience requirements of a search engine requires massive storage and computational resources and a careful system design for scalability. This is exemplified by the big data centers of the leading commercial search engines. Various proposals and debates have appeared in the literature as to whether Web indexes can be implemented in a fully distributed or even peer-to-peer manner without impeding scalability, and different partitioning strategies have been worked out. In this paper, we resume this ongoing discussion by analyzing the design space for distributed Web indexing, considering the influence of partitioning strategies as well as different storage technologies including Flash-RAM. We outline and discuss the pros and cons of three fundamental alternatives, and characterize their total costs for meeting all performance and availability requirements. We give arguments in favor of a system design based on term partitioning over a DHT-based peer-to-peer network with modern top-k query processing and a judiciously designed combination of disk and Flash-RAM storage, and we show that this design has intriguing properties and a very attractive cost/performance ratio.

Details

show
hide
Language(s): eng - English
 Dates: 2008-02-282007
 Publication Status: Issued
 Pages: -
 Publishing info: . : n/a
 Table of Contents: -
 Rev. Type: -
 Identifiers: eDoc: 356445
Other: Local-ID: C12573CC004A8E26-58F3F04DDF58733BC125730F003A6FF3-LSDSIR2007
 Degree: -

Event

show
hide
Title: Untitled Event
Place of Event: Amsterdam, The Netherlands
Start-/End Date: 2007-07-27 - 2007-07-27

Legal Case

show

Project information

show

Source 1

show
hide
Title: LSDS-IR : 1st Workshop on Large-Scale Distributed
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: . : n/a
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 16 - 22 Identifier: -