English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  P2P Directories for Distributed Web Search: From Each According to His Ability, to Each According to His Needs

Bender, M., Michel, S., & Weikum, G. (2006). P2P Directories for Distributed Web Search: From Each According to His Ability, to Each According to His Needs. In Proceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW'06) (pp. 1-10). Los Alamitos, USA: IEEE.

Item is

Files

show Files
hide Files
:
WIRI.pdf (Any fulltext), 283KB
 
File Permalink:
-
Name:
WIRI.pdf
Description:
-
OA-Status:
Visibility:
Private
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Bender, Matthias1, Author           
Michel, Sebastian1, Author           
Weikum, Gerhard1, Author           
Barga, Roger S., Editor
Zhou, Xiaofang, Editor
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Content

show
hide
Free keywords: -
 Abstract: A compelling application of peer-to-peer (P2P) system technology would be distributed Web search, where each peer autonomously runs a search engine on a personalized local corpus (e.g., built from a thematically focused Web crawl) and peers collaborate by routing queries to remote peers that can contribute many or particularly good results for these specific queries. Such systems typically rely on a decentralized directory, e.g., built on top of a distributed hash table (DHT), that holds compact, aggregated statistical metadata about the peers which is used to identify promising peers for a particular query. To support an a-priori unlimited number of peers, it is crucial to keep the load on the distributed directory low. Moreover, each peer should ideally tailor its postings to the directory to reflect its particular strengths, such as rich information about specialized topics that no or only few other peers would also cover. This paper addresses this problem by proposing strategies for peers that identify suitable subsets of the most beneficial statistical metadata. We argue that posting a carefully selected subset of metadata can achieve almost the same result quality as a complete metadata directory, for only the most relevant peers are eventually involved in the execution of a given query. Additionally, asking only relevant peers will result in higher precision, as the noise introduced by poor peers is reduced. We have implemented these strategies in our fully operational P2P Web search prototype Minerva, and present experimental results on real-world Web data that show the viability of the strategies and their gains in terms of high search result quality at low networking costs.

Details

show
hide
Language(s): eng - English
 Dates: 2007-04-272006
 Publication Status: Issued
 Pages: -
 Publishing info: Los Alamitos, USA : IEEE
 Table of Contents: -
 Rev. Type: -
 Identifiers: eDoc: 314518
Other: Local-ID: C1256DBF005F876D-994B56EFCB9347D7C12570EA00418847-WIRI2006
 Degree: -

Event

show
hide
Title: Untitled Event
Place of Event: Atlanta, GA, USA
Start-/End Date: 2006-04-03

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW'06)
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: Los Alamitos, USA : IEEE
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1 - 10 Identifier: ISBN: 0-7695-2571-7