P2P Directories for Distributed Web Search: From Each According to His Ability, 
to Each According to His Needs

Bender, Matthias; Michel, Sebastian; Weikum, Gerhard; Barga, Roger S.; Zhou, Xiaofang

Local TagsRelease HistoryDetailsSummary

P2P Directories for Distributed Web Search: From Each According to His Ability, to Each According to His Needs

Bender, M., Michel, S., & Weikum, G. (2006). P2P Directories for Distributed Web Search: From Each According to His Ability, to Each According to His Needs. In Proceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW'06) (pp. 1-10). Los Alamitos, USA: IEEE.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-23B3-2 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-23B4-F

Genre: Conference Paper

Files

show Files

hide Files

:

WIRI.pdf (Any fulltext), 283KB

File Permalink:
-

Name:
WIRI.pdf

Description:
-

OA-Status:

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Bender, Matthias¹, Author
Michel, Sebastian¹, Author
Weikum, Gerhard¹, Author
Barga, Roger S., Editor
Zhou, Xiaofang, Editor

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Content

show

hide

Free keywords: -

Abstract: A compelling application of peer-to-peer (P2P) system technology would be distributed Web search, where each peer autonomously runs a search engine on a personalized local corpus (e.g., built from a thematically focused Web crawl) and peers collaborate by routing queries to remote peers that can contribute many or particularly good results for these specific queries. Such systems typically rely on a decentralized directory, e.g., built on top of a distributed hash table (DHT), that holds compact, aggregated statistical metadata about the peers which is used to identify promising peers for a particular query. To support an a-priori unlimited number of peers, it is crucial to keep the load on the distributed directory low. Moreover, each peer should ideally tailor its postings to the directory to reflect its particular strengths, such as rich information about specialized topics that no or only few other peers would also cover. This paper addresses this problem by proposing strategies for peers that identify suitable subsets of the most beneficial statistical metadata. We argue that posting a carefully selected subset of metadata can achieve almost the same result quality as a complete metadata directory, for only the most relevant peers are eventually involved in the execution of a given query. Additionally, asking only relevant peers will result in higher precision, as the noise introduced by poor peers is reduced. We have implemented these strategies in our fully operational P2P Web search prototype Minerva, and present experimental results on real-world Web data that show the viability of the strategies and their gains in terms of high search result quality at low networking costs.

Details

show

hide

Language(s): eng - English

Dates: Modified: 2007-04-27Date issued: 2006

Publication Status: Issued

Pages: -

Publishing info: Los Alamitos, USA : IEEE

Table of Contents: -

Rev. Type: -

Identifiers: eDoc: 314518
Other: Local-ID: C1256DBF005F876D-994B56EFCB9347D7C12570EA00418847-WIRI2006

Degree: -

Event

show

hide

Title: Untitled Event

Place of Event: Atlanta, GA, USA

Start-/End Date: 2006-04-03

Legal Case

show

Project information

show

Source 1

show

hide

Title: Proceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW'06)

Source Genre: Proceedings

Creator(s):

Affiliations:

Publ. Info: Los Alamitos, USA : IEEE

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1 - 10 Identifier: ISBN: 0-7695-2571-7