Semantic Similarity Search on Semistructured Data with the XXL Search Engine

Schenkel, Ralf; Theobald, Anja; Weikum, Gerhard

Local TagsRelease HistoryDetailsSummary

Semantic Similarity Search on Semistructured Data with the XXL Search Engine

Schenkel, R., Theobald, A., & Weikum, G. (2005). Semantic Similarity Search on Semistructured Data with the XXL Search Engine. Information Retrieval, 8, 521-545.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-27A5-F Version Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-27A6-D

Genre: Journal Article

Files

show Files

hide Files

:

SchenkelTW05a.pdf (Any fulltext), 289KB

File Permalink:
-

Name:
SchenkelTW05a.pdf

Description:
-

OA-Status:

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Schenkel, Ralf¹, Author
Theobald, Anja¹, Author
Weikum, Gerhard¹, Author

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018

Content

show

hide

Free keywords: -

Abstract: Query languages for XML such as XPath or XQuery support Boolean retrieval: a query result is a (possibly restructured) subset of XML elements or entire documents that satisfy the search conditions of the query. This search paradigm works for highly schematic XML data collections such as electronic catalogs. However, for searching information in open environments such as the Web or intranets of large corporations, ranked retrieval is more appropriate: a query result is a ranked list of XML elements in descending order of (estimated) relevance. Web search engines, which are based on the ranked retrieval paradigmdo, however, not consider the additional information and rich annotations provided by the structure of XML documents and their element names. This article presents the XXL search engine that supports relevance ranking on XML data. XXL is particularly geared for path queries with wildcards that can span multiple XML collections and contain both exact-match as well as semantic-similarity search conditions. In addition, ontological information and suitable index structures are used to improve the search efficiency and effectiveness. XXL is fully implemented as a suite of Java classes and servlets. Experiments in the context of the INEX benchmark demonstrate the efficiency of the XXL search engine and underline its effectiveness for ranked retrieval.

Details

show

hide

Language(s): eng - English

Dates: Modified: 2006-01-17Date issued: 2005

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: eDoc: 278889
Other: Local-ID: C1256DBF005F876D-FC65F98892BD9DB4C1256F8E006D8AD5-SchenkelTW05a

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Information Retrieval

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 8 Sequence Number: - Start / End Page: 521 - 545 Identifier: ISSN: 1386-4564