English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Classification and Intelligent Search on Information in XML

Fuhr, N., & Weikum, G. (2002). Classification and Intelligent Search on Information in XML. IEEE Data Engineering Bulletin, 25(1), 51-58.

Item is

Files

show Files
hide Files
:
fuhrW02.pdf (Any fulltext), 48KB
 
File Permalink:
-
Name:
fuhrW02.pdf
Description:
-
OA-Status:
Visibility:
Private
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Fuhr, Norbert, Author
Weikum, Gerhard1, Author           
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Content

show
hide
Free keywords: -
 Abstract: {XML} will be the method of choice for representing all kinds of documents in product catalogs, digital libraries, scientific data repositories, and across the Web. This observation creates high expectations that {XML} will be a major catalyst in constructing the “Semantic Web”. However, merely casting all documents into {XML} format does not necessarily make a document’s semantics explicit and more amenable for effective information searching. Rather, to fully leverage {XML} on a global scale, significant progress is needed on the following issues: 1. providing an easy-to-use yet powerful and efficient search language that combines concepts from current {XML} pattern-matching languages (e.g., {XP}ath, {XQ}uery, etc.) with ontology-backed information-retrievalstyle search result ranking, 2. extracting more semantics from existing document collections by constructing structural and ontological skeletons (e.g., in the form of {DTD}s or {XML} schemas) that describe the data at a higher semantic level and can also facilitate new forms of indexing for efficiency, and 3. classifying existing documents according to a given thematic or personalized, hierarchical ontology to make searching more effective (e.g., exploit relevance feedback) and efficient (e.g., limit the search focus). {CLASSIX}, a joint project of the Universities of Dortmund and the Saarland in Germany, addresses these three issues. We describe our approaches for each of these topics in the remainder of this paper.

Details

show
hide
Language(s): eng - English
 Dates: 2007-04-132002
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: eDoc: 520372
Other: Local-ID: C1256DBF005F876D-49001CE30715F797C12572BC00404DAC-FuhrW02
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: IEEE Data Engineering Bulletin
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 25 (1) Sequence Number: - Start / End Page: 51 - 58 Identifier: -