English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  YAWN: A Semantically Annotated Wikipedia XML Corpus

Schenkel, R., Suchanek, F. M., & Kasneci, G. (2007). YAWN: A Semantically Annotated Wikipedia XML Corpus. In A. Kemper, H. Schöning, T. Rose, M. Jarke, T. Seidl, C. Quix, et al. (Eds.), Datenbanksysteme in Business, Technologie und Web (BTW): 12. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme" (pp. 277-291). Bonn, Germany: Gesellschaft für Informatik.

Item is

Files

show Files
hide Files
:
BTW2007_WikiXML.pdf (Any fulltext), 5KB
 
File Permalink:
-
Name:
BTW2007_WikiXML.pdf
Description:
-
OA-Status:
Visibility:
Private
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Schenkel, Ralf1, Author           
Suchanek, Fabian M.1, Author           
Kasneci, Gjergji1, Author           
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              

Content

show
hide
Free keywords: -
 Abstract: The paper presents YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags. We introduce algorithms to annotate pages and links with concepts from the WordNet thesaurus. This annotation process exploits categorical information in Wikipedia, which is a high-quality, manually assigned source of information, extracts additional information from lists, and utilizes the invocations of templates with named parameters. We give examples how such annotations can be exploited for high-precision queries.

Details

show
hide
Language(s): eng - English
 Dates: 2007
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: eDoc: 356487
Other: Local-ID: C12573CC004A8E26-A8194D1AAD809825C1257233004885E9-SchenkelSK07
 Degree: -

Event

show
hide
Title: Untitled Event
Place of Event: Aachen, Germany
Start-/End Date: 2007-03-07 - 2007-03-09

Legal Case

show

Project information

show

Source 1

show
hide
Title: Datenbanksysteme in Business, Technologie und Web (BTW) : 12. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme"
Source Genre: Proceedings
 Creator(s):
Kemper, Alfons, Editor
Schöning, Harald, Editor
Rose, Thomas, Editor
Jarke, Matthias, Editor
Seidl, Thomas, Editor
Quix, Christoph, Editor
Brochhaus, Christoph, Editor
Affiliations:
-
Publ. Info: Bonn, Germany : Gesellschaft für Informatik
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 277 - 291 Identifier: ISBN: 978-3-88579-197-3

Source 2

show
hide
Title: GI-Edition / Proceedings
Source Genre: Series
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 103 Sequence Number: - Start / End Page: - Identifier: -