English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text Classification

Mavroeidis, D., Tsatsaronis, G., Vazirgiannis, M., Theobald, M., & Weikum, G. (2005). Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text Classification. In Knowledge discovery in databases: PKDD 2005: 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (pp. 181-192). Berlin, Germany: Springer.

Item is

Files

show Files
hide Files
:
MavroeidisTVTW-PKDD05.pdf (Any fulltext), 248KB
 
File Permalink:
-
Name:
MavroeidisTVTW-PKDD05.pdf
Description:
-
OA-Status:
Visibility:
Private
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Mavroeidis, Dimitrios, Author
Tsatsaronis, George, Author
Vazirgiannis, Michalis1, Author           
Theobald, Martin1, Author           
Weikum, Gerhard1, Author           
Jorge, Alípio, Editor
Torgo, Luís, Editor
Brazdil, Pavel, Editor
Camacho, Rui, Editor
Joao, Gama2, Editor
Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018              
2Max Planck Society, ou_persistent13              

Content

show
hide
Free keywords: -
 Abstract: The introduction of hierarchical thesauri (HT) that contain significant semantic information, has led researchers to investigate their potential for improving performance of the text classification task, extending the traditional “bag of words” representation, incorporating syntactic and semantic relationships among words. In this paper we address this problem by proposing a Word Sense Disambiguation (WSD) approach based on the intuition that word proximity in the document implies proximity also in the HT graph. We argue that the high precision exhibited by our WSD algorithm in various humanly-disambiguated benchmark datasets, is appropriate for the classification task. Moreover, we define a semantic kernel, based on the general concept of GVSM kernels, that captures the semantic relations contained in the hierarchical thesaurus. Finally, we conduct experiments using various corpora achieving a systematic improvement in classification accuracy using the SVM algorithm, especially when the training set is small.

Details

show
hide
Language(s): eng - English
 Dates: 2006-06-142005
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: eDoc: 278964
Other: Local-ID: C1256DBF005F876D-D51CD9A3529F43CDC12570450049E6BD-MavroeidisTVTW05
 Degree: -

Event

show
hide
Title: Untitled Event
Place of Event: Porto, Portugal
Start-/End Date: 2005-10-03

Legal Case

show

Project information

show

Source 1

show
hide
Title: Knowledge discovery in databases: PKDD 2005 : 9th European Conference on Principles and Practice of Knowledge Discovery in Databases
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: Berlin, Germany : Springer
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 181 - 192 Identifier: ISBN: 3-540-29244-6

Source 2

show
hide
Title: Lecture Notes in Computer Science
Source Genre: Series
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 3721 Sequence Number: - Start / End Page: - Identifier: -