Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text 
Classification

Mavroeidis, Dimitrios; Tsatsaronis, George; Vazirgiannis, Michalis; Theobald, Martin; Weikum, Gerhard; Jorge, Alípio; Torgo, Luís; Brazdil, Pavel; Camacho, Rui; Joao, Gama

Lokale TagsFreigabegeschichteDetailsÜbersicht

Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text Classification

Mavroeidis, D., Tsatsaronis, G., Vazirgiannis, M., Theobald, M., & Weikum, G. (2005). Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text Classification. In Knowledge discovery in databases: PKDD 2005: 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (pp. 181-192). Berlin, Germany: Springer.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-2846-E Versions-Permalink: https://hdl.handle.net/11858/00-001M-0000-000F-2848-A

Genre: Konferenzbeitrag

Dateien

einblenden: Dateien

ausblenden: Dateien

:

MavroeidisTVTW-PKDD05.pdf (beliebiger Volltext), 248KB

Datei-Permalink:
-

Name:
MavroeidisTVTW-PKDD05.pdf

Beschreibung:
-

OA-Status:

Sichtbarkeit:
Privat

MIME-Typ / Prüfsumme:
application/pdf

Technische Metadaten:

Copyright Datum:
-

Copyright Info:
-

Lizenz:
-

Externe Referenzen

einblenden:

Urheber

einblenden:

ausblenden:

Urheber:
Mavroeidis, Dimitrios, Autor
Tsatsaronis, George, Autor
Vazirgiannis, Michalis¹, Autor
Theobald, Martin¹, Autor
Weikum, Gerhard¹, Autor
Jorge, Alípio, Herausgeber
Torgo, Luís, Herausgeber
Brazdil, Pavel, Herausgeber
Camacho, Rui, Herausgeber
Joao, Gama², Herausgeber

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018
2Max Planck Society, ou_persistent13

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: The introduction of hierarchical thesauri (HT) that contain significant semantic information, has led researchers to investigate their potential for improving performance of the text classification task, extending the traditional “bag of words” representation, incorporating syntactic and semantic relationships among words. In this paper we address this problem by proposing a Word Sense Disambiguation (WSD) approach based on the intuition that word proximity in the document implies proximity also in the HT graph. We argue that the high precision exhibited by our WSD algorithm in various humanly-disambiguated benchmark datasets, is appropriate for the classification task. Moreover, we define a semantic kernel, based on the general concept of GVSM kernels, that captures the semantic relations contained in the hierarchical thesaurus. Finally, we conduct experiments using various corpora achieving a systematic improvement in classification accuracy using the SVM algorithm, especially when the training set is small.

Details

einblenden:

ausblenden:

Sprache(n): eng - English

Datum: Geändert: 2006-06-14Erschienen: 2005

Publikationsstatus: Erschienen

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: eDoc: 278964
Anderer: Local-ID: C1256DBF005F876D-D51CD9A3529F43CDC12570450049E6BD-MavroeidisTVTW05

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: Untitled Event

Veranstaltungsort: Porto, Portugal

Start-/Enddatum: 2005-10-03

ausblenden:

Titel: Knowledge discovery in databases: PKDD 2005 : 9th European Conference on Principles and Practice of Knowledge Discovery in Databases

Genre der Quelle: Konferenzband

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: Berlin, Germany : Springer

Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 181 - 192 Identifikator: ISBN: 3-540-29244-6

Quelle 2

einblenden:

ausblenden:

Titel: Lecture Notes in Computer Science

Genre der Quelle: Reihe

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: -

Seiten: - Band / Heft: 3721 Artikelnummer: - Start- / Endseite: - Identifikator: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1

Quelle 2