English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Automatic identification of language varieties: The case of Portuguese

Zampieri, M., & Gebre, B. G. (2012). Automatic identification of language varieties: The case of Portuguese. In J. Jancsary (Ed.), Proceedings of the Conference on Natural Language Processing 2012, September 19-21, 2012, Vienna (pp. 233-237). Vienna: Österreichischen Gesellschaft für Artificial Intelligende (ÖGAI).

Item is

Files

show Files
hide Files
:
Zampieri_Konvens_2012.pdf (Publisher version), 178KB
Name:
Zampieri_Konvens_2012.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Zampieri, Marcos1, Author
Gebre, Binyam Gebrekidan2, Author           
Affiliations:
1University of Cologne Albertus-Magnus-Platz 1, 50931 Cologne, Germany, ou_persistent22              
2The Language Archive, MPI for Psycholinguistics, Max Planck Society, ou_530892              

Content

show
hide
Free keywords: -
 Abstract: Automatic Language Identification of written texts is a well-established area of research in Computational Linguistics. State-of-the-art algorithms often rely on n-gram character models to identify the correct language of texts, with good results seen for European languages. In this paper we propose the use of a character n-gram model and a word n-gram language model for the automatic classification of two written varieties of Portuguese: European and Brazilian. Results reached 0.998 for accuracy using character 4-grams.

Details

show
hide
Language(s): eng - English
 Dates: 201220122012
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: -
 Degree: -

Event

show
hide
Title: KONVENS2012 - The 11th Conference on Natural Language Processing
Place of Event: Vienna, Austria
Start-/End Date: 2012-09-19 - 2012-09-21

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the Conference on Natural Language Processing 2012, September 19-21, 2012, Vienna
Source Genre: Proceedings
 Creator(s):
Jancsary, Jeremy, Editor
Affiliations:
-
Publ. Info: Vienna : Österreichischen Gesellschaft für Artificial Intelligende (ÖGAI)
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 233 - 237 Identifier: ISBN: 3-85027-005-X