English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Improving Native Language Identification with TF-IDF weighting

Gebre, B. G., Zampieri, M., Wittenburg, P., & Heskes, T. (2013). Improving Native Language Identification with TF-IDF weighting. In Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications (pp. 216-223).

Item is

Files

show Files
hide Files
:
W13-1728.pdf (Publisher version), 136KB
Name:
W13-1728.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Gebre, Binyam Gebrekidan1, Author           
Zampieri, Marcos2, Author
Wittenburg, Peter1, Author           
Heskes, Tom3, Author
Affiliations:
1The Language Archive, MPI for Psycholinguistics, Max Planck Society, ou_530892              
2University of Cologne, Cologne, Germany, ou_persistent22              
3Radboud University, Nijmegen, The Netherlands, ou_persistent22              

Content

show
hide
Free keywords: L1 identification,native language
 Abstract: This paper presents a Native Language Identification (NLI) system based on TF-IDF weighting schemes and using linear classifiers - support vector machines, logistic regressions and perceptrons. The system was one of the participants of the 2013 NLI Shared Task in the closed-training track, achieving 0.814 overall accuracy for a set of 11 native languages. This accuracy was only 2.2 percentage points lower than the winner's performance. Furthermore, with subsequent evaluations using 10-fold cross-validation (as given by the organizers) on the combined training and development data, the best average accuracy obtained is 0.8455 and the features that contributed to this accuracy are the TF-IDF of the combined unigrams and bigrams of words.

Details

show
hide
Language(s): eng - English
 Dates: 20132013
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: -
 Degree: -

Event

show
hide
Title: the 8th NAACL Workshop on Innovative Use of NLP for Building Educational Applications (BEA8)
Place of Event: Atlanta, GA
Start-/End Date: 2013-06-09 - 2013-06-15

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 216 - 223 Identifier: ISBN: 978-1-937284-47-3