de.mpg.escidoc.pubman.appbase.FacesBean
English
 
Help Guide Disclaimer Contact us Login
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Talk

ePETaLS: Online Annotation Tool for Emotional Text Labelling

MPS-Authors
http://pubman.mpdl.mpg.de/cone/persons/resource/persons84285

Volkova,  E
Department Human Perception, Cognition and Action, Max Planck Institute for Biological Cybernetics, Max Planck Society;

http://pubman.mpdl.mpg.de/cone/persons/resource/persons84417

Mstislavski,  A
Department Human Perception, Cognition and Action, Max Planck Institute for Biological Cybernetics, Max Planck Society;

Locator
There are no locators available
Fulltext (public)
There are no public fulltexts available
Supplementary Material (public)
There is no public supplementary material available
Citation

Volkova, E., & Mstislavski, A. (2012). ePETaLS: Online Annotation Tool for Emotional Text Labelling. Talk presented at 22. Tagung der Computerlinguistik-Studierenden (TaCoS 2012). Trier, Germany.


Cite as: http://hdl.handle.net/11858/00-001M-0000-0013-B706-4
Abstract
Text annotation is one of the most popular methods of linguistic data collection. The quality of the resulting corpus is one of the major concerns for the researchers, especially when the annotation process is performed by participants who have not received any specic task-related training. One important factor that can help to ensure high resulting quality is a user-friendly annotation environment. In this talk we present a new annotation system ePETaLS [1] that can help researchers to collect texts annotated for various emotions. Pre-formatted texts can be uploaded onto the system and their annotation can be assigned to a participant, whose task is to mark each phrase in the text with a specic emotion or leave it neutral. For each phrase, the annotator is also asked to assign the emotional forse and mark the word on which the emotional emphasis falls. Before submission the annotation is checked and the user is informed of any missing values. This step help to ensure higher quality of the resulting texts. The time spent on each annotation is also logged which helps to detect outliers who spend extremely little or too much time on their annotation tasks. The resulting annotation is saved in the XML format and is ready for data extraction. Before an annotation procedure can begin, each text is automatically split into small annotation units. These units correspond to short phrases that people would usually pronounce without pausing when they read the text out loud. Each sentence in the text can contain one and more of such units, a typical unit length is three to seven word tokens. This component of ePETaLS is based on supervised machine learning system TiMBL [2] and uses WebLicht [3] for linguistic data extraction, e.g. lemmas, POS, dependency relation, etc. The machine learning algorithm uses a small corpus of texts that were split into phrases by nave participants. The annotation system is at present used for collecting a corpus of fairy tales in English written down by Andrew Lang [4]. Each text is annotated for ten to thirteen emotions. The nal goal of the project is to create an automatic sentiment analysis system for emotional virtual character animation.