English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks

Khan, A., Steiner, I., Sugano, Y., Bulling, A., & Macdonald, R. (2017). A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks. Retrieved from http://arxiv.org/abs/1712.04798.

Item is

Files

show Files
hide Files
:
arXiv:1712.04798.pdf (Preprint), 490KB
Name:
arXiv:1712.04798.pdf
Description:
File downloaded from arXiv at 2018-01-31 14:58
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Khan, Arif1, Author
Steiner, Ingmar1, Author
Sugano, Yusuke1, Author
Bulling, Andreas2, Author           
Macdonald, Ross1, Author
Affiliations:
1External Organizations, ou_persistent22              
2Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547              

Content

show
hide
Free keywords: Computer Science, Human-Computer Interaction, cs.HC,Computer Science, Computation and Language, cs.CL
 Abstract: Phonetic segmentation is the process of splitting speech into distinct phonetic units. Human experts routinely perform this task manually by analyzing auditory and visual cues using analysis software, which is an extremely time-consuming process. Methods exist for automatic segmentation, but these are not always accurate enough. In order to improve automatic segmentation, we need to model it as close to the manual segmentation as possible. This corpus is an effort to capture the human segmentation behavior by recording experts performing a segmentation task. We believe that this data will enable us to highlight the important aspects of manual segmentation, which can be used in automatic segmentation to improve its accuracy.

Details

show
hide
Language(s): eng - English
 Dates: 2017-12-132017
 Publication Status: Published online
 Pages: 4 p.
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: arXiv: 1712.04798
URI: http://arxiv.org/abs/1712.04798
BibTex Citekey: Khan2017
 Degree: -

Event

show

Legal Case

show

Project information

show

Source

show