English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  A New Workflow for Semi-automatized Annotations: Tests with Long-Form Naturalistic Recordings of Childrens Language Environments

Casillas, M., Bergelson, E., Warlaumont, A. S., Cristia, A., Soderstrom, M., VanDam, M., et al. (2017). A New Workflow for Semi-automatized Annotations: Tests with Long-Form Naturalistic Recordings of Childrens Language Environments. In Proceedings of Interspeech 2017 (pp. 2098-2102). doi:10.21437/Interspeech.2017-1418.

Item is

Files

show Files
hide Files
:
Casillas_etal_2017a.PDF (Publisher version), 307KB
Name:
Casillas_etal_2017a.PDF
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Casillas, Marisa1, Author           
Bergelson, Elika2, Author
Warlaumont, Anne S.3, Author
Cristia, Alejandrina4, Author
Soderstrom, Melanie5, Author
VanDam, Mark6, 7, Author
Sloetjes, Han8, Author           
Affiliations:
1Language Development Department, MPI for Psycholinguistics, Max Planck Society, ou_2340691              
2Duke University, ou_persistent22              
3Department of Cognitive and Information Sciences, University of California Merced, ou_persistent22              
4Laboratoire de Sciences Cognitives et Psycholinguistique (ENS, EHESS, CNRS), Département d’Etudes Cognitives, Ecole Normale Supérieure, PSL Research University, ou_persistent22              
5University of Manitoba, ou_persistent22              
6Washington State University, ou_persistent22              
7Hearing Oral Program of Excellence (HOPE) of Spokane, USA, ou_persistent22              
8The Language Archive, MPI for Psycholinguistics, Max Planck Society, ou_530892              

Content

show
hide
Free keywords: daylong recordings, language acquisition, annotation, speech recognition, speaker diarization
 Abstract: Interoperable annotation formats are fundamental to the utility, expansion, and sustainability of collective data repositories.In language development research, shared annotation schemes have been critical to facilitating the transition from raw acoustic data to searchable, structured corpora. Current schemes typically require comprehensive and manual annotation of utterance boundaries and orthographic speech content, with an additional, optional range of tags of interest. These schemes have been enormously successful for datasets on the scale of dozens of recording hours but are untenable for long-format recording corpora, which routinely contain hundreds to thousands of audio hours. Long-format corpora would benefit greatly from (semi-)automated analyses, both on the earliest steps of annotation—voice activity detection, utterance segmentation, and speaker diarization—as well as later steps—e.g., classification-based codes such as child-vs-adult-directed speech, and speech recognition to produce phonetic/orthographic representations. We present an annotation workflow specifically designed for long-format corpora which can be tailored by individual researchers and which interfaces with the current dominant scheme for short-format recordings. The workflow allows semi-automated annotation and analyses at higher linguistic levels. We give one example of how the workflow has been successfully implemented in a large cross-database project.

Details

show
hide
Language(s): eng - English
 Dates: 2017-03-212017-06-022017-03-142017-05-222017
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.21437/Interspeech.2017-1418
 Degree: -

Event

show
hide
Title: Interspeech 2017
Place of Event: Stockholm, Sweden
Start-/End Date: 2017-08-20 - 2017-08-24

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of Interspeech 2017
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 2098 - 2102 Identifier: -