The Language Archive: A new hub for language resources

Drude, Sebastian; Broeder, Daan; Trilsbeek, Paul; Wittenburg, Peter

アイテム詳細

登録内容を編集ファイル形式で保存

一時保存へ追加

タグ情報を表示リリース履歴を表示詳細要約

公開

会議論文

The Language Archive: A new hub for language resources

MPS-Authors

/persons/resource/persons39383

Drude, Sebastian
The Language Archive, MPI for Psycholinguistics, Max Planck Society;

/persons/resource/persons14

Broeder, Daan
The Language Archive, MPI for Psycholinguistics, Max Planck Society;

/persons/resource/persons184

Trilsbeek, Paul
The Language Archive, MPI for Psycholinguistics, Max Planck Society;

/persons/resource/persons216

Wittenburg, Peter
The Language Archive, MPI for Psycholinguistics, Max Planck Society;

External Resource

There are no locators available

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

フルテキスト (公開)

Drude_LREC_2012.pdf
(出版社版), 522KB

付随資料 (公開)

There is no public supplementary material available

引用

Drude, S., Broeder, D., Trilsbeek, P., & Wittenburg, P. (2012). The Language Archive: A new hub for language resources. In N., Calzolari (Ed.), Proceedings of LREC 2012: 8th International Conference on Language Resources and Evaluation (pp. 3264-3267). European Language Resources Association (ELRA).

引用: https://hdl.handle.net/11858/00-001M-0000-000F-850E-D

要旨

This contribution presents “The Language Archive” (TLA), a new unit at the MPI for Psycholinguistics, discussing the current developments in management of scientific data, considering the need for new data research infrastructures. Although several initiatives worldwide in the realm of language resources aim at the integration, preservation and mobilization of research data, the state of such scientific data is still often problematic. Data are often not well organized and archived and not described by metadata ― even unique data such as field-work observational data on endangered languages is still mostly on perishable carriers. New data centres are needed that provide trusted, quality-reviewed, persistent services and suitable tools and that take legal and ethical issues seriously. The CLARIN initiative has established criteria for suitable centres. TLA is in a good position to be one of such centres. It is based on three essential pillars: (1) A data archive; (2) management, access and annotation tools; (3) archiving and software expertise for collaborative projects. The archive hosts mostly observational data on small languages worldwide and language acquisition data, but also data resulting from experiments