Home Linguistics & Semiotics Sharing data in small and endangered languages
Chapter
Licensed
Unlicensed Requires Authentication

Sharing data in small and endangered languages

Cataloging and metadata, formats, and encodings
  • Nicholas Thieberger and Michel Jacobson
View more publications by John Benjamins Publishing Company
Language Documentation
This chapter is in the book Language Documentation

Abstract

Speakers of small or ‘under-resourced’ languages often first contact the world of Information Technology via the effort of field linguists. Good practices in linguistic data management include the separation of structure and content and of data and metadata formats. Primary outputs of field research (lexicon, transcripts and interlinear glossed text collections, and their associated media) need to be coded and preserved. Long-term access to these data is addressed by the establishment of archives that also act as the locus for training and advocacy for well-formed data. In this paper we discuss two such archives, one in Australia, the Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC), and the other in France, the “Archiving Project” from the LACITO/CNRS.

Abstract

Speakers of small or ‘under-resourced’ languages often first contact the world of Information Technology via the effort of field linguists. Good practices in linguistic data management include the separation of structure and content and of data and metadata formats. Primary outputs of field research (lexicon, transcripts and interlinear glossed text collections, and their associated media) need to be coded and preserved. Long-term access to these data is addressed by the establishment of archives that also act as the locus for training and advocacy for well-formed data. In this paper we discuss two such archives, one in Australia, the Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC), and the other in France, the “Archiving Project” from the LACITO/CNRS.

Downloaded on 21.2.2026 from https://www.degruyterbrill.com/document/doi/10.1075/z.158.15thi/html
Scroll to top button