Home Linguistics & Semiotics Challenges in the compilation, annotation, and analysis of learner corpus data
Chapter
Licensed
Unlicensed Requires Authentication

Challenges in the compilation, annotation, and analysis of learner corpus data

  • Marcus Callies
View more publications by John Benjamins Publishing Company
Challenges in Corpus Linguistics
This chapter is in the book Challenges in Corpus Linguistics

Abstract

This chapter highlights and discusses the special characteristics of learner corpus data and the challenges they may present for corpus compilation, annotation, and analysis. Because learner corpus and SLA researchers use their data to study L2 production and development, it is of utmost importance that the data are valid, that is, they represent “authentic” L2 production, which means that the data must stem from the studied learners’ own language production. I discuss challenges in three areas: (1) multilingual practices and metalinguistic language use, (2) lexical and constructional bias, often brought about by the wording of task instructions or writing prompts that learners are asked to respond to, and (3) learner corpus annotation in view of the “discourse of deficit” in SLA. For each of these challenges solutions as to how they can be met are offered.

Abstract

This chapter highlights and discusses the special characteristics of learner corpus data and the challenges they may present for corpus compilation, annotation, and analysis. Because learner corpus and SLA researchers use their data to study L2 production and development, it is of utmost importance that the data are valid, that is, they represent “authentic” L2 production, which means that the data must stem from the studied learners’ own language production. I discuss challenges in three areas: (1) multilingual practices and metalinguistic language use, (2) lexical and constructional bias, often brought about by the wording of task instructions or writing prompts that learners are asked to respond to, and (3) learner corpus annotation in view of the “discourse of deficit” in SLA. For each of these challenges solutions as to how they can be met are offered.

Downloaded on 16.2.2026 from https://www.degruyterbrill.com/document/doi/10.1075/scl.118.04cal/html
Scroll to top button