The LANGSNAP longitudinal learner corpus
-
Nicole Tracy-Ventura
Abstract
This chapter describes a new longitudinal corpus of L2 Spanish, which includes spoken and written data from the same learners, and was gathered in a residence/study abroad setting. The corpus was created as part of the Languages and Social Networks Abroad Project (LANGSNAP) which traced the learning of both Spanish and French during an academic year’s residence/study abroad by British Anglophone university students majoring in languages. Parallel longitudinal learner corpora totalling 300,000+ words were collected in each language. In this chapter we first describe the main features of the L2 Spanish corpus, and then illustrate how we have been using the corpus to investigate questions about language development, taking as an example the longitudinal development of lexical diversity (Jarvis 2013).
Abstract
This chapter describes a new longitudinal corpus of L2 Spanish, which includes spoken and written data from the same learners, and was gathered in a residence/study abroad setting. The corpus was created as part of the Languages and Social Networks Abroad Project (LANGSNAP) which traced the learning of both Spanish and French during an academic year’s residence/study abroad by British Anglophone university students majoring in languages. Parallel longitudinal learner corpora totalling 300,000+ words were collected in each language. In this chapter we first describe the main features of the L2 Spanish corpus, and then illustrate how we have been using the corpus to investigate questions about language development, taking as an example the longitudinal development of lexical diversity (Jarvis 2013).
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
-
Section 1. Introduction
- Spanish learner corpus research 3
- What is missing in learner corpus design? 33
-
Section 2. Compilation, annotation and exploitation of learner corpus data
- Learner Spanish on computer 55
- PoS-tagging a Spanish oral learner corpus 89
- The LANGSNAP longitudinal learner corpus 117
- The Aprescrilov corpus, or broadening the horizon of Spanish language learning in Flanders 143
- Spanish Corpus Proficiency Level Training Website and Corpus 169
-
Section 3. Analysis of learner corpus data
- Factors that can have an impact on the processes of perceiving Spanish/L2 199
- Pragmatic principles in anaphora resolution at the syntax-discourse interface 235
- Discourse markers in CEDEL2 and SPLLOC corpora of learner Spanish 267
- A corpus study of Spanish as a Foreign Language learners’ collocation production 299
- Index 333
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
-
Section 1. Introduction
- Spanish learner corpus research 3
- What is missing in learner corpus design? 33
-
Section 2. Compilation, annotation and exploitation of learner corpus data
- Learner Spanish on computer 55
- PoS-tagging a Spanish oral learner corpus 89
- The LANGSNAP longitudinal learner corpus 117
- The Aprescrilov corpus, or broadening the horizon of Spanish language learning in Flanders 143
- Spanish Corpus Proficiency Level Training Website and Corpus 169
-
Section 3. Analysis of learner corpus data
- Factors that can have an impact on the processes of perceiving Spanish/L2 199
- Pragmatic principles in anaphora resolution at the syntax-discourse interface 235
- Discourse markers in CEDEL2 and SPLLOC corpora of learner Spanish 267
- A corpus study of Spanish as a Foreign Language learners’ collocation production 299
- Index 333