Applying computing innovations to bilingual corpus analysis
-
Diana Carter
, Mirjam Broersma and Kevin Donnelly
Abstract
With current innovations in corpus analysis, it is now possible to extract and analyze large amounts of monolingual and bilingual data in minutes, as opposed to the numerous hours previously needed to manually analyze a much smaller quantum of data. In this chapter, we review innovative techniques in bilingual corpus building and analysis, which include the use of automated glossing to allow the extraction of data that can then be statistically analyzed using mixed-effects models. We discuss the application of these techniques, among others, and provide examples from three bilingual corpora. We end by suggesting how researchers may benefit from the increasingly powerful computing capability that is now available.
Abstract
With current innovations in corpus analysis, it is now possible to extract and analyze large amounts of monolingual and bilingual data in minutes, as opposed to the numerous hours previously needed to manually analyze a much smaller quantum of data. In this chapter, we review innovative techniques in bilingual corpus building and analysis, which include the use of automated glossing to allow the extraction of data that can then be statistically analyzed using mixed-effects models. We discuss the application of these techniques, among others, and provide examples from three bilingual corpora. We end by suggesting how researchers may benefit from the increasingly powerful computing capability that is now available.
Chapters in this book
- Prelim pages i
- Table of contents v
- Preface and acknowledgements 1
- L2 acquisition 3
-
Part I. Parameters and beyond
- Pro-drop then and now 17
- Reference comprehension and production in bilingual Spanish 37
-
Part II. Features and crossroads L1/L2
- When masculine as default supercedes L1 transfer 73
- L2 knowledge of gender and number agreement in Spanish noun ellipsis 99
- Not just algunos , but indeed unos L2ers can acquire scalar implicatures in L2 Spanish 125
-
Part III. Word order and complex structures
- Comprehension of subject and object relative clauses by second language learners of Spanish 149
- On the production of differential object marking and wh -question formation in native and non-native Spanish 187
-
Part IV. Monolingual and bilingual first language acquisition
- In search of the perfect tense 213
- The acquisition of grammatical gender in L1 bilingual Spanish 237
- Applying computing innovations to bilingual corpus analysis 281
- Index 303
Chapters in this book
- Prelim pages i
- Table of contents v
- Preface and acknowledgements 1
- L2 acquisition 3
-
Part I. Parameters and beyond
- Pro-drop then and now 17
- Reference comprehension and production in bilingual Spanish 37
-
Part II. Features and crossroads L1/L2
- When masculine as default supercedes L1 transfer 73
- L2 knowledge of gender and number agreement in Spanish noun ellipsis 99
- Not just algunos , but indeed unos L2ers can acquire scalar implicatures in L2 Spanish 125
-
Part III. Word order and complex structures
- Comprehension of subject and object relative clauses by second language learners of Spanish 149
- On the production of differential object marking and wh -question formation in native and non-native Spanish 187
-
Part IV. Monolingual and bilingual first language acquisition
- In search of the perfect tense 213
- The acquisition of grammatical gender in L1 bilingual Spanish 237
- Applying computing innovations to bilingual corpus analysis 281
- Index 303