John Benjamins Publishing Company
Developing corpus interoperability for phonetic investigation of learner corpora
-
and
Abstract
Although automatic analysis and computer-aided annotation tools are being developed, spoken learner corpora are still smaller and less numerous than written learner corpora. This chapter gives a critical overview of some of the phonetic research questions addressed by spoken learner corpora in relation to their annotation schemes and software. Some of their annotation schemes and guidelines are presented and assessed. Corpus design and tools are discussed in relation to some two of the challenges of spoken learner corpora: comparability of data and the potential contribution to prosodic modeling. It is argued that reusability of annotated spoken data and critical statistics should be the real order of the day.
Abstract
Although automatic analysis and computer-aided annotation tools are being developed, spoken learner corpora are still smaller and less numerous than written learner corpora. This chapter gives a critical overview of some of the phonetic research questions addressed by spoken learner corpora in relation to their annotation schemes and software. Some of their annotation schemes and guidelines are presented and assessed. Corpus design and tools are discussed in relation to some two of the challenges of spoken learner corpora: comparability of data and the potential contribution to prosodic modeling. It is argued that reusability of annotated spoken data and critical statistics should be the real order of the day.
Chapters in this book
- Prelim pages i
- Table of contents v
-
Section 1. Introduction
- Introduction 3
- Learner corpora 9
-
Section 2. Compilation, annotation and exchangeability of learner corpus data
- Developing corpus interoperability for phonetic investigation of learner corpora 33
- Learner corpora and second language acquisition 65
- Competing target hypotheses in the Falko corpus 101
-
Section 3. Automatic approaches to the identification of learner language features in learner corpus data
- Using learner corpora for automatic error detection and correction 127
- Automatic suprasegmental parameter extraction in learner corpora 151
- Criterial feature extraction using parallel learner corpora and machine learning 169
-
Section 4. Analysis of learner corpus data
- Phonological acquisition in the French-English interlanguage 207
- Prosody in a contrastive learner corpus 227
- A corpus-based comparison of syntactic complexity in NNS and NS university students’ writing 249
- Analysing coherence in upper-intermediate learner writing 265
- Statistical tests for the analysis of learner corpus data 287
- Index 311
Chapters in this book
- Prelim pages i
- Table of contents v
-
Section 1. Introduction
- Introduction 3
- Learner corpora 9
-
Section 2. Compilation, annotation and exchangeability of learner corpus data
- Developing corpus interoperability for phonetic investigation of learner corpora 33
- Learner corpora and second language acquisition 65
- Competing target hypotheses in the Falko corpus 101
-
Section 3. Automatic approaches to the identification of learner language features in learner corpus data
- Using learner corpora for automatic error detection and correction 127
- Automatic suprasegmental parameter extraction in learner corpora 151
- Criterial feature extraction using parallel learner corpora and machine learning 169
-
Section 4. Analysis of learner corpus data
- Phonological acquisition in the French-English interlanguage 207
- Prosody in a contrastive learner corpus 227
- A corpus-based comparison of syntactic complexity in NNS and NS university students’ writing 249
- Analysing coherence in upper-intermediate learner writing 265
- Statistical tests for the analysis of learner corpus data 287
- Index 311