Presented to you through Paradigm Publishing Services

John Benjamins Publishing Company

Visit our Partner Page See all our books

Chapter

Developing corpus interoperability for phonetic investigation of learner corpora

and

Abstract

Although automatic analysis and computer-aided annotation tools are being developed, spoken learner corpora are still smaller and less numerous than written learner corpora. This chapter gives a critical overview of some of the phonetic research questions addressed by spoken learner corpora in relation to their annotation schemes and software. Some of their annotation schemes and guidelines are presented and assessed. Corpus design and tools are discussed in relation to some two of the challenges of spoken learner corpora: comparability of data and the potential contribution to prosodic modeling. It is argued that reusability of annotated spoken data and critical statistics should be the real order of the day.

You are currently not able to access this content.

Abstract

Although automatic analysis and computer-aided annotation tools are being developed, spoken learner corpora are still smaller and less numerous than written learner corpora. This chapter gives a critical overview of some of the phonetic research questions addressed by spoken learner corpora in relation to their annotation schemes and software. Some of their annotation schemes and guidelines are presented and assessed. Corpus design and tools are discussed in relation to some two of the challenges of spoken learner corpora: comparability of data and the potential contribution to prosodic modeling. It is argued that reusability of annotated spoken data and critical statistics should be the real order of the day.

You are currently not able to access this content.

Chapters in this book

Prelim pages i
Table of contents v
Section 1. Introduction
Introduction 3
Learner corpora 9
Section 2. Compilation, annotation and exchangeability of learner corpus data
Developing corpus interoperability for phonetic investigation of learner corpora 33
Learner corpora and second language acquisition 65
Competing target hypotheses in the Falko corpus 101
Section 3. Automatic approaches to the identification of learner language features in learner corpus data
Using learner corpora for automatic error detection and correction 127
Automatic suprasegmental parameter extraction in learner corpora 151
Criterial feature extraction using parallel learner corpora and machine learning 169
Section 4. Analysis of learner corpus data
Phonological acquisition in the French-English interlanguage 207
Prosody in a contrastive learner corpus 227
A corpus-based comparison of syntactic complexity in NNS and NS university students’ writing 249
Analysing coherence in upper-intermediate learner writing 265
Statistical tests for the analysis of learner corpus data 287
Index 311

Automatic Treatment and Analysis of Learner Corpus Data

This chapter is in the book Automatic Treatment and Analysis of Learner Corpus Data

https://doi.org/10.1075/scl.59.05bal

Chapters in this book

Prelim pages i
Table of contents v
Section 1. Introduction
Introduction 3
Learner corpora 9
Section 2. Compilation, annotation and exchangeability of learner corpus data
Developing corpus interoperability for phonetic investigation of learner corpora 33
Learner corpora and second language acquisition 65
Competing target hypotheses in the Falko corpus 101
Section 3. Automatic approaches to the identification of learner language features in learner corpus data
Using learner corpora for automatic error detection and correction 127
Automatic suprasegmental parameter extraction in learner corpora 151
Criterial feature extraction using parallel learner corpora and machine learning 169
Section 4. Analysis of learner corpus data
Phonological acquisition in the French-English interlanguage 207
Prosody in a contrastive learner corpus 227
A corpus-based comparison of syntactic complexity in NNS and NS university students’ writing 249
Analysing coherence in upper-intermediate learner writing 265
Statistical tests for the analysis of learner corpus data 287
Index 311

Downloaded on 2.4.2026 from https://www.degruyterbrill.com/document/doi/10.1075/scl.59.05bal/html