Home Linguistics & Semiotics Statistical tests for the analysis of learner corpus data
Chapter
Licensed
Unlicensed Requires Authentication

Statistical tests for the analysis of learner corpus data

  • Stefan Th. Gries
View more publications by John Benjamins Publishing Company

Abstract

This paper is an overview of several basic statistical tools in corpus-based SLA research. I first discuss a few issues relevant to the analysis of learner corpus data. Then, I illustrate a few widespread quantitative techniques and statistical visualizations and exemplify them on the basis of corpus data on the genitive alternation – the of-genitive vs. the s-genitive from German learners and native speakers of English. The statistical methods discussed include a test for differences between frequencies (the chi-squared test), tests for differences between means/medians (the U-test), and a more advanced multifactorial extension, binary logistic regression.

Abstract

This paper is an overview of several basic statistical tools in corpus-based SLA research. I first discuss a few issues relevant to the analysis of learner corpus data. Then, I illustrate a few widespread quantitative techniques and statistical visualizations and exemplify them on the basis of corpus data on the genitive alternation – the of-genitive vs. the s-genitive from German learners and native speakers of English. The statistical methods discussed include a test for differences between frequencies (the chi-squared test), tests for differences between means/medians (the U-test), and a more advanced multifactorial extension, binary logistic regression.

Downloaded on 8.9.2025 from https://www.degruyterbrill.com/document/doi/10.1075/scl.59.17gri/html
Scroll to top button