Statistics in R: First steps

Joost van de Weijer; Dylan Glynn

Chapter

Statistics in R

First steps

Joost van de Weijer and Dylan Glynn

Published by

View more publications by John Benjamins Publishing Company

To Publisher Page

This chapter is in the book Corpus Methods for Semantics

Abstract

The R Project for Statistical Computing is one of the most comprehensive and widely used software options for statistical analysis. Moreover, it is open source, freely available and entirely cross-platform. It is for these reasons that the following chapters all employ R to demonstrate the application and interpretation of statistics. Like the commercially available software SAS, but unlike three other widely used suites (SPSS, Stata, and Statistica), R is principally used in command line. The need to work with commands rather than a graphical user interface can be a challenge for novice users, especially when combined with the task of learning statistics. However, commands given in a step-by-step fashion is arguably simpler than a graphic interface, which can overwhelm the novice user with options. This chapter is an introduction to R focusing on how to import data and make sure those data are in the correct format for analysis. Knowledge of each of these steps is assumed in the following chapters.

You are currently not able to access this content.

Abstract

You are currently not able to access this content.

Chapters in this book

Prelim pages i
Table of contents v
Contributors vii
Outline 1
Section 1. Polysemy and synonymy
Polysemy and synonymy 7
Competing ‘transfer’ constructions in Dutch 39
Rethinking constructional polysemy 61
Quantifying polysemy in Cognitive Sociolinguistics 87
The many uses of run 117
Visualizing distances in a set of near-synonyms 145
A case for the multifactorial assessment of learner language 179
Dutch causative constructions 205
The semasiological structure of Polish myśleć ‘to think’ 223
A multifactorial corpus analysis of grammatical synonymy 253
A diachronic corpus-based multivariate analysis of “I think that” vs. “I think zero” 279
Section 2. Statistical techniques
Techniques and tools 307
Statistics in R 343
Frequency tables 365
Collostructional analysis 391
Cluster analysis 405
Correspondence analysis 443
Logistic regression 487
Name index 535
Subject index 541

https://doi.org/10.1075/hcp.43.13wei

Chapters in this book

Prelim pages i
Table of contents v
Contributors vii
Outline 1
Section 1. Polysemy and synonymy
Polysemy and synonymy 7
Competing ‘transfer’ constructions in Dutch 39
Rethinking constructional polysemy 61
Quantifying polysemy in Cognitive Sociolinguistics 87
The many uses of run 117
Visualizing distances in a set of near-synonyms 145
A case for the multifactorial assessment of learner language 179
Dutch causative constructions 205
The semasiological structure of Polish myśleć ‘to think’ 223
A multifactorial corpus analysis of grammatical synonymy 253
A diachronic corpus-based multivariate analysis of “I think that” vs. “I think zero” 279
Section 2. Statistical techniques
Techniques and tools 307
Statistics in R 343
Frequency tables 365
Collostructional analysis 391
Cluster analysis 405
Correspondence analysis 443
Logistic regression 487
Name index 535
Subject index 541

Statistics in R

Abstract

Chapter PDF View

Abstract

Chapters in this book

Chapters in this book

Chapters in this book