Book
Licensed
Unlicensed
Requires Authentication
Frequency, Dispersion, Association, and Keyness
Revising and tupleizing corpus-linguistic measures
-
Stefan Th. Gries
Language:
English
Published/Copyright:
2024
About this book
This book is an attempt to revisit the main specifically corpus-linguistic statistics/measures the field has been relying on for decades: frequency, dispersion, association, and keyness. The book first discusses the purpose of these measures and how they have been measured. Then, the book makes three main proposals: First, that many measures of dispersion, association, and keyness are too confounded with frequency and how to 'take frequency out of them' to obtain conceptually cleaner and more interpretable measures. Second, that many existing measures can be replaced by the simple information-theoretic measure of the Kullback-Leibler divergence and that it, too, can have frequency 'removed' from it. Third, that corpus linguistics should abandon the tradition of trying to describe its findings with a single number and adopt a tupleization approach instead, where we use several separate dimensions of information for description and interpretation. The book is written in an informal, hands-on style and comes with its own R package featuring functions, example data, and several thousand lines of code exemplifying all applications.
Topics
-
Download PDFPublicly Available
日本言語政策学会 / Japan Association for Language Policy. 言語政策 / Language Policy 10. 2014
i -
Download PDFPublicly Available
Table of contents
v -
Requires Authentication UnlicensedLicensed
Chapter 1. Introduction
1 -
Requires Authentication UnlicensedLicensed
Chapter 2. A review
12 -
Requires Authentication UnlicensedLicensed
Chapter 3. Unification of measures
80 -
Requires Authentication UnlicensedLicensed
Chapter 4. The role, and the ‘partialing out’, of frequency
170 -
Requires Authentication UnlicensedLicensed
Chapter 5. Tupleization
229 -
Requires Authentication UnlicensedLicensed
Chapter 6. What should be next
269 -
Requires Authentication UnlicensedLicensed
Chapter 7. Conclusion
304 -
Requires Authentication UnlicensedLicensed
References
308 -
Requires Authentication UnlicensedLicensed
Index
319
Publishing information
Pages and Images/Illustrations in book
eBook published on:
July 4, 2024
eBook ISBN:
9789027246813
Pages and Images/Illustrations in book
Main content:
321
eBook ISBN:
9789027246813
Keywords for this book
Corpus linguistics; Computational & corpus linguistics; Theoretical linguistics
Audience(s) for this book
Professional and scholarly;