Home Linguistics & Semiotics Cluster Analysis for Corpus Linguistics
book: Cluster Analysis for Corpus Linguistics
Book
Licensed
Unlicensed Requires Authentication

Cluster Analysis for Corpus Linguistics

  • Hermann Moisl
Language: English
Published/Copyright: 2015
Become an author with De Gruyter Brill

About this book

The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.

  • Describes a range of clustering methods for analysis of data derived from language corpora.
  • Gives an intuitively accessible account of the mathematical concepts which underlie data creation, data transformation, and cluster analysis.

Author / Editor information

Hermann Moisl, Newcastle University, Newcastle upon Tyne, UK.

Supplementary Materials


Publicly Available Download PDF
I

Requires Authentication Unlicensed

Licensed
V

Publicly Available Download PDF
VII

Requires Authentication Unlicensed

Licensed
IX

Requires Authentication Unlicensed

Licensed
1

Requires Authentication Unlicensed

Licensed
7

Requires Authentication Unlicensed

Licensed
17

Requires Authentication Unlicensed

Licensed
153

Requires Authentication Unlicensed

Licensed
251

Requires Authentication Unlicensed

Licensed
277

Requires Authentication Unlicensed

Licensed
301

Requires Authentication Unlicensed

Licensed
303

Requires Authentication Unlicensed

Licensed
311

Requires Authentication Unlicensed

Licensed
359

Publishing information
Pages and Images/Illustrations in book
eBook published on:
February 24, 2015
eBook ISBN:
9783110363814
Hardcover published on:
January 19, 2015
Hardcover ISBN:
9783110350258
Pages and Images/Illustrations in book
Front matter:
15
Main content:
381
Downloaded on 8.12.2025 from https://www.degruyterbrill.com/document/doi/10.1515/9783110363814/html
Scroll to top button