Morphological complexity and unsupervised learning
-
Dunstan Brown
Abstract
This paper addresses the question of whether it is possible to use machine learning techniques on linguistic data to validate linguistic theory. We determine how readily inflectional classes recognized by linguists can be inferred by an unsupervised learning method when it is presented with the paradigms of a small number (80) of high frequency Russian noun lexemes. We interpret this as a measure of the validity of the linguistic theory. Inflectional classes are of particular interest, because they constitute a kind of autonomous morphological complexity that has no direct relationship to other levels of linguistic description, and hence there is no other objective way of assessing a theoretical characterization of them. Using the same method, we also examine the status of principal parts and defaults in inflectional classes, and the relationship between inflectional classes and stress in Russian nominal morphology. Our experiments suggest that this is an effective and interesting technique for shedding additional light on theoretical claims.
Abstract
This paper addresses the question of whether it is possible to use machine learning techniques on linguistic data to validate linguistic theory. We determine how readily inflectional classes recognized by linguists can be inferred by an unsupervised learning method when it is presented with the paradigms of a small number (80) of high frequency Russian noun lexemes. We interpret this as a measure of the validity of the linguistic theory. Inflectional classes are of particular interest, because they constitute a kind of autonomous morphological complexity that has no direct relationship to other levels of linguistic description, and hence there is no other objective way of assessing a theoretical characterization of them. Using the same method, we also examine the status of principal parts and defaults in inflectional classes, and the relationship between inflectional classes and stress in Russian nominal morphology. Our experiments suggest that this is an effective and interesting technique for shedding additional light on theoretical claims.
Chapters in this book
- Prelim pages i
- Table of contents v
- Foreword & acknowledgments vii
- Editors’ introduction ix
-
Part I. Regularity, irregularity, and analogy
- Arguments from Lovari loan-verb adaptation for an analogy-based analysis of verbal systems 3
- Possible and impossible variation in Hungarian 23
- Variation in the possessive allomorphy of Hungarian 51
- Revisiting exocentricity in compounding 65
- A constructionist account of the Modern Dutch adnominal genitive 83
-
Part II. The role of frequency in morphological complexity, morphological change and language acquisition
- Perspectives on morphological complexity 107
- Morphological complexity and unsupervised learning 135
- A working typology of multiple exponence 163
- Linguistic self-regulation 189
- Suffix predictability and stem transparency in the acquisition of German noun plurals 217
- Acquisition of German diminutive formation and compounding in a comparative perspective 237
- Index 265
Chapters in this book
- Prelim pages i
- Table of contents v
- Foreword & acknowledgments vii
- Editors’ introduction ix
-
Part I. Regularity, irregularity, and analogy
- Arguments from Lovari loan-verb adaptation for an analogy-based analysis of verbal systems 3
- Possible and impossible variation in Hungarian 23
- Variation in the possessive allomorphy of Hungarian 51
- Revisiting exocentricity in compounding 65
- A constructionist account of the Modern Dutch adnominal genitive 83
-
Part II. The role of frequency in morphological complexity, morphological change and language acquisition
- Perspectives on morphological complexity 107
- Morphological complexity and unsupervised learning 135
- A working typology of multiple exponence 163
- Linguistic self-regulation 189
- Suffix predictability and stem transparency in the acquisition of German noun plurals 217
- Acquisition of German diminutive formation and compounding in a comparative perspective 237
- Index 265