N -grams of grammatical functions and their significant order in the Japanese clause
-
Haruko Sanada
Abstract
The present study investigates the statistically significant order of grammatical functions in Japanese clauses by employing n-gram frequency data of grammatical functions. There are broad rules for the order of grammatical functions, though Japanese is an agglutinative SOV language and complements can be elliptic. I conclude that the time and the place appear between the subject and object with statistical significance. The occasion takes a position before the subject, between the subject and object, or after the object. Therefore, the occasion shows that Japanese is a free word order language. The subject and object play the role of ‘anchors’ in the clause. By using the ‘two-sample test for equality of proportions without continuity correction data’, the study introduces a descriptive verification method of implicit speaker-hearer knowledge.
Abstract
The present study investigates the statistically significant order of grammatical functions in Japanese clauses by employing n-gram frequency data of grammatical functions. There are broad rules for the order of grammatical functions, though Japanese is an agglutinative SOV language and complements can be elliptic. I conclude that the time and the place appear between the subject and object with statistical significance. The occasion takes a position before the subject, between the subject and object, or after the object. Therefore, the occasion shows that Japanese is a free word order language. The subject and object play the role of ‘anchors’ in the clause. By using the ‘two-sample test for equality of proportions without continuity correction data’, the study introduces a descriptive verification method of implicit speaker-hearer knowledge.
Chapters in this book
- Prelim pages i
- Table of contents v
- Introduction 1
- Part I. Theory and models 7
- On the impact of the initial phrase length on the position of enclitics in Old Czech 9
- Term distance, frequency and collocations 21
- A method for the comparison of general sequences via type-token ratio 37
- Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian 55
- N -grams of grammatical functions and their significant order in the Japanese clause 69
- Linking the dependents 93
- Grammar efficiency and the One-Meaning–One-Form Principle 109
- Distribution and characteristics of commonly used words across different texts in Japanese 121
- Part II. Empirical studies 135
- The perils of big data 137
- From distinguishability to informativity 145
- A Modern Greek readability tool 163
- Phonological properties as predictors of text success 177
- Calculating the victory chances 195
- Topological mapping for visualisation of high-dimensional historical linguistic data 209
- Book genre and author’s gender recognition based on titles 225
- Quantitative analysis of bibliographic corpora 239
- Analysis of English text genre classification based on dependency types 257
- In memory of Gabriel Altmann 271
- Index 277
Chapters in this book
- Prelim pages i
- Table of contents v
- Introduction 1
- Part I. Theory and models 7
- On the impact of the initial phrase length on the position of enclitics in Old Czech 9
- Term distance, frequency and collocations 21
- A method for the comparison of general sequences via type-token ratio 37
- Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian 55
- N -grams of grammatical functions and their significant order in the Japanese clause 69
- Linking the dependents 93
- Grammar efficiency and the One-Meaning–One-Form Principle 109
- Distribution and characteristics of commonly used words across different texts in Japanese 121
- Part II. Empirical studies 135
- The perils of big data 137
- From distinguishability to informativity 145
- A Modern Greek readability tool 163
- Phonological properties as predictors of text success 177
- Calculating the victory chances 195
- Topological mapping for visualisation of high-dimensional historical linguistic data 209
- Book genre and author’s gender recognition based on titles 225
- Quantitative analysis of bibliographic corpora 239
- Analysis of English text genre classification based on dependency types 257
- In memory of Gabriel Altmann 271
- Index 277