Chapter 4. How to compare speed and accuracy of syntactic parsers
-
Gertjan van Noord
Abstract
The paper introduces a methodological innovation as well as a practical innovation. Firstly, two scenarios are introduced to compare accurate, but slow parsers on the one hand, with faster, but less accurate parsers on the other hand. Secondly, a corpus-based technique is described to improve the efficiency of wide-coverage high-accuracy parsers. By keeping track of the derivation steps which lead to the best parse for a very large collection of sentences, the parser learns which parse steps can be filtered without significant loss in parsing accuracy, but with an important increase in parsing efficiency. Experimental results with the Alpino parser for Dutch indicate that the technique yields much faster parsers that perform with almost the same level of accuracy. An interesting characteristic of our approach is that it is self-learning, in the sense that it uses unannotated corpora.
Abstract
The paper introduces a methodological innovation as well as a practical innovation. Firstly, two scenarios are introduced to compare accurate, but slow parsers on the one hand, with faster, but less accurate parsers on the other hand. Secondly, a corpus-based technique is described to improve the efficiency of wide-coverage high-accuracy parsers. By keeping track of the derivation steps which lead to the best parse for a very large collection of sentences, the parser learns which parse steps can be filtered without significant loss in parsing accuracy, but with an important increase in parsing efficiency. Experimental results with the Alpino parser for Dutch indicate that the technique yields much faster parsers that perform with almost the same level of accuracy. An interesting characteristic of our approach is that it is self-learning, in the sense that it uses unannotated corpora.
Chapters in this book
- Prelim pages i
- Dedication v
- Table of contents vii
- Introduction 1
- Chapter 1. Bridging theoretical and experimental linguistic research 9
-
Data and its use
- Chapter 2. Experimental research 23
- Chapter 3. Finding long-distance dependencies in the Lassy Corpus 39
- Chapter 4. How to compare speed and accuracy of syntactic parsers 57
- Chapter 5. Adposition clusters in Dutch 77
- Chapter 6. Polarity licensing and intervention by conjunction 93
- Chapter 7. Frequential test of (S)OV as unmarked word order in Dutch and German clauses 107
- Chapter 8. Kratzer’s effect in the nominal domain 125
- Chapter 9. Is bilingual speech production language-specific or non-specific? 139
- Chapter 10. Prosody of restrictive and appositive relative clauses in Dutch and German 155
- Chapter 11. Licensing distributivity 177
-
Implementation and theory building
- Chapter 12. Extending categorial grammar to phonology 193
- Chapter 13. Stacking up for the long way down 207
- Chapter 14. Meaning between algebra and culture 227
- Chapter 15. Whether you like it or not, this is a paper about or not 249
- Chapter 16. Between desire and necessity 263
- Chapter 17. Inner aspect and the comparative quantifiers 281
- Chapter 18. The expressive en maar -construction 305
- Index 327
Chapters in this book
- Prelim pages i
- Dedication v
- Table of contents vii
- Introduction 1
- Chapter 1. Bridging theoretical and experimental linguistic research 9
-
Data and its use
- Chapter 2. Experimental research 23
- Chapter 3. Finding long-distance dependencies in the Lassy Corpus 39
- Chapter 4. How to compare speed and accuracy of syntactic parsers 57
- Chapter 5. Adposition clusters in Dutch 77
- Chapter 6. Polarity licensing and intervention by conjunction 93
- Chapter 7. Frequential test of (S)OV as unmarked word order in Dutch and German clauses 107
- Chapter 8. Kratzer’s effect in the nominal domain 125
- Chapter 9. Is bilingual speech production language-specific or non-specific? 139
- Chapter 10. Prosody of restrictive and appositive relative clauses in Dutch and German 155
- Chapter 11. Licensing distributivity 177
-
Implementation and theory building
- Chapter 12. Extending categorial grammar to phonology 193
- Chapter 13. Stacking up for the long way down 207
- Chapter 14. Meaning between algebra and culture 227
- Chapter 15. Whether you like it or not, this is a paper about or not 249
- Chapter 16. Between desire and necessity 263
- Chapter 17. Inner aspect and the comparative quantifiers 281
- Chapter 18. The expressive en maar -construction 305
- Index 327