Home What matters more: The size of the corpora or their quality?
Chapter
Licensed
Unlicensed Requires Authentication

What matters more: The size of the corpora or their quality?

The case of automatic translation of multiword expressions using comparable corpora
  • Ruslan Mitkov and Shiva Taslimipoor
View more publications by John Benjamins Publishing Company
Computational Phraseology
This chapter is in the book Computational Phraseology

Abstract

This study investigates (and compares) the impact of the size and the similarity/quality of comparable corpora on the specific task of extracting translation equivalents of verb-noun collocations from such corpora. The comprehensive evaluation of different configurations of English and Spanish corpora sheds some light on the more general and perennial question: what matters more – the quantity or quality of corpora?

Abstract

This study investigates (and compares) the impact of the size and the similarity/quality of comparable corpora on the specific task of extracting translation equivalents of verb-noun collocations from such corpora. The comprehensive evaluation of different configurations of English and Spanish corpora sheds some light on the more general and perennial question: what matters more – the quantity or quality of corpora?

Downloaded on 29.9.2025 from https://www.degruyterbrill.com/document/doi/10.1075/ivitra.24.09mit/html
Scroll to top button