Chapter 2. The Eurolect Observatory Multilingual Corpus
-
Marco Stefano Tomatis
Abstract
This chapter aims to explain the corpus design of the Eurolect Observatory Multilingual Corpus and the steps required to build all the different monolingual corpora the project needed to accomplish its research objectives. The first two paragraphs after the general introduction will point out the differences and the overlaps that characterize all the corpora that the author of this paper was in charge of producing as a member of the UNINT research team and that were used in the Eurolect Observatory Project for text mining. After accurately defining the data collection and corpus building strategies adopted, this paper will describe the corpus search tool that was developed in order to help scholars look for and save samples of text from the whole corpus in a convenient and easy way.
Abstract
This chapter aims to explain the corpus design of the Eurolect Observatory Multilingual Corpus and the steps required to build all the different monolingual corpora the project needed to accomplish its research objectives. The first two paragraphs after the general introduction will point out the differences and the overlaps that characterize all the corpora that the author of this paper was in charge of producing as a member of the UNINT research team and that were used in the Eurolect Observatory Project for text mining. After accurately defining the data collection and corpus building strategies adopted, this paper will describe the corpus search tool that was developed in order to help scholars look for and save samples of text from the whole corpus in a convenient and easy way.
Chapters in this book
- Prelim pages i
- Table of contents v
- Preface vii
- Acknowledgements xiii
- Chapter 1. Introduction 1
- Chapter 2. The Eurolect Observatory Multilingual Corpus 27
- Chapter 3. Observing Eurolects 47
- Chapter 4. Observing Eurolects 63
- Chapter 5. Observing Eurolects 93
- Chapter 6. Observing Eurolects 121
- Chapter 7. Observing Eurolects 147
- Chapter 8. Observing Eurolects 169
- Chapter 9. Observing Eurolects 199
- Chapter 10. Observing Eurolects 243
- Chapter 11. Observing Eurolects 267
- Chapter 12. Observing Eurolects 295
- Chapter 13. Observing Eurolects 329
- Chapter 14. Conclusions 369
- Subject index 393
- Language index 395
Chapters in this book
- Prelim pages i
- Table of contents v
- Preface vii
- Acknowledgements xiii
- Chapter 1. Introduction 1
- Chapter 2. The Eurolect Observatory Multilingual Corpus 27
- Chapter 3. Observing Eurolects 47
- Chapter 4. Observing Eurolects 63
- Chapter 5. Observing Eurolects 93
- Chapter 6. Observing Eurolects 121
- Chapter 7. Observing Eurolects 147
- Chapter 8. Observing Eurolects 169
- Chapter 9. Observing Eurolects 199
- Chapter 10. Observing Eurolects 243
- Chapter 11. Observing Eurolects 267
- Chapter 12. Observing Eurolects 295
- Chapter 13. Observing Eurolects 329
- Chapter 14. Conclusions 369
- Subject index 393
- Language index 395