Indexation and analysis of a parallel corpus using CQPweb: The COVALT PAR_ES Corpus (EN/FR/DE > ES)

Teresa Molés-Cases; Ulrike Oster

Chapter

Indexation and analysis of a parallel corpus using CQPweb

The COVALT PAR_ES Corpus (EN/FR/DE > ES)

Teresa Molés-Cases and Ulrike Oster

Published by

View more publications by John Benjamins Publishing Company

To Publisher Page

This chapter is in the book Parallel Corpora for Contrastive and Translation Studies

Abstract

This contribution presents a section of the Corpus Valencià de Literatura Traduïda (COVALT), created by the research group of the same name (Department of Translation and Communication, Universitat Jaume I, Spain). The COVALT corpus is a four-million word corpus made up of narrative works originally written in English, French, and German and their Catalan translations published in the autonomous community of Valencia between 1990 and 2000. Since the members of the Covalt group are interested in translation research, and more specifically in the investigation of translated Catalan and Spanish, this corpus has recently been extended to include translations into Spanish published in Spain (COVALT PAR_ES corpus). This chapter presents the COVALT PAR_ES corpus, as well as its process of compilation and analysis with CQPweb.

You are currently not able to access this content.

Abstract

You are currently not able to access this content.

Chapters in this book

Prelim pages i
Table of contents v
Acknowledgments ix
Parallel corpora in focus 1
Part I. Parallel corpora
Comparable parallel corpora 19
Living with parallel corpora 39
Working with parallel corpora 57
Innovations in parallel corpus alignment and retrieval 79
Part II. Parallel corpora
InterCorp 93
Corpus PaGeS 103
Building EPTIC 123
Enriching parallel corpora with multimedia and lexical semantics 141
Discourse annotation in the MULTINOT corpus 159
PEST 183
Indexation and analysis of a parallel corpus using CQPweb 197
P-ACTRES 2.0 215
An overview of Basque corpora and the extraction of certain multi-word expressions from a translational corpus 233
Part III. Parallel corpora
Strategies for building high quality bilingual lexicons from comparable corpora 251
Discovering bilingual collocations in parallel corpora 267
Normalization of shorthand forms in French text messages using word embedding and machine translation 281
Index 299

https://doi.org/10.1075/scl.90.12mol

Chapters in this book

Prelim pages i
Table of contents v
Acknowledgments ix
Parallel corpora in focus 1
Part I. Parallel corpora
Comparable parallel corpora 19
Living with parallel corpora 39
Working with parallel corpora 57
Innovations in parallel corpus alignment and retrieval 79
Part II. Parallel corpora
InterCorp 93
Corpus PaGeS 103
Building EPTIC 123
Enriching parallel corpora with multimedia and lexical semantics 141
Discourse annotation in the MULTINOT corpus 159
PEST 183
Indexation and analysis of a parallel corpus using CQPweb 197
P-ACTRES 2.0 215
An overview of Basque corpora and the extraction of certain multi-word expressions from a translational corpus 233
Part III. Parallel corpora
Strategies for building high quality bilingual lexicons from comparable corpora 251
Discovering bilingual collocations in parallel corpora 267
Normalization of shorthand forms in French text messages using word embedding and machine translation 281
Index 299

Indexation and analysis of a parallel corpus using CQPweb

Abstract

Chapter PDF View

Abstract

Chapters in this book

Chapters in this book

Chapters in this book