Building EPTIC: A many-sided, multi-purpose corpus of EU parliament proceedings

Adriano Ferraresi; Silvia Bernardini

Kapitel

Building EPTIC

A many-sided, multi-purpose corpus of EU parliament proceedings

Adriano Ferraresi und Silvia Bernardini

Veröffentlicht von

John Benjamins Publishing Company

Weitere Titel anzeigen von John Benjamins Publishing Company

Suche

Ein Kapitel aus dem Buch Parallel Corpora for Contrastive and Translation Studies

Abstract

This chapter describes the steps involved in the construction of EPTIC, an intermodal corpus of European Parliament speeches. Despite its limited size, this corpus has features that justify its labour-intensive building process, in particular its multiple alignments. The text-to-text alignments allow users to compare interpretations and translations of source speeches and their written-up reports, while text-to-video alignments allow them to access the multimedia components from concordance lines. To illustrate the potential of EPTIC, a case study is presented of English loan words in original, translated and interpreted Italian and French. Results suggest that borrowing is more likely to occur in translated Italian than in any of the other corpus components.

Sie haben derzeit keinen Zugang zu diesem Inhalt.

Abstract

Sie haben derzeit keinen Zugang zu diesem Inhalt.

Kapitel in diesem Buch

Prelim pages i
Table of contents v
Acknowledgments ix
Parallel corpora in focus 1
Part I. Parallel corpora
Comparable parallel corpora 19
Living with parallel corpora 39
Working with parallel corpora 57
Innovations in parallel corpus alignment and retrieval 79
Part II. Parallel corpora
InterCorp 93
Corpus PaGeS 103
Building EPTIC 123
Enriching parallel corpora with multimedia and lexical semantics 141
Discourse annotation in the MULTINOT corpus 159
PEST 183
Indexation and analysis of a parallel corpus using CQPweb 197
P-ACTRES 2.0 215
An overview of Basque corpora and the extraction of certain multi-word expressions from a translational corpus 233
Part III. Parallel corpora
Strategies for building high quality bilingual lexicons from comparable corpora 251
Discovering bilingual collocations in parallel corpora 267
Normalization of shorthand forms in French text messages using word embedding and machine translation 281
Index 299

https://doi.org/10.1075/scl.90.08fer

Kapitel in diesem Buch

Prelim pages i
Table of contents v
Acknowledgments ix
Parallel corpora in focus 1
Part I. Parallel corpora
Comparable parallel corpora 19
Living with parallel corpora 39
Working with parallel corpora 57
Innovations in parallel corpus alignment and retrieval 79
Part II. Parallel corpora
InterCorp 93
Corpus PaGeS 103
Building EPTIC 123
Enriching parallel corpora with multimedia and lexical semantics 141
Discourse annotation in the MULTINOT corpus 159
PEST 183
Indexation and analysis of a parallel corpus using CQPweb 197
P-ACTRES 2.0 215
An overview of Basque corpora and the extraction of certain multi-word expressions from a translational corpus 233
Part III. Parallel corpora
Strategies for building high quality bilingual lexicons from comparable corpora 251
Discovering bilingual collocations in parallel corpora 267
Normalization of shorthand forms in French text messages using word embedding and machine translation 281
Index 299

Building EPTIC

Abstract

Kapitel PDF Ansicht

Abstract

Kapitel in diesem Buch

Kapitel in diesem Buch

Kapitel in diesem Buch