How do students cope with machine translation output of multiword units? An exploratory study
-
Joke Daems
Abstract
In this chapter, we take a closer look at students’ post-editing of multiword units (MWUs) from English into Dutch. The data consists of newspaper articles post-edited by translation students as collected by means of advanced keystroke logging tools.
We discuss the quality of the machine translation (MT) output for various types of MWUs, and compare this with the final post-edited quality. In addition, we examine the external resources consulted for each type of MWU. Results indicate that contrastive MWUs are harder to translate for the MT system, and harder to correct by the student post-editors than non-contrastive MWUs. We further find that consulting a variety of external resources helps student post-editors solve MT problems.
Abstract
In this chapter, we take a closer look at students’ post-editing of multiword units (MWUs) from English into Dutch. The data consists of newspaper articles post-edited by translation students as collected by means of advanced keystroke logging tools.
We discuss the quality of the machine translation (MT) output for various types of MWUs, and compare this with the final post-edited quality. In addition, we examine the external resources consulted for each type of MWU. Results indicate that contrastive MWUs are harder to translate for the MT system, and harder to correct by the student post-editors than non-contrastive MWUs. We further find that consulting a variety of external resources helps student post-editors solve MT problems.
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
- About the editors vii
- Multiword units in machine translation and translation technology 1
-
Part 1. Multiword units in machine translation
- Analysing linguistic information about word combinations for a Spanish-Basque rule-based machine translation system 41
- How do students cope with machine translation output of multiword units? An exploratory study 61
- Aligning verb + noun collocations to improve a French-Romanian FSMT system 81
-
Part 2. Multiword units in multilingual NLP applications
- Multiword expressions in multilingual information extraction 103
- A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish 125
- Dutch compound splitting for bilingual terminology extraction 147
-
Part 3. Identification and translation of multiword units
- A flexible framework for collocation retrieval and translation from parallel and comparable corpora 165
- On identification of bilingual lexical bundles for translation purposes 181
- The quest for croatian idioms as multiword units 201
- Corpus analysis of croatian constructions with the verb doći ‘to come’ 223
- Anaphora resolution, collocations and translation 243
- Index 257
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
- About the editors vii
- Multiword units in machine translation and translation technology 1
-
Part 1. Multiword units in machine translation
- Analysing linguistic information about word combinations for a Spanish-Basque rule-based machine translation system 41
- How do students cope with machine translation output of multiword units? An exploratory study 61
- Aligning verb + noun collocations to improve a French-Romanian FSMT system 81
-
Part 2. Multiword units in multilingual NLP applications
- Multiword expressions in multilingual information extraction 103
- A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish 125
- Dutch compound splitting for bilingual terminology extraction 147
-
Part 3. Identification and translation of multiword units
- A flexible framework for collocation retrieval and translation from parallel and comparable corpora 165
- On identification of bilingual lexical bundles for translation purposes 181
- The quest for croatian idioms as multiword units 201
- Corpus analysis of croatian constructions with the verb doći ‘to come’ 223
- Anaphora resolution, collocations and translation 243
- Index 257