Chapter 10. Domain-adapting and evaluating machine translation for institutional German in South Tyrol
-
Antonio Giovanni Contarino
Abstract
Building on a prior small-scale study on machine translation adaptation in the same language pair and domain (De Camillis 2021), this chapter reports on a (i) domain-adaptation, (ii) quality assessment, and (iii) automatic legal terminology evaluation experiment for legal South Tyrolean German. After collecting, sentence-aligning, and cleaning the LEXB parallel corpus, we used this bilingual resource to domain-adapt a ModernMT engine. Performance improvements were measured in terms of automatic quality metrics. The machine-translation of South Tyrolean legal terms was evaluated using an ad hoc automatic terminology evaluation tool. We observed a significant boost in performance and term accuracy in the output of the ModernMT adapted engine, but the improvement in legal terminology translation was deemed unsatisfactory.
Abstract
Building on a prior small-scale study on machine translation adaptation in the same language pair and domain (De Camillis 2021), this chapter reports on a (i) domain-adaptation, (ii) quality assessment, and (iii) automatic legal terminology evaluation experiment for legal South Tyrolean German. After collecting, sentence-aligning, and cleaning the LEXB parallel corpus, we used this bilingual resource to domain-adapt a ModernMT engine. Performance improvements were measured in terms of automatic quality metrics. The machine-translation of South Tyrolean legal terms was evaluated using an ad hoc automatic terminology evaluation tool. We observed a significant boost in performance and term accuracy in the output of the ModernMT adapted engine, but the improvement in legal terminology translation was deemed unsatisfactory.
Chapters in this book
- Prelim pages i
- Table of contents v
- Cross-linguistic research and corpora 1
- Chapter 1. Light Verb Constructions as a testing ground for the Gravitational Pull Hypothesis 12
- Chapter 2. Light Verb Constructions in English-Spanish translation 34
- Chapter 3. Reporting direct speech in Spanish and German 51
- Chapter 4. “Ich bekomme es erklärt” 67
- Chapter 5. Exploring near-synonyms through translation corpora 91
- Chapter 6. run away! 108
- Chapter 7. Film dialogue synchronization and statistical dubbese 124
- Chapter 8. Opera audio description in the spoken-written language continuum 142
- Chapter 9. Using a multilingual parallel corpus for Journalistic Translation Research 157
- Chapter 10. Domain-adapting and evaluating machine translation for institutional German in South Tyrol 179
- Chapter 11. Word alignment in the Russian-Chinese parallel corpus 195
- Chapter 12. Building corpus-based writing aids from Spanish into English 216
- Index 235
Chapters in this book
- Prelim pages i
- Table of contents v
- Cross-linguistic research and corpora 1
- Chapter 1. Light Verb Constructions as a testing ground for the Gravitational Pull Hypothesis 12
- Chapter 2. Light Verb Constructions in English-Spanish translation 34
- Chapter 3. Reporting direct speech in Spanish and German 51
- Chapter 4. “Ich bekomme es erklärt” 67
- Chapter 5. Exploring near-synonyms through translation corpora 91
- Chapter 6. run away! 108
- Chapter 7. Film dialogue synchronization and statistical dubbese 124
- Chapter 8. Opera audio description in the spoken-written language continuum 142
- Chapter 9. Using a multilingual parallel corpus for Journalistic Translation Research 157
- Chapter 10. Domain-adapting and evaluating machine translation for institutional German in South Tyrol 179
- Chapter 11. Word alignment in the Russian-Chinese parallel corpus 195
- Chapter 12. Building corpus-based writing aids from Spanish into English 216
- Index 235