Chapter 5. Semi-automatic approaches to Anglicism detection in Norwegian corpus data
-
Gisle Andersen
Abstract
This article describes corpus-based research methods and language processing tools that are used for the systematic study of the influence of English on Norwegian lexis. The tools are developed in connection with the Norwegian Newspaper Corpus (NNC) project. The study presents a survey of the types of phenomena that an Anglicism detection tool should aim at identifying and the problems associated with the orthographic and morphological variability of Anglicisms. It also describes the development of an Anglicism detection tool and accounts for a set of experiments using lexicon-based, n-gram-based and combinatory methods. Finally it describes recently developed machine learning techniques that have been developed by the NNC team, arguing that the computational approach to Anglicism identification is a fruitful one.
Abstract
This article describes corpus-based research methods and language processing tools that are used for the systematic study of the influence of English on Norwegian lexis. The tools are developed in connection with the Norwegian Newspaper Corpus (NNC) project. The study presents a survey of the types of phenomena that an Anglicism detection tool should aim at identifying and the problems associated with the orthographic and morphological variability of Anglicisms. It also describes the development of an Anglicism detection tool and accounts for a set of experiments using lexicon-based, n-gram-based and combinatory methods. Finally it describes recently developed machine learning techniques that have been developed by the NNC team, arguing that the computational approach to Anglicism identification is a fruitful one.
Chapters in this book
- Prelim pages i
- Table of contents v
- List of contributors vii
- Acknowledgements ix
- The lexical influence of English on European languages 1
-
Section I. Exploring Anglicisms
- Chapter 1. Fair play to them 27
- Chapter 2. Proposing a pragmatic distinction for lexical Anglicisms 43
- Chapter 3. Investigating gender variation of English loanwords in German 65
- Chapter 4. The collection of Anglicisms 91
- Chapter 5. Semi-automatic approaches to Anglicism detection in Norwegian corpus data 111
- Chapter 6. Lexicographic description of recent Anglicisms in Serbian 131
- Chapter 7. Anglicisms in Armenian 149
-
Section II. English-induced phraseology
- Chapter 8. Phraseology in flux 169
- Chapter 9. Multi-word loan translations and semantic borrowings from English in French journalistic discourse 199
- Chapter 10. Newly-coined Anglicisms in contemporary Spanish 217
- Chapter 11. Der Elefant im Raum… 239
- Chapter 12. English influence on Polish proverbial language 261
-
Section III. Anglicisms in specialized discourse
- Chapter 13. English direct loans in European football lexis 281
- Chapter 14. Incorporation degrees of selected economics-related Anglicisms in Italian 305
- Chapter 15. Anglicisms in the discourse of Alitalia’s bailout in the Italian press 325
- Author index 343
- Language index 347
- Subject index 349
Chapters in this book
- Prelim pages i
- Table of contents v
- List of contributors vii
- Acknowledgements ix
- The lexical influence of English on European languages 1
-
Section I. Exploring Anglicisms
- Chapter 1. Fair play to them 27
- Chapter 2. Proposing a pragmatic distinction for lexical Anglicisms 43
- Chapter 3. Investigating gender variation of English loanwords in German 65
- Chapter 4. The collection of Anglicisms 91
- Chapter 5. Semi-automatic approaches to Anglicism detection in Norwegian corpus data 111
- Chapter 6. Lexicographic description of recent Anglicisms in Serbian 131
- Chapter 7. Anglicisms in Armenian 149
-
Section II. English-induced phraseology
- Chapter 8. Phraseology in flux 169
- Chapter 9. Multi-word loan translations and semantic borrowings from English in French journalistic discourse 199
- Chapter 10. Newly-coined Anglicisms in contemporary Spanish 217
- Chapter 11. Der Elefant im Raum… 239
- Chapter 12. English influence on Polish proverbial language 261
-
Section III. Anglicisms in specialized discourse
- Chapter 13. English direct loans in European football lexis 281
- Chapter 14. Incorporation degrees of selected economics-related Anglicisms in Italian 305
- Chapter 15. Anglicisms in the discourse of Alitalia’s bailout in the Italian press 325
- Author index 343
- Language index 347
- Subject index 349