Chapter
Open Access
A workflow for creating, harmonizing and analyzing structured corpora of multimodal interaction
-
Anne Ferger
Chapters in this book
- Frontmatter I
- Table of contents V
- From CMC to DMC: Digital writing beyond the keyboard 1
- Utilizing Text Dispersion Keyness on Turkish web registers: The case of Informational Description and Opinion 33
- “Also ehrlich” – From adjectival use to interactive discourse marker 61
- Digital punctuation from a contrastive perspective: Corpus-based investigations of ellipsis points in German and Chinese messaging interactions 83
- A multivariate register perspective on Reddit: Exploring lexicogrammatical variation in online communities 115
- Novel methods of intensification in young people’s digitally-mediated communication 137
- Collecting minority language data from Twitter (X): A case study of Karelian 163
- What’s New, Switzerland? Collecting and sharing half a million WhatsApp messages in French 187
- A workflow for creating, harmonizing and analyzing structured corpora of multimodal interaction 207
- Machine Learning is heading to the SUD (Socially Unacceptable Discourse) analysis: From Shallow Learning to Large Language Models to the rescue, where do we stand? 225
- An automatic pipeline for processing streamed content: New horizons for corpus linguistics and phonetics 257
- IDA – Incel Data Archive 275
- Not an expert, but not a fan either. A corpus-based study of negative self-identification in web forum interaction 305
- Social media corpora for analyzing linguistic variation 329
- Computer-Mediated Communication to facilitate inclusion: Digital corpus analysis on disability diversity on social media 349
- The representation of the Jew as enemy in French public Telegram channels within an identitarian-conspiratorial milieu 371
- CoDEC-M: The multi-lingual manosphere subcorpus of the Corpus of Digital Extremism and Conspiracies 395
- The negotiation of pronominal address on talk pages of the German, French, and Italian Wikipedia 421
- Investigating extreme cases in Wikipedia talk pages: Some insights on user behaviours 453
- Index
Chapters in this book
- Frontmatter I
- Table of contents V
- From CMC to DMC: Digital writing beyond the keyboard 1
- Utilizing Text Dispersion Keyness on Turkish web registers: The case of Informational Description and Opinion 33
- “Also ehrlich” – From adjectival use to interactive discourse marker 61
- Digital punctuation from a contrastive perspective: Corpus-based investigations of ellipsis points in German and Chinese messaging interactions 83
- A multivariate register perspective on Reddit: Exploring lexicogrammatical variation in online communities 115
- Novel methods of intensification in young people’s digitally-mediated communication 137
- Collecting minority language data from Twitter (X): A case study of Karelian 163
- What’s New, Switzerland? Collecting and sharing half a million WhatsApp messages in French 187
- A workflow for creating, harmonizing and analyzing structured corpora of multimodal interaction 207
- Machine Learning is heading to the SUD (Socially Unacceptable Discourse) analysis: From Shallow Learning to Large Language Models to the rescue, where do we stand? 225
- An automatic pipeline for processing streamed content: New horizons for corpus linguistics and phonetics 257
- IDA – Incel Data Archive 275
- Not an expert, but not a fan either. A corpus-based study of negative self-identification in web forum interaction 305
- Social media corpora for analyzing linguistic variation 329
- Computer-Mediated Communication to facilitate inclusion: Digital corpus analysis on disability diversity on social media 349
- The representation of the Jew as enemy in French public Telegram channels within an identitarian-conspiratorial milieu 371
- CoDEC-M: The multi-lingual manosphere subcorpus of the Corpus of Digital Extremism and Conspiracies 395
- The negotiation of pronominal address on talk pages of the German, French, and Italian Wikipedia 421
- Investigating extreme cases in Wikipedia talk pages: Some insights on user behaviours 453
- Index