Chapter 12. Using Twitter to build a corpus for linguistic variation
-
Antonio Ruiz Tinoco
Abstract
Data for linguistic analysis from social media are being used for a variety of reasons. One of the most attractive reasons is the possibility of gathering very large quantities of data in the digital form. However, collecting data from these sources is not always a simple task. In this chapter, a few methods to harvest data from Twitter will be introduced like FireAnt, 140dev Streaming API Framework, and Elastic Stack, as well as visualization tools like QGIS to map geotagged tweets. Finally, some examples of tweet analyses are discussed.
Abstract
Data for linguistic analysis from social media are being used for a variety of reasons. One of the most attractive reasons is the possibility of gathering very large quantities of data in the digital form. However, collecting data from these sources is not always a simple task. In this chapter, a few methods to harvest data from Twitter will be introduced like FireAnt, 140dev Streaming API Framework, and Elastic Stack, as well as visualization tools like QGIS to map geotagged tweets. Finally, some examples of tweet analyses are discussed.
Chapters in this book
- Prelim pages i
- Table of contents v
- Introduction 1
-
Section I. Dialectology
- Chapter 1. The syntactic tradition in the Spanish linguistic atlases 15
- Chapter 2. Using linguistic atlases to explore syntactic issues 35
- Chapter 3. The negative expressions in three dialectal repertoires 77
- Chapter 4. A microsyntactic study of Pyrenean negative emphatic polarity particles with the help of data from linguistic atlases 109
-
Section II. Current perspectives on variation
- Chapter 5. Syntactic features and dialect areas in European Spanish 149
- Chapter 6. Feature analysis of neuter gender in Spanish and Asturian languages 175
- Chapter 7. Parameters of clitic combination 203
- Chapter 8. Gerund structures in Ecuadorian Spanish 225
- Chapter 9. On the role of prosody in wh -in-situ 263
-
Section III. New tools to approach syntactic variation
- Chapter 10. ASinEs 297
- Chapter 11. The Corpus del español del siglo XXI ( CORPES XXI ) 319
- Chapter 12. Using Twitter to build a corpus for linguistic variation 347
- Language index 381
- Subject index 383
Chapters in this book
- Prelim pages i
- Table of contents v
- Introduction 1
-
Section I. Dialectology
- Chapter 1. The syntactic tradition in the Spanish linguistic atlases 15
- Chapter 2. Using linguistic atlases to explore syntactic issues 35
- Chapter 3. The negative expressions in three dialectal repertoires 77
- Chapter 4. A microsyntactic study of Pyrenean negative emphatic polarity particles with the help of data from linguistic atlases 109
-
Section II. Current perspectives on variation
- Chapter 5. Syntactic features and dialect areas in European Spanish 149
- Chapter 6. Feature analysis of neuter gender in Spanish and Asturian languages 175
- Chapter 7. Parameters of clitic combination 203
- Chapter 8. Gerund structures in Ecuadorian Spanish 225
- Chapter 9. On the role of prosody in wh -in-situ 263
-
Section III. New tools to approach syntactic variation
- Chapter 10. ASinEs 297
- Chapter 11. The Corpus del español del siglo XXI ( CORPES XXI ) 319
- Chapter 12. Using Twitter to build a corpus for linguistic variation 347
- Language index 381
- Subject index 383