Home General Interest Chapter 12. Using Twitter to build a corpus for linguistic variation
Chapter
Licensed
Unlicensed Requires Authentication

Chapter 12. Using Twitter to build a corpus for linguistic variation

Collecting tweets and mapping them out
  • Antonio Ruiz Tinoco
View more publications by John Benjamins Publishing Company
Syntactic Geolectal Variation
This chapter is in the book Syntactic Geolectal Variation

Abstract

Data for linguistic analysis from social media are being used for a variety of reasons. One of the most attractive reasons is the possibility of gathering very large quantities of data in the digital form. However, collecting data from these sources is not always a simple task. In this chapter, a few methods to harvest data from Twitter will be introduced like FireAnt, 140dev Streaming API Framework, and Elastic Stack, as well as visualization tools like QGIS to map geotagged tweets. Finally, some examples of tweet analyses are discussed.

Abstract

Data for linguistic analysis from social media are being used for a variety of reasons. One of the most attractive reasons is the possibility of gathering very large quantities of data in the digital form. However, collecting data from these sources is not always a simple task. In this chapter, a few methods to harvest data from Twitter will be introduced like FireAnt, 140dev Streaming API Framework, and Elastic Stack, as well as visualization tools like QGIS to map geotagged tweets. Finally, some examples of tweet analyses are discussed.

Downloaded on 24.2.2026 from https://www.degruyterbrill.com/document/doi/10.1075/ihll.34.12tin/html
Scroll to top button