Abstract
As social media use continues to increase its presence in our lives, so does the language of such platforms. One of the most salient features of social media discourse is the hashtag. Starting its life on Twitter (X) about 15 years ago, the hashtag has seeped from online to offline communication. Yet, it is not clear whether hashtags are words, tags, or something else altogether, nor is it clear what morphological process gives rise to them. This study presents an extensive analysis of 3,423 hashtags from 1,216 English-language tweets, each manually coded for various linguistics features, including position in the tweet, grammatical function, and syntactic integration. Our findings suggest that hashtags are extremely varied and we propose that they are indeed words, arising through a process of hashtagging (which is distinct from compounding). We also argue that some hashtags are syntactically integrated while others constitute parenthetical material.
Acknowledgements
We are grateful to the University of Waikato ALPSS Research Fund, to David Trye for comments on the draft and for his Python code and help in extracting the data, and to the anonymous referees for their helpful comments. All remaining errors are of course our own.
References
Bauer, Laurie. 2017. Compounds and compounding. Cambridge: Cambridge University Press.10.1017/9781108235679Suche in Google Scholar
Burnette, Jessie & Andreea S. Calude. 2022. Wake up New Zealand! Directives, politeness and stance in Twitter #Covid19NZ posts. Journal of Pragmatics 196. 6–23. https://doi.org/10.1016/j.pragma.2022.05.002.Suche in Google Scholar
Burnette, Jessie & Maebh Long. 2022. Bubbles and lockdown in Aotearoa New Zealand: The language of self-isolation in #Covid19nz tweets. Medical Humanities 49. 93–104. https://doi.org/10.1136/medhum-2022-012401.Suche in Google Scholar
Caleffi, Paola-Maria. 2015. The “hashtag”: A new word or a new rule? SKASE Journal of Theoretical Linguistics 12(2). 46–69.Suche in Google Scholar
Dehé, Nicole & Yordanka Kavalova. 2007. An introduction. In Nicole Dehé & Yordanka Kavalova (eds.), Parentheticals, 1–25. Amsterdam: John Benjamins.10.1075/la.106.03dehSuche in Google Scholar
Fiesler, Casey & Nicholas Proferes. 2018. Participants’ perceptions of Twitter research ethics. Social Media + Society 4(1). 1–14. https://doi.org/10.1177/2056305118763366.Suche in Google Scholar
Haspelmath, Martin. 2011. The indeterminacy of word segmentation and the nature of morphology and syntax. Folia Linguistica 45. 31–80. https://doi.org/10.1515/flin.2011.002.Suche in Google Scholar
Haspelmath, Martin. 2023. Defining the word. Unpublished manuscript. https://www.academia.edu/download/95034120/Defining_the_word.pdf (accessed 15 May 2023).Suche in Google Scholar
Long, Maebh, Andreea S. Calude & Jessie Burnette. In press. “This was never about a virus”: Perceptions of vaccination hazards and pandemic risk in #Covid19NZ tweets. Journal of Medical Humanities.Suche in Google Scholar
Maity, Suman Kalyan, Ritvik Saraf & Animesh Mukherjee. 2016. #Bieber+#Blast=#Bieberblast: Early prediction of popular hashtag compounds. In Proceedings of the 19th ACM conference on computer-supported cooperative work & social computing, 50–63. New York: ACM.10.1145/2818048.2820019Suche in Google Scholar
Messina, Chris. 2007. Groups for Twitter; or a proposal for Twitter tag channels. https://factoryjoe.com/2007/08/25/groups-for-twitter-or-a-proposal-for-twitter-tag-channels/ (accessed 19 May 2022).Suche in Google Scholar
Page, Ruth. 2012. The linguistics of self-branding and micro-celebrity in Twitter: The role of hashtags. Discourse & Communication 6(2). 181–201. https://doi.org/10.1177/1750481312437441.Suche in Google Scholar
RStudio Team. 2020. RStudio: Integrated development for R. Boston: RStudio. Available at: http://www.rstudio.com/.Suche in Google Scholar
Scott, Kate. 2015. The pragmatics of hashtags: Inference and conversational style on Twitter. Journal of Pragmatics 81. 8–20. https://doi.org/10.1016/j.pragma.2015.03.015.Suche in Google Scholar
Scott, Kate. 2018. “Hashtags work everywhere”: The pragmatic functions of spoken hashtags. Discourse, Context & Media 22. 57–64. https://doi.org/10.1016/j.dcm.2017.07.002.Suche in Google Scholar
Statista. 2020. LinkedIn statistics. https://www.statista.com/topics/951/linkedin/#dossierKeyfigures (accessed 7 March 2022).Suche in Google Scholar
Trye, David, Andreea S. Calude, Felipe Bravo-Márquez & Te Taka Keegan. 2020. Hybrid hashtags – #YouKnowYoureAKiwiWhen your tweet contains English and Māori. Frontiers in Artificial Intelligence 3. 1–19. https://doi.org/10.3389/frai.2020.00015.Suche in Google Scholar
Venkit, Pranav, Zeba Karishma, Chi-Yang Hsu, Rahul Katiki, Kenneth Huang, Shomir Wilson & Patrick Dudas. 2021. A “sourceful” twist: Emoji prediction based on sentiment, hashtags and application source. arXiv.org. Available at: http://arxiv.org/abs/2103.07833.Suche in Google Scholar
Wagh, Rashika & Payal Punde. 2018. Survey on sentiment analysis using Twitter dataset. In 2018 Second international conference on electronics, communication and aerospace technology, 208–211. Available at: https://doi.org/10.1109/ICECA.2018.8474783. (accessed 23 May 2023).Suche in Google Scholar
Wickham, Hadley. 2016. Ggplot2: Elegant graphics for data analysis. New York: Springer. Available at: https://ggplot2.tidyverse.org.10.1007/978-3-319-24277-4_9Suche in Google Scholar
Zappavigna, Michele. 2011. Ambient affiliation: A linguistic perspective on Twitter. New Media and Society 13. 788–806. https://doi.org/10.1177/1461444810385097.Suche in Google Scholar
Zappavigna, Michele. 2012. Discourse of Twitter and social media: How we use language to create affiliation on the web. London: Bloomsbury.10.5040/9781472541642Suche in Google Scholar
Zappavigna, Michele. 2015. Searchable talk: The linguistic functions of hashtags. Social Semiotics 25(3). 274–291. https://doi.org/10.1080/10350330.2014.996948.Suche in Google Scholar
© 2024 Walter de Gruyter GmbH, Berlin/Boston
Artikel in diesem Heft
- Frontmatter
- Editorial
- Editorial 2024
- Phonetics & Phonology
- The role of recoverability in the implementation of non-phonemic glottalization in Hawaiian
- Epenthetic vowel quality crosslinguistically, with focus on Modern Hebrew
- Japanese speakers can infer specific sub-lexicons using phonotactic cues
- Articulatory phonetics in the market: combining public engagement with ultrasound data collection
- Investigating the acoustic fidelity of vowels across remote recording methods
- The role of coarticulatory tonal information in Cantonese spoken word recognition: an eye-tracking study
- Tracking phonological regularities: exploring the influence of learning mode and regularity locus in adult phonological learning
- Morphology & Syntax
- #AreHashtagsWords? Structure, position, and syntactic integration of hashtags in (English) tweets
- The meaning of morphomes: distributional semantics of Spanish stem alternations
- A refinement of the analysis of the resultative V-de construction in Mandarin Chinese
- L2 cognitive construal and morphosyntactic acquisition of pseudo-passive constructions
- Semantics & Pragmatics
- “All women are like that”: an overview of linguistic deindividualization and dehumanization of women in the incelosphere
- Counterfactual language, emotion, and perspective: a sentence completion study during the COVID-19 pandemic
- Constructing elderly patients’ agency through conversational storytelling
- Language Documentation & Typology
- Conative animal calls in Macha Oromo: function and form
- The syntax of African American English borrowings in the Louisiana Creole tense-mood-aspect system
- Syntactic pausing? Re-examining the associations
- Bibliographic bias and information-density sampling
- Historical & Comparative Linguistics
- Revisiting the hypothesis of ideophones as windows to language evolution
- Verifying the morpho-semantics of aspect via typological homogeneity
- Psycholinguistics & Neurolinguistics
- Sign recognition: the effect of parameters and features in sign mispronunciations
- Influence of translation on perceived metaphor features: quality, aptness, metaphoricity, and familiarity
- Effects of grammatical gender on gender inferences: Evidence from French hybrid nouns
- Processing reflexives in adjunct control: an exploration of attraction effects
- Language Acquisition & Language Learning
- How do L1 glosses affect EFL learners’ reading comprehension performance? An eye-tracking study
- Modeling L2 motivation change and its predictive effects on learning behaviors in the extramural digital context: a quantitative investigation in China
- Ongoing exposure to an ambient language continues to build implicit knowledge across the lifespan
- On the relationship between complexity of primary occupation and L2 varietal behavior in adult migrants in Austria
- The acquisition of speaking fundamental frequency (F0) features in Cantonese and English by simultaneous bilingual children
- Sociolinguistics & Anthropological Linguistics
- A computational approach to detecting the envelope of variation
- Attitudes toward code-switching among bilingual Jordanians: a comparative study
- “Let’s ride this out together”: unpacking multilingual top-down and bottom-up pandemic communication evidenced in Singapore’s coronavirus-related linguistic and semiotic landscape
- Across time, space, and genres: measuring probabilistic grammar distances between varieties of Mandarin
- Navigating linguistic ideologies and market dynamics within China’s English language teaching landscape
- Streetscapes and memories of real socialist anti-fascism in south-eastern Europe: between dystopianism and utopianism
- What can NLP do for linguistics? Towards using grammatical error analysis to document non-standard English features
- From sociolinguistic perception to strategic action in the study of social meaning
- Minority genders in quantitative survey research: a data-driven approach to clear, inclusive, and accurate gender questions
- Variation is the way to perfection: imperfect rhyming in Chinese hip hop
- Shifts in digital media usage before and after the pandemic by Rusyns in Ukraine
- Computational & Corpus Linguistics
- Revisiting the automatic prediction of lexical errors in Mandarin
- Finding continuers in Swedish Sign Language
- Conversational priming in repetitional responses as a mechanism in language change: evidence from agent-based modelling
- Construction grammar and procedural semantics for human-interpretable grounded language processing
- Through the compression glass: language complexity and the linguistic structure of compressed strings
- Could this be next for corpus linguistics? Methods of semi-automatic data annotation with contextualized word embeddings
- The Red Hen Audio Tagger
- Code-switching in computer-mediated communication by Gen Z Japanese Americans
- Supervised prediction of production patterns using machine learning algorithms
- Introducing Bed Word: a new automated speech recognition tool for sociolinguistic interview transcription
- Decoding French equivalents of the English present perfect: evidence from parallel corpora of parliamentary documents
- Enhancing automated essay scoring with GCNs and multi-level features for robust multidimensional assessments
- Sociolinguistic auto-coding has fairness problems too: measuring and mitigating bias
- The role of syntax in hashtag popularity
- Language practices of Chinese doctoral students studying abroad on social media: a translanguaging perspective
- Cognitive Linguistics
- Metaphor and gender: are words associated with source domains perceived in a gendered way?
- Crossmodal correspondence between lexical tones and visual motions: a forced-choice mapping task on Mandarin Chinese
Artikel in diesem Heft
- Frontmatter
- Editorial
- Editorial 2024
- Phonetics & Phonology
- The role of recoverability in the implementation of non-phonemic glottalization in Hawaiian
- Epenthetic vowel quality crosslinguistically, with focus on Modern Hebrew
- Japanese speakers can infer specific sub-lexicons using phonotactic cues
- Articulatory phonetics in the market: combining public engagement with ultrasound data collection
- Investigating the acoustic fidelity of vowels across remote recording methods
- The role of coarticulatory tonal information in Cantonese spoken word recognition: an eye-tracking study
- Tracking phonological regularities: exploring the influence of learning mode and regularity locus in adult phonological learning
- Morphology & Syntax
- #AreHashtagsWords? Structure, position, and syntactic integration of hashtags in (English) tweets
- The meaning of morphomes: distributional semantics of Spanish stem alternations
- A refinement of the analysis of the resultative V-de construction in Mandarin Chinese
- L2 cognitive construal and morphosyntactic acquisition of pseudo-passive constructions
- Semantics & Pragmatics
- “All women are like that”: an overview of linguistic deindividualization and dehumanization of women in the incelosphere
- Counterfactual language, emotion, and perspective: a sentence completion study during the COVID-19 pandemic
- Constructing elderly patients’ agency through conversational storytelling
- Language Documentation & Typology
- Conative animal calls in Macha Oromo: function and form
- The syntax of African American English borrowings in the Louisiana Creole tense-mood-aspect system
- Syntactic pausing? Re-examining the associations
- Bibliographic bias and information-density sampling
- Historical & Comparative Linguistics
- Revisiting the hypothesis of ideophones as windows to language evolution
- Verifying the morpho-semantics of aspect via typological homogeneity
- Psycholinguistics & Neurolinguistics
- Sign recognition: the effect of parameters and features in sign mispronunciations
- Influence of translation on perceived metaphor features: quality, aptness, metaphoricity, and familiarity
- Effects of grammatical gender on gender inferences: Evidence from French hybrid nouns
- Processing reflexives in adjunct control: an exploration of attraction effects
- Language Acquisition & Language Learning
- How do L1 glosses affect EFL learners’ reading comprehension performance? An eye-tracking study
- Modeling L2 motivation change and its predictive effects on learning behaviors in the extramural digital context: a quantitative investigation in China
- Ongoing exposure to an ambient language continues to build implicit knowledge across the lifespan
- On the relationship between complexity of primary occupation and L2 varietal behavior in adult migrants in Austria
- The acquisition of speaking fundamental frequency (F0) features in Cantonese and English by simultaneous bilingual children
- Sociolinguistics & Anthropological Linguistics
- A computational approach to detecting the envelope of variation
- Attitudes toward code-switching among bilingual Jordanians: a comparative study
- “Let’s ride this out together”: unpacking multilingual top-down and bottom-up pandemic communication evidenced in Singapore’s coronavirus-related linguistic and semiotic landscape
- Across time, space, and genres: measuring probabilistic grammar distances between varieties of Mandarin
- Navigating linguistic ideologies and market dynamics within China’s English language teaching landscape
- Streetscapes and memories of real socialist anti-fascism in south-eastern Europe: between dystopianism and utopianism
- What can NLP do for linguistics? Towards using grammatical error analysis to document non-standard English features
- From sociolinguistic perception to strategic action in the study of social meaning
- Minority genders in quantitative survey research: a data-driven approach to clear, inclusive, and accurate gender questions
- Variation is the way to perfection: imperfect rhyming in Chinese hip hop
- Shifts in digital media usage before and after the pandemic by Rusyns in Ukraine
- Computational & Corpus Linguistics
- Revisiting the automatic prediction of lexical errors in Mandarin
- Finding continuers in Swedish Sign Language
- Conversational priming in repetitional responses as a mechanism in language change: evidence from agent-based modelling
- Construction grammar and procedural semantics for human-interpretable grounded language processing
- Through the compression glass: language complexity and the linguistic structure of compressed strings
- Could this be next for corpus linguistics? Methods of semi-automatic data annotation with contextualized word embeddings
- The Red Hen Audio Tagger
- Code-switching in computer-mediated communication by Gen Z Japanese Americans
- Supervised prediction of production patterns using machine learning algorithms
- Introducing Bed Word: a new automated speech recognition tool for sociolinguistic interview transcription
- Decoding French equivalents of the English present perfect: evidence from parallel corpora of parliamentary documents
- Enhancing automated essay scoring with GCNs and multi-level features for robust multidimensional assessments
- Sociolinguistic auto-coding has fairness problems too: measuring and mitigating bias
- The role of syntax in hashtag popularity
- Language practices of Chinese doctoral students studying abroad on social media: a translanguaging perspective
- Cognitive Linguistics
- Metaphor and gender: are words associated with source domains perceived in a gendered way?
- Crossmodal correspondence between lexical tones and visual motions: a forced-choice mapping task on Mandarin Chinese