Outsourcing teenage language: a participatory approach for exploring speech and text messaging

Kadri Koreinik; Aive Mandel; Maarja-Liisa Pilvik; Kristiina Praakli; Virve-Anneli Vihman

doi:10.1515/lingvan-2021-0152

Article Open Access

Outsourcing teenage language: a participatory approach for exploring speech and text messaging

Kadri Koreinik , Aive Mandel , Maarja-Liisa Pilvik , Kristiina Praakli and Virve-Anneli Vihman

Published/Copyright: June 5, 2023

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Linguistics Vanguard Volume 9 Issue s4

Abstract

This paper presents a remote method used for engaging teenagers as citizen sociolinguists within the research project Teen Speak in Estonia. The project, launched in January 2020, aims to investigate young people’s language by creating the first systematic dual corpus of Estonian teenagers’ spoken language and text messaging. Previously, youth language in Estonia has not been the subject of much research. Yet scholarly attention to teenage language usage has been on the rise internationally, with corpora compiled in English and several other languages. The article describes the process of recruiting and training participants, as well as adjustments made to data collection in connection with the outbreak of COVID-19 and the restrictions which followed. While the pandemic did modestly interfere with the participatory approach, the data collection was successfully carried out, thanks primarily to well-networked and dedicated 9- to 18-year-old participants. Notwithstanding a gender imbalance, the corpus of 97 h of speech and nearly 60,000 words of text messages will facilitate research into Estonian teenagers’ Estonian–English codeswitching and other linguistic features, across two registers. The remote method is recommended for use elsewhere, especially in places where mutual trust prevails, and teenagers claim a fair amount of independence, responsibility and facility with online applications.

Keywords: citizen sociolinguistics; Estonian; remote data collection; variation studies; youth language

1 Introduction

Teenagers are avid linguistic innovators and have been shown to be the drivers of various sorts of linguistic changes (Androutsopoulos 2005; Cheshire 2005; Eckert 2003; Tagliamonte 2016). Accessing samples of their language poses a methodological challenge, yet it is crucial for a better understanding of youth language and its role in variation and change. Importantly, most of the research in this area has focused on the language of L1 English-speaking youth, but linguistic practices in English are influencing young people’s language use much more broadly. English is a lingua franca used in entertainment and social media channels, and social media has grown to play a major role in young people’s interactions. Hence, examining teenage language outside of the English-speaking world is important for understanding new language contact phenomena that derive both from direct spoken communication between interlocutors and from computer-mediated communication, an increasingly central form of communication.

This paper presents a remote method for engaging teenagers in the research of a phenomenon they encounter hourly: their own language use. The method was devised as part of a research project (Teen Speak in Estonia, 2019–2022) dedicated to compiling the first corpus of Estonian teenagers’ spoken language and text messaging conversations.^[1] The project aims to identify features of language use by age, gender, and geographic area, to map the nature of the use of English among Estonian teens, and to analyse differences between texted and spoken language. The timing was such that society shut down for the initial period of the COVID-19 pandemic just as the team was preparing to recruit participants, and so the project moved to an entirely remote model of communication and data-gathering. Although this was indispensable for the success of the project in this period, we also discuss other benefits, as well as drawbacks, of this method.

We sketch the motivation and mechanics of the method and how it intersected with the great social challenges associated with the outbreak of COVID-19 in March 2020. We provide some descriptive information on the participants and data resulting from the study and discuss the lessons we learnt from the process of remote collection of teenagers’ language.

2 Background

2.1 Youth language and citizen sociolinguistics

The period of adolescence is characterized by a developmental transition from childhood to adulthood, including rapid physical and cognitive changes, as well as risk-taking and novelty-seeking behaviour (Spear 2000). Teenagers’ tendencies to seek ways to differentiate themselves, and to define themselves in opposition to earlier generations, may underlie the frequent stigmatization of their language by speech communities (Labov 2001: 514). The language spoken by teenagers in English-language cultures has been investigated from various angles; it is also strikingly familiar on a global stage, thanks to the popularity of anglophone entertainment. Language change has been linked broadly to sociocultural change, and more specifically to the processes of globalization and mediatization, understood as “the pervasive spread of media contents and platforms through all types of context and practice”, that is, technically mediated communication with permanent access to media content (Androutsopoulos 2014: 7; Couldry and Hepp 2013). Mediatization has shaped the lives of the younger cohorts of today, raised in a culture characterized by omnipresent digital technology (Calvo-Porral and Pesqueira-Sanchez 2020; Kalmus and Opermann 2019). From our perspective, not only does media provide a 24/7 context for peer-oriented and meaning-making teenagers, but it also facilitates English-language contact. Estonian teenagers have frequent and intense contact with English, and participate in interactive global media communities, but the immediate social context consists in their peer groups and Estonian society (cf. Kerswill et al. 2013). This duality points to the need for much more research on the non-English sociolinguistic contexts emerging out of the globally connected media environment (Stanford 2016), to better describe and understand the factors shaping today’s multilingual teenagers’ language use and its characteristic features (e.g., vogue words, swear words, discourse markers; see Stenström et al. 2002: 64–65).

Over the last decade, sociolinguistics has joined other disciplines in pursuing citizen science, “a methodology suitable for gathering large amounts of data”, which, in the case of language studies, also challenges the notions of authority and source (Svendsen 2018: 139). Advocates of this approach in sociolinguistics highlight the unique affordance of social media behaviour in embracing speakers’ expertise and understanding the role of language in social life (Rymes 2020). The need for citizen science in variation studies comes about for (a) epistemological reasons, including efforts to counteract the neglect of mixed speech and contact languages in earlier research; (b) technological reasons deriving from developments in information and communications technology; and (c) applied reasons, accounting for the need to broaden the picture of language usage for teaching and communication (Rymes and Leone 2014). Outsourcing the collection of language data by way of citizen sociolinguistics empowers language informants by giving them the ultimate say in what is shared with researchers. It also has the virtue of simultaneously promoting the two-way street underlying best research practices, in which informants possess full knowledge of the goals and potential benefits of the research and allowing ethical access to behaviour otherwise inaccessible to out-group researchers, such as talk about intimate issues or the use of taboo language (Drange et al. 2014; Sturtz Sreetharan et al. 2019).

The citizen sociolinguistic approach is used here to reach Estonian teenagers, a demographic experiencing rapid societal and linguistic change, whose language is strikingly underrepresented in corpus data. Millennial and post-millennial cohorts in Estonia, as elsewhere, are both experiencers and promoters of an influx of English and the rise of digital technology as primary communication channels. The easily accessible global market and discourse space has brought with it the relativization of values and relaxation of restrictions, as well as a countervailing rise in prescriptivist language attitudes.

2.2 Research into youth language in Estonia

Scholarly interest in youth language in Estonia can be traced to the mid-1920s, when Andrus Saareste, extraordinary professor of linguistics at the University of Tartu, called for investigation of social variants. While Saareste (1927: 167) considered youth especially prone to language innovation, he also warned that not all innovations were welcome and considered the use of foreign loans to be affective rather than “rational”. His views corresponded to the dominant stance of the era and set the tone in research for the coming decades.

Generally, however, youth language remained a marginal research subject until the 1970s and 1980s, when Labovian urban dialectology reached Estonian linguists.^[2] Even then, youth language research remained situated within slang studies or focused on urban youth for decades (Loog 1992; Peegel 1976; Tender 1984). Data collection methods included word lists, known from (descriptive) dialectology, written questionnaires, and interviews where informants were asked to respond to stimuli (e.g., Keevallik 1998). Only one study aimed at collecting the unmonitored speech of urban youth, involving secondary students in the data collection (see Loog 1988a, 1988b).

While the corpus of spoken Estonian was launched in 1997 (Hennoste 2003), that corpus does not include any samples dedicated to teenagers. This dearth of teenage language samples was true in other languages as well: the last three decades have seen the compilation of youth language corpora in English (e.g., COLT, see Stenström et al. 2002; Toronto Teen Corpus, Toronto Instant Messaging Corpus, Tagliamonte 2016); Spanish (COLA, Stenström 2006); French (Secova 2017); and Portuguese (Mata et al. 2014). In the Estonian context, the project described here is thus far the only one devoted to the compilation of a corpus on this scale. A bilingual English–Estonian corpus mainly comprising the language of bloggers and vloggers, mostly in their twenties, has also recently been assembled (Bahtina et al. 2021). However, no representative corpus of teen language has been available for systematic study.

3 The participatory approach of Teen Speak in Estonia

The current project initiated a novel approach to investigating young people’s language: the compilation of a systematic, dual corpus of teenagers’ language, with samples of both spoken and texted conversations for each speaker. The data was collected by means of a participatory, citizen linguistics approach, engaging the target group to document the language used by themselves and their friends. In doing so, we gave the young participants more agency and less of a sense of being monitored. While speakers may always be conscious of their listeners, the sense of freedom from the immediate presence of investigators may also have a freeing effect from self-monitoring. It is not deemed possible to collect truly “unmonitored” speech, nor is it needed (see, e.g., Wertheim 2006), but the project targeted a language register marked by casual interaction. Hence, we wished to record young speakers in their most familiar contexts, at home or with friends, where they felt comfortable and at ease.

Delegating responsibility for data collection to the young participants also gave us a chance to speak directly with them in a set of training sessions, encouraging them to reflect on the goals of linguistics, their own language usage, and their interactions with peers. Our contact with students through schools was scheduled to begin in the second half of March 2020. Because of the timing of the global pandemic and complete lockdown in Estonia, our reliance on remote means of contact, training, and data collection turned out to be much greater than initially planned, and the only way the project could have proceeded.

3.1 Recruitment and training

We recruited participants through schools, supplemented by personal contacts to redress imbalances and target groups who proved more difficult to access. Targeted students were in fourth, sixth, eighth, and tenth grade, which in Estonia spans the ages of 10–17. The youngest, pre-teen group was a smaller sample, included for apparent-time comparison, as we expected them to show fewer characteristic features of teenage language use, and to talk about different topics from the older groups (cf. Stenström et al. 2002: 28–29). To compare language use across regions differing in the presence of languages other than Estonian and its dialects, four locations were selected. First, we chose the capital city, Tallinn (approximately 438,000 inhabitants), where residents include Estonian and Russian speakers in similar proportions, and more tourists and migrants than elsewhere in Estonia. Second, we selected Tartu, Estonia’s second largest city (about 98,000 inhabitants), where standard Estonian is spoken as the dominant native language. Finally, we included one smaller town where the insular dialect is spoken, and another in the South Estonian dialect area.^[3] In each location, we targeted a gender-balanced sample, with two fourth graders and four participants from each of the older age groups (sixth, eighth, and tenth grades), for a total of 14 lead informants from each location, or 56 informants altogether, who additionally brought their friends into the project.

We first contacted teachers and heads of target schools via email. After positive replies from schools, we scheduled school visits to introduce the project and recruit interested students as lead participants, or ‘language ambassadors’ (keelesaadikud in Estonian). However, the outbreak of COVID-19 and subsequent lockdown led us to re-envision the recruitment and training process, using an entirely online format. We created a 5-min video [associated video-1-koreinik.mp4] describing the project, discussing the value of young people’s language for research and society, and inviting interested students to send us their contact details, personal interests, and reasons for wanting to join the project. We asked teachers to share the video among their students and posted it on the project website. Some teachers declined, reporting that their students were already challenged by the transition to remote learning (see Section 3.2), but others felt that participation in this project was an excellent opportunity for a rare, permissible interactive experience, as it involved pairing students with only one or two friends who were part of the student’s social contact “bubble”. Those teachers transmitted the project invitation, and in the first, springtime round we received applications from various age groups and regions, totalling 18 applicants.

Applicants were invited to participate in two 45-minute online training sessions via Zoom, run on different days by members of the research group. The first training session was used to inform trainees about the project, to encourage reflection and mediate discussion about trainees’ own language use, and to discuss ethical issues in collecting language data, including trust, privacy, and data accessibility. We explained the need for explicit, written consent from participants and their parents. The second training session focused on technical instruction in recording spoken language using audio recorders, accessing and submitting chat conversations and the required metadata, and other necessary steps for completing the project. Trainees were also given a homework assignment between the two sessions involving asking for consent and recording a couple minutes of conversation with a family member, listening to the recording, and responding to discussion questions. Their experiences and observations sparked an often spirited discussion at the second session.

Once they had completed training, the language ambassadors were given two primary tasks: (a) recording and submitting spontaneous spoken conversations and (b) submitting conversations held over text messaging applications, the most popular of which were Facebook Messenger and Discord. As the ambassadors and their friends were minors, ethics approval was secured from the Research Ethics Committee of the University of Tartu before contacting schools, and informed consent was obtained from all participants (including their friends and interlocutors in texted conversations) and their parents or guardians.

For the spoken language samples, participants were asked to recruit at least one friend, or co-informant, who would be willing to engage in an hour-long spoken conversation, and to organize and record two such conversations. Zoom H2N audio recorders were provided by the research group via a parcel delivery service, as direct contact through schools was hampered during lockdown. After the recordings were completed, the recorders and their memory cards were collected directly from the informants or teachers.

For the texted chats, informants were asked to look for digital texted conversations which they were willing to share with the research team, for a minimum of 1,000 words of conversation. We took a retrospective approach, advising participants to collect preferably pre-existing conversations in order to reduce effects of the project in the chats. We aimed to collect 112 h of spoken language and a minimum of 56,000 words of text message data. Contextual metadata on the recordings (e.g., participants, recording situation, place, and time) was collected through an online form. Background data on each participant in the conversations was collected via an online survey asking for school, place of residence, language skills and use, and social media use. Forms for collecting metadata and instructions on how to use the project recorders and how to download and submit text messages were available on the project website during data collection. Instructions in Estonian are now accessible on the page for students and teachers (https://sisu.ut.ee/teke/opetajale_ja_opilasele), under the heading TeKE projektis kasutatud ankeedid ja juhendid ‘Documents and instructions used in the TeKE project’, along with information on outreach classes dedicated to youth language and linguistics.

Participants were compensated for their time. The ambassadors were given a monetary stipend upon completion of all the required tasks, and each participant received a gift card per one hour-long spoken conversation (ambassadors received gift cards in addition to the stipend).

3.2 Remote data collection during COVID-19 restrictions

The outbreak of the COVID-19 pandemic considerably affected the data collection process. As Estonia implemented a full lockdown in mid-March, just as our data collection was meant to begin, we were unable to make any face-to-face contact with participants. The lack of direct contact made recruitment more difficult, and various teachers reported difficulty in contacting students remotely. Some parents reported concerns about their children’s sense of being overwhelmed, and this topic arises in the recordings as well. In addition, it proved to be more difficult to reach students online than at a face-to-face project introduction in the classroom, as initially planned. For the students, contacting the research group by email is likely to have been more of an obstacle than a question-and-answer session at the school. Online, written instructions via email and the internet were more difficult for students to follow than direct instruction in classrooms, especially for younger participants.

All participants were encouraged to contact the project members with any questions or concerns, and the project coordinator monitored their progress closely and offered assistance where possible. In the case of younger participants, some parents took a more active role in helping their children and contacting the project representatives. The natural filter this provided likely impacted on the sample of participants, those joining the project having already demonstrated an amount of drive and persistence, in addition to the dedication and responsibility required to attend and actively participate in both online training sessions.

It was clear that the lockdown affected the participants, especially due to a lack of direct, in-person contact with their friends and teachers. Some participants reported difficulties in arranging meetings with their friends for recording conversations; in a small minority of instances, we allowed the option of recording conversations held online, over a video call. As could be expected, the pandemic and concomitant lockdown had profound effects on the topics discussed in conversations. Due to these challenges, both recruitment of participants and the data collection process were delayed and slowed down.

Yet the acute sense of living through an extraordinary experience during the pandemic may also have fuelled interest in the project, as an outlet for productive energy, a diversion from the stress of remote schooling, and a channel for communicating socio-emotional difficulties or demonstrating the frustrations and sense of isolation of the pandemic era.

3.3 Engaged participants, collected data

After the first trial wave of participants in April and September of 2020, the project undertook a second recruitment wave at the end of 2020, when schools were partially engaged in contact teaching and social life had some semblance of normalcy, although both individuals and entire schools were in isolation for parts of the 2020–2021 academic year as well. From our first wave, we had amassed experience in how to run the recruitment, training, and monitoring over the internet, and the second wave proved successful in recruiting a pool of participants similar to the original aims. A total of 47 ambassadors completed the tasks (with 56 originally targeted); see Figure 1.

Figure 1:

Participants (ambassadors and total participants) in the corpus, by age and gender.

Gender balance was not achieved: the set of ambassadors included over twice as many girls as boys (32 female, 15 male). Each ambassador invited one or more friends to participate in the spoken and text conversations; here, the gender imbalance became even more pronounced: a total of 84 friends joined the participant sample, with 62 females and 22 males. Altogether, the corpus includes language use from 131 participants (72 percent female), ranging in age from 9 to 18 (mean = 13.68, SD = 2.40).

Informants provided a total of 97 h and 13 min of spoken conversation and 59,688 words of texted conversations, surpassing the minimal expectation of 56,000 words. No upper limit on texted conversations was set, and many informants contributed more than 1,000 words, with the notably large contributions of two sixth graders who sent 9,346 and 37,288 words.^[4] Setting aside these two outliers, the agents sent a mean of 1,296 words each (SD = 991). Although our minimum of 1,000 words was specifically intended to make it easier for less text-friendly participants to fulfil the task, some younger participants were not as used to chatting with their friends and seemed to devise chats especially for project purposes, as in the two lines in (1a) and (1b), written by a fourth-grade girl to her best friend.

(1)

mina kes ei oska messi kasutada ime et ma sind üles leidsin

‘me who doesn’t know how to use mess[enger] [it’s a] wonder that I even found you’

teeme kõne ma ei viitsi kirjutada

‘let’s make a call I can’t be bothered to write’

A team of mostly student transcribers are, at the time of writing, in the process of transcribing the spoken conversations using ELAN software,^[5] according to transcription guidelines worked out by the project team. A user-friendly application is nearly complete for help with easy, parallel searches of both corpora, but the texted chats require more manual tagging to make them accessible and searchable in a similar way. As of 1 May 2023, about 78 h of spoken conversation had been transcribed, amounting to 646,351 words. The transcription process includes coding for language used, pseudonymization, and omission of identifying information. To provide access to the data, transcriptions and metadata will be deposited in the University of Tartu’s DataDOI repository. Stable versions of the corpus will be assigned digital object identifiers (DOIs) in order to facilitate searching and to encourage the reuse of the data. Because the data derives from the spontaneous language of minors, and hence requires protected access, researchers interested in the corpus will be able to apply for access to the corpus.

4 Reflections on remotely accessing teen language

What have we learnt during the process of remote data collection? Some critical questions deserve closer reflection: How did teenagers perform as citizen sociolinguists? How balanced are the samples, how valid the compiled data, and what can be expected from further linguistic analysis? How transferable is the participatory approach to other contexts?

Overall, participants were dedicated and responsible language scientists. Some ambassadors positioned themselves as interviewers when spontaneous conversation didn’t seem to kick off, as seen in (2).

(2)

jah et ma alustan ee Priidu sul ühe hea küsimusega

et mis on üks asi mida sa kahetsed et oled elus teinud või tegemata jätnud

‘yes, I’ll begin, uh, Priidu with a question,

what is one thing that you’ve regretted doing or not doing in [your] life?’

(female, 12 years old)

The youngest group (aged 9 to 11) experienced more difficulty following instructions, requiring more guidance from the research team and their parents, who often interceded. As expected, ambassadors and their friends remained aware of being recorded, as demonstrated by comments such that in (3, boldface indicates code-switching with English):

(3)

imagine see inimene kes seda ümber peab kirjutama

‘Imagine the person who has to write this [all] down!’

(male, 14 years old)

Nevertheless, conversations included a variety of topics, including friends, teachers, social media influencers, school subjects, COVID-19 restrictions, domestic problems, and entertainment media. In fact, although this project aims to analyse language usage, a more interdisciplinary approach drawing on the social sciences more broadly is needed in order to do justice to the range and depth of conversations.

Our recruitment process resulted in significant gender asymmetry, which raises the question: if boys were harder to recruit than girls, were they also less communicative when recording conversations? We calculated the share of speech in recordings and ran an ANOVA for this measure to compare gender and age groups. We found no statistically significant difference in amount of speech by gender (p = 0.531), hence we do not need to be concerned that boys spoke less once they joined the project. Differences were found in reported use of texting apps: boys use Discord, designed for gamers, more often than girls, who use Instagram more widely. This leads us to a further question – how to tackle the problem of involving male participants – and perhaps to its answer. Rather than insisting on holding conversations in person, allowing young people to participate via conversations held online, despite the compromise in sound quality, would allow researchers to target young people who interact with each other primarily via gaming or other video applications; this is replacing face-to-face conversations in some demographics, and was especially noticeable during the pandemic.

Our survey findings provide some indication that participants gave credible responses and took the research in earnest. Figure 2 presents the mean scores of self-rated language skills, which demonstrate realistic estimates by teenagers of the relative dominance of their languages (known from census data; see Statistics Estonia 2021). English skills received remarkably high ratings, only slightly lower than those of Estonian, the first language of the bulk of our informants. Russian, German, and Finnish are the most frequent second foreign languages learnt at school, after English, from grade 6 or earlier. English has become the single outstanding language with which young people have early, intense contact, usually beginning before they begin to learn it at school (Praakli and Koreinik 2020). A preliminary analysis of the code-switching (CS) in our data found a remarkably high individual maximum rate of 23.4 s (39 percent) of CS insertions per speech minute, with an average of 2.4 s (4 percent) with a standard deviation of 3, which demonstrates significant individual variation in the use of English (see also Vihman et al. 2022). The function of English in the data sets promises rich material for analysis.

Figure 2:

Self-reported language skills of participants, by age group (9–12, 13–15 and 16–18 years old), responding to the question “How well do you speak this language?” on a five-point scale (1 = not at all, 5 = fluently).

Finally, while outsourcing teenage language served the purposes of our research well in the Estonian context, how easily can it be applied elsewhere? In many ways, Estonia is an ideal research context. There is growing interest in studying language use in social media, demonstrated by school research projects and initiatives by the Estonian Research Council. Young people in Estonia are allowed a fair amount of autonomy, and are generally given a good deal of independence in the hours after the school day ends, as well as in their online habits. Young Estonians have been shown to be avid, autonomous internet users, with online activity embedded in the daily lives of Estonian children. A reported 97 percent of young people aged 9–17 access the internet every day through at least one device (Sukk et al. 2019) and, in our project, only the very youngest age group had any concerns about producing examples of texted language usage. The method is recommended for use in settings where teenagers are independent and responsible agents both online and offline, and where a generalized trust can be built with schools and/or parents.

5 Conclusions

The participatory approach adopted in this study has a number of qualities which recommend it for broader use, as well as points of caution. Remote data collection allowed the teenage language project described in the paper to continue during the period of societal restrictions in 2020, but it was part of our research design even before the pandemic began. Many participants found our flexible approach to online training sessions to be congenial: less time-consuming and simpler to arrange than in-person sessions, and more familiar as a learning environment. It extended the researchers’ reach, while also making the data collection process messier and more difficult to monitor. This, of course, is part of the point of citizen linguistics: the project takes the baton of authority away from the researcher and hands it to the informants. Outsourcing teenage language means much more than delegating the task of data collection to reach a wider public or to control costs: it empowers speakers as the producers, receivers, and managers of their own data. By highlighting the agency of the speakers in production and analysis of their language samples, linguistic research has a better claim to social relevance and depth.

Corresponding author: Kadri Koreinik, Institute of Estonian and General Linguistics, University of Tartu, Ülikooli 18, Tartu 50090, Estonia, E-mail: kadri.koreinik@ut.ee

Funding source: Estonian Language and Culture in the Digital Age programme, Estonian Ministry of Research and Education

Award Identifier / Grant number: EKKD3

Acknowledgements

We gratefully acknowledge funding for the Teen Speak in Estonia project from the “Estonian Language and Culture in the Digital Age (2019–2027)” R&D programme of the Estonian Ministry of Education and Research (grant nr. EKKD3), and from the Kadri, Nikolai, and Gerda Rõuk research fund at the University of Tartu. In addition, we are very grateful to all the teenagers who participated in the project as language ambassadors and friends, as well as their parents for approving and supporting their participation and their schools for assistance with the recruitment process and ongoing interest in the results. Finally, we extend our heartfelt thanks also to our team of transcribers.

References

Alumäe, Tanel, Ottokar Tilk & Asadullah. 2018. Advanced rich transcription system for Estonian speech. In Kadri Muischnek & Kaili Müürisep (eds.), Human language technologies – the Baltic perspective: Proceedings of the Eighth International Conference, Baltic HLT 2018 (Frontiers in Artificial Intelligence and Applications 307), 1–8. Amsterdam: IOS Press.Search in Google Scholar

Androutsopoulos, Jannis. 2005. Research on youth-language. In Ammon Ulrich, Norbert Dittmar, Klaus J. Mattheier & Trudgill Peter (eds.), An international handbook of the science of language and society, vol. 2, 1496–1505. Berlin & New York: De Gruyter Mouton.Search in Google Scholar

Androutsopoulos, Jannis. 2014. Mediatization and sociolinguistic change: Key concepts, research traditions, open issues. In Jannis Androutsopoulos (ed.), Mediatization and sociolinguistic change (Linguae & Litterae: Publications of the School of Language & Literature Freiburg Institute for Advanced Studies 36), 3–48. Berlin & Boston: De Gruyter.10.1515/9783110346831.3Search in Google Scholar

Bahtina, Daria, Helin Kask & Anna Verschik. 2021. English adjectives and Estonian nouns: Looking for agreement? Frontiers in Psychology 12. 1–14. https://doi.org/10.3389/fpsyg.2021.735232.Search in Google Scholar

Calvo-Porral, Cristina & Rogelio Pesqueira-Sanchez. 2020. Generational differences in technology behavior: Comparing millennials and Generation X. Kybernetes 49(11). 2755–2772. https://doi.org/10.1108/k-09-2019-0598.Search in Google Scholar

Cheshire, Jenny. 2005. Age- and generation-specific use of language. In Ulrich Ammon, Norbert Dittmar, Klaus Mattheier & Peter Trudgill (eds.), An international handbook of the science of language and society, vol. 2, 1552–1563. Berlin and New York: De Gruyter Mouton.Search in Google Scholar

Couldry, Nick & Andreas Hepp. 2013. Conceptualizing mediatization: Contexts, traditions, arguments. Communication Theory 3(3). 191–202. https://doi.org/10.1111/comt.12019.Search in Google Scholar

Drange, Eli-Marie, Ingrid Hasund & Anna-Brita Stenström. 2014. “Your mum!” Teenagers’ swearing by mother in English, Spanish and Norwegian. International Journal of Corpus Linguistics 19(1). 29–59. https://doi.org/10.1075/ijcl.19.1.02dra.Search in Google Scholar

Eckert, Penelope. 2003. Language and adolescent peer groups. Journal of Language and Social Psychology 22(1). 112–118. https://doi.org/10.1177/0261927x02250063.Search in Google Scholar

Hennoste, Tiit. 2003. Suulise eesti keele uurimine: Korpus [Investigating spoken Estonian: The corpus]. Keel ja Kirjandus 7. 481–500.Search in Google Scholar

Kalmus, Veronika & Signe Opermann. 2019. Operationalising Mannheim: Empirical building blocks of generational identity. Comunicazioni Sociali 2. 232–246.Search in Google Scholar

Keevallik, Leelo. 1998. Sinatamisest ja teietamisest koolilaste arvamuse põhjal [Usage of informal and formal second person forms, based on the opinions of schoolchildren]. Keel ja Kirjandus 8. 541–553.Search in Google Scholar

Kerswill, Paul, Jenny Cheshire, Susan Fox & Eivind Torgersen. 2013. English as a contact language: The role of children and adolescents. In Daniel Schreier & Marianne Hundt (eds.), English as a contact language: Studies in English language, 258–282. Cambridge: Cambridge University Press.10.1017/CBO9780511740060.015Search in Google Scholar

Koreinik, Kadri & Tõnu Tender. 2013. Eesti keeltest rahvaloendustel [On Estonian languages in censuses]. Emakeele Seltsi Aastaraamat 59. 77–102. https://doi.org/10.3176/esa59.04.Search in Google Scholar

Labov, William. 2001 [1994]. Principles of linguistic change, vol. 2, Social factors. Oxford: Blackwell.Search in Google Scholar

Loog, Mai. 1988a. Soome mõjusid Tallinna noorte keelekasutuses [Effects of Finnish on the language usage of Tallinn’s youth]. Keel ja Kirjandus 2. 83–89.Search in Google Scholar

Loog, Mai. 1988b. Soome mõjusid Tallinna noorte keelekasutuses (Järg) [Effects of Finnish on the language usage of Tallinn’s youth, continued]. Keel ja Kirjandus 3. 144–148.Search in Google Scholar

Loog, Mai. 1992. Nende sõned, sõnad: Uurimus eriealiste tüdrukute ja poiste kõnekeelest [Their word forms, their words: A study of the spoken language of girls and boys of different ages]. Tampere: Tampere University Dissertation.Search in Google Scholar

Mata, Ana Isabel, Helena Moniz, Fernando Batista & Julia Hirschberg. 2014. Teenage and adult speech in school context: Building and processing a corpus of European Portuguese. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk & Stelios Piperidis (eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation, 3914–3919. Reykjavik: European Language Resources Association. https://aclanthology.org/L14-1 (accessed 23 April 2023).Search in Google Scholar

Paunonen, Heikki. 1977. Uurimisobjektiks Helsingi kõnekeel [Helsinki colloquial language as an object of research]. Keel ja Kirjandus 12. 729–735.Search in Google Scholar

Peegel, Juhan. 1976. Eesti rahvuskorpuse sõduriargoost [Soldiers’ slang in the Estonian Rifle Corps of the Red Army]. Keel ja Kirjandus 6. 339–342.Search in Google Scholar

Population and Housing Census. 2021. RL214462: Estonian-speaking population by ability to speak a dialect, age group, sex, educational attainment, and place of residence (settlement region), 31 December 2021. Tallinn: Statistics Estonia.Search in Google Scholar

Praakli, Kristiina & Kadri Koreinik. 2020. Keelemuutus vs. sotsiolingvistiline muutus: Eesti keele sotsioperioodid re-revisited [Language change vs. sociolinguistic change: The socio-periods of Estonian re-revisited]. Keel ja Kirjandus 11. 915–934. https://doi.org/10.54013/kk756a1.Search in Google Scholar

Rymes, Betsy. 2020. How we talk about language: Exploring citizen sociolinguistics. Cambridge: Cambridge University Press.10.1017/9781108770194Search in Google Scholar

Rymes, Betsy & Andrea R. Leone. 2014. Citizen sociolinguistics: A new media methodology for understanding language and social life. Working Papers in Educational Linguistics 29(2). 25–43.Search in Google Scholar

Saareste, Andrus. 1927. Tundmused tegurina keelearengus [Affective factors in language change]. Eesti Keel: Akadeemilise Emakeele Seltsi ajakiri 6(7). 161–184.Search in Google Scholar

Secova, Maria. 2017. Discourse-pragmatic variation in Paris French and London English: Insights from general extenders. Journal of Pragmatics 114. 1–15. https://doi.org/10.1016/j.pragma.2017.03.014.Search in Google Scholar

Spear, Linda P. 2000. The adolescent brain and age-related behavioral manifestations. Neuroscience & Biobehavioral Reviews 24(4). 417–463. https://doi.org/10.1016/s0149-7634(00)00014-2.Search in Google Scholar

Stanford, James N. 2016. A call for more diverse sources of data: Variationist approaches in non-English contexts. Journal of Sociolinguistics 20(4). 525–541. https://doi.org/10.1111/josl.12190.Search in Google Scholar

Statistics Estonia. 2021. RL21439: Population by command of foreign languages, sex, age group, and place of residence (administrative unit), 31 December 2021. Population and Housing Census. Available at: https://andmed.stat.ee/en/stat/rahvaloendus__rel2021__rahvastiku-demograafilised-ja-etno-kultuurilised-naitajad__voorkeeleoskus-murded/RL21439.Search in Google Scholar

Stenström, Anna-Brita, Gisle Andersen & Ingrid Kristine Hasund. 2002. Trends in teenage talk: Corpus compilation, analysis and findings (Studies in Corpus Linguistics 8). Amsterdam: John Benjamins.10.1075/scl.8Search in Google Scholar

Stenström, Anna-Brita. 2006. Taboo words in teenage talk. Spanish in Context 3(1). 115–138. https://doi.org/10.1075/sic.3.1.08ste.Search in Google Scholar

Sturtz Sreetharan, Cindi L., Gina Agostini, Alexandra Brewis & Amber Wutich. 2019. Fat talk: A citizen sociolinguistic approach. Journal of Sociolinguistics 23(3). 263–283. https://doi.org/10.1111/josl.12342.Search in Google Scholar

Sukk, Marit, Kadri Soo, Veronika Kalmus, Andra Siibak & Roosmarii Kurvits. 2019. Preliminary findings of the EU kids online 2018 Estonian survey: Summary. Tartu: University of Tartu, Institute of Social Studies.Search in Google Scholar

Svendsen, Bente A. 2018. The dynamics of citizen sociolinguistics. Journal of Sociolinguistics 22. 137–160. https://doi.org/10.1111/josl.12276.Search in Google Scholar

Tagliamonte, Sali A. 2016. Teen talk: The language of adolescents. Cambridge: Cambridge University Press.10.1017/CBO9781139583800Search in Google Scholar

Tender, Tõnu. 1984. Tänapäeva õpilasslängi [Contemporary student slang]. Keel ja Kirjandus 3. 168–171.Search in Google Scholar

Vihman, Virve-Anneli, Kristiina Praakli, Maarja-Liisa Pilvik & Mari-Liis Korkus. 2022. Kas noored on inglise keelega obsessed? Millest räägivad korpusandmed? [Are Estonian teenagers obsessed with English? What do the corpus data tell us? Philologia Estonica Tallinnensis 7. 292–322.10.22601/PET.2022.07.11Search in Google Scholar

Wertheim, Suzanne. 2006. Cleaning up for company: Using participant roles to understand fieldworker effect. Language in Society 35(5). 707–727. https://doi.org/10.1017/s0047404506060337.Search in Google Scholar