Home Plastic Mandarin tones: regional identity in prosody
Article Open Access

Plastic Mandarin tones: regional identity in prosody

  • Chenzi Xu ORCID logo EMAIL logo
Published/Copyright: October 28, 2025
Phonetica
From the journal Phonetica

Abstract

Local differentiation and innovation in spoken Mandarin are now ubiquitous in metropolises in China, accelerated by the widespread promotion of the national lingua franca Standard Mandarin. This paper examines the lexical tones of Plastic Mandarin, a newly crystallised urban Mandarin dialect in Changsha, where the dominant regional variety has been Changsha Xiang. This paper establishes the lexical tone system for Plastic Mandarin using Growth Curve Analysis and explores its development by comparing its tones with those of Changsha produced by the same group of multilingual speakers, employing Generalised Additive Mixed Modelling. The findings show that, while Plastic Mandarin shares the same tone categories as Standard Mandarin, it features distinctive f0 patterns, some of which closely resemble their corresponding Changsha tones. These f0 patterns, imbued with regional identity, are likely motivated by phonetic variation, systemic constraints, social biases, and language contact. The case of Plastic Mandarin may exemplify contact-induced tonal contour changes in forming new Mandarin varieties.

1 Introduction

Millennials and younger generations in downtown Changsha, Hunan, China often speak what is called Plastic Mandarin (sùliào pǔtōnghuà, 塑料普通话). Although the term ‘Plastic’ carries a seemingly negative undertone of inauthenticity, it has increasingly become preferred accent of young locals in daily life. Broadly recognised as a non-standard Mandarin with a heavy Xiang accent, Plastic Mandarin in this paper refers to a particular Mandarin accent that features the speech of young urban residents in Changsha and that has crystallised over the past few decades. Plastic Mandarin of the young generally differs from the Mandarin accents of their parents or grandparents’ generation, many of whom failed to master or rarely use Standard Mandarin. The use of Plastic Mandarin is largely not a matter of Mandarin proficiency, but a matter of choice for young people who grew up learning Standard Mandarin since kindergarten yet choose to speak Plastic Mandarin.

Plastic Mandarin is a new variety of Mandarin that may still be quite variable but that features a core of distinctive pitch patterns. One central question of this paper is: What are the citation tones of Plastic Mandarin? Considering the linguistic repertoire of the population in the city of Changsha, this paper hypothesises that the development of the tonal system of Plastic Mandarin is closely related to the two varieties in contact, Standard Mandarin and Changsha.

This paper examines the acoustics of citation tones in Plastic Mandarin and Changsha produced by high school students in downtown Changsha and explores the development of the lexical tones of Plastic Mandarin through a comparative approach. Such a case study of crystallising regional mandarin contributes to our understanding of dialect levelling and variation in contact. This article also offers a systematic approach to characterise and compare f0 contours of lexical tones in tone languages.

1.1 The city of Changsha and its accents

Changsha is the capital of Hunan province, in south central China. Historically, Changsha has been a regional centre for administration, transportation, commerce, and culture. Nowadays, it is one of the core cities in central China with a total population up to 10 million (Hunan Provincial Bureau of Statistics 2021), covered by extensive train and bus networks, waterways, and air travel.

Changsha Xiang has been the main variety spoken in the city of Changsha, especially in the downtown area. It is largely mutually unintelligible with Standard Mandarin. Other Xiang varieties such as Ningxiang and Liuyang are also spoken in the counties under the jurisdiction of Changsha. The integration of Changsha and two neighbouring cities, Zhuzhou and Xiangtan, into the Greater Changsha Metropolitan Region (also known as the Chang-Zhu-Tan Megapolis) has also brought speakers of different Xiang varieties into Changsha. Changsha is the only city in Hunan Province that has positive net migration rate over the years (Hunan Provincial Bureau of Statistics 2017), and the great majority of migrants are from other regions within the province, thanks to its impressive economic growth in recent years. The surge of migrants in the labour market yields increasing complexity of the local linguistic repertoire, since there are numerous linguistic varieties within Hunan, many of which are mutually unintelligible.

While Standard Mandarin is required and omnipresent in television, films, radio, lectures, and formal meetings, Changsha has been dominating everyday communication and the local opera Huāgǔxì (also known as Flower Drum Opera). Contact between Standard Mandarin and Changsha in this multilingual society has resulted in a spectrum of non-standard varieties blending elements of both languages in varying degrees and manners. It is thus possible to identify two broad categories of non-standard Mandarin produced by Changsha speakers: Hunan-accented Mandarin and Plastic Mandarin. Hunan-accented (or Xiang-accented) Mandarin is characterised by perceptual and production differences in segmental phonology between Xiang vernaculars and Standard Mandarin, while largely retaining the tone patterns of Standard Mandarin. For example, some Changsha speakers are not perceptually sensitive to the distinction between /ʂ/ and /s/ in Mandarin, a non-native contrast in many Xiang vernaculars including Changsha. This is illustrated in two Hunan-accented Mandarin examples [1] shī shēng and [2] dú shū by a Changsha speaker taken from a transcript in Wu (2005: 29). The corresponding Standard Mandarin, Changsha, and Plastic Mandarin pronunciations are provided for comparison. [1] and [2] demonstrate that the Changsha speaker employs either the alveolar fricative [s] or the alveolo-palatal fricative [ɕ], acoustically similar segments available in the vernacular Changsha, as alternatives to the retroflex fricative [ʂ] in Standard Mandarin, while showing no difficulty in producing Standard Mandarin tones. Such proficiency-based variation also includes the realisation of syllable onsets /n/ as [l], /tʂ/ as [ts], / tʂʰ/ as [tsʰ], /x/ as [f], and the syllable coda /ŋ/ as [n], patterns that are commonly observed among adult Xiang learners of Mandarin. These segmental features are often stigmatised or regarded as production errors, while compliance with the Standard Mandarin segmental features is associated with ‘educatedness’ and oral competence in contexts where Mandarin is expected.

[1]
师 生

teacher(s) and student(s)
shī shēng
Standard Mandarin ʂəŋ]
Changsha [sz̩ sən]
Hunan-accented Mandarin [sz̩ sən]
Plastic Mandarin ʂəŋ]
[2]
读 书

read books
dú shū
Standard Mandarin [tu ʂu]
Changsha [təʊ ɕy]
Hunan-accented Mandarin [tu ɕy]
Plastic Mandarin [tu ʂu]

In contrast, we can see that in [1] and [2] the segments in Plastic Mandarin align with those in Standard Mandarin, but the tones differ. The phoneme inventory of Plastic Mandarin is largely identical to that of Standard Mandarin (Jing and Niu 2010), though there may be individual variation. The distinctive tonal patterns serve as the primary perceptual and social cues for Plastic Mandarin. Interestingly, prosodic deviations from Standard Mandarin (as in Plastic Mandarin) appear to be considerably less stigmatised than segmental deviations (as in Hunan-accented Mandarin).

1.2 The crystallisation of Plastic Mandarin

One of the earliest mentions of this term Plastic Mandarin in academic work appears in Wu (2005: 22), referring to “a variety that combines the features of the local Changsha dialect with features of Standard Mandarin”. Translated literally from the term 塑料普通话 used by local speakers, it was originally an umbrella term covering various Mandarin-vernacular hybrid varieties, including L2 Mandarin accents spoken by adult Xiang learners of Mandarin. Hence, the boundary between Plastic Mandarin and Hunan-accented Mandarin is not always clearly delineated.

In recent decades, the referent of Plastic Mandarin has become crystallised and increasingly identified as a particular non-standard Mandarin with distinct and consistent tonal patterns spoken by young people. This was validated in Xu et al. (2012)’s study: about 90 % of their 104 survey respondents in Changsha correctly identified the recording of Plastic Mandarin among recordings of four major local varieties, including Standard Mandarin, Plastic Mandarin, Changsha-accented L2 Mandarin, and Changsha; more than half (55.8 %) of them differentiated Plastic Mandarin from Changsha-accented L2 Mandarin. The gap between identification and differentiation is largely attributable to the fact that some participants considered Changsha-accented L2 Mandarin as a variant of Plastic Mandarin. Despite the absence of authoritative definition or standard, most respondents could recognise and were highly aware of Plastic Mandarin. Furthermore, Jing and Niu (2010) observed that the most speakers of Plastic Mandarin were below 30 years old, and those with high academic qualifications were capable of and inclined to code-switching between Standard Mandarin, Plastic Mandarin, and Changsha.

Plastic Mandarin is spreading to broader geographic and social space and to a wider range of age groups, which has come to be linked increasingly to Changsha. Considering that older generations in Changsha do not speak Plastic Mandarin, it was likely formed in the past two or three decades, during a period when Standard Mandarin was extensively promoted through education and media under national language legislation and strong language policy. This push toward linguistic standardisation can be seen in the designation of the third week of every September, since 1999, as the “Putonghua (Standard Mandarin) Promotion Week” across the country, an annual national event jointly guided by nine central government bureaus including the Ministry of Education. Activities and competitions for the annual event are organised through the public school system and by various state and cultural institutions.

1.2.1 From de facto lingua franca in schools to public service

Plastic Mandarin has been predominantly used in schools, particularly in secondary schools, in urban Changsha. It thrives on the gap between policy and practice – despite the formal expectation that Standard Mandarin be used in schools, the Hunan-accented Mandarin produced by many experienced local teachers, who are themselves adult learners of Standard Mandarin, is tolerated and in many cases, has to be. From the perspective of Standard Mandarin acquisition, Plastic Mandarin is generally considered more standard and intelligible than Hunan-accented Mandarin. As such, speaking Plastic Mandarin is widely accepted as a comfortable and covertly legitimised form of professional conduct.

Plastic Mandarin is in fact the de facto lingua franca in schools. Wang (2018) reported that 36 out of 37 subjects in their study attended secondary schools where the major medium of communication was Plastic Mandarin. According to a brief survey reported in Xu (2022), which draws on the same participant group as the present study, 20 out of 21 high school students always or often using Plastic Mandarin to communicate with their classmates and friends at school (one participant did not complete the relevant item). Notably, 13 of these 21 students reported always or often speaking to their teachers in Plastic Mandarin – even during class, one of the most formal settings within the school environment. In Xu (2022), considerably fewer subjects reported speaking to their parents (8 out of 21) or grandparents (3 out of 21) in Plastic Mandarin than to their classmates and friends, suggesting that Plastic Mandarin functions as a sociolect of the younger generation. Plastic Mandarin may not be the dominant variety students encounter on university campuses unless their social networks are locally-based, since university admissions are typically a mixed intake of students from different parts of China.

The use of Plastic Mandarin extends beyond the realm of education and co-exists with Standard Mandarin and Changsha in many public spaces. About half the adolescent participants in Xu (2022) always or often speak Plastic Mandarin in public space too. Jing and Niu (2010) interviewed 42 Changsha locals who are competent ‘trilinguals’ in Standard Mandarin, Plastic Mandarin and Changsha, and found that for these speakers, Plastic Mandarin was also frequently used in workplace and public sphere. Especially in high-end department stores, restaurants, hotels, post offices, banks, airports, and train stations, more than three-quarters of them preferred to communicate in Plastic Mandarin.

1.2.2 From adolescents to wider age groups

Plastic Mandarin is first widely discernible in the speech of adolescents and young adults, based on the author’s personal observation. Xu et al. (2012) surveyed 104 Changsha residents aged from 15 to 55 years old, and less than 30 % of them used Plastic Mandarin in various contexts. The lower usage rate of Plastic Mandarin in their study compared to Jing and Niu (2010) is mostly likely attributed to the lower percentage of young people in the participants (only one secondary school student).

Plastic Mandarin, however, is not confined to teenagers. The generation born after 1980s and 1990s and proficient in Plastic Mandarin may have been reshaping the linguistic landscape of urban Changsha when they entered the workplace and started families. For instance, the presence of Plastic Mandarin was noticeable in the speech of primary school students on the reality television show Grade One produced by Hunan Broadcasting System, which was aired in 2014. The show featured seven students, aged 5 and 6, as they embarked on their first semester in a Changsha elementary boarding school. Through 120 cameras, the show documented interactions between these young children and their parents, classmates, and teachers. A significant number of these interactions featured Plastic Mandarin, even in moments at home with their parents or during emotionally charged situations such as crying or losing their temper in conflicts. Children who acquired Plastic Mandarin from their caregivers were naturally accustomed to using it in everyday communication. While it is important to note that the students on the reality television show may not be representative of all young children in Changsha, and that the potential effects of camera lenses and editing should be taken into consideration, the show does afford a window on the language practices of young children from relatively well-off families in Changsha. Additional evidence comes from Mao (2009), in which 93 out of 120 urban Changsha children aged 3 to 12 were reported to use Plastic Mandarin in their conversational speech.

1.3 Contact and acquisition: the formation of a new Mandarin dialect

The scholarly discourse on the nature of Plastic Mandarin revolved around two theoretical strands: (1) Plastic Mandarin as a fossilised interlanguage (Jing and Niu 2010; Mao 2009) and (2) Plastic Mandarin as a contact-induced new variety (Fu 2010). The former concept of interlanguage in the field of second language acquisition is specific to individual non-native speakers, while the latter concept in the field of sociolinguistics places the focus on collective linguistic behaviour arising in multilingual communities. Both contact and acquisition are essential factors in the formation of Plastic Mandarin.

Interlanguage refers to the language system that learners develop in the process of acquiring a target language and has a structurally intermediate status between the native language and the target language (Selinker 1972). The accented Mandarin produced by older adults in Changsha tends to exhibit characteristics of interlanguage, i.e. permeability, dynamism, and systematicity. There are some uniform patterns alongside considerable individual variability in the accented Mandarin, as evident in the classification of heavy, medium, and light accents in Fu (2010)’s examination of accented Mandarin produced by 200 native Changsha speakers aged from 16 to 55. The homogeneous linguistic forms deviant from the norm of Standard Mandarin in Plastic Mandarin, nevertheless, should not be analysed as persistent errors or failed approximations as in the fossilised interlanguage account, since most Plastic Mandarin speakers in this study were aware of the mismatch between Plastic Mandarin and Standard Mandarin and capable of producing more standard forms.

The sociolinguistic perspective of contact and new dialect formation is highly relevant to the case of Plastic Mandarin. For the millennial generation, the norms of Plastic Mandarin were not introduced by their parents nor grandparents. The lack of continuity between generations serves as a marker of a new dialect (Kerswill 2020). The contact between Changsha and Standard Mandarin in the city of Changsha may not be as apparent, given that there has not been large-scale migration of Standard Mandarin speakers to Changsha in recent years. The contact accompanied the process of adopting Standard Mandarin under the language policy and was sustained and accelerated in the digital space including mass media and digital communication. Considering Changsha and Standard Mandarin, one might regard Changsha as a case of extended diglossia (Fishman 1967) with bilingualism, where two varieties coexist in the society but are used in different domains: Standard Mandarin with a highly codified system is the High variety used in written form and formal domains and Changsha is the Low variety marginalised in formal education and thriving in ordinary conversations at home and wet market. The compartmentalisation by usage domains of the two varieties in a diglossic situation is often not clear-cut or stable in reality (Sayahi 2020), especially when coupled with non-native competence. Changsha presents a fluid diglossic situation, or diaglossia in Auer (2005)’s term, as the local linguistic landscape is also filled up with intermediate variants.

The rise of homogeneous Plastic Mandarin during the contact, akin to the focusing stage in Trudgill’s three-stage theory of new-dialect formation (Trudgill 2004), complicates the functional distribution of varieties. Plastic Mandarin tends to encroach upon the High variety in contexts such as education and corporate meetings as well as infiltrate the Low variety in some informal contexts. Zhang et al. (2024) have observed an ongoing change in language use in Changsha, identifying vernacular-preserving and vernacular-lost groups of speakers.

1.4 The citation tones of Standard Mandarin and Changsha

Given that tone plays a central role in distinguishing Plastic Mandarin from other varieties, it is essential to first review the citation tones of Standard Mandarin and Changsha, as displayed in Tables 1 and 2, where each tone category is illustrated with at least one [ma] syllable, using both tone letters and numerals (Chao 1930). While the tones of Standard Mandarin are well-established (e.g. Lee and Zee 2003), the description of Changsha tones is based on the seminal work of Firth and Rogers (1937), Beijing University (1995), Li (1998), Wu (2005), and Bao and Chen (2005). Recent work by Wu (2023) suggests slightly different Changsha tone values (e.g. T1 is 34 instead of 33; T2 is 223 instead of 13; T3 is 43 instead of 41; T5 is 31 instead of 21), though the findings are based on data from a single speaker. In the ‘Tone’ column, the Arabic tone numbers are arbitrary but conventional, and their historical tone category is also labelled. The tone categories in Changsha and Standard Mandarin that evolved from the same Middle Chinese category are mostly labelled with the same number to indicate their connection. For instance, Tone 2 in Standard Mandarin and Tone 2 in Changsha are both evolved from the yáng píng category of Middle Chinese.

Table 1:

Citation tones of Standard Mandarin.

Tone Description Example
yīn píng ˥ 55 high – level [ma] ‘mother’ 妈 ‘wipe’ 抹
yáng píng 35 mid – rising [ma] ‘hemp’ 麻
shǎng 214 low mid – dipping [ma] ‘horse’ 马
21 low – falling
51 high – falling [ma] ‘abuse’ 骂 [pʰa] ‘fear’ 怕
Table 2:

Citation tones of Changsha.

Tone Description Example
yīn píng 33 mid – level [ma˧] ‘mother’ 妈
yáng píng 13 low – rising [ma] ‘hemp’ 麻
yīn shǎng 41 mid high – falling [ma] ‘horse’ 马
yīn qù 45 mid high – rising [pʰa] ‘fear’ 怕
yáng qù 21 low – falling [ma] ‘abuse’ 骂
24 low mid – rising [ma] ‘wipe’ 抹

In Standard Mandarin (Table 1), Tone 3 has two common variants, excluding the sandhi form: the full form and the reduced form , although the former is the prescriptive form in textbooks. Tone 3 is typically reduced to a simple low fall tone [ 21] without the rising tail in non-utterance final positions (Chao 1968). Changsha has a more complicated tone system than Standard Mandarin, with 6 distinct citation tones. The tonotactic gap in the [ma] set in Table 2 reflects that Changsha evolved from the ‘register distinction’ of tone (Norman 1988) in Middle Chinese. In other words, Tone 4 in Changsha usually co-occur with syllables with a voiceless obstruent onset. Therefore, [pʰa] with a voiceless bilabial stop onset is employed instead to illustrate the yīn qù tone (Tone 4) in Changsha. Changsha is not standardised and sometimes the /a/ vowel may be realised in a more retracted and raised manner as [ɐ] or [ɔ] by some speakers for instance.

1.5 Research questions

Tone has been identified as one of the prosodic aspects of language that is highly diffusible through language contact (Ratliff 2015). Plastic Mandarin manifests Mandarin tonal variation and change in a multilingual community.

The primary objective of this study is to establish the tone system of Plastic Mandarin through phonetic-acoustic analysis. A few prior studies, such as Jing and Niu (2010), Xu et al. (2012), Wang (2018), and Shen (2019), have reported the four citation tones of Plastic Mandarin, but these were either based on impressionistic descriptions or fundamental frequency (f0) measurements of data from a single speaker. Building upon these studies, this empirical study examines the prototypical f0 contours of citation tones in Plastic Mandarin employing speech data from a group of high school students, the primary social group associated with this variety and for whom Plastic Mandarin constitutes a predominant part of their everyday linguistic repertoire.

The second objective of this study is to explore the potential mechanisms underlying the formation of Plastic Mandarin tones. During language contact, tonal features from the two source varieties – Standard Mandarin and Changsha – coexist. Adolescents may have selectively adopted certain Changsha features associated with local identity into their Mandarin, contributing to the emergence of a distinct tonal system. In order to attest the hypothesis that Plastic Mandarin has borrowed certain tone patterns from Changsha, one has to at least demonstrate that similarities in pitch contours between Plastic Mandarin and Changsha tones are strongly present. A parallel dataset of Plastic Mandarin and Changsha citation tones, produced by the same group of multilingual participants, serves as a perfectly controlled research ground where we can track both group-level patterns and speaker-specific variation. This dataset enables us to evaluate the potential correspondence between each individual’s Changsha and Plastic Mandarin tone production.

Specifically, the present study addresses the following specific questions:

  1. What is the prototypical pitch contour of each tone in Plastic Mandarin?

  2. How similar the pitch contours are between the tones of Plastic Mandarin and Changsha?

2 Methods

Citation tone data in Plastic Mandarin and Changsha were elicited from high school students in Changsha, who represent ideal sources of Plastic Mandarin data due to their frequent and fluent use of Plastic Mandarin. The data collection, conducted in 2020, was part of a larger project (Xu 2022) that included several speech production tasks and a brief sociolinguistic survey on language use. In the spontaneous conversation task, participants also made explicit comments regarding speaking Plastic Mandarin.

2.1 Participants

Twenty-one young adults contributed to the elicitation of Plastic Mandarin tones (sixteen females and five males; mean age = 17.24 years, SD = 0.70 years). Their demographic profiles are presented in Table 3. They were all students at the Changsha Nanya Middle School and recruited from pre-existing friendship groups. Changsha Nanya Middle School is a boarding school where students are connected not only through classes and academic work, but also through extracurricular, social, and daily activities. This environment fosters a speech community characterised by dense and multiplex social networks and high frequencies of Plastic Mandarin interaction.

Table 3:

Demographic profile of Plastic Mandarin participants. Self-assessed Changsha proficiency was measured on a five-point Likert scale, ranging from 1 (“I do not speak Changsha”) to 5 (“I speak Changsha fluently”).

Speaker ID Sex Age Place of birth Years in Changsha Self-rated Changsha proficiency
101 Female 17 Zhangjiajie, Hunan 12 4
102 Female 17 Changsha, Hunan 17 5
103 Female 17 Changsha, Hunan 17 4
104 Female 17 Changsha, Hunan 17 4
105 Male 17 Loudi, Hunan 17 4
106 Male 18 Loudi, Hunan 18 4
107 Male 17 Changsha, Hunan 12 5
108 Female 17 Yongzhou, Hunan 17 2
109 Female 17 Zhuzhou, Hunan 6 3
110 Female 17 Yueyang, Hunan 17 4
111 Female 17 Changsha, Hunan 17 4
112 Female 17 Changsha, Hunan 17 4
113 Female 17 Changsha, Hunan 17 5
114 Female 17 Changsha, Hunan 17 4
115 Female 17 Changsha, Hunan 17 4
116 Female 17 Jishou, Hunan 6 2
117 Female 17 Changsha, Hunan 17 5
118 Male 17 Changsha, Hunan 17 2
119 Male 17 Changsha, Hunan 17 2
120 Female 18 Changsha, Hunan 18 2
121 Female 20 Changsha, Hunan 20 3

Fourteen participants were born and raised in Changsha; the others were born in nearby cities in Hunan province and moved to Changsha at an early age. On average, the participants had spent 15.71 years in Changsha at the time of elicitation. At home, many boarding school students also speak Changsha or other Xiang vernaculars, particularly with their grandparents; 16 out of 21 participants (76 %) reported always or often doing so. Thirteen of them who grew up in downtown Changsha, reported Changsha as their only Xiang vernacular, and self-rated their proficiency as at least a moderately high (a score of 4 or 5 on a 5-point Likert scale; highlighted in grey in Table 3) contributed to the elicitation of Changsha tones. This subgroup consisted of ten females and three males (mean age = 17.08 years, SD = 0.28 years). No history of speech or hearing disorders was reported. Written informed consent was acquired from subjects prior to recording.

2.2 Materials

The materials for Plastic Mandarin recordings contain four sets of Mandarin monosyllables composed of different segments; within each set these syllables differ only in tone (see Table 4). These sets of Mandarin syllables were selected because (1) their Changsha counterparts (cognates) share a similar or matched segmental composition, facilitating tonal comparison; (2) they are high-frequency words; and (3) their syllable structures are relatively simple – these syllables are either vowel-only or consist of a consonant-monophthong structure, featuring three vowel contexts /i/, /u/, /a/. The majority of them are fully voiced to minimise potential f0 perturbation introduced by voiceless consonants. The corresponding Changsha sets in Table 5 were used for Changsha recordings.

Table 4:

Speech materials for Plastic Mandarin citation tones.

IPA Pinyin Tone
T1 T2 T3 T4
[iː] yi

clothes


aunt


ant


desire
[tiː] di

low


flute


bottom


earth
[uː] wu

house


Wu (surname)


martial


fog
[maː] ma

mother


hemp


horse


scold
Table 5:

Speech materials for Changsha citation tones.

IPA Pinyin Tone
T1 T2 T3 T4 T5 T6
[iː] yi

clothes


aunt


ant


desire


easy


one
[tiː] di

low


flute


bottom


emperor


earth


flute
[uː] wu

black


Wu


martial


task


fog


house
[maː] (*[pʰa:]) ma (*pa)

mother


hemp


horse
怕*

fear


scold


wipe

2.3 Procedure

The elicitation of Plastic Mandarin and Changsha was mostly conducted in a sound-attenuated music room in the Changsha Nanya middle school, China. Two additional Plastic Mandarin speakers were recorded in a sound studio in the Phonetics Laboratory, University of Oxford. Each participant had a Rode Lavalier microphone clipped to their collar. The recording was made on one channel at a sampling rate of 44.1 kHz and 16-bit quantization using a Marantz (PMD661 MK.II) digital audio recorder.

Participants were instructed to articulate the monosyllables and repeat each six times, with pauses in between. The instructions and materials were presented on a piece of paper and on a computer screen in written Chinese. Participants were self-paced without interruptions and all utterances of a participant were recorded in one WAV audio file.

The participants were not trained to be aware of the citation tones of Plastic Mandarin since the tones have not been explicitly established. Instead, they had overt knowledge of the four tones of Standard Mandarin due to years of formal learning and training in schools. To prevent participants habitually switching to Standard Mandarin, they were instructed to think of or think aloud words, phrases, or any utterances that contain the target syllable in Plastic Mandarin before the monosyllabic articulation. Prior to the monosyllabic elicitation, participants engaged in other tasks conducted in conversational Plastic Mandarin with their friend (e.g. Xu 2024). The researcher, a speaker of both Plastic Mandarin and Changsha, conducted the sessions in the language corresponding to the elicitation task.

2.4 Acoustic measurement

The recordings were time-aligned to Chinese characters at the syllable level using the Penn Phonetics Lab Forced Aligner (P2FA) for Chinese (Yuan and Liberman 2008, 2015). The output TextGrid files containing the temporal boundaries of each syllable were checked against the corresponding audio files and misalignment errors were manually corrected with the aid of spectrograms and waveforms. The interval of each target syllable, padded with 50 ms silence before and after, were extracted and saved as individual WAV files. Such zero-padding prevented information loss due to windowing in subsequent acoustic analysis.

The f0 contour of the entire voiced portion of each syllable was initially extracted in the first pass, revealing that f0 estimates at the syllable edges were often inaccurate, primarily due to creaky voice associated with speech initiation and termination. As discussed in Xu and Zhang (2024), f0 was extracted over the entire voiced portion of each syllable in citation form, without excluding any segment a priori based on certain phonological assumptions. This study then adopted an intensity-dependent semi-automatic approach to trim the edges of f0 tracks of monosyllables, whereby the intensity was measured along with each f0 point. For monosyllables, the intensity curve typically resembles an inverse U-shape: the beginning and end exhibit a steep rise and a plunge respectively, while the middle portion reaches a plateau. The algorithm filters out f0 measurements in edge intervals of low intensity, by excluding edge f0 estimates whose intensity falls more than one standard deviation below the mean intensity of the token. This process was achieved by a customised Python script that employs the Parselmouth library (Jadoul et al. 2018) to implement Praat measurements (Boersma 1993). As demonstrated in Figure 1, f0 measurements outside the specified interval were excluded from further analysis. Empirically tested, this method was conducive to excluding inaccurate f0 estimates at edges as well as attenuating effects attributed to a neighbouring voiceless consonant. The plateau intervals were visually checked and revised if it excluded a significant portion of a token, which would result in information loss. Within each f0 track interval, f0 estimates were obtained at 21 equidistant points, corresponding to intervals of every 5 % of the duration. f0 measurements were further checked and manually corrected for artifacts such as octave errors. Utterances with a creaky portion in the middle of the vowel were discarded due to missing f0 measurements. In total, there are 2,618 f0 contours of Plastic Mandarin tones and 1,848 f0 contours of Changsha tones.

Figure 1: 
Trimmed f0 track (black solid) and intensity (pink dashed) of the syllable 雾 wu in Plastic Mandarin. f0 track segments outside the marked interval were not considered.
Figure 1:

Trimmed f0 track (black solid) and intensity (pink dashed) of the syllable 雾 wu in Plastic Mandarin. f0 track segments outside the marked interval were not considered.

2.5 Statistical analysis

2.5.1 f0 normalisation

To allow inter-speaker comparison, all f0 values of a language variety were normalised to semitones above/below each speaker’s mean f0 of that variety. Given the slightly imbalanced data, the speaker’s mean f0 was defined as the equally weighted mean f0 across all tones. In other words, the speaker’s mean f0 represents the centre of the tonal space, comprising a balanced set of all citation tones (Xu and Zhang 2024). Each f0 measurement in Hz was transformed into semitone using the following equation:

f 0 semitone = 12 log 2 1 f s

where the reference frequency value in this semitone transformation is the speaker mean f s . Each normalised f0 measurement is essentially the semitone difference between the measured f0 and the speaker’s mean f0. This normalisation method was considered the best among many others in preserving phonemic variation as well as sociolinguistic variation (Zhang 2018).

2.5.2 Modelling the prototypical f0 contours

This study utilised Growth Curve Analysis (GCA) to model the prototypical tone contours in Plastic Mandarin and Changsha. The normalised f0 contours were modelled with second-order orthogonal polynomials and fixed effects of tone category on all time terms. Since the f0 contour of a monosyllable is usually not complicated, it is desirable to keep the degree of polynomials low to ensure that all coefficients are straightforward to interpret, and to potentially filter out unstructured information. For orthogonal polynomials, the intercept term corresponds to the overall average of a curve, the linear coefficient captures the steepness of the linear change, and the quadratic coefficient captures the sharpness of the centred parabolic peak (Mirman 2014).

The models included random effects of participants on all times terms and of vowel types on intercept and quadratic terms. In the Changsha model, correlations between the specified terms in estimating random effects are retained. In the Plastic Mandarin model, the correlation between the intercept and quadratic terms for the random effect of vowel type was removed due to the failure to converge with the maximal random effect structure.

To arrive at a tone letter or numeral representation (Chao 1930), the predicted f0 values in semitone from the growth curve models are further scaled to the range of [0, 5], where five one-unit intervals in-between represent the five pitch height zones, and each zone corresponds to a number in the tone stave. A rescaled f0 close to the boundary between two pitch zones may be assigned flexibly, depending on the contrastiveness among the tones. Representing Plastic Mandarin tones using Chao’s system aligns the quantitative f0 contour analysis with the conventions of citation tone studies, situating the findings within the existing body of literature and enabling cross-varietal discussions.

2.5.3 Comparing f0 contours in Plastic Mandarin and Changsha

Leveraging their greater flexibility over GCA, Generalised additive mixed models (GAMMs) were then used to visually assess the nonlinear difference over time between two similar tone patterns. Data from the thirteen speakers who contributed to recordings in both Changsha and Plastic Mandarin were aggregated and formed a comparative within-speaker dataset. The speaker mean was calibrated using data from both Plastic Mandarin and Changsha for the same speaker.

In GAMMs, a baseline smooth function over time indexes was fitted to the normalised f0 contours of one variety and another smooth function represented the non-linear f0 differences of the other variety to the baseline. Considering the intrinsic vowel f0 differences, vowel is also included as another predictor of the normalised f0. Following Sóskuthy (2017), the variables coding the difference between the two sets of contours were converted to ordered factors with contrast coding before the model fitting. For the random structures, the random reference smooth by speaker plus random difference smooths of language variety × vowel were used. Sóskuthy (2021) suggests that the random reference/difference smooths are the appropriate tool for addressing within-speaker dependencies in the presence of effects that vary within speakers. After initial model-fitting, diagnostic tests were performed. When analysing time series data such as f0, the residuals at time t (i.e. the difference between the observed and the estimated values) are often correlated with residuals at preceding time steps, especially at t−1 and hence an autoregressive at lag 1 (AR1) component was incorporated in the models (Xu 2024). The Tone 4 model was further modelled with a scaled-t distribution due to that the distribution of the residuals clearly has two heavy tails.

All analyses were carried out in R version 4.4.0 (R Core Team 2024) using the lme4 1.1-35.3 (Bates et al. 2015), lmerTest 3.1-3 (Kuznetsova et al. 2017), mgcv 1.9-1 (Wood 2022), and itsadug 2.4.1 (van Rij et al. 2022) packages. Details of the model specifications and diagnostic tests are provided in the supplementary materials available at the Open Science Framework repository https://osf.io/x4jq2/.

3 The prototypical Plastic Mandarin tones

Prior to the GCA analysis, it is helpful to visualise and compare the variation exhibited in the f0 data throughout the course of a tone. Figure 2 presents the kernel density estimation of normalised f0 data at every 5 % intervals across the voicing portion of syllables grouped by tone category. Most f0 distributions have a single peak, indicating the mode. The data indicate that the Plastic Mandarin tone patterns produced by these participants are highly homogenous. The ridge, formed by the concatenation of these peaks at all times, represents the most likely f0 contour for each tone. Tone 1 and Tone 2 have relatively similar f0 distribution at the end of a syllable, but Tone 2 is characterised by a distinct curvy shape. The f0 distribution of Tone 4 predominantly occupies a higher frequency range, with minimal overlap with other tones. Its f0 values display a wider spread towards the end of the syllable, compared to the other tones. Tone 3 exhibits a downward f0 trend from the onset and likewise shows slightly increased dispersion in f0 in the latter portion of the syllable.

Figure 2: 
Distribution of f0 (semitone) of citation tones in Plastic Mandarin across time. (a) Superimposed f0 distributions of all four tones of Plastic Mandarin, for comparison; (b) f0 distribution for each tone separately, for clarity.
Figure 2:

Distribution of f0 (semitone) of citation tones in Plastic Mandarin across time. (a) Superimposed f0 distributions of all four tones of Plastic Mandarin, for comparison; (b) f0 distribution for each tone separately, for clarity.

The data and model fits of GCA are shown in Figure 3 and Table 6 shows the fixed effect parameter estimates and their standard errors along with corresponding t- and p-values. In Table 6, the Tone 1 condition was treated as the baseline and parameters were estimated for other tone conditions. Parameter-specific p-values were estimated using the Satterthwaite’s approximation via lmerTest library (Kuznetsova et al. 2017) in R. We can see that the four citation tones of Plastic Mandarin are statistically significantly different from one another, even when considering only their means (intercept term). Tone 1 tends to be in the centre of the tonal space comprising the four tones, while Tones 2 and 3 are lower. Tone 4 is the highest tone, on average over 4 semitones higher than the speaker mean, whereas Tone 3 is the lowest tone, on average about 3.5 semitones lower than the speaker mean. The slope of Tone 1 is very close to 0, indicating its flat linear trend. Both Tone 2 and Tone 4 are rising tones, while Tone 3 has a falling f0 trajectories. Tone 2 has the largest positive quadratic term, indicating a concave upwards shape in the centre of the utterance. In contrast, Tone 4, while also a rising tone, exhibits a concave downward shape. Other tones are generally much less curvy than Tones 2 and 4.

Figure 3: 
Observed data (error bars indicate ± standard error) and growth curve model fits (lines) for f0 contour of each tone in Plastic Mandarin.
Figure 3:

Observed data (error bars indicate ± standard error) and growth curve model fits (lines) for f0 contour of each tone in Plastic Mandarin.

Table 6:

Parameter estimates for analysis of f0 contour of each tone in Plastic Mandarin.

Estimate Std. error t p
Intercept 0.166 0.182 0.913 0.422
Linear 0.150 0.280 0.535 0.598
Quadratic 0.289 0.361 0.801 0.453
Tone 2 −2.180 0.021 −102.274 ***
Tone 3 −3.712 0.022 −171.629 ***
Tone 4 3.954 0.021 186.296 ***
Linear: Tone 2 3.255 0.098 33.322 ***
Linear: Tone 3 −4.602 0.099 −46.430 ***
Linear: Tone 4 1.849 0.097 19.014 ***
Quadratic: Tone 2 2.461 0.098 25.194 ***
Quadratic: Tone 3 0.128 0.099 1.293 0.196
Quadratic: Tone 4 −1.781 0.097 −18.309 ***
  1. ***p < 0.001.

The prototypical contour of the four tones is illustrated in Figure 3. To summarise, Plastic Mandarin comprises the following four distinct tones: a mid-level tone [], a low-to-mid rising tone [], a low falling tone [], and a high rising tone [].

4 Changsha tones and their comparison with Plastic Mandarin

4.1 Changsha tones

Figure 4 visualises the distribution of normalised f0 data at every 5 % intervals across the voicing portion of syllables grouped by tone category in Changsha. It shows that the tonal realisations among participants are largely internally consistent. For Tones 1, 2, 3, and 6, the distributions are unimodal, with the ridge representing the most likely f0 contour. The f0 values of Tone 4 are primarily distributed above 0, although there are some much thinner bands below 0 in Figure 4(b), representing deviations. Upon check, two additional elicited variants of the Tone 4 morpheme [uː] 务 ‘task’ were identified. However, these variants were not consistent across the group, with each one produced by only a single participant. The f0 distribution of Tone 5 bifurcates and there are two different f0 patterns: one resembles the high rising Tone 4 and the other largely occupies a lower f0 region below 0. The bimodality exhibited in their production of Tone 5 is mostly attributed to the alternative pronunciation of the Tone 5 morpheme [uː] 雾 ‘fog’. Both pronunciations are acceptable, and this is related to a more ubiquitous phenomenon among Chinese varieties that some characters have differentiated literary and vernacular readings (Ho 2015). The doublets of readings result from language contact and reflect lexical stratification under the influence of the standard language of education and literary instruction at various periods (Norman 1979), especially for non-Mandarin regions such as Changsha. The two patterns in Figure 4(b) align with Li (1998)’s observations: the vernacular reading of many Tone 5 syllables is a low falling tone (the majority of Tone 5, historical yáng qù tone), while their literary reading is a high rising tone (the majority of Tone 4, historical yīn qù tone).

Figure 4: 
Distribution of f0 (semitone) of citation tones in Changsha across time. (a) Superimposed f0 distributions of all six tones of Changsha, for comparison; (b) f0 distribution for each tone separately, for clarity.
Figure 4:

Distribution of f0 (semitone) of citation tones in Changsha across time. (a) Superimposed f0 distributions of all six tones of Changsha, for comparison; (b) f0 distribution for each tone separately, for clarity.

The variations in Changsha tones shown in Figure 4 suggest that Changsha is not as strictly prescribed as Standard Mandarin and that Changsha may be undergoing tonal change. In many modern Mandarin varieties, the historical Tone has been maintained as a single tone category (Mandarin Tone 4). In contrast, it was split into two registers that became modern Tones 4 and 5 in Changsha. Peng (2006) thus predicted a trend toward the merging Tone 4 and Tone 5 in Changsha, reflecting the levelling effect of Changsha becoming more similar to Mandarin.

Figure 5 illustrates the prototypical contour of the six tones, excluding the less common variants. Changsha tones produced by these young Changsha participants can be represented as 33, 23, 43, 45, 21, 24. It is worth noting that a 5-level system may not precisely depict the exact f0 height but provides a simplified and stylized representation that preserves the phonological contrasts. Although both Tone 3 and Tone 4 begin with the numeral “4”, the actual starting pitch for Tone 4 is much higher than that of Tone 3. Tone 3 is represented as beginning with “4” to maintain a clear contrast with Tone 1. A comparison between the current Changsha data and tone values documented in the literature in Table 2 suggests that Tone 3 in Changsha may have shifted from /41/ to /43/.

Figure 5: 
Observed data (error bars indicate ± standard error) and growth curve model fits (lines) for f0 contour of each tone in Changsha.
Figure 5:

Observed data (error bars indicate ± standard error) and growth curve model fits (lines) for f0 contour of each tone in Changsha.

4.2 Cross-varietal comparison

Visually inspecting the prototypical tone patterns in Figures 3 and 5, several similar tone pairs of the same (historical) tone category can be identified. For instance, Plastic Mandarin and Changsha have comparable Tone 4s, Tone 2s, and Tone 1s. In addition, Plastic Mandarin Tone 3 and Changsha Tone 5 have a similar contour.

4.2.1 The high rising Tone 4 and level Tone 1

Figure 6 illustrates the model predictions for the two groups of Tone 4 contours with 95 % pointwise confidence intervals. Overall, Changsha Tone 4 is higher than Plastic Mandarin Tone 4. The parametric term of language variety from the GAMM is statistically different from zero (t = −3.45, p < 0.001), indicating that they have different f0 heights. By removing such effect of language variety on overall f0 contour height through centring the predicted f0 (see Figure 6b), we can find that the contour shape between Tone 4 of the two varieties does not statistically significant differ between the two varieties over the course of the tone.

Figure 6: 
Estimated f0 trajectories of high rising Tone 4s and comparison. (a) Estimated f0 trajectories (GAMM fits) of Plastic Mandarin and Changsha Tone 4, with 95 % pointwise confidence intervals. (b) Same trajectories, centred to facilitate shape comparison.
Figure 6:

Estimated f0 trajectories of high rising Tone 4s and comparison. (a) Estimated f0 trajectories (GAMM fits) of Plastic Mandarin and Changsha Tone 4, with 95 % pointwise confidence intervals. (b) Same trajectories, centred to facilitate shape comparison.

There is some variation in the individual realisation of Tone 4 in Plastic Mandarin. Figure 7 shows f0 contours of Tone 4 in Plastic Mandarin and Changsha by each speaker in an overlaying manner. Speaker d104 produced mostly high falling Tone 4 in Plastic Mandarin, distinct from the corresponding high rising Tone 4 in Changsha. For speakers d103, d105, and d106, some of their Plastic Mandarin Tone 4 contours also feature a falling tail. In contrast, the remaining speakers, in particular speakers d102, d110 and d112, produced Plastic Mandarin and Changsha Tone 4 contours that were highly similar and often overlapping.

Figure 7: 
f0 contours of Tone 4 in Plastic Mandarin (purple) and Changsha (orange) by speaker.
Figure 7:

f0 contours of Tone 4 in Plastic Mandarin (purple) and Changsha (orange) by speaker.

For Tone 1, we are mainly interested in f0 heights since these are level tones. Tone 1s in both varieties sit mid-range in the tonal space. Plastic Mandarin Tone 1 is generally a bit higher than the Changsha counterpart, illustrated by the model predictions of the GAMM in Figure 8. The estimated height difference based on the parametric term in the GAMM is 0.68 semitone (t = 3.00, p = 0.0027), which is statistically significant.

Figure 8: 
Estimated f0 trajectories (GAMM fits) of Plastic Mandarin and Changsha Tone 1, with 95 % pointwise confidence intervals (duration: 0.27 s; vowel: /i/).
Figure 8:

Estimated f0 trajectories (GAMM fits) of Plastic Mandarin and Changsha Tone 1, with 95 % pointwise confidence intervals (duration: 0.27 s; vowel: /i/).

4.2.2 The low rising and low falling tones

Figure 9 shows the comparison of the two groups of low rising tones (Plastic Mandarin and Changsha Tone 2) and low falling tones (Plastic Mandarin Tone 3 and Changsha Tone 5). Plastic Mandarin Tone 2 is closely resembles its Changsha counterpart throughout the course of the tone and the estimated curve difference between them in Figure 9(a) is non-significant for the first 89 % of the tone duration. Plastic Mandarin Tone 3 and Changsha Tone 5 tend to reach similar pitch heights in the last 37 % of their tone contours.

Figure 9: 
Estimated f0 trajectories of low rising and low falling tones and comparison. (a) Estimated f0 trajectories (GAMM fits) of Plastic Mandarin and Changsha Tone 2, with 95 % pointwise confidence intervals. (b) Estimated difference between the two groups in (a) with the associated 95 % pointwise confidence interval. (c) Estimated f0 trajectories (GAMM fits) of Plastic Mandarin Tone 3 and Changsha Tone 5, with 95 % pointwise confidence intervals. (d) Estimated difference between the two groups in (c) with the associated 95 % pointwise confidence interval. The highlighted red area in (b) and (d) indicates where the confidence interval excludes zero.
Figure 9:

Estimated f0 trajectories of low rising and low falling tones and comparison. (a) Estimated f0 trajectories (GAMM fits) of Plastic Mandarin and Changsha Tone 2, with 95 % pointwise confidence intervals. (b) Estimated difference between the two groups in (a) with the associated 95 % pointwise confidence interval. (c) Estimated f0 trajectories (GAMM fits) of Plastic Mandarin Tone 3 and Changsha Tone 5, with 95 % pointwise confidence intervals. (d) Estimated difference between the two groups in (c) with the associated 95 % pointwise confidence interval. The highlighted red area in (b) and (d) indicates where the confidence interval excludes zero.

5 Potential formation of Plastic Mandarin tones

The phonetic characteristics of Mandarin accents have shifted between the older and younger generations of speakers in Changsha, as have attitudes towards these non-standard Mandarin accents. Rather than perceive Plastic Mandarin as a ‘bad Mandarin’ that needs to be corrected, young people generally have a favourable regard for Plastic Mandarin. Through informal conversations with these teenager participants, I learnt that speaking Plastic Mandarin in school is seen as a gesture of friendliness and amiability, as well as a sign of solidarity and reduced social distance. As the norm of communication within the social networks in school, Plastic Mandarin is considered an inclusive variety spoken by both Changsha local and non-local students – many of the non-locals are from other cities in Hunan province and may speak a different Xiang variety from Changsha. Plastic Mandarin is acknowledged as a linguistic marker of Changsha and, more broadly, of Hunan (Xiang) province. Through speaking Plastic Mandarin, young people signal a shared cultural heritage and make more self-conscious regional identity claims. The positive associations of Plastic Mandarin may also be a concomitant of the gradual status shift of Mandarin from an L2 to an L1 in Changsha.

One possible explanation for the formation of Plastic Mandarin tone patterns is that, in the process of new dialect formation, variation in Standard Mandarin tones may have arisen by pure chance, and these randomly mutated tone variants eventually became generalised. Another possibility, though not exhaustive, is that Standard Mandarin tones may have undergone tonal contour changes through a phonetically biased chain shift, similar to what has been reported for Bangkok Thai over the past century (see Pittayaporn 2018: 259). Drawing on cross-linguistic evidence, the following sections discuss several factors that may have contributed to the development of Plastic Mandarin tone patterns.

5.1 The phonetic bias account: recurrent tone contour changes

In his analysis of Bangkok Thai tone changes, Pittayaporn (2018) suggested that most phonetic variation is not random but biased and that tone variants arising from the production and perception of lexical tones tended to be crystallised and phonologised. For instance, f0 peaks tend to be delayed rather than early (e.g. Hao et al. 2017; Xu and Wang 2001), introducing a bias toward the rightward shift of tone peaks. This tendency may be attributed to common articulatory mechanisms such as carryover effects and truncation of tone production due to time pressure (Yang and Xu 2019).

Table 7 summarises the Standard Mandarin-Plastic Mandarin tonal variation by tone categories, by comparing the tone patterns of Plastic Mandarin and Standard Mandarin. When tone patterns are conceptualised as probabilistic approximation of high or low pitch target(s) that anchored to specific time points relative to the onset of a syllable, a varied tone pattern can be generated when the pitch target moves in the temporal domain (leftward or rightward) and/or the pitch domain (upward or downward). These tone variation patterns summarised in Table 7 are in fact among the most common and recurrent patterns of tonal change observed cross-linguistically, as identified in Yang and Xu (2019)’s review of 52 studies on tone change in East and Southeast Asia. For example, that the mid-rising tone becomes the low-rising tone (Tone 2) is also observed in Dong’gan, Gaotian Zhuang, Tai Phuan, and Lalo; the contour reduction of the dipping tone, low fall-rise to low falling tone (Tone 3), is also observed in Taiwan Mandarin, Ma’ya, Libo, Taipusi. Interestingly, the rightward sliding and flattening of Tone 4 is also reminiscent of the changes observed in Thai Tone 4 between the early 20th century and 1962. Many unrelated languages have parallel development of certain types of tone patterns. The fact that Standard–Plastic Mandarin tonal variation largely aligns with the crosslinguistic tendencies strengthens the phonetically-based account of tonal contour changes.

Table 7:

Summary of tone changes from Standard Mandarin to Plastic Mandarin. ‘Rightward’ is referenced to the left syllable boundary.

Tone Standard Mandarin > Plastic Mandarin Tone variation
1 high level > mid level downward shifting
2 mid rising > low rising downward shifting
3 low fall-rise > low falling rightward sliding of valley, contour reduction
4 high falling > high rising rightward sliding of peak (peak delay)

5.2 The borrowing account: contour similarity between Plastic Mandarin and Changsha tones

The comparison of the tone patterns of Plastic Mandarin and Changsha leads to an alternative promising account of the mechanism of tone change, the contact-induced borrowing. There is a striking similarity in the shape of the f0 patterns between the Plastic Mandarin Tones 1, 2, and 4 and their corresponding Changsha tones. This suggests that Plastic Mandarin may have borrowed certain tone patterns from the corresponding Changsha tones, in which the historical tone category continue to play a mediating role. The average f0 heights can differ slightly between these similar pairs of tones, which may imply that speakers borrow the tone shapes and adjust the tone heights when creating a new tonal system instead of direct copying. The high rising tone tends to be higher in Changsha than in Plastic Mandarin, which may be attributed to Changsha’s more complex tone system, potentially requiring a larger tonal space for distinction.

In Standard Mandarin, syllables with Tone 4, the high falling tone, tend to be judged the most prominent among other tones in both prosodically strong and weak positions (Deng 2010). Tone 4 is also the most frequent tone in Standard Mandarin (Wu et al. 2020). The change in the Tone 4 tone pattern is, therefore, highly perceptually salient, and it puts the borrowed Changsha Tone 4 with a distinct high rising pattern in the spotlight. Furthermore, the high rising tone pattern can serve as a signature of Xiang varieties. The majority of Xiang vernaculars retain a tone category evolved from the high (yīn) register of tone (Bao and Chen 2005), typically manifesting as a high rising or mid-to-high rising tone, as seen in varieties such as Changsha, Xiangtan, Xiangxiang, Yueyang, Shaoyang, Shuangfeng, Loudi, and others. Rather rarely we find such high rising tone in other most spoken Chinese varieties. For instance, there is no such high rising tone pattern in the tone inventories of Guangzhou Cantonese, Xiamen Min, Suzhou Wu, and Meixian Hakka.

The adaptation of Changsha tone patterns as new canonical tone patterns has allowed Plastic Mandarin to express a shared local and cultural identity and social solidarity. While specialised lexicon associated with local food, festivals, activities, and so on is often the site or marker of cultural identity in a local Mandarin variety, the case of Plastic Mandarin shows that identity can be marked prosodically through pitch.

5.3 The hybrid account: Mandarin lexical tone variation

While the phonetic bias account does not fully account for the crystallisation of the Plastic Mandarin tone patterns, the contact-induced borrowing account does not explain why Tone 3 in Plastic Mandarin does not borrow from Changsha Tone 3. It is likely that other biases or constraints also exert some influence in the process. These accounts may not be mutually exclusive. In addition to the phonetic biases, Pittayaporn (2018) identified three systemic biases in tonal contour changes: contour maximisation, contour accentuation, and avoidance of similar tones, which favoured variants of dynamic tones with a greater and more dramatic f0 excursion, and tone variants dispersed in phonetic space. Such biases were supported by perception experiments. For example, dynamic pitch excursions improve speech recognition in noise, and speech with dynamic pitch is consistently better perceived than monotone speech (Shen and Souza 2018). The Tone 3 in Changsha [] lies in the mid-high space of the pitch range, which would not be optimal in maximising perceptual contrast if adopted by Plastic Mandarin, given another two mid-range tones, Tone 1 [] and Tone 2 [], and a high-range Tone 4 [] in Plastic Mandarin. The low falling Tone 3 [] in Plastic Mandarin, instead, magnify the perceptual distance between Tone 3 and other tones.

The phonetic and systemic biases, as well as the contact-induced borrowing effects, altogether can offer a coherent explanation for the synchronic Plastic Mandarin variation. In addition, I propose that social biases play a crucial role in the formation of Plastic Mandarin, with social identity formation guiding this change. Figure 10 schematises the potential phonologisation of the high rising Tone 4 contour, following Pittayaporn’s (2018) approach. A pool of structured and random variants motivated by phonetic biases, by borrowing, or by chance is first generated. The variants illustrated in Figure 10 are not exhaustive but are theoretically plausible. Most of them were chosen based on common cross-linguistic patterns identified in Yang and Xu (2019, Table 8). Specifically, a high falling tone /51/ may transform into a falling tone with a rising offset [53] (e.g. Old Beijing) , a high level tone [55] (e.g. Dalian, Lanzhou, Bangkok Thai) , a mid rise-fall tone [341] (e.g. Chongqing) , or mid rising tone [45] (e.g. Penang Hokkien) . Some of these variants are reflected in individual Tone 4 productions (see participants d103 – d106 in Figure 7 for examples). All of these variants are characterised by the clockwise tone shift pattern (Yang and Xu 2019; Zhu 2018), a well-attested cross-linguistic tendency in tone change observed across East and Southeast Asia. The clockwise pattern, essentially a temporal realignment of pitch movements, can be considered as a phonetic bias that influences the directionality of tonal variation. We may also consider [41], with a lowering onset, as a random phonetically similar variant, and [45] as a borrowed variant. Among all these variants, [45] was preferred and prevailed as the primary variant, likely because it closely resembles or embodies the Changsha Tone 4 (yīn qù), the ‘signature’ Xiang tone, which, in turn, serves as a socially marked variant associated with regional identity. This high pitch tone is uniquely distinct from the other Plastic Mandarin tones, which primarily occupy the low-to-mid pitch range, thereby maximising perceptual contrast and revealing a systemic bias towards avoiding similar tones. The bottom part of Figure 10 shows that [45] is selected as the new canonical tone value in Plastic Mandarin, reflecting selectional biases that involve both systemic and social factors.

Figure 10: 
Schematic illustration of the phonologisation of the high rising contour of Tone 4 in Plastic Mandarin. Variants [53], [55], [341], and [45] reflect the clockwise bias in tonal variation. [45] may also be a borrowed variant and [41] may be a phonetically similar random variant.
Figure 10:

Schematic illustration of the phonologisation of the high rising contour of Tone 4 in Plastic Mandarin. Variants [53], [55], [341], and [45] reflect the clockwise bias in tonal variation. [45] may also be a borrowed variant and [41] may be a phonetically similar random variant.

Table 8:

Summary of Plastic Mandarin lexical tones.

Tone Phonetic description IPA representation
1 a mid level tone [] 33
2 a low to mid rising tone [] 23
3 a low falling tone [] 21
4 a high rising tone [] 45

6 Conclusions

This study characterises the lexical tones of Plastic Mandarin through empirical data and phonetic-acoustic analysis, demonstrating the application of Growth Curve Analysis in citation tone production research. Plastic Mandarin has four citation tone categories, and their patterns are summarised in Table 8 and illustrated in Figure 3. The resemblance of Plastic Mandarin Tones 1, 2, and 4 to their corresponding Changsha tones suggests the influence of contact on tonal variation and change. These resembling or borrowed tone patterns carry a characteristic regional flavour and can serve as sociophonetic indexical markers of speakers’ regional identity. The formation of Plastic Mandarin tones can be attributed to contact-induced borrowing, along with phonetic, systemic, and social biases, all of which interact to shape the distinctive tonal patterns.


Corresponding author: Chenzi Xu, Faculty of Linguistics, Philology and Phonetics, University of Oxford, Oxford, OX1 2HG, UK, E-mail:

Funding source: Leverhulme Trust

Award Identifier / Grant number: ECF-2023-079

Acknowledgment

This work was supported by the Leverhulme Trust Early Career Fellowship and the Birla Graduate Studentship, Faculty of Linguistics, Philology, and Phonetics, University of Oxford. My special thanks go to Professors John Coleman, Aditi Lahiri, and Elinor Payne, and the Associate Editor and reviewers for their valuable suggestions. Any errors are mine.

  1. Research ethics: This study was conducted in accordance with the ethical guidelines outlined in the Declaration of Helsinki. The research protocol was reviewed and approved by the Central University Research Ethics Committee (CUREC) and Social Sciences and Humanities Interdivisional Research Ethics Committee at the University of Oxford (Approval Reference: R67136/RE001). Written informed consent was obtained from all participants prior to their participation in the study.

  2. Author contributions: This is a single-authored paper.

  3. Conflict of interest: The author has no conflicts of interest to declare.

References

Auer, P. 2005. Europe’s sociolinguistic unity, or: A typology of European dialect/standard constellations. In N. Delbecque, J. van der Auwera & D. Geeraerts (eds.), Perspectives on variation. Sociolinguistic, historical, comparative, 7–42. Berlin, New York: De Gruyter Mouton.10.1515/9783110909579.7Search in Google Scholar

Bao, H. X. & H. Chen. 2005. The classification of Xiang group. Fang Yan 3. 261–270.Search in Google Scholar

Bates, D., M. Mächler, B. Bolker & S. Walker. 2015. Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1). https://doi.org/10.18637/jss.v067.i01.Search in Google Scholar

Beijing University. 1995. Hanyu fangyan cihui 汉语方言词汇 (Chinese dialects vocabularies), 2nd edn. Beijing: Yuwen chubanshe.Search in Google Scholar

Boersma, P. 1993. Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In Proceedings of the institute of phonetic sciences, 17 (1193), 97–110.Search in Google Scholar

Chao, Y. R. 1930. A system of tone letters. Le Maitre Phonetique 30. 24–27.Search in Google Scholar

Chao, Y. R. 1968. A grammar of spoken Chinese. Berkeley and Los Angeles: University of California Press.Search in Google Scholar

Deng, D. 2010. Hanyu yunlvci yanjiu 汉语韵律词研究 (Experimental study of Chinese prosodic word). Beijing: Peking University Press.Search in Google Scholar

Firth, J. R. & B. B. Rogers. 1937. The structure of the Chinese monosyllable in a Hunanese dialect (Changsha). Bulletin of the School of Oriental and African Studies 8(4). 1055–1074. https://doi.org/10.1017/S0041977X00082380.Search in Google Scholar

Fishman, J. A. 1967. Bilingualism with and without diglossia; diglossia with and without bilingualism. Journal of Social Issues 23(2). 29–38. https://doi.org/10.1111/j.1540-4560.1967.tb00573.x.Search in Google Scholar

Fu, L. 2010. Fangyan yu Putonghua de jiechu yanjiu 方言与普通话的接触研究 (A study of the contact between Mandarin and dialects) [Soochow University]. Available at: https://cnki.cgl.org.cn/kcms/detail/detail.aspx?DbCode=CDFD&filename=2010147048.nh.Search in Google Scholar

Hao, L., W. Zhang, Y. Xie & J. Zhang. 2017. Reanalyze fundamental frequency peak delay in Mandarin. INTERSPEECH, 3186–3190.10.21437/Interspeech.2017-1235Search in Google Scholar

Ho, D. 2015. Chinese dialects. In W. S. Y. Wang & C. Sun (eds.), The Oxford handbook of Chinese linguistics, 1, 149–159. Oxford: Oxford University Press.Search in Google Scholar

Hunan Provincial Bureau of Statistics. 2017. Changsha population summary and prospect 长沙人口规模现状和发展趋势. Available at: https://tjj.hunan.gov.cn/tjfx/sxfx/zss/201706/t20170627_4327150.html.Search in Google Scholar

Hunan Provincial Bureau of Statistics. 2021. Hunan province statistical communiqué for seventh national population census 2020 湖南省第七次全国人口普查公报(第二号). Available at: https://tjj.hunan.gov.cn/hntj/tjfx/tjgb/rkpc/202105/t20210519_19050124.html.Search in Google Scholar

Jadoul, Y., B. Thompson & B. de Boer. 2018. Introducing Parselmouth: A Python interface to Praat. Journal of Phonetics 71. 1–15. https://doi.org/10.1016/j.wocn.2018.07.001.Search in Google Scholar

Jing, S. & F. Niu. 2010. Changsha difang putonghua guhua yanjiu 长沙地方普通话固化研究:地方普通话固的个案调查 (The fossilization of local Putonghua in Changsha: A sociolinguistic survey). Yuyan Wenzi Yingyong 语言文字应用 4. 41–49. https://doi.org/10.16499/j.cnki.1003-5397.2010.04.005.Search in Google Scholar

Kerswill, P. 2020. Contact and new varieties. The handbook of language contact, 2nd edn. 241–259. Hoboken, NJ: Wiley-Blackwell.10.1002/9781119485094.ch12Search in Google Scholar

Kuznetsova, A., P. B. Brockhoff & R. H. B. Christensen. 2017. lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software 82(13). https://doi.org/10.18637/jss.v082.i13.Search in Google Scholar

Lee, W.-S. & E. Zee. 2003. Standard Chinese (Beijing). Journal of the International Phonetic Association 33(1). 109–112. https://doi.org/10.1017/s0025100303001208.Search in Google Scholar

Li, R. 1998. In H. X. Bao, R. Shen, Y. Wu & Zhenhua Cui (eds.), Chang sha fang yan ci dian. (Changsha dialect dictionary), 2nd edn. Nanjing: Jiang su jiao yu chu ban she.Search in Google Scholar

Mao, L. 2009. Experimental study on the acquisition of tones in Changsha childrens Mandarin [Master Dissertation]. Beijing Language and Culture University.Search in Google Scholar

Mirman, D. 2014. Growth curve analysis and visualization using R. Boca Raton: CRC Press.Search in Google Scholar

Norman, J. 1979. Chronological strata in the Min dialects. Fang Yan 4. 268–274.Search in Google Scholar

Norman, J. 1988. Chinese. Cambridge: Cambridge University Press.Search in Google Scholar

Peng, J. 2006. Xiang yu yin yun li shi ceng ci yan jiu. (Analysis of the historical strata of Xiang) 湘语音韵历史层次研究 [Shanghai Normal University]. Available at: https://www.wanfangdata.com.cn/details/detail.do?_type=degree&id=Y932132.Search in Google Scholar

Pittayaporn, P. 2018. Phonetic and systemic biases in tonal contour changes in Bangkok Thai. In M. Giriko & H. Kubozono (eds.), Tonal change and neutralization, 249–278. De Gruyter Mouton. Available at: https://ezproxy-prd.bodleian.ox.ac.uk/login?url=https://doi.org/10.1515/9783110567502.10.1515/9783110567502-010Search in Google Scholar

R Core Team. 2024. R: A language and environment for statistical computing. [Computer software]. R Foundation for Statistical Computing. Available at: https://www.R-project.org/.Search in Google Scholar

Ratliff, M. 2015. Tonoexodus, tonogenesis, and tone change (Oxford Handbooks in Linguistics), 1. Oxford: Oxford University Press.10.1093/oxfordhb/9780199232819.013.021Search in Google Scholar

Sayahi, L. 2020. Contact, bilingualism, and diglossia. The handbook of language contact, 51–66. Hoboken: John Wiley & Sons, Ltd.10.1002/9781119485094.ch2Search in Google Scholar

Selinker, L. 1972. Interlanguage. International Review of Applied Linguistics in Language Teaching 10(3). 209–232. https://doi.org/10.1515/iral.1972.10.1-4.209.Search in Google Scholar

Shen, S. 2019. An acoustical analysis of citation tones in Mandarin with Changsha dialect 长沙“塑普”单字调分析. Zhongguo Gaoxiao Renwen Shehui Kexue Xinxi Wang. Available at: https://www.sinoss.net/c/2019-09-06/557055.shtml.Search in Google Scholar

Shen, J. & P. E. Souza. 2018. On dynamic pitch benefit for speech recognition in speech masker. Frontiers in Psychology 9. https://doi.org/10.3389/fpsyg.2018.01967.Search in Google Scholar

Sóskuthy, M. 2017. Generalised additive mixed models for dynamic analysis in linguistics: A practical introduction [Monograph]. York. Available at: https://eprints.whiterose.ac.uk/113858/.Search in Google Scholar

Sóskuthy, M. 2021. Evaluating generalised additive mixed modelling strategies for dynamic speech analysis. Journal of Phonetics 84. 101017. https://doi.org/10.1016/j.wocn.2020.101017.Search in Google Scholar

Trudgill, P. 2004. New-dialect formation: The inevitability of colonial Englishes. Edinburgh University Press. Available at: https://www.jstor.org/stable/10.3366/j.ctv2f4vkzd.Search in Google Scholar

van Rij, J., M. Wieling, R. H. Baayen & H. van Rijn. 2022. itsadug: Interpreting time series and autocorrelated data using GAMMs (Version 2.4.1) [Computer software]. Available at: https://CRAN.R-project.org/package=itsadug.Search in Google Scholar

Wang, Y. 2018. Changsha ‘suliao’ putonghua yanjiu 长沙 “塑料” 普通话研究 A study of Changsha Plastic Mandarin [Shanghai Jiao Tong University]. Available at: https://wap.cnki.net/touch/web/Dissertation/Article/10248-1019635302.nh.html.Search in Google Scholar

Wood, S. 2022. mgcv: Mixed GAM computation vehicle with automatic smoothness estimation (Version 1.8-40) [Computer software]. Available at: https://CRAN.R-project.org/package=mgcv.Search in Google Scholar

Wu, N. 2023. Changsha Xiang Chinese. Journal of the International Phonetic Association. 1–15. https://doi.org/10.1017/S0025100323000075.Search in Google Scholar

Wu, Y. 2005. A synchronic and diachronic study of the grammar of the Chinese Xiang dialects. Berlin: De Gruyter Mouton.10.1515/9783110927481Search in Google Scholar

Wu, Y., M. Adda-Decker & L. Lamel. 2020. Mandarin lexical tones: A corpus-based study of word length, syllable position and prosodic position on duration. Interspeech 2020. 1908–1912. https://doi.org/10.21437/Interspeech.2020-1614.Search in Google Scholar

Xu, C. 2022. Investigating the tonal system of Plastic Mandarin: A cross-varietal comparison [Http://purl.org/dc/dcmitype/Text, University of Oxford]. Available at: https://ora.ox.ac.uk/objects/uuid:df4f01ec-a867-4d96-89f8-6a17f1de0d82.Search in Google Scholar

Xu, C. 2024. Cross-dialectal perspectives on Mandarin neutral tone. Journal of Phonetics 106. 101341. https://doi.org/10.1016/j.wocn.2024.101341.Search in Google Scholar

Xu, Y. & Q. E. Wang. 2001. Pitch targets and their realization: Evidence from Mandarin Chinese. Speech Communication 33(4). 319–337. https://doi.org/10.1016/S0167-6393(00)00063-7.Search in Google Scholar

Xu, C. & C. Zhang. 2024. A cross-linguistic review of citation tone production studies: Methodology and recommendations. The Journal of the Acoustical Society of America 156(4). 2538–2565. https://doi.org/10.1121/10.0032356.Search in Google Scholar

Xu, X., Z. Chen & X. Peng. 2012. Dialect enregisterment during the popularization process of Putonghua – a sociolinguistic study on Changsha “plastic” Putonghua 普通话普及过程中新方言的 “登(enregisterment)”-长沙塑料普通话的社会语言学调查研宄. In Beijingshi yuyan wenzi gongzuo yanjiu luncong 北京市语言文字工作研究论丛 (Collection of linguistic studies in BeiJing). Shoudu shifan daxue chubanshe. Available at: https://read01.com/zQRjEP.html.Search in Google Scholar

Yang, C. & Y. Xu. 2019. Cross-linguistic trends in tone change: A review of tone change studies in East and Southeast Asia. Diachronica 36. 417–459. https://doi.org/10.1075/dia.18002.yan.Search in Google Scholar

Yuan, J. & M. Liberman. 2008. Speaker identification on the SCOTUS corpus. The Journal of the Acoustical Society of America 123(5). 3878. https://doi.org/10.1121/1.2935783.Search in Google Scholar

Yuan, J. & M. Liberman. 2015. Investigating consonant reduction in Mandarin Chinese with improved forced alignment. ISCA Archive, 2675–2678. Available at: https://www.isca-speech.org/archive/interspeech_2015.10.21437/Interspeech.2015-401Search in Google Scholar

Zhang, J. 2018. A comparison of tone normalization methods for language variation research. In Proceedings of the 32nd Pacific Asia conference on language, information and computation. PACLIC 2018, Hong Kong. Available at: https://www.aclweb.org/anthology/Y18-1095.Search in Google Scholar

Zhang, Y., L. Li, W. Lai & X. Xu. 2024. The influence of dialect loss on tone perception: Diminishing voice quality cues in preserved tone contrast. The Journal of the Acoustical Society of America 156(6). 3707–3722. https://doi.org/10.1121/10.0034444.Search in Google Scholar

Zhu, X. 2018. 演化比较法: 如何进行声调演化的研究 (The evolutionary comparative method: How to conduct studies of tonal evolution?). 语言科学 (Linguistic Science) 17(2). 2. https://doi.org/10.7509/j.linsci.201703.031391.Search in Google Scholar

Received: 2025-01-07
Accepted: 2025-09-05
Published Online: 2025-10-28

© 2025 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 27.10.2025 from https://www.degruyterbrill.com/document/doi/10.1515/phon-2025-0001/html
Scroll to top button