Home Could L1 intonation patterns be applied in teaching Mandarin tones to atonal learners of Chinese? – An acoustic phonetic study
Article Open Access

Could L1 intonation patterns be applied in teaching Mandarin tones to atonal learners of Chinese? – An acoustic phonetic study

  • Kornélia Juhász

    Kornélia Juhász is a junior research fellow at the HUN-REN Hungarian Research Centre for Linguistics, who completed her doctoral studies in 2024 at the Eötvös Loránd University. The research topic of her dissertation focused on the acoustic analysis of Mandarin segments in the production of Hungarian learners of Chinese. However, her research area was extended to the acoustic analysis of the acquisition of Mandarin lexical tones and intonation recently, including in the production of atonal Hungarian speakers.

    EMAIL logo
    and Huba Bartos

    Huba Bartos is a senior research fellow at the HUN-REN Hungarian Research Centre for Linguistics, and an Associate Professor in the Institute of East Asian Studies at Eötvös Loránd University in Budapest. His main research fields are syntax and semantics, but has also recently turned his attention towards experimental phonetics and L2 acquisition studies. Besides these, he is editor-in-chief of a pair of new Chinese-Hungarian (2019) and Hungarian-Chinese (2024) dictionaries.

Published/Copyright: September 24, 2024
Become an author with De Gruyter Brill

Abstract

This study focuses on the acoustic comparison of Mandarin Chinese lexical tones and Hungarian monosyllabic intonation patterns. The aim of this experiment was to explore whether atonal L1 intonation patterns are applicable in teaching tones in L2 Mandarin Chinese, and also to investigate to what extent atonal and tonal monosyllabic contours differ. Mandarin Chinese rising Tone 2 is compared to two corresponding Hungarian rising contours: i) monosyllabic interrogative contour and ii) the rising phase of the f 0 pattern in alternative questions. Mandarin Chinese falling Tone 4 is likewise compared to Hungarian i) declarative and ii) imperative f 0 curves. We compared Mandarin Chinese and Hungarian native speakers’ production regarding five acoustic features. Results show that Mandarin Chinese Tone 2 is articulated with longer duration, and a more concave f 0 curve compared to Hungarian rising intonation patterns. Mandarin Chinese Tone 4, on the other hand, was found to differ from the Hungarian imperative contour only in terms of the f 0 curve shape, since Tone 4 featured a more domed curve. Hungarian declaratives are characterized by a more concave f 0 pattern, realized in a lower f 0 range compared to Tone 4. The results of this study may contribute to the proper application of atonal intonation patterns in L2 teaching of Mandarin Chinese through the differences and similarities between atonal and tonal monosyllabic f 0 patterns.

提要

本研究着重于汉语普通话声调与匈牙利语单音节语调模式的声学比较。 该实验的目的是探讨 “语调语言” (即不具有声调的语言) 的单音节语调模式可否适用于汉语普通话作为第二语言的声调教学, 并研究声调语言与语调语言单音节轮廓的异同程度。 汉语普通话的升调 (第二声) 与两个相应的匈牙利升调模式进行了比较: i) 单音节疑问语调模式与 ii) 选择疑问句中基频上升单音节阶段的模式。 汉语普通话的降调 (第四声) 则与匈牙利语的 i) 陈述句及 ii) 祈使句的基频轮廓进行了比较。 我们对比了汉语普通话与匈牙利母语者在五个声学特征上的发音情况。 结果显示, 汉语普通话的第二声相比于匈牙利升调模式具有更长的持续时间和更凹的基频轮廓。 另一方面, 发现汉语普通话的第四声仅在基频轮廓形状上与匈牙利语祈使句语调有所不同, 因为第四声具有更凸的轮廓。 匈牙利语陈述句的特点是基频模式更凹, 且基频范围较第四声更低。 本研究的结果可以有助于通过语调语言与声调语言单音节基频模式之间的差异与相似性, 将语调模式应用于对外汉语教学。

1 Introduction

In the fields of linguistics and language education, the analysis of suprasegmental differences between atonal and tonal languages is a significant research area, shedding light on the different utilization strategies of pitch variation in prosody. This acoustic phonetic study focuses on the comparison of Mandarin Chinese (MC) lexical tones and Hungarian monosyllabic intonation contours. The aim of the acoustic experiment is to explore whether atonal Hungarian (L1) intonation patterns are applicable in teaching tones in MC (as the L2), that is, to investigate whether the monosyllabic patterns in the two languages are similar to the extent that MC teaching instructions could involve the substitution of MC L2 tonal patterns with the monosyllabic realization of some Hungarian L1 intonation contours. Furthermore, the results of the acoustic analysis also provide a brief insight into the investigation of the following question: In what ways, and to what extent, do atonal and tonal monosyllabic contours differ? However, before turning to the acoustic analysis, a typological description is presented about the differences of pitch utilization between atonal and tonal languages, complemented by a short introduction of the corresponding physiological processes of speech production. In addition, the literature review is further extended to the description of the acoustic structure of the hereby analyzed Hungarian intonation patterns and MC lexical tones.

In atonal or monotone languages (such as Hungarian), pitch variation primarily alters the meaning on the sentence or discourse level, expressing emotions or intentions through intonation (Markó 2017). However, in tonal languages (such as MC), pitch variation does not only modify meaning at the sentence level, but also at the lexical level: lexical pitch variation, i.e., tone, is an argument of a word or, more precisely, a monosyllabic unit. Consequently, the argumentation of different pitch variation patterns (pitch contours) to the same monosyllabic sound sequence induces different lexical meanings (Chao 1948/1963). At the same time it must be mentioned that MC tonal realizations are highly influenced by tonal context, as well as the sentence type, that is, intonation (Shen 1989).

In speech production, both intonation and tone production are generated by the same physiological process, that is, the modulation of vocal cord vibration. The continuous opening and closing cycles of vocal cord vibration give rise to a quasi-periodic acoustic signal generally referred to as voicing. One of the fundamental characteristics of voicing is the fundamental frequency (f 0), which represents the number of vibration cycles occurring within one temporal unit, thus, the universal measuring unit of f 0 is Hertz (Hz). The pitch variation required for both intonation and lexical tone production arises from changes in the f 0, since the perceived pitch is closely related to the fundamental frequency: the higher fundamental frequency is expected to convey a higher perceived pitch (Gósy 2004; Hart et al. 1990). In this particular analysis both Hungarian intonation patterns and MC lexical tones are referred to as tunes, as an umbrella term incorporating both above-mentioned functions of pitch variation.

1.1 The perception of fundamental frequency and musical tones

Due to the fact that the production of both lexical tones and intonation are correlated to the modulation of f 0, a brief musical introduction is required on the relevant terms appearing in the acoustic characterization of the analyzed MC and Hungarian tunes. Correspondingly, the following description serves as a basis of the acoustic analysis, since it sheds light on how human perception processes and normalizes inter-speaker differences of f 0 variation. As regards human perception, the relationship between the perception of pitch variation and the actual f 0 frequency cannot be described by a mere linear relationship. This means that the difference between two frequencies is only perceived as the same pitch interval, if the ratio between the frequencies is the same. Thus, the relationship between the perceived pitch and the corresponding frequencies is rather described based on log2 function, as demonstrated below in Figure 1. The ratio between two musical notes is referred to as a musical interval, and an octave represents a distance of eight musical notes (see Figure 1). In this example, the musical note is chosen as a reference point, with a frequency value of 440 Hz (a conventional reference point commonly used in instrument tuning) (Bolla 1995; Kesztler 1952). The frequency of the musical note , (which is located an octave higher compared to ), is characterized by 880 Hz, while the A note (positioned one octave lower relative to ) features 220 Hz (Figure 1). In sum, if we compute f 0 values of A notes, beginning from 220 Hz, we can observe a pattern that the neighboring octave is always obtained by raising 2 to the power of x. This means that the 440 Hz () is twice (21) the A’s 220 Hz. Correspondingly, the next A note positioned at an octave higher (i.e., ) is four times (22) 220 Hz (equal to 880 Hz), and the subsequent A note located even higher (i.e., A‴) features a frequency value of 1,760 Hz, which is eight times (23) the original 220 Hz.

Figure 1: 
Frequency values of A notes and their positions on the octave scale (based on Bolla 1995; Kesztler 1952).
Figure 1:

Frequency values of A notes and their positions on the octave scale (based on Bolla 1995; Kesztler 1952).

Based on this example, it can be concluded that comparing absolute frequencies is not suitable for contrasting speech produced in different frequency ranges by different individuals. For instance, a 50 Hz frequency difference in a male speaker’s production, (e.g., f 0 rising from 100 to 150 Hz), can be equivalent in perception to an f 0 change of 90 Hz in a female speaker’s production (in which case f 0 rising from 180 to 270 Hz). However the frequency ratio in both cases is 1:1.5 (Hart et al. 1990: 24), thus, based on the same ratio of the two f 0 changes, the two tunes would be perceived as having the same magnitude. Therefore, f 0 values are often converted into logarithmic units, such as semitones, to normalize f 0 changes occurring in different frequency ranges (Nolan 2003). The difference of one semitone corresponds to one-twelfth of an octave, meaning an octave can be divided into 12 equally spaced semitones (Bolla 1995). In musical theory, the musical interval of one semitone (in perceived pitch difference) is referred to as a minor second (e.g., between E-F and B-C), while a major second equals to two semitones (which surfaces between all other adjacent musical notes, except for E-F and B-C). In the case of thirds, a minor third represents a three-semitone difference, while a major third spans four semitones. In later parts of this study, other musical intervals may arise, such as fourths (representing five musical notes), fifths (seven musical notes), and sixths (eight or nine musical notes) (Kesztler 1952).

Furthermore it is essential to note that apart from the f 0 range of the tonal contour, the shape of the f 0 curve is also an important factor in speech perception as well as production. For instance, perceptual experiments have shown that in American English the shape of the intonation curve and the timing of f 0 variation play a significant role in distinguishing different intonation melodies (Barnes et al. 2010). Thus, it can be concluded that the f 0 contour’s shape is also a significant acoustic feature, and so will be analyzed in this experiment. However, prior to the acoustic analysis, the analyzed tunes’ acoustic characteristics are presented in the next section.

1.2 The analyzed tunes and the motivation of the study

The present acoustic analysis is focusing on the comparison of MC tones and the corresponding Hungarian monosyllabic intonation contours, that is, MC rising Tone 2 is compared to two Hungarian rising contours: i) monosyllabic yes/no interrogative contour and ii) the rising phase of the f 0 pattern in alternative questions. MC falling Tone 4 is likewise compared to Hungarian i) declarative and ii) imperative f 0 curves. Thus, in both cases, one MC tone was compared to two Hungarian monosyllabic intonation contours, resulting in six different f 0 patterns being analyzed.

The motivation for this study arises from the fact that Hungarian teachers of MC often presume that the analyzed rising and falling Hungarian and Chinese tunes have an identical acoustic structure, thus instructing Hungarian MC learners to substitute MC L2 tonal patterns with the monosyllabic realization of a Hungarian L1 intonation contour. In particular, among Chinese teachers, it is a popular technique to regard the MC Tone 2 and the Hungarian monosyllabic yes/no interrogative intonation contour identical. Likewise the MC Tone 4 is considered equal to the Hungarian imperative intonation contour. However, the application of these teaching strategies is merely based on impressions, since up to this point there is no previous empirical analysis on the acoustic similarity of the above-mentioned MC and Hungarian tunes.

The validity of the above-mentioned analogies and teaching methodology must also be questioned from the viewpoint of L2 learning: if the acquisition of L2 segmental and suprasegmental levels are concerned, L1 influence is fundamentally present and assumed to bias both perception and production in L2, since the two are presumed to be intertwined, or to develop simultaneously (Flege and Bohn 2021). The perceptional (and therefore also the productional) bias exerted by the L1 often induces perceptual assimilation of the non-native L2 pattern to its closest L1 category. In this particular case (according to the Perceptual Assimilation Model for suprasegmentals – PAM-s) this means that native speakers of a non-tonal language are hypothesized to assimilate the foreign (L2) lexical tones into their native prosodic (i.e., intonational) categories (So and Best 2014). In particular, similar L2 lexical tones and L1 monosyllabic intonation patterns are expected to be perceived as identical, even though there may be observable and apparent dissimilarities between the two realizations. This means that these acoustic differences are expected to get filtered out from the perception of the L2 learner as perceptual assimilation takes place (labelled as ‘equivalence classification’ by Flege (1995) and Flege and Bohn (2021)), since the acoustic difference at hand is not considered contrastive in the L1. Therefore, to summarize, the investigation of acoustic similarities of MC lexical tones and Hungarian monosyllabic intonation patterns could possibly lay the foundation for discovering both teaching and learning difficulties among Hungarian teachers and learners of MC. To be more precise, a question arises regarding what kind of influence could be expected from the usage of L1 patterns in MC speech, if the hypothesis outlined above on perceptual assimilation surfaces in the case of Hungarian speakers of MC. This hypothesized perceptual bias could appear at two different levels of MC acquisition and teaching. Firstly, the above-mentioned analogies of MC lexical tones and Hungarian intonation patterns might be induced by a simple case of perceptual assimilation where teachers themselves are deceived by their own L1 (Hungarian) perception, identifying two similar, albeit acoustically distinct, realizations as identical, and as a result, encouraging students to apply the analogies between the two languages, which may then lead to foreign-accented speech among L2 learners. Secondly, even if L2 learners receive proper guidance on producing MC tones, their own perception may happen to get biased by the L1 patterns, so similar outcomes are expected in this case, too, leading to foreign accented speech.

Consequently, if the above-mentioned analogies between MC tones and Hungarian intonation patterns turn out to be false and misleading – and also if L1 effect is presupposed in the case of L2 learners and differences between L1 and L2 patterns are filtered out from learners’ perception (and as a result, production) – the results of the present paper could possibly predict how L1 intonation patterns influence and reshape MC tones in Hungarian learners’ production. In contrast, if the analyzed L1 intonation patterns are proven to be acoustically identical to MC lexical tones, then the substitution of L2 lexical tones by the L1 intonation patterns (i.e., the direct transfer of L1 patterns to L2 production) could be directly utilized in teaching MC tones for Hungarian speakers. This teaching methodology could provide a simple and genuine technique aiding the proper acquisition of MC tones, as opposed to previous methods, which often hold out prerequisites for previously acquired knowledge or skills, as they are mostly built upon using music as the primary tool when introducing MC tones to atonal speakers.

For instance, Chao (1948/1963) registers MC tones within a scale of 5 degrees (see Figure 4), while Yu (2007) proposes that the concept of music could help learners understand how tonal melodies change. The importance of music also surfaces in more recent papers – see the results of Han and colleagues (2019) who shed light on direct interaction between accuracy of Mandarin tone perception and musical training. In addition, teaching lexical tones often involves visual and/or hand movement techniques, however, according to Guan (2000), these techniques often do not achieve the desired effect, since they do not provide proper introduction to how these tonal curves are produced. In sum, all of the above listed examples require the perceptual skills to acoustically detect and identify dissimilarities between melodies, especially in those widely used imitation techniques where students need to learn lexical tones while barely relying on what they hear and perceive. This means that even though there are various techniques for teaching MC lexical tones, if the perceptual basis on detecting the acoustic keys of different tones is absent, then without proper acoustic feedback, the development of the native-like production of lexical tones may be delayed. As opposed to the methods listed above, the usage of L1 patterns transferred to L2 production might provide a more direct path to tonal acquisition of MC, since L1 patterns could be accessed by L2 learners directly.

To summarize the motivation of the present paper, the empirical results presented here could serve as an insight into how the application of L1 patterns influences Hungarian MC learners’ production and could confirm (or eventually disprove) the validity of the teaching techniques involving L1 intonation patterns. In other words, the results could contribute to and develop the teaching methodology concerning MC lexical tones by drawing attention to the (in)appropriate application of L1 intonation patterns. Furthermore, this analysis may serve as a basis for further acoustic comparisons in the future which could draw conclusions on how and what L1 patterns could be applied in MC teaching in languages other than Hungarian.

In the following subsections, the acoustic characteristics of the presumably similar MC and Hungarian tunes are presented. Moreover, as was mentioned above, the analysis is also complemented by two additional Hungarian tunes: a rising and a falling one – specifically, the rising melody of the first (monosyllabic) conjunct of an alternative question and the intonation contour of monosyllabic declaratives. These tunes are analyzed in order to provide more reference on the realization of MC lexical tones relative to Hungarian intonation patterns. It must be noted that this paper exclusively deals with MC tones produced in isolation, disregarding the problem of tonal realization in tone sequences, the effect of tonal coarticulation, and the influence of intonation exerted on tonal curves (Shen 1990; Xu 1997; for the interaction of tones and intonation in the production of Hungarian learners of MC, see Juhász and Bartos 2023). The reason for this decision is that the usage of L1 patterns in teaching MC tones is applied exclusively in the beginning phase of tonal acquisition, when lexical tones are most probably anchored to monosyllabic L1 patterns in the case of atonal speakers, since MC lexical tones are introduced to L2 learners in isolation.

1.2.1 The rising tunes analyzed

The two types of analyzed Hungarian rising tunes are two interrogative patterns: the monosyllabic yes/no question and the first (monosyllabic) unit of the alternative question. According to the description given by Fónagy and Magdics (1967), Hungarian monosyllabic yes/no questions do not exhibit the standard rising-falling interrogative pattern, instead, owing to the truncation of the underlying L*HL pattern (Varga 2002), the f 0 curve only features a rising phase in which the f 0 rises a minor third (Fónagy and Magdics 1967). Based on Markó’s (2007) measurements of spontaneous speech, the pitch change can also be equivalent to a major second, a minor third, or even a perfect fifth, and the f 0 contour of this tune is described as a convex pattern (see Figure 2).

Figure 2: 

F
0 contour of the Hungarian monosyllabic yes/no interrogative Én? [eːn?] ‘Me?’ (Markó 2007: 64, left, and Olaszy 2002: 90, right), (in the latter case, values are normalized to the f
0 value at the beginning of declarative statements (equals to 100 %)).
Figure 2:

F 0 contour of the Hungarian monosyllabic yes/no interrogative Én? [eːn?] ‘Me?’ (Markó 2007: 64, left, and Olaszy 2002: 90, right), (in the latter case, values are normalized to the f 0 value at the beginning of declarative statements (equals to 100 %)).

The second analyzed Hungarian rising intonation pattern is the first (monosyllabic) conjunct of an alternative question. For example, in the question Én vagy ő? [eːn vaɟ øː?] Me or her?’, the first syllable of Én ‘Me’ features a rising pattern, argued by both Olaszy (2002) and Markó (2017) (see Figure 3).

Figure 3: 

F
0 pattern of the alternative question Én vagy ő? [eːn vaɟ øː?] ‘Me or her?’ (Olaszy 2002: 95).
Figure 3:

F 0 pattern of the alternative question Én vagy ő? [eːn vaɟ øː?] ‘Me or her?’ (Olaszy 2002: 95).

According to Olaszy’s (2002) measurements, we can draw the following conclusions in the comparison of these two Hungarian rising intonation contours. Relative to the monosyllabic yes/no question, the rising phase of the alternative question is initiated from a higher f 0 register and its f 0 range is also compressed compared to the monosyllabic yes/no question pattern. However, concerning maximal f 0, the rising phase of alternative questions reaches a higher f 0 compared to the monosyllabic question.

As is well-known, in Mandarin Chinese, lexical tone is an argument of a syllable and defines the syllable’s meaning on the lexical level (as far as homophonous syllables are not considered). The standard variation of Mandarin Chinese distinguishes 4 (+1) tones (see Figure 4), however in this particular analysis, only the rising Tone 2 and the falling Tone 4 are addressed because only these contours have a monosyllabic counterpart in the Hungarian intonation system.

Figure 4: 
Introductory musical notes of MC tones (top row) and their schematic tone contours (middle row) (based on Chao 1948/1963: 85), along with the Chinese syllables realized with the respective tone (in Chinese characters and Pinyin transcription, and their meanings).
Figure 4:

Introductory musical notes of MC tones (top row) and their schematic tone contours (middle row) (based on Chao 1948/1963: 85), along with the Chinese syllables realized with the respective tone (in Chinese characters and Pinyin transcription, and their meanings).

According to Chao’s description (1948/1963), in the case of the rising MC Tone 2, the f 0 rises a major third, equivalent to 4 semitones, while measurements by Moore and Jongman (1997) index an approximate rising phase of about 3.5 semitones. Furthermore, MC Tone 2 is characterized by a convex f 0 contour, similar to the Hungarian monosyllabic yes/no question intonation pattern (see Figure 5).

Figure 5: 

F
0 contours of Mandarin Chinese tones (Moore and Jongman 1997: 1872).
Figure 5:

F 0 contours of Mandarin Chinese tones (Moore and Jongman 1997: 1872).

1.2.2 The falling tunes analyzed

In the acoustic analysis of falling tunes, we compare the Hungarian declarative and imperative monosyllabic intonation patterns with the f 0 contour of the falling Tone 4 in Mandarin Chinese. Regarding the Hungarian declarative pattern, the authors are not aware of any published measurements related to frequency or f 0 changes in monosyllabic utterances. However, it is clear that the Hungarian declarative melody exhibits a falling pattern (Fónagy and Magdics 1967). Similar to the declarative tune, Hungarian imperative melody also features a falling pattern, but according to Fónagy and Magdics’s (1967) measurement, imperatives are initiated from a higher f 0 level, that is, a second (two or three semitones) higher compared to the declarative intonation pattern. In comparison, in MC falling tones, according to Chao (1948/1963), f 0 falls by eight semitones (equivalent to a sixth) (see Figure 4), while according to Moore and Jongman’s (1997) measurements the f 0 range of the MC falling Tone approximates seven semitones, which is more akin to a perfect fifth. Note that since our analysis focuses on isolated tonal patterns, we take the wide f 0 range (51) as the standard for Tone 4, rather than its realization occurring in a position preceding another Tone 4, often labelled as “half Tone 4”, where the f 0 range is described as 53 (for the acoustic realization of Tone 4 + Tone 4 sequences, see Xu 1997).

1.3 Research questions

  1. The present study aims to empirically investigate whether the presumed similarities between the Hungarian intonation patterns and MC lexical tones (namely, the intonation pattern of Hungarian monosyllabic yes/no questions versus the rising MC Tone 2, on the one hand, and the Hungarian imperative intonation pattern versus the falling MC Tone 4, on the other) are confirmably strong, and whether their application in Mandarin Chinese L2 teaching is valid and promotes L2 learners’ tone production.

  2. Furthermore, in view of the fact that both lexical tone and intonation production is generated by the same physiological process and both arising by the modulation of vocal fold vibration rate, we explore what acoustic features differentiate lexical tones and monosyllabic intonation patterns in atonal and tonal languages.

2 Methodology

2.1 Participants

In the recording session the speech production of two speaker groups were analyzed: i) a group of Hungarian native speakers (mean age: 26.3 ± 2.81 years) and ii) a group of Mandarin Chinese native speakers (mean age: 25.8 ± 1.86 years), in each group seven female university students were recorded with 14 participants in total. The Hungarian speakers all spoke English daily and had no experience learning Mandarin Chinese or any other tonal language. The Chinese native speakers were all from Beijing and had been in Hungary for an average of 3.8 years. All seven Chinese native speakers spoke English weekly on average, and four of them reported speaking Hungarian on a daily basis. The speakers’ speech was recorded in 16-bit/44.1 kHz using an external sound card and an omnidirectional condenser head microphone in a sound attenuated room.

2.2 Recording material

The recorded material consisted of five Hungarian and five Chinese CV-structured, monosyllabic words that were read five times in a randomized order in the following tune types. For Hungarian utterances, the speakers produced four different sentence types: for rising tunes, monosyllabic yes/no questions and alternative questions were recorded, and for falling tunes, declarative and imperative monosyllabic utterances were produced. Chinese monosyllabic words featured either the MC rising Tone 2 or falling Tone 4. In sum, a total of six different tunes were examined in this experiment, encompassing four different Hungarian intonation patterns and two different MC lexical tones. Hungarian native speakers exclusively read Hungarian utterances, likewise the Chinese native speakers only pronounced the Chinese monosyllabic words. This means that (5 repetitions × 5 words × 7 speakers =) 175 tokens for each of the six tunes were recorded, that is, in sum, 1,050 realizations were analyzed in the present experiment. During the experiment, the analyzed Hungarian monosyllabic utterances were embedded into short texts to facilitate the standard production of the required intonation pattern. In the case of the Chinese words, context was not necessary, as the realization of the given MC tone was induced by the lexical meaning, thus, Chinese words were recorded in isolation and displayed both Chinese characters and pinyin transcription (for examples, see Table 1; for the entire material, see Tables A and B in the Appendix).

Table 1:

Examples of monosyllabic utterances for rising and falling tunes in both MC and Hungarian (the target words are in bold italics).

Rising tunes Falling tunes
Hungarian MC Hungarian MC
Alternative & unmarked monosyllabic interrogative:

Cső [tʃøː] vagy csá?

– Nem tudom. Cső? [tʃøː]

Uttered when offering the choice between two types of informal greetings: Tube (hi) or hey? I don’ know. Tube (hi)?)
Tone 2:



zhé

[d̥ʂɤ]

(to cut off)
Declarative:

Mi az a henger, amiben folyik a víz? – Cső. [tʃøː]

(What is that cylinder called in which the water flows? –Tube.)
Tone 4:



zhè

[d̥ʂɤ]

(this)
Imperative:

A vízvezetékszerelő tíz óra munka után így szólítja fel a csövet: Cső! [tʃøː] Nehogy kilyukadj nekem!

(After a long job the plumber orders the tube: Tube! Don’t you dare leak!)

The Chinese utterances were recorded in isolation and in randomized order. In the acoustic analysis, the vocalic phase of monosyllabic words was analyzed, which in each case was a central vowel, either a front unrounded [øː] for Hungarian words, or a central vowel, back unrounded [ɤ] for Chinese utterances. Despite the different articulatory features of these two vowels, they are acoustically very similar: both vowels are positioned in the middle of the acoustic vowel space, since the place of articulation (in this particular case: horizontal tongue position) and lip rounding have opposing effects on the acoustic structure of vowels. The labiality of the front [øː] in Hungarian lowers the second formant frequency (F 2) (compared to other illabial front vowels), while the illabiality of the back [ɤ] in MC raises the F 2 (compared to other labial back vowels). In this manner both vowels are positioned in the middle of the acoustic vowel space (Juhász 2020).

Additionally, the consonant context of each vowel was controlled. Syllable nuclei were always preceded by a voiceless unaspirated obstruent onset in the interest of avoiding distinct coarticulatory effects of consonant contexts on f 0 values. It must be noted that both MC and Hungarian are characterized by a two-way obstruent opposition: in Hungarian, voiced unaspirated (lenis) and voiceless unaspirated (fortis) segments are in contrast, while in MC, voiceless unaspirated (lenis) and voiceless aspirated (fortis) obstruents are in opposition (for detailed comparison, see Juhász 2021). To elaborate, Hungarian fortis and MC lenis obstruent categories overlap, that is, segments in both languages can be described by one single phonetic category concerning the acoustic feature of voicing: the voiceless unaspirated category. All MC and Hungarian obstruents recorded in this experiment belong to the above-mentioned voiceless unaspirated phonetic category. Note that in the descriptions of Ye and Bartos (2017), the voiceless unaspirated plosives are denoted by devoiced IPA characters, whereby denotation is also applied in the present paper (e.g., [d̥, g̥]).

It should further be noted that, due to the restricted syllable structure variability and monosyllabic nature of MC, there were limited options to find Hungarian words to stand as equivalents to the Chinese syllables in all of the key features, namely, syllable structure, meaning, and the formation of minimal pairs. Thus, we were forced to also record MC Tone 2 syllable “sé [sɤ]”, which is a well-formed sequence in terms of Chinese phonotactic rules but happens to represent an accidental gap – Tone 2 is not associated with this sound sequence in any MC lexeme. That is, this sequence does not exist in Mandarin Chinese and has no associated meaning or Chinese character. Since our options were limited, we chose to nevertheless include this syllable in the analysis, and we found that it was not problematic for MC speakers to produce it. Besides, although the recorded Hungarian dialogues might appear relatively rarely in spontaneous speech, the meaning of the recorded utterances were apparent, and so Hungarian speakers could easily produce the expected intonation contours.

2.3 Acoustic analyses

Audio recordings were labelled and analyzed using Praat software (Boersma and Weenink 2019). All analyses were performed exclusively on the vocalic phases featuring a quasi-periodic signal. This interval was also considered as the duration of the segment since this is the voiced phase carrying the tonal contour (owing to the fact that voiceless obstruent onsets are not voiced). The extracted f 0 values were converted to semitones using the hqmisc package (Quené 2014) in R (R Core Team 2019), with a reference value of 50 Hz in all cases. The vocalic phase of the Hungarian and Chinese utterances was analyzed using four static features (i. – iv.) and a dynamic one (v.), which are listed below:

  1. Maximal f 0 (f 0max, in semitones).

  2. Minimal f 0 (f 0min, in semitones).

  3. F 0 range (which was calculated as the difference between maximal and minimal f 0, in semitones).

  4. The duration of the vocalic phase.

  5. The dynamic analysis of the f 0 contours (in order to obtain f 0 contours, f 0 was extracted by each 5 ms interval automatically).

In the statistical analysis of static acoustic features, linear mixed models were applied in R (R Core Team 2019) using the lme4 package (Bates et al. 2015). Satterthwaite approximation was used to calculate p- and F-values, which is available in the lmerTest package (Kuznetsova et al. 2017). The levels of variables were compared pairwise using post-hoc tests with the emmeans package (Lenth 2020). The random variable was defined as the speaker in all mixed models (random intercept). Rising and falling tunes were analyzed separately, meaning that altogether, eight models were composed for the four static dependent variables, namely maximal f 0 , minimal f 0 , f 0 range, and duration. In these models, the independent variable was always the tune type, which had three levels for rising tunes (MC Tone 2, Hungarian monosyllabic yes/no question, and Hungarian alternative question) and three levels for falling tunes (MC Tone 4, the Hungarian imperative, and the Hungarian declarative question). The post hoc test was carried out by using the emmeans package (Lenth 2020), computing pairwise post-hoc comparisons adjusted to the Tukey test. The graphs were created using the ggplot2 package in R (Wickham 2016).

For dynamic analyses, generalized additive models (GAMMs) were applied for rising and falling tunes, respectively. GAMM is a statistic modelling used for analyzing non-linear data and calculates an estimated curve using combinations of basic functions and associated weights. In the basic GAMM models, the effect of the normalized duration was examined on the dependent variable f 0 (obtained every 5 ms and converted into semitones), that is, to observe how the f 0 changes dependent on the normalized duration of the vocalic phase. The GAMMs were further extended by the inclusion of a three-level ordered factor variable of tune type (as a parametric term), defining the reference curve and the difference curves measured from this reference. In the models of both rising and falling tunes, the reference curve was set to the MC Tone, and difference curves were computed for the Hungarian tunes. Random smooth function was applied by each f 0 trajectory. The results of GAMMs were calculated in R using the mgcv package (Wood 2017) and the estimated curves were visualized by using the itsadug package (van Rij et al. 2020).

3 Results

3.1 Rising tunes

3.1.1 Static analyses of rising tunes

The tune type independent variable significantly influenced the maximal and minimal f 0, as well as f 0 range of rising tunes (f 0max: F (2, 25) = 113, p < 0.001; f 0min: F (2, 25) = 163, p < 0.001; f 0 range: F (2, 25) = 426, p < 0.001). Based on pairwise post-hoc tests, the Hungarian monosyllabic yes/no question had a significantly higher f 0max mean compared to the alternative question (p < 0.001) and the Chinese Tone 2 (p < 0.05). In terms of minimal f 0, the Hungarian alternative question featured a significantly higher f 0min mean compared to the MC Tone 2 (p < 0.05) and the Hungarian monosyllabic yes/no question (p < 0.001) (see Figure 6, left). As regards f 0 range, MC Tone 2 was realized with a more compressed f 0 range compared to the Hungarian monosyllabic yes/no questions (p < 0.01), but featured a wider mean range compared to the alternative questions (p < 0.05). In the rising phase of MC, Tone 2 f 0 rose an average of 5.5 semitones (approximately a perfect fourth), while the Hungarian alternative question featured a rise of three semitones (approximately a minor third), and the f 0 of Hungarian monosyllabic yes/no questions increased by 8.9 semitones (a sixth) (see Figure 6, right).

Figure 6: 
Minimal and maximal f
0 (mean and SD, in semitones) of the three analyzed rising tunes (MC Tone 2 and the Hungarian monosyllabic and alternative questions) on the left, and f
0 range of the utterances (in semitones) on the right.
Figure 6:

Minimal and maximal f 0 (mean and SD, in semitones) of the three analyzed rising tunes (MC Tone 2 and the Hungarian monosyllabic and alternative questions) on the left, and f 0 range of the utterances (in semitones) on the right.

3.1.2 Dynamic analysis of rising tunes

In the basic GAMM model comparing rising tunes, the f 0 change was analyzed throughout the normalized duration of the vocalic section of the monosyllabic utterances. The model was complemented by the ordered three-level factor variable tune type (a parametric term), which allowed us to obtain a better fit than the basic model (χ 2 (8.00) = 2,632, p < 0.001). The model was further complemented by a random smooth function applied by each f 0 trajectory. As a result of incorporating the parametric term and random smooth functions, the model could explain 99.6 % of the variance in the data. The estimated curve of the MC Tone 2 was characterized by a concave f 0 curve (edf = 8.8; p < 0.001), featuring its lowest f 0 values positioned at the 20–40 % interval of normalized duration of the vocalic phase (see Figure 7). The f 0 curve of Hungarian monosyllabic questions was characterized by a significantly steeper rising pattern (edf = 8.6; p < 0.001) compared to the MC Tone 2, featuring an intersection of both the MC Tone 2 and the Hungarian alternative question (edf = 8.3; p < 0.001) (see Figure 7). MC Tone 2 featured exclusively one single negative excursion and showed a rising pattern beginning at approximately 40 % of the normalized duration. In contrast, the shape of both Hungarian question contours was realized with a more linear pattern containing two excursions: one negative excursion at 20 % and one positive excursion at 90 % of the normalized duration. Thus, in both Hungarian interrogative contours, the rising phase starts from 20 % of the normalized duration, as opposed to MC Tone 2, where it starts at approximately 40 %. This means that compared to the two Hungarian question intonation patterns, where the rising phase dominates 80 % of the vocalic section, the MC Tone 2 approximates a falling-rising pattern. The intervals of normalized duration in which the three f 0 curves significantly differ are shown in Table 2.

Figure 7: 
The f
0 curves of the analyzed rising tunes (red solid line = MC Tone 2, blue dashed line = Hungarian monosyllabic yes/no question, green dotted line = Hungarian alternative question).
Figure 7:

The f 0 curves of the analyzed rising tunes (red solid line = MC Tone 2, blue dashed line = Hungarian monosyllabic yes/no question, green dotted line = Hungarian alternative question).

Table 2:

The significantly different intervals of normalized duration of rising tones.

MC Tone 2 – Hungarian monosyllabic yes/no question 0–36 % 58–100 %
Tone 2 – Hungarian alternative question 0–78 % 80–100 %
Hungarian monosyllabic yes/no question – Hungarian alternative question 0–89 %

3.2 Falling tunes

3.2.1 Static analyses of falling tunes

As regards falling tunes, maximal f 0 and minimal f 0, as well as the f 0 range dependent variables were all influenced significantly by the tune type independent variable (f 0max F (2, 25) = 188, p < 0.001; f 0min F (2, 25) = 22, p < 0.0001; f 0 range: F (2, 25) = 34, p < 0.001). Post-hoc tests regarding the maximal f 0 did not show differences between MC Tone 4 and the Hungarian imperative intonation, but the Hungarian declarative tune was characterized by a significantly lower maximal f 0 mean compared to the other two tunes (p < 0.001, see Figure 8, left). Concerning minimal f 0, the Hungarian imperative tune featured a significantly higher mean compared to the Hungarian declarative pattern. On the other hand, MC Tone 4 did not differ from either of the Hungarian falling tunes, which can be attributed to the relatively high standard deviation in the data (see Figure 8). In terms of f 0 range, the Hungarian declarative tune surfaced with a significantly narrower, compressed f 0 range compared to MC Tone 4 (p < 0.05) and the Hungarian imperative tune (p < 0.001). It is important to note that MC Tone 4 featured a higher average pitch range, as a tendency, compared to the Hungarian imperative tune, and the data for the Chinese Tone 4 also showed more variability, possibly explaining the lack of significant difference between MC Tone 4 and the Hungarian declarative tune. F 0 range in MC Tone 4 descended by approximately 9.2 semitones (equivalent to a major sixth), while the Hungarian imperative tune featured a falling phase of approximately 8.3 semitones (corresponding to a minor sixth), and the Hungarian declarative tune descended approximately 5.9 semitones (a fifth, see Figure 8, right).

Figure 8: 
Minimal and maximal f
0 (mean and SD, in semitones) of the three analyzed falling tunes (MC Tone 4 and the Hungarian imperative and declarative tunes) on the left, and f
0 range of the utterances (in semitones) on the right.
Figure 8:

Minimal and maximal f 0 (mean and SD, in semitones) of the three analyzed falling tunes (MC Tone 4 and the Hungarian imperative and declarative tunes) on the left, and f 0 range of the utterances (in semitones) on the right.

3.2.2 Dynamic analyses of falling tunes

In the dynamic analysis of falling f 0 contours, the basic GAMM model (examining the f 0 change within the normalized duration) was extended with the parametric term of tune type (which was an ordered factor variable) to achieve better fitting to the data (χ 2 (8.00) = 7,789, p < 0.001). Similarly to rising tunes, a random smooth function was applied by each f 0 trajectory. In this manner, the final model explained 99.7 % of the variability found in the data. The results of the GAMM showed that the Hungarian imperative f 0 contour differed significantly from the MC Tone 4’s curvature (edf = 6.3; p < 0.001). Similarly, the f 0 pattern of the Hungarian declarative tune also featured significantly different realization compared to the MC Tone 4 (edf = 8.1; p < 0.001). These differences are also apparent in Figure 9, where curvatures of the three falling tunes are all well separated from each other (see also Table 3). MC Tone 4 was realized with a slightly convex f 0 curve, initiated from a relatively high f 0 level. In contrast, the Hungarian declarative f 0 curve featured a concave shape, positioned to a relatively lower f 0 range than the other two curves. The f 0 pattern of Hungarian imperatives was located between the other two descending patterns, approximating a quasi-linear pattern.

Figure 9: 
The f
0 curves of the analyzed falling tunes (red solid line = MC Tone 4, blue dashed line = Hungarian imperative tune, green dotted line = Hungarian declarative pattern).
Figure 9:

The f 0 curves of the analyzed falling tunes (red solid line = MC Tone 4, blue dashed line = Hungarian imperative tune, green dotted line = Hungarian declarative pattern).

Table 3:

The significantly different intervals of normalized duration of falling tones.

Tone 4 – Imperative 0–100 %
Tone 4 – Declarative 0–100 %
Imperative – Declarative 0–100 %

3.3 The duration of rising and falling tunes

Tune types also showed significant difference across their duration (rising tunes: F (2, 25) = 259, p < 0.001), falling tunes: F (2, 25) = 17, p < 0.001)). Among rising tunes, all three tune types differed significantly (p < 0.001), with the MC Tone 2 featuring the longest mean duration and the alternative questions being the shortest. Consequently, Hungarian monosyllabic questions had an intermediate duration compared to the two previously mentioned rising tunes. As for the analyzed falling tunes, only the realization of Hungarian imperative and declarative tunes differed significantly (p < 0.001), with the imperative utterance featuring a longer mean duration compared to the declarative realization. This also means that no significant difference was found in terms of duration between the MC Tone 4 and the two Hungarian falling intonation contours, presumably because the average duration of the MC Tone 4 fell between the mean durations of the two Hungarian falling tunes (see Figure 10). Furthermore, we should note that considering only mean durations, MC Tone 2 has a strikingly long realization compared to MC Tone 4 as well as to each of the Hungarian tunes, yielding a length of approximately 200 ms.

Figure 10: 
The duration (mean and SD) of the vocalic phases of the analyzed rising and falling tunes (dark grey bar plots = rising tunes, light grey bar plots = falling tunes).
Figure 10:

The duration (mean and SD) of the vocalic phases of the analyzed rising and falling tunes (dark grey bar plots = rising tunes, light grey bar plots = falling tunes).

4 Discussion

The presented acoustic analysis focused on the comparison of Mandarin Chinese lexical tones and Hungarian monosyllabic intonation contours. The main aim of the experiment was to explore whether atonal Hungarian (L1) intonation patterns are applicable in teaching lexical tones in MC (as the L2), and to investigate in what ways and to what extent atonal and tonal monosyllabic contours differ. The results were expected to shed light on the validity of teaching methodologies, applying analogies between L1 intonation patterns and L2 lexical tones. If L1 and L2 patterns were found to be identical, then the substitution of L2 lexical tones by the L1 intonation patterns could be directly utilized in teaching MC tones for Hungarian speakers, which could provide a simple and genuine technique aiding the proper acquisition of MC tones without requiring any previously acquired musical skills. In contrast, if L1 and L2 patterns were found to differ, and the teaching methodologies in question were proven to be false and misleading, then the results of the present analysis could serve as a basis for formulating hypotheses on difficulties Hungarian learners experience owing to L1 transfer and equivalence classification when learning MC lexical tones.

The investigation was focused exclusively on rising and falling contours: MC Tone 2 was compared to the Hungarian monosyllabic yes/no question intonation pattern, and to the first conjunct of an alternative question (characterized by a rising tune). In the ‘realm’ of falling tunes, MC Tone 4 was contrasted with the Hungarian declarative and imperative patterns. The rising and falling tunes were analyzed separately, examining four static and one dynamic acoustic aspect. The static analysis included the analysis of minimal and maximal f 0, f 0 range, and duration. The dynamic analysis focused on the realization of the shape of f 0 curvatures.

Concerning Hungarian tunes, we may find inconsistencies comparing the results of the present study with those introduced in the literature review, however, since these acoustic dissimilarities between the two measurements are not strictly relevant in the current analysis, in-depth explanations of the reasons behind these differences are not going to be presented here.[1] Thus, from this point on, we are focusing primarily and exclusively on the results regarding the comparison of MC lexical tones and Hungarian intonation patterns.

Rising MC Tone 2 and the two rising Hungarian question intonation patterns shared the following acoustic features:

  1. the minimal f 0 of MC Tone 2 was found to be equivalent to the minimal f 0 of Hungarian monosyllabic yes/no questions.

  2. in contrast, the maximal f 0 of MC Tone 2 was identical to that of Hungarian alternative questions.

This suggests that the MC Tone 2 had an intermediate realization compared to the two Hungarian question types investigated, while also possessing an intermediate f 0 pitch range. As regards the f 0 curves, MC Tone 2 featured a relatively concave pattern with a delayed rising phase initiated at approximately 40 % of the vocalic section. In contrast to this concave realization of MC Tone 2, both Hungarian question intonation patterns showed a rather linear-like shape, with both tunes featuring one negative and one positive excursion, at approximately the same position within the vocalic section. These results indicate certain acoustic dissimilarities between the analyzed rising f 0 patterns, but further perceptual measurements are required to determine to what extent these acoustic differences affect Mandarin Chinese speakers’ judgement on accuracy and correctness of the tonal realizations. It can be argued that MC Tone 2 and the two Hungarian question intonation patterns are not completely identical in their acoustic realizations, yet they do share some acoustic features, which could at least serve as an anchor when teaching MC Tone 2 production to Hungarian learners by determining the minimal and maximal threshold of the lexical tone’s realization. Furthermore, the acoustic dissimilarities between MC and Hungarian rising tunes might serve as a basis for formulating hypotheses on how Hungarian learners’ f 0 curves differ from native Chinese patterns, if the transfer of L1 patterns in L2 production is assumed to occur, owing to equivalence classification, i.e., Hungarian learners of MC are expected to produce MC Tone 2 with a steeper and more linear f 0 pattern compared to native MC speakers’ production.

It is important to note that the distinction between MC lexical tones, more particularly, between MC Tone 2 and 4, is not based solely on characteristics related to the change of f 0 (e.g., f 0 range or rising/descending curve), but also on the duration of the monosyllabic vocalic section. As opposed to MC Tone 2, with a relatively long average duration during which the f 0 rose 5.5 semitones, MC Tone 4 units were realized in roughly half the time of Tone 2 ones and descended approximately 9.2 semitones. Zhang and colleagues (2022) argued that in whispered MC speech, duration serves as a secondary acoustic cue for MC lexical tone perception and identification, since native Chinese speakers are inclined to emphasize these temporal differences in whispered speech to maintain the contrast primarily expressed by the f 0 contour (Zhang et al. 2022). Thus, in MC teaching, it would be crucial to emphasize these durational differences between MC lexical tones, and in this manner develop and encourage more native-like production. The emphasis made on durational differences of MC lexical tones would also be crucial because – based on the presented results – Hungarian intonation patterns are not characterized by enormous durational differences; thus, it is reasonable to suppose that Hungarian learners would not be attentive and sensitive to duration as an acoustic trait without careful instruction.

Regarding the analyzed falling tunes, MC Tone 4 was found to be identical in all static characteristics with the Hungarian imperative tune, meaning that the two tones did not show differences in terms of either minimal or maximal f 0, f 0 range, or in terms of the duration of the vocalic section. Thus, as far as these static acoustic aspects are concerned, these two above-mentioned falling tunes could be considered identical. The only difference between the falling MC Tone 4 and the Hungarian imperative tune was the shape of the f 0 curve: MC Tone 4 was initiated on a higher f 0 level, featuring a more domed and convex f 0 curve compared to the Hungarian imperative pattern. Consequently, it remains a question for a future perceptual analysis whether producing MC Tone 4 with the f 0 curve of the Hungarian imperative tune influences the perception of native MC speakers.

In sum, the conformity of rising tunes in Chinese and Hungarian was not confirmed by our measurements. Thus, applying the teaching technique in question, that is, the substitution of MC lexical tones with Hungarian rising interrogative intonation patterns, should only be applied with caution (if at all). On the other hand, in the domain of falling patterns, our analysis confirmed that MC Tone 4 and the Hungarian imperative tune indeed share almost all of their analyzed acoustic features; only the shape of their f 0 curvature differs somewhat. Thus, we conclude that applying the Hungarian monosyllabic imperative tune in MC Tone 4 teaching could be more useful in establishing native-like pronunciation than in the case of rising tunes. As a brief summary on the acoustic features of monosyllabic f 0 contours of lexical tones compared to intonation patterns, we conclude that although the f 0 patterns and f 0 features of atonal Hungarian and tonal MC showed differences, they also shared many identical features: the f 0 contours emerged in approximately the same f 0 register, and with similar f 0 ranges. These fundamental similarities between atonal and tonal monosyllabic patterns can be attributed to physiological reasons, i.e., the general features and limitations of vocal fold vibration. Last but not least, we have also established that duration (which directly affects the velocity of f 0 change) is also an acoustic feature to be taken into consideration in the comparison of tonal and atonal f 0 patterns.

The results of this acoustic study could directly be applied in teaching MC as a second language, as well as serving as a basis in explaining MC lexical tone production problems originating from L1 transfer, that is, when the Hungarian (L1) monosyllabic intonation pattern is applied in Mandarin Chinese production in the case of Hungarian learners of MC. In addition, the results of this research contribute to the deeper understanding of the differences between atonal and tonal tunes, which might also be applicable in the teaching and learning of Chinese as a foreign language.


Corresponding author: Kornélia Juhász, HUN-REN Hungarian Research Centre for Linguistics, 33. Benczúr utca, 1068 Budapest, Hungary; and ELTE Eötvös Loránd University, Budapest, Hungary, E-mail:

About the authors

Kornélia Juhász

Kornélia Juhász is a junior research fellow at the HUN-REN Hungarian Research Centre for Linguistics, who completed her doctoral studies in 2024 at the Eötvös Loránd University. The research topic of her dissertation focused on the acoustic analysis of Mandarin segments in the production of Hungarian learners of Chinese. However, her research area was extended to the acoustic analysis of the acquisition of Mandarin lexical tones and intonation recently, including in the production of atonal Hungarian speakers.

Huba Bartos

Huba Bartos is a senior research fellow at the HUN-REN Hungarian Research Centre for Linguistics, and an Associate Professor in the Institute of East Asian Studies at Eötvös Loránd University in Budapest. His main research fields are syntax and semantics, but has also recently turned his attention towards experimental phonetics and L2 acquisition studies. Besides these, he is editor-in-chief of a pair of new Chinese-Hungarian (2019) and Hungarian-Chinese (2024) dictionaries.

  1. Research funding: This paper reports results of the research carried out in the collaborative project under grant no. NKM2023-15 of the Hungarian Academy of Sciences and the Chinese Academy of Social Sciences and was supported by the EKÖP-24 University Excellence Scholarship Program.

Appendix

Table A:

The Hungarian utterances in dialogues (target words in bold).

Hungarian monosyllabic utterances (with Hungarian orthography and IPA)
Alternative and unmarked monosyllabic interrogative Imperative Declarative
[tøː] vagy tó?

–Nem tudom. Tő?

(Is that a vine or a lake? I don’t know. Vine?)
Egy távoli országban a következő jókívánsággal szokták bátorítani a tőkéket:

– Tő! Váljék belőled jó bor!

(In a distant country, people encourage the growth of the grapevines with the following phrase: Vine! May you turn into good wine!)
–Hogy hívják a szőlő szárát?

–Tő.

(What do they call the stem of a grape? Vine.)
sző [søː] –Sző vagy fon?

–Nem tudom. Sző?

(Is he weaving or braiding? I don’t know. Weaving?)
Egy távoli országban a halált jelentő szó hasonlít a magyar „sző” szóhoz. Ezért gyakran felszólítják a halált, hogy menjen el messzire: Sző! Távozz tőlünk!

(In a distant country, a word that means death resembles the Hungarian word ‘sző’. That’s why they often urge death to go far away: Sző! Depart from us!)
–Mit csinál Peti a szövőszéken?

–Sző.

(What is Peti doing on the weaving machine? He is weaving.)
[køː] –Kő vagy lő?

–Nem tudom. ?

(Stone or shooting? I don’t know. Stone?)
Pali legjobb barátja egy kavics, akit Kőnek hívnak. Gyakran így szól hozzá:

Kő! Gyere ide hozzám!

(Pali’s best friend is a pebble named Kő. He often says to it: Kő! Come here to me!)
–Mi az a kemény anyag?

–Kő.

(What is that hard substance? Stone.)
[høː] –Hő vagy hó?

–Nem tudom. Hő?

(Temperature or snow? I don’t know: Temperature?)
Amikor nagyon meleg van az emberek felszólítják az időjárást: Hő! Légy egy kicsit alacsonyabb!

(When it is very hot, people order the weather: Temperature! Be a bit lower!)
–Milyen mérővel szoktunk lázat mérni?

–Hő.

(What do we measure in case of fever? Temperature.)
cső [tʃøː] Cső vagy csá?

–Nem tudom. Cső?

(Uttered when offering the choice between two types of informal greetings: Tube (hi) or hey? I don’t know. Tube (hi)?)
A vízvezetékszerelő 10 óra munka után így szól a vízvezetékhez: Cső! Most már nehogy kilyukadj nekem!

(After a long work the plumber orders the tube: Tube! Don’t you dare to leak!)
–Mi az a henger, amiben folyik a víz?

–Cső.

(What is that cylinder called in which the water flows? –Tube.)
Table B:

The recorded MC monosyllabic units.

MC monosyllabic units by consonant context (with MC characters, pinyin, IPA and meaning)
Tone 2 Tone 4
de [d̥ɤ] 得2 [d̥ɤ] ‘get’ 嘚 4 [d̥ɤ] ‘cocky’
se [sɤ] 2 [sɤ] pseudoword 色4 [sɤ] ‘color’
ge [g̥ɤ] 格2 [g̥ɤ] ‘square formed by crossed lines’ 各4 [g̥ɤ] ‘all, every’
he [hɤ] 河2 [hɤ] ‘river’ 贺4 [hɤ] ‘congratulate’
zhe [d̥ʂɤ] 折2 zhé [d̥ʂɤ] ‘break’ 这4 zhè [d̥ʂɤ] ‘this’

References

Barnes, Jonathan, Nanette Veilleux, Alejna Brugos & Stefanie Shattuck-Hufnagel. 2010. The effect of global F0 contour shape on the perception of tonal timing contrasts in American English intonation. In Speech Prosody 2010, vol. 445(1–4). Chicago: Speech Prosody.10.21437/SpeechProsody.2010-150Search in Google Scholar

Bates, Douglas, Martin Mächler, Ben Bolker & Steve Walker. 2015. Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1). 1–48. https://doi.org/10.18637/jss.v067.i01.Search in Google Scholar

Boersma, Paul & David Weenink. 2019. Praat: Doing phonetics by computer. [Computer program]. Version 6.1.05. Amsterdam: University of Amsterdam. http://www.praat.org/ (accessed 4 November 2019).Search in Google Scholar

Bolla, Kálmán. 1995. Magyar fonetikai atlasz [Hungarian atlas of phonetics]. Budapest: Nemzeti Tankönyvkiadó.Search in Google Scholar

Chao, Yuen Ren. 1948/1963. Mandarin primer. Cambridge: Harvard University Press.10.4159/harvard.9780674732889Search in Google Scholar

Flege, James Emil. 1995. Second language speech learning: Theory, findings, and problems. In Winifred Strange (ed.), Speech perception and linguistic experience: Theoretical and methodological issues in cross-language speech research, 233–277. Timonium: York Press.Search in Google Scholar

Flege, James Emil & Ocke Schwen Bohn. 2021. The revised speech learning model (SLM-r). In Ratree Wayland (ed.), Second language speech learning: Theoretical and empirical progress, 3–83. Cambridge: Cambridge University Press.10.1017/9781108886901.002Search in Google Scholar

Fónagy, Iván & Klára Magdics. 1967. A magyar beszéd dallama [Melodies of Hungarian speech]. Budapest: Akadémiai Kiadó.Search in Google Scholar

Gósy, Mária. 2004. Fonetika, a beszéd tudománya [Phonetics, The science of speech]. Budapest: Osiris.Search in Google Scholar

Guan, Jian [关 键]. 2000. 声 调 教 学 改 革 初 探 [Preliminary Exploration on Reformation of tone teaching]. 语言教学与研究 [Language Teaching and Linguistic Studies] 4. 51–54.Search in Google Scholar

Han, Yueqiao, Martijn Goudbeek, Maria Mos & Marc Swerts. 2019. Mandarin tone identification by tone-naïve musicians and non-musicians in auditory-visual and auditory-only conditions. Frontiers in Communication 4. https://doi.org/10.3389/fcomm.2019.00070.Search in Google Scholar

Hart, Joseph T., René Collier & Raymond Cohen. 1990. A perceptual study of intonation: An experimental phonetic approach to speech melody. Cambridge: Cambridge University Press.10.1017/CBO9780511627743Search in Google Scholar

Juhász, Kornélia. 2020. A mandarin illabiális veláris magánhangzó [ɤ], illetve az alveoláris [ɹ] és posztalveoláris [ɻ] approximánsok produkciója kínaiul tanuló magyarok körében [The Mandarin illabial velar [ɤ], the alveolar [ɹ] and post-alveolar [ɻ] approximants in the production of Hungarian learners of Chinese]. Alkalmazott nyelvtudomány 20(2). 1–16. https://doi.org/10.18460/ANY.2020.2.007.Search in Google Scholar

Juhász, Kornélia. 2021. A mandarin explozívák ejtése magyar anyanyelvűeknél [Mandarin Chinese plosives in the production of Hungarian learners of Mandarin]. In Tekla Etelka Gráczi & Zsófia Ludányi (eds.), Doktoranduszok tanulmányai az alkalmazott nyelvészet köréből 2021, 257–284. Budapest: Hungarian Research Centre of Linguistics.Search in Google Scholar

Juhász, Kornélia & Huba Bartos. 2023. Mandarin Chinese sentence intonation patterns in the production of Hungarian learners of Chinese. In 1st international conference on tone and intonation (TAI), 71–75. Sønderborg: International Speech Communication Association.10.21437/TAI.2021-15Search in Google Scholar

Kesztler, Lőrinc. 1952. Összhangzattan [Harmonics]. Budapest: Editio Musica Budapest Zeneműkiadó.Search in Google Scholar

Kuznetsova, Alexandra, Per B. Brockhoff & Rune Christensen. 2017. lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software 82(13). 1–26. https://doi.org/10.18637/jss.v082.i13.Search in Google Scholar

Lenth, Russel. 2020. Emmeans: Estimated marginal means, aka least-squares means. R package version 1.7.1. https://cran.r-project.org/web/packages/emmeans/emmeans.pdf (accessed 13 March 2020).Search in Google Scholar

Markó, Alexandra. 2007. Kérdő funkciójú hanglejtésformák a spontán beszédben [Tunes with interrogative functions in spontaneous speech]. In Mária Gósy (ed.), Beszédkutatás 2007, 59–74. Budapest: MTA Nyelvtudományi Intézet.Search in Google Scholar

Markó, Alexandra. 2017. Hangtan [Phonetics]. In András Imrényi Nóra Kugler, Mária Ladányi, Alexandra Markó, Szilárd Tátrai & Gábor Tolcsvai Nagy (eds.), Nyelvtan [Hungarian grammar], 75–203. Budapest: Osiris.Search in Google Scholar

Moore, Corinne & Allard Jongman. 1997. Speaker normalization of Mandarin Chinese tones. The Journal of the Acoustical Society of America 102(3). 1864–1877. https://doi.org/10.1121/1.420092.Search in Google Scholar

Nolan, Francis. 2003. Intonational equivalence: An experimental evaluation of pitch scales. In Proceedings of the 15th international congress of phonetic sciences, 771–774. Bacelona: Universitat Autònoma de Barcelona.Search in Google Scholar

Olaszy, Gábor. 2002. A magyar kérdés dallamformáinak és intenzitásszerkezetének fonetikai vizsgálata [The phonetic analysis of Hungarian interrogative tunes and their intensity structure]. In Mária Gósy (ed.), Beszédkutatás 2002 – Kísérleti beszédkutatás, 83–99. Budapest: MTA Nyelvtudományi Intézet.Search in Google Scholar

Olaszy, Gábor. 2006/2016. Hangidőtartamok és időszerkezeti elemek a magyar beszédben [Duration of segments and other structural units in Hungarian speech]. Budapest: Akadémiai Kiadó.Search in Google Scholar

Quené, Hugo. 2014. hqmisc: Miscellaneous convenience functions and dataset. 0.1-1-es R package version. Available at: https://doi.org/10.32614/CRAN.package.hqmisc.Search in Google Scholar

R Core Team. 2019. R: A language and environment for statistical computing. Version 3.6.1. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/ (accessed 4 November 2019).Search in Google Scholar

Rij, Jacolien van, Martijn Wieling, Rolf Harald Baayen & Hedderik van Rijn. 2020. itsadug: Interpreting time Series and autocorrelated data using GAMMs. R package version 2.4.Search in Google Scholar

Shen, Xiaonan S. 1989. Interplay of the four citation tones and intonation in Mandarin Chinese. Journal of Chinese Linguistics 17(1). 61–74.Search in Google Scholar

Shen, Xiaonan S. 1990. The prosody of Mandarin Chinese. California: University of California Press.Search in Google Scholar

So, Connie K. & Catherine Best. 2014. Phonetic influences on English and French listeners’ assimilation of Mandarin tones to native prosodic categories. Studies in Second Language Acquisition 36. 195–221. https://doi.org/10.1017/S0272263114000047.Search in Google Scholar

Varga, László. 2002. The intonation of monosyllabic Hungarian yes-no questions. Acta Linguistica Hungarica 49(3–4). 307–320. https://doi.org/10.1556/ALing.49.2002.3-4.4.Search in Google Scholar

Wickham, Hadley. 2016. ggplot2: Elegant graphics for data analysis. New York: Springer-Verlag.10.1007/978-3-319-24277-4_9Search in Google Scholar

Wood, Simon N. 2017. Generalized additive models: An introduction with R. New York: Chapman and Hall/CRC.10.1201/9781315370279Search in Google Scholar

Xu, Yi. 1997. Contextual tonal variations in Mandarin. Journal of Phonetics 25. 61–83. https://doi.org/10.1006/jpho.1996.0034.Search in Google Scholar

Ye, Qiuyue & Huba Bartos. 2017. The comparison of Hungarian and Chinese phonological systems: A pedagogical perspective. In Istvan Kecskes (ed.), Explorations into Chinese as a second language. Educational linguistics, 31–63. Dordrecht: Springer.10.1007/978-3-319-54027-6_2Search in Google Scholar

Yu, Jiang [喻江]. 2007. 声调教学新教案 [A New Teaching Plan for Chinese Tones]. 语言教学与研究 [Language Teaching and Linguistic Studies] 1. 77–81.Search in Google Scholar

Zhang, Hui, Seth Wiener & Lori L. Holt. 2022. Adjustment of cue weighting in speech by speakers and listeners: Evidence from amplitude and duration modifications of Mandarin Chinese tone. The Journal of the Acoustical Society of America 151. 992–1005. https://doi.org/10.1121/10.0009378.Search in Google Scholar

Published Online: 2024-09-24
Published in Print: 2024-10-28

© 2024 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 29.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/caslar-2024-2001/html
Scroll to top button