Zum Hauptinhalt springen
Artikel Open Access

Tonal language experience facilitates pitch perception in L2/3 Japanese

  • EMAIL logo und
Veröffentlicht/Copyright: 13. April 2026
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

This study tested nine groups of listeners on their discrimination of three pitch accent patterns in Japanese-like nonwords. We performed three analyses on listeners’ pitch discrimination accuracy. The first compared two tonal language-speaking groups (Mandarin and Cantonese), the second involved L1 English groups with tonal or nontonal L2 classroom experience, and the third L2/3 Japanese learners at two proficiency levels. The main finding of the study revealed that speaking a tonal L1 or having learned a tonal language as an L2 facilitates the discrimination of Japanese pitch accent patterns. However, accuracy differences between the two tonal L1 groups were not detected on the discrimination task. In addition, Japanese proficiency only increased perceptual accuracy for L1 tonal speakers. These findings provide further evidence for facilitative transfer of tone experience to the prelexical processing of pitch in an L2/3.

1 Introduction

Listeners perceive fundamental frequency (F0) as pitch. Pitch is an integral part of speech that listeners use to identify word stress, word meaning, and sentence intonation (among other features; Cutler 2012). This central role of pitch in speech perception means that a listener’s first language (L1) exerts a powerful influence over how they perceive pitch in a new language (Chang 2018; Chen et al. 2023; Gandour 1983). For example, an L1 Mandarin speaker heavily weights pitch movement because Mandarin is a tonal language and pitch patterns convey lexical information (Chandrasekaran et al. 2010; Wong and Perrachione 2007). In contrast, an L1 English speaker weights pitch cues at the word level far less, as pitch primarily conveys post-lexical information (Francis et al. 2008; Jongman and Wade 2007). Japanese, the target language in the current study, is situated between these two languages in its lexical use of pitch to distinguish segmental homophones (e.g., [i.ɕi] ‘will’ vs. [i.ɕi] ‘stone’) (Shibata and Shibata 1990).

Recent work (e.g., Choi 2021; Qin et al. 2017; Tong et al. 2015; Wiener and Goss 2019) suggests that perception of F0 can be ‘better than native’ in the sense that a listener whose L1 is tonal, like Mandarin or Cantonese, can better discriminate pitch cues in a nonnative language (Lx) like English or Japanese than L1 listeners of those respective languages. Other recent evidence indicates that this advantage is not limited to tonal L1 speakers – learning a tonal language as an L2 or L3 can also enhance the perception of F0-cued contrasts in a new language (Qin and Jongman 2016; Qin et al. 2024). For instance, after one semester of classroom Mandarin training, Japanese-naïve L1 English speakers increase their sensitivity to Japanese pitch cues to a level comparable to L1 Japanese speakers (Wiener and Goss 2018). Here, we describe this benefit in pitch perception derived from speaking a tonal L1 or from learning a tonal Lx as facilitative transfer of pitch perception.

In the current study, we examine the limits of facilitative transfer using Japanese lexical pitch accent as a test case. Japanese serves as a midpoint between typologically tonal (e.g., Cantonese, Mandarin) and primarily intonational (e.g., English, German) languages in its lexical use of pitch (Beckman and Pierrehumbert 1986). This allows us to examine how listeners perceive a language in which pitch is either more informative or less informative than their L1 or Lx. We test a total of nine groups of listeners from three different L1 backgrounds, including six groups which are engaged in different classroom Lx learning, on their pitch accent discrimination behavior in Japanese-like nonwords. We include a range of learning experiences, which enables us to model three variables of primary interest – L1 tonal status, Lx tonal status, and target-language (Japanese) proficiency – each of which has generally been considered in isolation. We aim to better understand the facilitative transfer reported in prior studies by modelling the effects of these three variables on Japanese pitch accent discrimination. We frame our study with the following three research questions:

Research Question 1:

Do listeners with different tonal L1s show differences in Japanese pitch perception?

Research Question 2:

Can learning any additional (L2+) language improve pitch perception or must the language be tonal?

Research Question 3:

What role does L2/L3 Japanese learner proficiency play in pitch perception?

2 Background

2.1 Tonal advantage in Lx perception

Tonal L1 speakers often display an advantage in perceiving pitch cues over speakers of languages where pitch is primarily used for intonation. While the origin of this advantage is debated, it has been observed in crosslinguistic studies on various target languages (e.g., Kaan et al. 2008; Wu et al. 2012). Two prominent models of speech learning posit that L2 sounds are filtered through L1 phonological knowledge. The Speech Learning Model (SLM) focuses on how listeners categorize L2 speech segments on the basis of L1 categories (Flege 1995). Its revised form, the SLM-r, maintains the emphasis on sound categorization but shifts its focus to how cue weighting patterns develop between L1 and L2 (Flege et al. 2021).

The Perceptual Assimilation Model (PAM) also treats the L1 as a filter for L2 sounds but considers the goodness-of-fit of phonological categories between L1 and Lx as the primary driver of perceptual behavior (PAM: Best 1995; Best and Tyler 2007). That is, listeners are better at discriminating or categorizing Lx tones when they assimilate to existing tonal or intonational categories in the L1 in a one-to-one manner, and worse when the mapping is many-to-one. Of the two models, only PAM has been applied to suprasegmentals (PAM for Suprasegmentals (PAM-S); So and Best 2014). Prior studies broadly framed in terms of category assimilation of suprasegmentals have found cross-linguistic assimilation in the perception of Mandarin and Cantonese tones (So and Best 2010; Wu et al. 2014), as well as English stress (Chan 2008). Categorical assimilation has also been reported in the perception and production of Japanese pitch accent (e.g., Ayusawa 2003; Hirano-Cook 2011; Laméris and Graham 2020; Shport 2016a).

In contrast, other studies have approached the question of tonal advantage through the lens of functional load of tone in the L1 (e.g., Surendran and Niyogi 2006). In other words, the greater the role that lexical tones play in word selection in the L1, the more weight that listeners tend to give these prosodic cues when perceiving tonal contrasts in a new language. While never formalized as a single model, functional load-driven advantage has been shown when the target language is tonal (Mandarin, Cantonese (Chang et al. 2017; Choi and Chiu 2023); Thai (Chan and Leung 2020; Wayland and Guion 2004)), pitch accent (Wiener and Goss 2019), or nontonal (i.e., stress or intonation perception; Choi et al. 2019). To be certain, the acoustic properties of tones themselves in the target language play a role in perceptual accuracy (Tong et al. 2015; Zhu et al. 2023), and the tonal advantage is not universal (Cooper and Wang 2012; Francis et al. 2008), but a substantial body of evidence points toward the facilitative role of tonal L1.

Behavioral studies have operationalized the functional load of F0 by grouping listeners (e.g., Laméris et al. 2023; Schaefer and Darcy 2014) by tonal status in their L1 (e.g., high, medium, and low), or by calculating informativeness of F0 based on corpus estimates of pitch, stress, and tone minimal pairs (Shibata and Shibata 1990; Wiener and Goss 2019). Here, we also assume the functional view that experience with L1/Lx cues is a primary driver of facilitative transfer of pitch in a new language.

In the current study, we address our first research question by comparing listeners from two tonal languages – Mandarin and Cantonese – on their perception of Japanese pitch accent. To our knowledge, no prior studies have compared two groups of tonal (Sinitic) language speakers on their perception of Japanese pitch accent. To refine the notion of facilitative transfer – that tonal language speakers can perceive unfamiliar pitch cues as well as or better than target language L1 speakers – we compare L1 speakers of these two languages, where pitch is highly informative, but nonetheless differs in tone inventory (i.e., six Cantonese tones vs. four Mandarin tones) and functional load. Specifically, Cantonese listeners may be better at Japanese pitch discrimination because of the slightly higher functional load of tone than in Mandarin (Oh et al. 2013) or its larger tone inventory (for evidence on tone inventory, see Hu et al. 2020). Here, we treat functional load and tone inventory as indexing the same construct. That is, we assume that the larger the lexical tone inventory a language possesses, the greater the functional load that tone carries in the language. This aligns with Wiener and Goss’ (2019) minimal pair-based index of functional load, which they termed informativeness of F0, as well as other information-theory approaches to phonological contrasts (e.g., Oh et al. 2013; Surendran and Niyogi 2006).

Conversely, from the vantage of feature assimilation (i.e., PAM-S), Mandarin tone 2 (rising F0) would assimilate to the Japanese LHH (unaccented) pitch pattern over a three-mora unit, while Mandarin tone 4 (falling F0) would correspond to the H*LL (initial accent) pattern. Cantonese features two rising tones (tones 2 and 5) and a falling tone (tone 4), in addition to three level tones (Yip 2002). However, neither language contains a tone category that resembles the Japanese LH*L (medial accent) pattern. Thus, given the presence of rising and falling tone categories in both languages, the two groups would perform similarly under an assimilation account.

2.2 Lx learning-derived tonal advantage

Tonal advantage in F0 perception can also be an outcome of Lx learning. Few studies have attempted to depict the complex, yet very real, picture of how learning additional languages influences L3 perception (for cumulative enhancement in morphosyntax, see Flynn et al. 2004). For nontonal L1 speakers, learning a tonal Lx can enhance perceptual accuracy for F0 cues in an unfamiliar language. For instance, Qin and Jongman (2016) found that L1 English learners of L2 Mandarin were more sensitive to certain Cantonese tone contrasts than L1 English monolinguals. Likewise, a recent replication of this study with L1 Korean speakers found that those who had learned L2 Mandarin could discriminate Cantonese tones on par with L1 Mandarin speakers (Qin et al. 2024).

For Japanese pitch perception, evidence exists for both category-specific assimilation, which would align with PAM’s predictions, as well as general enhancement resulting from Lx classroom learning of a tonal language. For example, L1 English-L2 Mandarin learners may assimilate Japanese pitch patterns (particularly the falling, H*-L pattern) to a Mandarin tonal category (tone 4, falling F0) after only one semester of Mandarin study, leading to higher accuracy on the discrimination of patterns that resemble Mandarin tones (Wiener and Goss 2018). On the other hand, a follow-up study also involving classroom learners and using the same Japanese stimuli found that L1 English listeners’ sensitivity to Japanese pitch accent was best predicted by informativeness of F0 in their L2 (Mandarin), rather than Japanese-specific experience (Wiener and Goss 2019). That is, these L1 English-L2 Mandarin learners discriminated all Japanese pitch patterns in the stimuli (HLL, LHL, LHH) equally well, while outperforming L2 Japanese learners who had classroom exposure to these patterns. This result suggests that category assimilation may not be the primary driver of perceptual behavior, mirroring recent findings that show extralinguistic factors such as non-linguistic tone processing ability (Laméris et al. 2023), or acoustic properties of tone contours themselves (Shport 2016b), contributing more to cross-linguistic tone perception than category assimilation.

However, the question remains as to whether the differences exhibited by tonal L2 learners in these studies were the result of facilitative tonal L2 to Lx transfer, or whether classroom input in any L2 might have triggered the cue reweighting (Wang et al. 1999), and thus greater sensitivity to Japanese pitch patterns. Two pieces of evidence motivate this question. First, L1 speakers of languages with rich intonational systems, but limited lexical prosody, such as German, show greater sensitivity to Mandarin tone mismatches than both L1 Japanese and French speakers, languages with fewer intonational shapes than German (Braun et al. 2014). This suggests that L2 German classroom learners may be similarly advantaged in their perception of Japanese pitch. German prosodically resembles English in that both are stress-accented languages with a tendency toward initial syllable stress. Both languages also display a similar range of pitch shapes at the utterance level. For example, both languages use falling pitch in statements and rising pitch in yes/no questions. Statement intonation may approximate the Japanese HLL accent pattern, and question intonation the rising LHH pattern. Yet in laboratory tasks German speakers are more sensitive to rising intonation than English speakers, perhaps because of their greater attention to F0 movement before a tonal accent (Kember et al. 2017). This heightened attention to rising intonation could potentially transfer to language-like tasks as well, giving German learners an advantage on the LHH (unaccented) pitch pattern in Japanese.

Second, classroom input does not necessarily reflect normal distributional patterns of language (Ellis 2002), including the “frequency dimension” of intonation (Mennen 2015). This allows for the possibility that classroom input on intonational features in a language like German could enhance learners’ perception of pitch or tone in a new language (but see Thomson and Derwing (2015) for discussion of L2 pronunciation instruction).

Because the previous design (in Wiener and Goss 2019) did not account for the possibility that classroom practice with the intonational features of any Lx may increase sensitivity to F0, we include a group of nontonal L1 English-L2 German learners, who were undergoing classroom German training at the time of the study. Thus, our second research question asks whether learning any additional language results in enhanced pitch discrimination ability or if the Lx learning must be tonal. To test this, we compare L1 English speakers currently studying three L2s – Mandarin, Japanese and German.

2.3 Proficiency and pitch perception

The final research question pertains to the role of Japanese-specific experience with pitch cues. We ask – do learners increase their sensitivity to pitch cues as their Japanese ability increases, and does this experience with Japanese pitch cues bring them into parity with L1 tonal speakers? The question of target-language proficiency is relevant to the current study because the notion of facilitative transfer predicts that tonal L1 status would continue to outweigh Japanese-specific experience with pitch, at least in pre-lexical pitch (i.e., nonword) discrimination. If cue-specific experience enables Japanese learners to close the gap with L1 tonal speakers, then L1 tonal status may not have such a privileged position in pitch perception.

Findings are mixed as to whether Japanese learners from nontonal backgrounds improve their pitch perception ability in step with overall proficiency gains. Some studies report gains in accuracy (Wu et al. 2017), yet others do not, even among learners above the beginner level (Shport 2008). Wiener and Goss (2019) found tonal language experience to be the best predictor of sensitivity to Japanese pitch patterns, but did not include L1 English speakers who had attained advanced L2 Japanese proficiency. Here, we compare L2/3 Japanese learners from both tonal and non-tonal backgrounds at two proficiency levels (beginner and advanced) on their pitch discrimination ability. This comparison enables us to see whether Japanese learner proficiency modulates pitch discrimination ability irrespective of the L1.

3 Methods

3.1 Participants

Nine participant groups (15 participants per group; n = 135) took part in the experiment (Table 1). Of the nine groups, four (n = 60) were newly recruited for this study. Data from the remaining five groups (n = 75) were incorporated from Wiener and Goss (2019), which followed identical procedures to the current study. All stimuli, data, and R code are available on the Open Science Framework (https://osf.io/9328s/).

Table 1:

Participant demographics and estimates of pitch cue informativeness.

Group Sex Age (yrs) Length of L2/3 study (yrs or mos) Time in US (yrs)
L1-Mandarin

(L2 English)
7M; 8F 23.4 10.2 4.5
L1-Mandarin +

(L2 English)

L3-Japanese (BEG)
9M; 6F 19.5 8.5

3.5 (mos)
2.4
L1-Mandarin +

(L2 English)

L3-Japanese (ADV)
8M; 7F 20.8 8.3

2.8
3.2
L1-Cantonese +

(L2 English)

L3-Japanese (BEG)
6M; 9F 19.5 11.7

3.2 (mos)
L1-English 7M; 8F 24.5
L1-English +

L2-German (BEG)
10M; 5F 20.3 4.1 (mos)
L1-English +

L2-Japanese (BEG)
9M; 6F 19.3 1.7
L1-English +

L2-Japanese (ADV)
7M; 8F 21.5 3.6
L1-English +

L2-Mandarin (BEG)
8M; 7F 21.2 1.4

In this study, we define L1 as the first language a speaker acquired as an infant. L2 and L3 are defined as languages learned after the L1 was already in place, with the L3 being learned after L2 was acquired (De Angelis 2007). “Native speaker” status was not asked as recent studies have shown it can be ambiguous and mean different things for different populations of speakers (Brown et al. 2023). Non-L1-English participants were matched on their L2 English proficiency by self-reported abilities and standardized English proficiency test scores (Test of English as a Foreign Language (TOEFL) or International English Language Testing System (IELTS). L1-Mandarin participants reported only speaking standard Mandarin and no other tonal dialect. L1-Cantonese speakers were from Hong Kong, were Cantonese-dominant, and reported basic Mandarin ability. The L1-English group not learning or exposed to an L2 were functionally monolingual and had 2 years or less of secondary school instruction in Spanish, French, Latin, or German.

For Lx learners, L1-English groups engaged in L2-Mandarin, L2-Japanese, or L2-German classroom learning were controlled for their respective level of L2 proficiency based on self-reported abilities, length of L2 study, and placement in comparable university courses. No participant had studied abroad in the target country. L1-Mandarin + L3-Japanese learners were recruited from an advanced Japanese class in the US and screened by placement test scores, which mainly measured grammatical and lexical knowledge, but also included a short aural comprehension task. L1-Cantonese + L3 Japanese learners were enrolled in a 1st-year Japanese class at a university in Hong Kong.

Participants were screened for their nonlinguistic pitch perception using an adaptive pitch test (Mandell 2018) or frequency discrimination test (Lutman 2004). In both tasks, listeners heard a 500 Hz reference tone followed by a tone that differed by hertz intervals. Listeners judged by button press whether the second tone was of higher or lower pitch than the first tone. The between-stimulus frequency difference either increased or decreased depending on the accuracy of participants’ responses. All 135 participants were able to reliably differentiate two tones at a 20 Hz interval or lower (range: 1.4–20 Hz). Whereas any cutoff is arbitrary, 20 Hz provided us with sufficient power and screened for potential congenital amusia (e.g., Zhu et al. 2023).

3.2 Materials

We employed an ABX discrimination task with Japanese-like nonwords. Ten nonwords with a segmental structure of CV.CV.CV were used as stimuli (see our Open Science Framework repository for all items and acoustic measurements). Each trimoraic stimulus was recorded in a carrier sentence by a female native speaker of Tokyo Japanese with three existent pitch accent patterns: HLL, LHL, and LHH. Nonwords were then extracted in Praat (Boersma and Weenink 2020). The nonword tazana, for example, was produced as TAzana (HLL), taZAna (LHL), and taZANA (LHH). Acoustic parameters for each category aligned with accent categories used in previous studies (e.g., Cutler and Otake 1999; Shport 2015, 2016a) and, besides volume normalization, were presented to listeners without phonetic manipulation. The ABX stimuli consisted of three consecutive trimoraic nonwords with a 250 ms interstimulus interval. The first two sounds, A and B, only differed in pitch accent pattern (i.e., HLL, LHL, or LHH). Participants were asked to decide whether the third sound, X, matched the pitch pattern of the first (A) or second (B) sound. Participants completed a total of 120 trials (12 ABX patterns per target × 10 targets). See Appendix for a list of the nonword stimuli.

3.3 Procedures

Participants were instructed to indicate by button press whether the final sound (X) was similar to the first (A) or second (B) sound as quickly and accurately as possible. If participants did not respond within 2.5 s from the offset of X, the next trial would proceed automatically. AB ordering was counterbalanced across all trials with a 1 s intertrial interval. Two practice trials were presented using a practice stimulus of similar moraic structure and accent pattern as the stimuli. Stimuli were presented to listeners via headphones using either Superlab 5 or Eprime 2.0 software.

3.4 Statistical analysis

All analyses and visualizations were carried out in R (version 4.4.0; R Core Team 2024). Trials in which participants timed out (n = 263; <2 %) were first removed. Accuracy of the ABX trials (coded as 1 correct; 0 incorrect) was modeled using the lme4 package (Bates et al. 2015). A generalized linear mixed effects model (with a logit link function and the “bobyqa” optimizer) was built with items and participants as random intercepts. The independent variables included pitch accent pattern type as a three-level categorical variable (with LHH as the reference) and listener group, which changed given the research question with pairwise comparisons and adjusted p-values obtained using estimated marginal means (emmeans package; Lenth 2021). For RQ1, the L1 Cantonese and L1 Mandarin groups were compared to one another with the L1 Cantonese group as the reference level. For RQ2, the monolingual L1 English group was coded as the reference level and the three beginner L2 groups (L1 English-L2 Mandarin (Beg), L1 English-L2 German (Beg), L1 English-L2 Japanese (Beg)) were compared. For RQ3, four groups studying Japanese as an L2/L3 were examined. These groups were Beginner/Advanced pairs. The L1 English-L2 Japanese (Beg) was coded as the reference level and the L1 English-L2 Japanese (Adv), L1 Mandarin-L3 Japanese (Beg), and L1 Mandarin-L3 Japanese (Adv) were compared.

4 Results

4.1 Research question 1: listeners with different tonal L1s do not show differences in Japanese pitch perception

Figure 1 shows the results from the two tonal L1 groups. Both groups performed similarly as indicated by their respective 95 % confidence intervals: L1 Cantonese [0.90, 0.95], L1 Mandarin [0.90, 0.94]. The logistic regression model, which had moderate explanatory power (conditional R 2 = 0.22), revealed no difference between the two groups (β = −0.01, Z = −0.01, SD = 0.29, p = 0.99). Neither a main effect of pitch accent type nor a two-way interaction with accent type and listener group was found with an alpha-level of 0.05 indicating the two groups performed similarly across the three pitch accent types.

Figure 1: 
ABX discrimination accuracy for L1 Cantonese and L1 Mandarin participants. The figure shows each participant’s mean (points), and group box plots and density plots.
Figure 1:

ABX discrimination accuracy for L1 Cantonese and L1 Mandarin participants. The figure shows each participant’s mean (points), and group box plots and density plots.

4.2 Research question 2: only learning an additional tonal language improves Japanese pitch perception relative to the non-learning control group

Figure 2 shows the results from the four L1 English groups. Groups performed differently as indicated by their respective 95 % confidence intervals: L1 English [0.76, 0.79], L1 English-L2 German (Beg) [0.77, 0.81], L1 English-L2 Japanese (Beg) [0.83, 0.86], L1 English-L2 Mandarin (Beg) [0.85, 0.88]. The logistic regression model, which had moderate explanatory power (conditional R 2 = 0.22), revealed that relative to the L1 English group as the reference level, the L1 English-L2 German (Beg) performed similarly (β = 0.08, Z = 0.32, SD = 0.25, p = 0.98), the L1 English-L2 Japanese (Beg) performed similarly (β = 0.49, Z = 1.92, SD = 0.25, p = 0.22) and the L1 English-L2 Mandarin (Beg) performed significantly better (β = 0.72, Z = 2.81, SD = 0.25, p = 0.03). Estimated marginal means indicated that all three groups learning an L2 performed similarly (ps > 0.05). Overall, no difference in pitch accent types was found (ps > 0.05), however, a two-way interaction was found between pitch accent types and listener group, which indicated that for the HLL stimuli the L1 English group and L1 English-L2 Mandarin (Beg) group differed (β = 1.14, Z = 3.85, SD = 0.30, p = 0.007). In other words, the difference observed between these two groups was driven by performance on the HLL words only; LHH and LHL behavior did not differ (ps > 0.05).

Figure 2: 
ABX discrimination accuracy for four different L1 English groups. The figure shows each participant’s mean (points), and group box plots and density plots.
Figure 2:

ABX discrimination accuracy for four different L1 English groups. The figure shows each participant’s mean (points), and group box plots and density plots.

4.3 Research question 3: improved learner proficiency only helps tonal L1 listeners’ Japanese pitch perception

Figure 3 shows the results from the four L2/L3 Japanese learner groups. Groups performed differently as indicated by their respective 95 % confidence intervals: L1 English-L2 Japanese (Beg) [0.83, 0.86], L1 Mandarin-L3 Japanese (Beg) [0.88, 0.91], L1 English-L2 Japanese (Adv) [0.84, 0.87], L1 Mandarin-L3 Japanese (Adv) [0.95, 0.96]. The logistic regression model, which had substantial explanatory power (conditional R 2 = 0.28), revealed that relative to the L1 English-L2 Japanese (Beg) group as the reference level, the L1 English-L2 Japanese (Adv) performed similarly (β = 0.15, Z = 0.57, SD = 0.26, p = 0.94). The L1 Mandarin-L3 Japanese (Beg) performed similarly to the L1 English-L2 Japanese (Beg) group (β = 0.52, Z = 1.98, SD = 0.26, p = 0.19). The L1 Mandarin-L3 Japanese (Adv) outperformed the L1 English-L2 Japanese (Beg) group (β = 1.46, Z = 5.31, SD = 0.27, p < 0.001). Estimated marginal means indicated that the L1 Mandarin-L3 Japanese (Adv) outperformed the L1 Mandarin-L3 Japanese (Beg) (β = 0.94, Z = 3.38, SD = 0.28, p = 0.004). The L1 Mandarin-L3 Japanese (Adv) outperformed the L1 English-L2 Japanese (Adv) group (β = 1.31, Z = 4.77, SD = 0.28, p < 0.001). The L1 Mandarin-L3 Japanese (Beg) performed similarly to the L1 English-L2 Japanese (Adv) group (β = 0.37, Z = 1.42, SD = 0.26, p = 0.49). Overall, no difference in pitch accent types was found (ps > 0.05). A two-way interaction was found between pitch accent types and listener group for all three pitch accent types for differences found between L1 English-L2 Japanese (Beg) and L1 Mandarin-L3 Japanese (Adv) and for L1 English-L2 Japanese (Adv) and L1 Mandarin-L3 Japanese (Adv) (ps < 0.05). In other words, the differences were consistent across all three pitch accent types. Estimated marginal means of the interactions revealed that the L1 Mandarin-L3 Japanese (Adv) and L1 Mandarin-L3 Japanese (Beg) differences were null (ps > 0.05) when examined individually by pitch accent type. That is, the difference between the two L1 Mandarin groups with different L3 Japanese proficiencies was the result of an aggregate effect rather than a specific pitch accent type. In contrast, the differences between the L1 Mandarin-L3 Japanese (Adv) group and the two L1 English groups with different L2 Japanese proficiencies were consistent differences across all three pitch accent types.

Figure 3: 
ABX discrimination accuracy for four different Lx Japanese learner groups. The figure shows each participant’s mean (points), and group box plots and density plots.
Figure 3:

ABX discrimination accuracy for four different Lx Japanese learner groups. The figure shows each participant’s mean (points), and group box plots and density plots.

5 Discussion and conclusions

The current study explored facilitative transfer in low-level pitch perception derived from L1 knowledge, L2/3 learning, and target-language proficiency level. We tested nine listener groups on their discrimination of Japanese-like nonwords that carried three pitch accent patterns. For RQ1, we asked if there were differences in listeners from two tonal L1s. In both Mandarin and Cantonese, pitch movement is vital in word candidate selection, but they differ in tone inventory (four vs. six tones, respectively) and, by extension, presumably functional load of tone as well. Our assumption was based on corpus studies of tonal distributions that show that languages with greater numbers of tones tend to have larger inventories of tonal minimal pairs (Oh et al. 2013; Shibata and Shibata 1990). We predicted these differences might advantage Cantonese speakers in their discrimination of Japanese pitch patterns. Contrary to our prediction, we neither found a difference in accuracy between these two groups in their perception of the target pitch patterns nor an effect of accent type (including the LHL pattern, absent in both languages) or its interaction with group. Both groups discriminated the nonword stimuli with high accuracy (95 % CI for Cantonese: [0.90, 0.95], Mandarin: [0.90, 0.94]), which aligns with previous research showing a general perceptual advantage in pitch perception by tonal language speakers (Chang et al. 2017; Choi and Chiu 2023).

However, this finding differs from a prior study that compared tonal language groups on their behavioral pitch discrimination (Hu et al. 2020). Specifically, speakers of the 9-tone Sinitic language Dong outperformed 4-tone Mandarin speakers on two pitch discrimination measures, which the researchers attributed to differences in the complexity of tonal shapes between the two languages. We did not replicate in our comparison of Cantonese and Mandarin speakers the finding that differences in complexity or functional load between two tonal systems transfer to higher accuracy by the more-complex tonal group. In other words, comparatively small differences in the tonal systems (i.e., tone inventory and/or functional load) between the two languages did not manifest as perceptual differences between the groups, at least to the extent detectable in the ABX discrimination task.

We note that in Hu et al. (2020), the stimuli were “pitch-evoking” tones based on pure and harmonic tones, and thus differed from the comparatively more language-like, naturally produced nonword stimuli used in the current study. In contrast, the high mean accuracy by tonal L1 speakers in the current study (>90 % correct) suggests that the discrimination task may not have been sensitive enough to detect between-group differences in how listeners processed the stimuli. An additional consideration, given that the Cantonese listeners were in their 20s (as were all participants), is the full and partial merger of some tones in perception and production among younger Cantonese speakers (Fung and Lee 2019). For example, the falling tones (2 and 5) have merged in perception and production for many Cantonese speakers. This reduction in complexity of the tonal system may limit the number of tonal cues available for transfer, resulting in the negligible difference we found between Mandarin and Cantonese speakers. A more rigorous analysis of the cross-linguistic differences in functional load carried by tone would strengthen future perception research.

For RQ2, we filled a gap in the literature by testing if L2 classroom input in a nontonal language transferred to heightened pitch perception relative to other L1 English speakers. We prefaced this assumption with the fact that German is rich in intonational shapes (Vennemann 1990), despite pitch having low informativeness at the word level. Prior experimental evidence on German speakers’ transfer of L1 utterance-level intonation cues to the learning of Mandarin tonal contrasts led us to extrapolate that learning L2 German might convey a similar advantage on Japanese pitch cue discrimination (Braun et al. 2014). However, we found that the beginner-level L2 German learners performed the same as L1 English monolinguals (95 % CI for L1 English: [0.76, 0.79], L1 English-L2 German: [0.77, 0.81]) as did the beginner L2 Japanese learners (95 % CI [0.83, 0.86]). By comparison, beginner L1 English-L2 Mandarin learners performed significantly better on the task (95 % CI [0.85, 0.88]). This difference, however, was driven by the HLL words only; LHH and LHL behavior did not differ. It appears that relative to a non-learning monolingual group, only the group learning an additional tonal language improved low-level pitch perception as measured by the ABX discrimination task. The previous finding on classroom L2 input appears to stand (Wiener and Goss 2018). That is, the L2 must be tonal for pitch discrimination ability to develop beyond that of monolingual English speakers. But importantly, this development by L2 tonal language learners was limited to the HLL (falling) pitch pattern, suggesting that the effect was driven by experience with the acoustically similar Mandarin tone 4 (So and Best 2014; Wiener and Goss 2018). We cannot determine from the current design whether L1 English-L2 Mandarin learners’ greater discrimination accuracy was driven by the functional load of pitch cues (i.e., the greater number of minimal tone pairs L2 Mandarin learners acquire in comparison to other L2s tested) or the greater pedagogical focus on tone in the L2 Mandarin classroom – relative to L2 Japanese – required to learn the ways a given syllable can contrast by tone (e.g., Liu et al. 2011). Yet transfer does appear to occur from a tonal L2 to an unfamiliar L3, across the “functional pitch domains” of the syllable in Mandarin and the mora in Japanese (Schaefer and Darcy 2014).

Lastly, for RQ3 we asked if L2/3 Japanese proficiency gains led to increased discrimination accuracy. We compared L2/3 Japanese learners from L1 English and Mandarin backgrounds at the beginner and advanced levels of Japanese proficiency. Considering the comparatively low functional load of pitch in word selection in Japanese, we predicted that proficiency would not play a role in discrimination accuracy in the L1 English group. In addition to the expected higher overall accuracy by L1 Mandarin speakers, we found that higher Japanese proficiency only benefitted tonal L1 Mandarin listeners’ Japanese pitch perception. Specifically, the advanced L1 Mandarin group outperformed the L1 English-L2 Japanese beginner group and advanced group. This was consistent across all three pitch accent patterns indicating a strong perceptual advantage irrespective of the pitch accent type and L1 English-L2 Japanese learners’ proficiency. It appears that L1 Mandarin speakers not only had an advantage over L1 English speakers in the early stages of L2 Japanese learning, but that higher proficiency only benefitted the L1 Mandarin speakers. From the vantage of the L1 English listeners, they did not close the gap in accuracy after approximately three years of classroom learning. In addition, the L1 Mandarin-L3 Japanese advanced group outperformed the L1 Mandarin-L3 Japanese beginner group, suggesting an L3 Japanese proficiency advantage for Mandarin speakers. This difference between the two L3 Japanese learner groups was found to be an overall additive effect not driven by any one pitch accent type, which reflects a true proficiency account because listeners’ accuracy gains were aggregate.

Given the reported differences in rate of acquisition of Japanese when compared with L1 Mandarin speakers (Tamaoka 2015), the time scale needed for L1 English speakers to improve pitch perception could just be longer. We note that none of the Japanese learners in this study received targeted training on pitch accent. Learners reported their exposure to pitch accent as consisting of a general introduction to the pitch accent system by their instructor, accents notated in word lists in their introductory-level textbook, and occasionally having production errors corrected in the classroom. However, studies involving laboratory training, such as high variability phonetic training (HVPT), indicate that non-tonal L1 listeners can improve their pitch perception ability (e.g., Naito 2023; Shport 2016a; Wiener et al. 2025). This suggests that L1 English speakers can change how they weight pitch cues in Japanese under targeted training that varies by talker over multiple acoustic dimensions (e.g., Holt and Lotto 2006).

Before concluding, we remind the reader of the main limitation of this study. To what degree these “low-level” discrimination abilities transfer to “higher-level” processing, such as word and sentence-level prosody, remains an open question. In fact, limited research suggests these abilities do not transfer. For example, Hwang et al. (2022) used a visual search task to measure L1 English and L1 tonal listeners behavior while listening to English contrastive prosody. The authors showed that the two groups performed similarly even under conditions that enforced a heavy reliance on pitch contours. Although it is plausible to assume that L1 Mandarin/Sinitic language speakers would continue to benefit from L1 tonal knowledge in their L2/3 acquisition of word and sentence-level prosody (see Tamaoka (2015) for an overview of L1 Mandarin speakers’ acquisition of Japanese), we caution against generalizing the current findings very far beyond what they say about discrimination behavior of the phonetic cues to Japanese lexical accent when they are “disembodied” from lexical information. We also assumed that L2 and L3 acquisition share the same underlying mechanisms, and thus that speaking Japanese as an L2 versus speaking it as an L3 would influence the perception of pitch cues in a similar way. More work is needed to clarify the status of L2 as compared to L3 knowledge of the target language in prosodic perception.

To conclude, the main finding of this study revealed that speaking a tonal L1 or having learned a tonal language as an L2 conveys a perceptual advantage to the discrimination of Japanese pitch accent patterns. However, differences in perception between two groups of tonal speakers (Mandarin and Cantonese) were not detected on the discrimination task we used. In addition, Japanese proficiency, as indexed primarily by length of study, only increased perceptual accuracy for L1 tonal speakers. While the study did not address lexical perception, it provides more evidence for the advantageous position of tonal language knowledge in the prelexical processing of pitch in an L2 or L3.


Corresponding author: Seth Goss, Department of Russian and East Asian Languages and Cultures, Emory University, Modern Language Building, 532 Kilgo Circle, Atlanta, GA 30322, USA, E-mail:

Acknowledgments

We thank the guest editor and two anonymous reviewers for their helpful feedback on earlier versions of the manuscript. We also thank Katsuo Tamaoka, Yumi Inoue, and Aya Nakanishi McDaniel for their assistance with participant recruitment in Japan, Hong Kong, and the US, respectively.

  1. Research funding: This research was supported in part by the University Research Committee of Emory University.

Appendix: Nonword stimuli for the ABX task

makana

bamana

natana

tazana

yasana

sehena

regena

tesena

nemena

kerena

References

Ayusawa, Takako. 2003. Gaikokuzin gakusyūsya no nihongo akusento-intonēsyon gakusyū [Acquisition of Japanese accent and intonation by foreign learners]. Journal of the Phonetic Society of Japan 7(2). 47–58.Suche in Google Scholar

Bates, Douglas, Martin Mächler, Ben Bolker & Steve Walker. 2015. Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1). 1–48.10.18637/jss.v067.i01Suche in Google Scholar

Beckman, Mary E. & Janet B. Pierrehumbert. 1986. Intonational structure in Japanese and English. Phonology 3. 255–309. https://doi.org/10.1017/s095267570000066x.Suche in Google Scholar

Best, Catherine T. 1995. A direct realist view of cross-language speech perception. In Winifred Strange (ed.), Speech perception and linguistic experience: Issues in cross-language research, 167–200. York, UK: York Press.Suche in Google Scholar

Best, Catherine T. & Michael D. Tyler. 2007. Nonnative and second-language speech perception. In Murrary J. Munro & Ocke-Schwen Bohn (eds.), Second language speech learning: The role of language experience in speech and production, 13–34. Amsterdam: John Benjamins.10.1075/lllt.17.07besSuche in Google Scholar

Boersma, Paul & David Weenink. 2020. Praat: Doing phonetics by computer [computer software]. Version 6.1.16. Retrieved from http://www.praat.org/.Suche in Google Scholar

Braun, Bettina, Tobias Galts & Baris Kabak. 2014. Lexical encoding of L2 tones: The role of L1 stress, pitch accent and intonation. Second Language Research 30(3). 323–350. https://doi.org/10.1177/0267658313510926.Suche in Google Scholar

Brown, Bianca, Botagoz Tusmagambet, Valentino Rahming, Chun-Ying Tu, Michael B. DeSalvo & Seth Wiener. 2023. Searching for the “native” speaker: A preregistered conceptual replication and extension of Reid, Trofimovich, and O’Brien. 2019. Applied PsychoLinguistics 44(4). 475–494. https://doi.org/10.1017/s0142716423000127.Suche in Google Scholar

Chan, Ming-Kei. 2008. The perception and production of lexical stress by Cantonese speakers of English. Hong Kong: The University of Hong Kong dissertation.Suche in Google Scholar

Chan, Ricky K. & Janny H. Leung. 2020. Why are lexical tones difficult to learn? Insights from the incidental learning of tone-segment connections. Studies in Second Language Acquisition 42(1). 33–59. https://doi.org/10.1017/s0272263119000482.Suche in Google Scholar

Chandrasekaran, Bharath, Padma D. Sampath & Patrick C. M. Wong. 2010. Individual variability in cue-weighting and lexical tone learning. Journal of the Acoustical Society of America 128(1). 456–465. https://doi.org/10.1121/1.3445785.Suche in Google Scholar

Chang, Charles B. 2018. Perceptual attention as the locus of transfer to nonnative speech perception. Journal of Phonetics 68. 85–102. https://doi.org/10.1016/j.wocn.2018.03.003.Suche in Google Scholar

Chang, Yung-Hsiang S., Yao Yao & Becky H. Huang. 2017. Effects of linguistic experience on the perception of high-variability non-native tones. Journal of the Acoustical Society of America 141(2). EL120–EL126. https://doi.org/10.1121/1.4976037.Suche in Google Scholar

Chen, Juqiang, Mark Antoniou & Catherine T. Best. 2023. Phonological and phonetic contributions to perception of non-native lexical tones by tone language listeners: Effects of memory load and stimulus variability. Journal of Phonetics 96. 101199. https://doi.org/10.1016/j.wocn.2022.101199.Suche in Google Scholar

Choi, William. 2021. Cantonese advantage on English stress perception: Constraints and neural underpinnings. Neuropsychologia 158. 107888. https://doi.org/10.1016/j.neuropsychologia.2021.107888.Suche in Google Scholar

Choi, William & Ming Ming Chiu. 2023. Why aren’t all Cantonese tones equally confusing to English listeners? Language and Speech 66(4). 870–895. https://doi.org/10.1177/00238309221139789.Suche in Google Scholar

Choi, William, Xiuli Tong & Arthur G. Samuel. 2019. Better than native: Tone language experience enhances English lexical stress discrimination in Cantonese-English bilingual listeners. Cognition 189. 188–192. https://doi.org/10.1016/j.cognition.2019.04.004.Suche in Google Scholar

Cooper, Angela & Yue Wang. 2012. The influence of linguistic and musical experience on Cantonese word learning. Journal of the Acoustical Society of America 131(6). 4756–4769. https://doi.org/10.1121/1.4714355.Suche in Google Scholar

Cutler, Anne. 2012. Native listening: Language experience and the recognition of spoken words. Cambridge, MA: MIT Press.10.7551/mitpress/9012.001.0001Suche in Google Scholar

Cutler, Anne & Takashi Otake. 1999. Pitch accent in spoken-word recognition in Japanese. Journal of the Acoustical Society of America 105. 1877–1888. https://doi.org/10.1121/1.426724.Suche in Google Scholar

De Angelis, Gessica. 2007. Third or additional language acquisition, vol. 24. Clevedon: Multilingual Matters.10.21832/9781847690050Suche in Google Scholar

Ellis, Nick C. 2002. Frequency effects in language processing: A review with implications for theories of implicit and explicit language acquisition. Studies in Second Language Acquisition 24(2). 143–188. https://doi.org/10.1017/s0272263102002024.Suche in Google Scholar

Flege, James E. 1995. Second language speech learning: Theory, findings, and problems. In Winifred Strange (ed.), Speech perception and linguistic experience: Issues in cross-language research, 233–277. Timonium, MD: York Press.Suche in Google Scholar

Flege, James E., Katsura Aoyama & Ocke-Schwen Bohn. 2021. The revised speech learning model (SLM-r) applied. In Ratree Wayland (ed.), Second language speech learning: Theoretical and empirical progress, 84–118. Cambridge, UK: Cambridge University Press.10.1017/9781108886901.003Suche in Google Scholar

Flynn, Suzanne, Claire Foley & Vinnitskaya Inna. 2004. The cumulative-enhancement model for language acquisition: Comparing adults’ and children’s patterns of development in first, second and third language acquisition of relative clauses. International Journal of Multilingualism 1(1). 3–16. https://doi.org/10.1080/14790710408668175.Suche in Google Scholar

Francis, Alexander L., Valter Ciocca, Lian Ma & Kimberly Fenn. 2008. Perceptual learning of Cantonese lexical tones by tone and non-tone language speakers. Journal of Phonetics 36(2). 268–294. https://doi.org/10.1016/j.wocn.2007.06.005.Suche in Google Scholar

Fung, Roxana & Chris Lee. 2019. Tone mergers in Hong Kong Cantonese: An asymmetry of production and perception. Journal of the Acoustical Society of America 146(5). EL424–EL430. https://doi.org/10.1121/1.5133661.Suche in Google Scholar

Gandour, Jack. 1983. Tone perception in Far Eastern languages. Journal of Phonetics 11(2). 149–175. https://doi.org/10.1016/s0095-4470(19)30813-7.Suche in Google Scholar

Hirano-Cook, Erika. 2011. Japanese pitch accent acquisition by learners of Japanese: Effects of training on Japanese accent instruction, perception, and production. Lawrence, KS: University of Kansas dissertation.Suche in Google Scholar

Holt, Lori L. & Andrew J. Lotto. 2006. Cue weighting in auditory categorization: Implications for first and second language acquisition. Journal of the Acoustical Society of America 119(5). 3059–3071. https://doi.org/10.1121/1.2188377.Suche in Google Scholar

Hu, Axu, Menghuan Wang, Yonghong Li, Qiaojuan Tang & Feng Gu. 2020. Dong speakers outperform Mandarin speakers in behavioral pitch discrimination. Journal of the Acoustical Society of America 147(1). EL62–EL65. https://doi.org/10.1121/10.0000604.Suche in Google Scholar

Hwang, Jiwon, Chikako Takahashi, Hyunah Baek, Alex Yeung & Broselow Ellen. 2022. Do L1 tone language speakers enjoy a perceptual advantage in processing English contrastive prosody? Bilingualism: Language and Cognition 25(5). 816–826. https://doi.org/10.1017/s1366728921001176.Suche in Google Scholar

Jongman, Allard & Travis Wade. 2007. Acoustic variability and perceptual learning. In Murray J. Munro & Ocke-Schwen Bohn (eds.), Second language speech learning: The role of language experience in speech and production, 135–150. Amsterdam: John Benjamins.10.1075/lllt.17.14jonSuche in Google Scholar

Kaan, Edith, Christopher M. Barkley, Mingzhen Bao & Ratree Wayland. 2008. Thai lexical tone perception in native speakers of Thai, English and Mandarin Chinese: An event-related potentials training study. BMC Neuroscience 9. 1–17. https://doi.org/10.1186/1471-2202-9-53.Suche in Google Scholar

Kember, Heather, A.-K. Grohe, Katharina Zahner, Bettina Braun, Andrea Weber & Anne Cutler. 2017. Similar prosodic structure perceived differently in German and English. Proceedings of interspeech 2017, 1388–1392.10.21437/Interspeech.2017-544Suche in Google Scholar

Laméris, Tim J. & Calbert Graham. 2020. L2 perception and production of Japanese lexical pitch: A suprasegmental similarity account. Journal of Monolingual and Bilingual Speech 2(1). 106–136.10.1558/jmbs.14948Suche in Google Scholar

Laméris, Tim J., Miquel Llompart & Brechtje Post. 2023. Non-native tone categorization and word learning across a spectrum of L1 tonal statuses. Bilingualism: Language and Cognition 27(4). 729–743.10.1017/S1366728923000871Suche in Google Scholar

Lenth, Russell V. 2021. Estimated marginal means, aka least-squares means [R Package Emmeans Version 1.6. 0]. Comprehensive R Archive Network (CRAN).Suche in Google Scholar

Liu, Ying, Min Wang, Charles A. Perfetti, Brian Brubaker, Sumei Wu & Brian MacWhinney. 2011. Learning a tonal language by attending to the tone: An in vivo experiment. Language Learning 61(4). 1119–1141. https://doi.org/10.1111/j.1467-9922.2011.00673.x.Suche in Google Scholar

Lutman, Mark. 2004. Frequency discrimination test [computer software]. Retrieved from https://resource.isvr.soton.ac.uk/audiology/Software/FD_test.htm.Suche in Google Scholar

Mandell, Joshua. 2018. Tonometric [computer software]. Retrieved from https://tonometric.com.Suche in Google Scholar

Mennen, Ineke. 2015. Beyond segments: Towards a L2 intonation learning theory. In Elisabeth Delais-Roussarie, Mathieu Avanzi & Sophie Herment (eds.), Prosody and language in contact, 171–189. Berlin: Springer.10.1007/978-3-662-45168-7_9Suche in Google Scholar

Naito, Yuka. 2023. Does musical training influence perceptual learning of Japanese pitch accent? The case of native Italian speakers. In Georgios P. Georgiou, Aretousa Giannakou & Christine Savvidou (eds.), Advances in second/foreign language acquisition, 1–18. Cham, Switzerland: Palgrave Macmillan.10.1007/978-3-031-38522-3_1Suche in Google Scholar

Oh, Yoon Mi, Francois Pellegrino, Christophe Coupé & Egidio Marsico. 2013. Cross-language comparison of functional load for vowels, consonants, and tones. Proceedings of interspeech 2013, 3032–3036.10.21437/Interspeech.2013-662Suche in Google Scholar

Qin, Zhen, Yu-Fu Chien & Annie Tremblay. 2017. Processing of word-level stress by Mandarin-speaking second language learners of English. Applied Psycholinguistics 38(3). 541–570. https://doi.org/10.1017/s0142716416000321.Suche in Google Scholar

Qin, Zhen & Allard Jongman. 2016. Does second language experience modulate perception of tones in a third language? Language and Speech 59(3). 318–338. https://doi.org/10.1177/0023830915590191.Suche in Google Scholar

Qin, Zhen, Sang-Im Lee-Kim & Haifeng Qi. 2024. The effect of second-language learning experience on Korean listeners’ use of pitch cues in the perception of Cantonese tones. Second Language Research [Advance online publication]. https://doi.org/10.1177/02676583241244604.Suche in Google Scholar

Schaefer, Vance & Isabelle Darcy. 2014. Lexical function of pitch in the first language shapes cross-linguistic perception of Thai tones. Laboratory Phonology 5(4). 489–522. https://doi.org/10.1515/lp-2014-0016.Suche in Google Scholar

Shibata, Takeshi & Ritei Shibata. 1990. Akusento wa dō’ongo o dono teido benbetu siuruka: Nihongo, eigo tyūgokugo no ba’ai [Is word-accent significant in differentiating homonyms in Japanese, English, and Chinese?]. Keiryō Kokugogaku [Mathematical Linguistics] 17. 317–323.Suche in Google Scholar

Shport, Irina A. 2008. Acquisition of Japanese pitch accent by American learners. In Patrick Heinrich & Yuko Sugita (eds.), Japanese as a foreign language in the age of globalization, 165–187. Munich, Germany: Iudicium Publishing.Suche in Google Scholar

Shport, Irina A. 2015. Perception of acoustic cues to Tokyo Japanese pitch-accent contrasts in native Japanese and native English listeners. Journal of the Acoustical Society of America 138. 307–318. https://doi.org/10.1121/1.4922468.Suche in Google Scholar

Shport, Irina A. 2016a. Training English listeners to identify pitch-accent patterns in Tokyo Japanese. Studies in Second Language Acquisition 38(4). 739–769. https://doi.org/10.1017/s027226311500039x.Suche in Google Scholar

Shport, Irina A. 2016b. Perceptual assimilation and discrimination of falling, level, and rising lexical tones by native English speakers. Proceedings of speech prosody 2016, 533–537.10.21437/SpeechProsody.2016-109Suche in Google Scholar

So, Connie K. & Catherine T. Best. 2010. Cross-language perception of non-native tonal contrasts: Effects of native phonological and phonetic influences. Language and Speech 53(2). 273–293. https://doi.org/10.1177/0023830909357156.Suche in Google Scholar

So, Connie K. & Catherine T. Best. 2014. Phonetic influences on English and French listeners’ assimilation of Mandarin tones to native prosodic categories. Studies in Second Language Acquisition 36(2). 195–221. https://doi.org/10.1017/s0272263114000047.Suche in Google Scholar

Surendran, Dinoj & Partha Niyogi. 2006. Quantifying the functional load of phonemic oppositions, distinctive features, and suprasegmentals. In Ole N. Thomsen (ed.), Competing models of linguistic change: Evolution and beyond, 43–58. Amsterdam: John Benjamins.10.1075/cilt.279.05surSuche in Google Scholar

Tamaoka, Katsuo. 2015. Processing of the Japanese language by native Chinese speakers. In Mineharu Nakayama (ed.), Handbook of Japanese psycholinguistics, 583–632. Berlin: De Gruyter Mouton.10.1515/9781614511212-025Suche in Google Scholar

Thomson, Ron I. & Tracey M. Derwing. 2015. The effectiveness of L2 pronunciation instruction: A narrative review. Applied Linguistics 36(3). 326–344. https://doi.org/10.1093/applin/amu076.Suche in Google Scholar

Tong, Xiuli, Stephen M. Lee, Meng M. Lee & Denis Burnham. 2015. A tale of two features: Perception of Cantonese lexical tone and English lexical stress in Cantonese–English bilinguals. PLoS One 10. e0142896. https://doi.org/10.1371/journal.pone.0142896.Suche in Google Scholar

Vennemann, Theo. 1990. Syllable structure and simplex accent in modern standard German. In Michael Ziolkowski, Manuela Noske & Karen Deaton (eds.), Papers from the regional meeting of the Chicago Linguistic Society 26: Volume II: The parasession, 399–412. Chicago, IL: Chicago Linguistic Society.Suche in Google Scholar

Wang, Yue, Michelle M. Spence, Allard Jongman & Joan A. Sereno. 1999. Training American listeners to perceive Mandarin tones. Journal of the Acoustical Society of America 106(6). 3649–3658. https://doi.org/10.1121/1.428217.Suche in Google Scholar

Wayland, Ratree P. & Susan G. Guion. 2004. Training English and Chinese listeners to perceive Thai tones: A preliminary report. Language Learning 54. 681–712. https://doi.org/10.1111/j.1467-9922.2004.00283.x.Suche in Google Scholar

Wiener, Seth & Seth Goss. 2018. Perceptual assimilation of non-native prosodic cues: Cross-linguistic effects of lexical F0 learning. Proceedings of speech prosody 2018, 947–951.10.21437/SpeechProsody.2018-191Suche in Google Scholar

Wiener, Seth & Seth Goss. 2019. Second and third language learners’ sensitivity to Japanese pitch accent is additive: An information-based model of pitch perception. Studies in Second Language Acquisition 41(4). 897–910. https://doi.org/10.1017/s0272263119000068.Suche in Google Scholar

Wiener, Seth, Timothy K. Murphy & Lori L. Holt. 2025. Incidental nonspeech auditory learning scaffolds phonetic, category, and word learning in a foreign language classroom. Language Learning 75(4). 1091–1121.10.1111/lang.12700Suche in Google Scholar

Wong, Patrick C. & Tyler K. Perrachione. 2007. Learning pitch patterns in lexical identification by native English-speaking adults. Applied Psycholinguistics 28(4). 565–585. https://doi.org/10.1017/s0142716407070312.Suche in Google Scholar

Wu, Xianghua, Saya Kawase & Yue Wang. 2017. Effects of acoustic and linguistic experience on Japanese pitch accent processing. Bilingualism: Language and Cognition 20. 931–946. https://doi.org/10.1017/s1366728916000559.Suche in Google Scholar

Wu, Xianghua, J. Munro Murray & Yue Wang. 2014. Tone assimilation by Mandarin and Thai listeners with and without L2 experience. Journal of Phonetics 46. 86–100. https://doi.org/10.1016/j.wocn.2014.06.005.Suche in Google Scholar

Wu, Xianghua, Jung-Yueh Tu & Yue Wang. 2012. Native and nonnative processing of Japanese pitch accent. Applied Psycholinguistics 33. 623–641. https://doi.org/10.1017/s0142716411000506.Suche in Google Scholar

Yip, Moira Jean Winsland. 2002. Tone. Cambridge, UK: Cambridge University Press.Suche in Google Scholar

Zhu, Jiaqiang, Xiaoxiang Chen, Fei Chen, Caicai Zhang, Jing Shao & Seth Wiener. 2023. Distributional learning of musical pitch despite tone deafness in individuals with congenital amusia. Journal of the Acoustical Society of America 153(5). 3117–3129. https://doi.org/10.1121/10.0019472.Suche in Google Scholar

Received: 2024-11-04
Accepted: 2025-05-29
Published Online: 2026-04-13
Published in Print: 2026-05-26

© 2026 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Heruntergeladen am 11.5.2026 von https://www.degruyterbrill.com/document/doi/10.1515/jjl-2026-2003/html?lang=de
Button zum nach oben scrollen