Abstract
This study adopts a system perspective to explore how inventory, process, and constraint function to shape the complexity of syllable onsets in Southern Min. Only 15 onsets are phonemic, but they present several typologically uncommon features and are rich in allophonic variations. About 57 variants are identified that are triggered by only three distinct features from subsequent segments, and involve processes of dentalization, palatalization, labialization, lenition, lamination, glottalization, nasalization, and prenasalization. The usage frequency of onsets and their combinability with other syllable components are severely constrained, leading to more than 71% of theoretically possible syllables failing to be attested. The nasality feature from nuclei, labial feature within syllables, complexity of final types, diachronic requirements for assigning tones with respect to syllable onsets and codas, and the nature of onsets can all evoke constraints. This exploration has significant implications for our knowledge of segmental phonetics, segmental phonology, and segmental phonotactics in this language. It is hoped that this study will contribute to the understanding of the importance of inventory, process, and constraint in the phonological research of specific languages, and shed light on adopting the perspective of system to investigate other phonological categories and stretch our cognition of natural languages.
1 Introduction
Phonology concerns the sound system of a language. It deals with inventories within a system that engage with how many contrastive consonants, vowels, and suprasegments (tones, stresses etc.) are used to make lexical distinctions (e.g. Bickford and Floyd 2006; Gimson 1980; Ladefoged and Disner 2012; Ladefoged and Maddieson 1996; Maddieson 2013a; Maddieson 2013b; Szigetvári 2010; Yule 2010). It cares about processes within a system that engage with how the system is dynamic in real-world utterances, such as how assimilatory adjustments of phonemes are made to adapt to the nature of surrounding surrounds, and how reductions may occur in a non-prominent position (cf. Anderson 1978; Cohn 2007; Gurevich 2011; Hyman 1975; Kochetov 2011; Ohala 1993; Pike 1947). It also addresses constraints within a system that engage with how the distribution of phonemes, their relative frequency, and interrelationships with other elements are constrained (e.g. Algeo 1975; Celata and Calderone 2015; Clements 1988; Duanmu 2003; Kager 1999; Kirby and Yu 2007; Pearce 2007; Zec 1995; Zhang 2006). As indicated, inventories, processes, and constraints are essential portions of a system that form the core of phonology. A sound knowledge of the phonology of specific languages or language families is expected to provide a clear understanding of how inventories, processes, and constraints function as parts of a system. However, in conventional models of phonological descriptions of individual languages, attention is predominantly given to encoding inventories and less time is spent on processes and constraints. This is noted by Maddieson (2023) that, ‘information on inventories is more uniformly present in language descriptions… whether they come in the form of a chapter or so in a descriptive grammar, as a dedicated paper or book on the phonology of language X or the introductory pages to a bilingual dictionary.’ Likewise, in a considerable number of phonological databases, such as PBase (Mielke 2008), UCLA Phonological Segment Inventory Database (UPSID, Maddieson and Precoda 1990), Lyon-Albuquerque Phonological Systems Database (LapSyd, Maddieson et al. 2013), the Database of Eurasian Phonological Inventories (EURPhon, Nikolaev 2018), PHOIBLE 2.0 (Moran and McCloy 2019), SegBo (Grossman et al. 2020), BDPROTO 1.1 (Moran et al. 2021), and Phonotacticon 1.0 (Joo and Hsu 2025), they exclusively document inventories for a large and diverse sample of languages, and rarely archive processes and constraints. A similar situation can also be seen in studies of phonological typology, in which the focus is dominantly on the areal and genealogical distribution of inventories. As can be seen, our understanding and research efforts in phonology tend to be imbalanced. Information on processes and constraints is relatively limited.
This study is grounded in the perspective of system to undertake an in-depth exploration of inventories, processes, and constraints of speech sounds in languages. It will specifically focus on the phonological category of consonants that function as syllable onsets, typically before nuclei (cf. Blevins 1995; Davis 1982; Fudge 1969; Selkirk 1982). Cross-linguistically, onsets can be characterized using diverse dimensions, from the manner of articulation, the place of articulation, laterality, nasality, phonation, aspiration, airstream mechanism, to vocal fold vibration (e.g. Bickford and Floyd 2006; Ladefoged and Disner 2012; Ladefoged and Maddieson 1996; Maddieson 2013a, 2013b; Szigetvári 2010; Yule 2010). For example, depending on where the airstream is initiated and in which direction it flows within a vocal tract, onsets can be divided into pulmonic egressive, glottalic egressive (ejective), glottalic ingressive (implosive), and velaric ingressive (click) (Bickford and Floyd 2006). Likewise, depending on whether the vocal folds vibrate over production, they can be classified as either voiced or voiceless. Languages vary in the consonants that can occur as onsets. For example, consonant clusters are permitted in English, as in the words strong, spring, and splash (Yule 2010), but are banned in Sinitic and Fijian languages (Zec 1995). In most conventional studies, the descriptions are restricted to providing a list of onset phonemes and showing how they are distinguished. They pay less systematic attention to exploring how the continuous motion of human vocal apparatus induces diverse surface outputs of onset phonemes, the phenomenon of which can be captured as allophonic variations (e.g. Bickford and Floyd 2006; Ladefoged and Disner 2012; Ladefoged and Maddieson 1996; Maddieson 2013a, 2013b; Szigetvári 2010; Yule 2010) and that belongs to the concern of processes within the perspective of system. They also pay less systematic attention to examining whether onset phonemes can randomly combine with other syllable components to generate linguistic entities, such as syllables and morphemes, for the purpose of communication, a phenomenon that can be captured as phonotactics (e.g. Algeo 1975; Celata and Calderone 2015; Clements 1988; Duanmu 2003; Kirby and Yu 2007; Pearce 2007; Zec 1995; Zhang 2006), which belongs to the concern of constraints from the perspective of the system.
This study aims to uncover how inventories, processes, and constraints work to encode onset consonants as a system and to manifest their complexity and dynamics in a specific language. The exploration is based on the Sinitic language of Southern Min, commonly referred to as Hokkien or Min Nan, which has significant implications for expanding our phonological knowledge. Diachronically, this language is asserted to have split off from mainstream Sinitic languages in the transition period between Western and Eastern Han (roughly 50 BCE–50 CE) and has been regarded as a living fossil by historical linguists to construct the proto-language of Old Chinese around 1200 BC (cf. Baxter and Sagart 2014; Matisoff 1973; Michaud and Sands 2020; Norman 1991). For example, no labial-dental fricative onset phoneme is postulated in Old Chinese, and this can be justified in Southern Min because no words are attested beginning with onsets such as /f/ and /v/. On the contrary, such onsets are widely reported in other Sinitic languages, such as the words /fa55/ ‘deliver’, and /fo35/ ‘buddha’ in Mandarin; /vu3/ ‘father’ and /fu1/ ‘husband’ in Shanghainese (Chen and Gussenhoven 2015), and /fa1/ ‘flower’, /fat6/ ‘method’ in Cantonese (Bauer and Benedict 1997). Synchronically, this language encodes syllable onsets in a highly complex, dynamic, and constrained manner. It has an inventory of only 15 onset phonemes, which is smaller than that of a vast variety of languages worldwide. However, it has several typologically uncommon characteristics. For example, the aspiration and voicing features are distinct in formulating a three-way contrast among supraglottal stops (/p, t, k/ vs. /pʰ, tʰ, kʰ/ vs. /ɓ, ɗ, ɠ/); the ingressive airstream mechanism is phonemic in constructing implosive onsets of three places of articulation (/ɓ, ɗ, ɠ/). No nasal consonants are phonemic at the onset position while nasal vowels, syllabic nasals and nasal codas are extensively used by native speakers. Moreover, the processes of allophonic variations are extraordinarily dynamic among onset consonants. A single onset phoneme can surface several variants, and over 50 variants can be derived at the output. For example, the glottal stop /ʔ/ has six variants: [ʔ], [ʔj], [ʔw], [mʔ], [nʔ], and [ŋʔ]. Three implosive onsets /ɓ, ɗ, ɠ/ can surface eleven variants: [ɓ], [β], [m], [ɗ̪], [lʷ], [n], [ɗ], [ɠʲ], [ɣʷ], [ŋ], and [ɠ]. The processes of variation are triggered by only three distinctive features (palatal, nasality, and labial) from subsequent sounds, but they can induce a variety of changes on target onsets, which include altering airstream mechanism, manner of articulation, active articulator, location of primary constriction, and a secondary articulation. In addition, the usage frequency of individual onsets and their combinability with other syllable components (finals and tones) are severely constrained, leading to more than 71% of syllables, which are theoretically possible to occur, failing to be attested in the field. The constraints are dominantly invoked by the nasality feature of nuclei, the co-occurrence restriction of the labial feature, the complexity of internal structures, the diachronic restrictions of assigning tones with respect to syllable onsets and codas, and the nature of onset phonemes. For example, voiced onsets are banned from occurring before syllabic nasals, and labial onsets are not allowed to co-occur with codas of the same labial feature. As illustrated above, the system of syllable onsets in Southern Min is far more complex than expected, but it provides an important basis to stretch our knowledge of how inventories, processes, and constraints work to shape a complex system in specific languages.
This study incorporates field linguistics, phonetic and phonological theories, and Sinitic dialectology to conduct an in-depth and comprehensive exploration of the onset system in Southern Min. It addresses three main research questions that correspond to three main aspects (inventories, processes, and constraints) of a system.
How many onset phonemes are contrastive, and what is their typological significance?
How do onset phonemes change their forms to react to the featural properties of subsequent segments, and how can their allophonic processes be interpreted using rules within the paradigm of generative phonology?
How can the usage frequency of onset phonemes and their combinability with other syllable components be constrained to generate the linguistic entities of monosyllabic morphemes, and what factors may trigger phonotactic constraints?
This research is data-driven, but each research task will particularly focus on its phonological and typological background and significance. In the inventory section, both typologically common and uncommon characteristics of onset phonemes in this language will be discussed. In the section of processes, attention will focus on exploring how varying triggers from vocalic segments impact the surface forms of onset phonemes, and how onset phonemes react differently to their occurring environments. It will examine all possible processes and discuss the mechanisms behind the asymmetrical reactions of onsets and the asymmetrical distributions of allophonic processes. In the section of phonotactics, it will build upon an exhaustive examination of all possible combinations of onsets, finals (a term that is created in Sinitic convention to refer to all syllable components except onset), and tones. It will focus on how the usage frequency of onset phonemes is constrained at the syllable level and what factors constrain the onset-final and onset-tonal combinations over the generation of syllables for communication. It is hoped that this in-depth examination of Southern Min onsets will contribute to an understanding of the importance of inventories, processes, and constraints in the phonological research of natural languages, shed light on adopting the perspective of system to investigate phonological categories other than consonants, and expand our knowledge of languages.
2 Southern Min and Zhangzhou speech
Southern Min (ISO 639-3 [nan]) originates from two cities of Quanzhou and Zhangzhou in Fujian Province along the southeast coast of China; however, it is transnational in nature. This is because, as early as the 6th century, the Southern Min ethnic group has been migrating to a wide range of regions in the world, particularly in Southeast Asia, such as Singapore, Malaysia, Indonesia, Myanmar, the Philippines, and Thailand (cf. Ding 2016; Jones 2009; Kwok 2019; Sew 2020). The language of Southern Min, known as Hokkien for the colloquial pronunciation of its birthplace of Fujian, had long served as a lingua franca among overseas Chinese populations in history. Thus, Southern Min, in a broader sense, should incorporate the varieties spoken outside Fujian in China. This study is based on the speech sounds of Zhangzhou, one of the two birthplaces of Southern Min. Zhangzhou is a prefecture-level city situated in the southern seaward part of Fujian Province in China, with a registered population of approximately 5.05 million and an area of approximately 12,600 square kilometers in the 2020 census (Huang 2021). The colloquial language in Zhangzhou is Southern Min, which is mutually intelligible with other Southern Min varieties in Quanzhou, Xiamen, Taiwan, and overseas, but is uncommunicable with other Sinitic languages such as Hakka, Wu, Cantonese, and Jin. However, Mandarin has replaced Southern Min as the mother tongue among younger generations because of its dominant status as the only official language in China.
The Zhangzhou dialect is a variety of Southern Min. The City of Zhangzhou governs eleven administrative areas, and certain regional variations can be observed in their sound systems. For example, Yangru tone (Tone 7 in this study), which is diachronically related to syllables with voiced onsets and obstruent codas, has been transcribed as a short high contour [4] in Longhai District, a mid-level contour [33] in Changtai District, a low-rising contour [13] in Dongshan and Zhao’an Counties, and as a convex contour of either [121] or [131] elsewhere (Yang 2008). Because this study is not designed to examine the socio-phonetic variation of onsets across different regions in this city, the research locality is strictly restricted to the urban area of Xiangcheng and Longwen Districts, which is conventionally considered to be historically, socially, culturally, and linguistically representative of Zhangzhou and has received the most attention in the literature (e.g. FJG 1998; Gao 1999; Huang 2018, 2019, 2021, 2023a, 2023b, 2024; Ma 1994; Schlegel 1886; Xie 1818; Yang 2008; ZZG 1999). The data were mainly from two sources: (a) field data collected by the author in Xiangcheng and Longwen in 2015 and 2025, and (b) rhyme tables that Huang (2021) compiled to tabulate all possible combinations of syllable onsets, finals, and tones in the generation of monosyllabic morphemes.
Lexical morphemes in this language are dominantly monosyllabic, such as /tĩ35/ ‘sweet’, /tŋ̩22/ ‘long’, /ke35/ ‘chicken’, and /ɗɔ33/ ‘road’, while multisyllabic morphemes are also used by communities, such as /ɗɐj32.tsi35/ ‘litchi’, /ɗiŋ22.kɐn/ ‘longan’, /pi33.pɛ22/ ‘loquat’, and /ɓɐ̃35.ɗiŋ33.tsi22/ ‘potato’. Syllables can be generalized as having a C(G)V(X) structure in which the onset consonant (C) and nucleus (V) are compulsory, whereas the prevocalic glide (G) and coda (X) are optional (Huang 2021). An inventory of 15 onsets, 2 prevocalic glides, 13 nuclei, and 8 codas can be posited as phonemically distinctive, as summarized in Table 1. Segments that can function as nuclei are diverse, including oral vowels, nasal vowels, and syllabic nasals. This reflects the distinctive function of the nasality feature in the vocalic system, which is typologically rare in spoken languages. Likewise, segments that can serve as codas are diverse, including glides, nasal consonants, and obstruent stops. Consonantal clusters are banned from occurring in both onset and coda positions.
Segmental inventory of Zhangzhou Southern Min.
| Component | Phoneme | |
|---|---|---|
| C | onset | p, pʰ, ɓ, t, tʰ, ɗ, k, kʰ, ɠ, ts, tsʰ, s, z, ħ, ʔ |
| G | glide | j, w |
| V | nucleus | i, e, ɛ, ɐ, ɔ, ɵ, u, ĩ, ɛ̃, ɐ̃, ɔ̃, m̩ ŋ̩ |
| X | coda | j, w, m, n, ŋ, p, t, k |
As a typical Sinitic language, tones in Zhangzhou are contrastive to deliver lexical meanings. All conventional documents posited seven tones but with inconsistent transcriptions in pitch (FJG 1998; Gao 1999; Guo 2014; Lin 1992a; Lin 1992b; Lin 1992c; Ma 1994; Medhurst 1832; Schlegel 1886; Xie 1818; Yang 2008; Zhou 2006; ZZG 1999). Huang (2018) proposed a system of eight tones based on a multidimensional investigation of tonal properties in different contexts. The eighth tone emerges from those syllables that are conventionally transcribed with a glottal stop coda in Yangru tone (Tone 7 in this study), a Middle Chinese (MC) tonal category, but the glottal stop coda is found to have undergone deletion, rendering its related syllables open and have different tonal manifestations. Table 2 summarizes the eight tones in this language, along with their names in the MC tonal category, to make them diachronically traceable and synchronically comparable with other Sinitic languages. This tonal inventory serves as a foundation for exploring how onset-tone alignments are constrained at the syllable level. It should be noted that in this language, tones that share a similar F0 contour can differ considerably in other parameters (Huang 2018). For example, Tone 4 (/ti41/ ‘drop’) and Tone 6 (/tit41/ ‘bamboo’) both show a mid-high falling contour in citation, but tone 6 is shorter, and its related high vowels are diphthongized. Likewise, Tone 2 (/pɛ22/ ‘crawl’) and Tone 8 (/pɛ22/ ‘white’) both present a low-level contour in citation, but in a non-rightmost (sandhi) position, Tone 2 shows a mid-level contour (/pɛ33.ħiŋ22/ ‘crawl forward’) while Tone 8 (/pɛ32.sik41/ ‘white color’) shows a mid-falling contour.
Examples of Zhangzhou tones.
| Tone | Pitch citation | Duration citation | Pitch sandhi | Example 1 | Example 2 | |
|---|---|---|---|---|---|---|
| 1 | Yinping | [35] | extra-long | [33] | /tɐŋ35/ ‘east’ | /kɔ35/ ‘mushroom’ |
| 2 | Yangping | [22] | extra-long | [33] | /tɐŋ22/ ‘copper’ | /kɔ22/ ‘glue’ |
| 3 | Shang | [51] | medium | [35] | /tɐŋ51/ ‘to wait’ | /kɔ51/ ‘drum’ |
| 4 | Yinqu | [41] | medium | [63] | /tɐŋ41/ ‘frozen’ | /kɔ41/ ‘look after’ |
| 5 | Yangqu | [33] | extra-long | [32] | /tɐŋ33/ ‘heavy’ | /ħɔ33/ ‘rain’ |
| 6 | Yinru | [41] | short | [65] | /tɐp41/ ‘answer’ | /kɔk41/ ‘country’ |
| 7 | Yangru | [221] | long | [32] | /tsɐp221/ ‘ten’ | /tɔk221/ ‘poison’ |
| 8 | Yangru | [22] | extra-long | [32] | /tsi22/ ‘tongue’ | /kɔ̃22/ ‘snore’ |
3 Inventory of onset phonemes in Zhangzhou
3.1 The size of onset inventory
In this language, only 15 onsets are phonemically contrastive to make lexical distinctions. They are summarized in Table 3 and illustrated in Table 4 with (near-) minimal pairs. The size of this onset inventory is smaller than that of most languages and/or dialects in China and many others worldwide. For example, among a sample of 70 Chinese dialects (Zee and Zee 2014), the number of onset phonemes ranges from 14 in Putian of Middle Min and Ningde of Northern Min to 29 in Xinhua of the Xiang language. Thus, it seems that only a few languages in China have a smaller onset inventory than this language. Likewise, in a cross-linguistic survey of 317 languages worldwide (Maddieson 1984), the East Papuan language Rotokas has the lowest number of 6, whereas the Khoisan language! Xũ has the greatest number of 46, which is primarily due to the large number of clicks and laryngeal contrasts exploited by both click and non-click consonants (Gordon 2016; Maddieson 1984). As can be predicted, the number of 15 onset consonants in Zhangzhou is supposed to be lower than many of the world’s languages. The reason why the onset inventory is small in this language is unclear, but it might be ascribed to a compensatory relationship between the number of vowels and onset consonants. For example, Zee and Zee (2016) report that among 70 dialects in China, the number of vowels ranges from a low of 3 in Yongding of the Hakka language to a high of 13 in Ningbo of the Wu language. Maddieson (1984) reports that the vast majority of languages have between five and seven vowels (64.7 %), with the modal number being five (30.9 %). In Zhangzhou, a total of 11 vowels (7 oral vowels and 4 nasal vowels) is predictable to be larger than in most of the world’s languages, and this might potentially reduce the number of consonants to function as onset phonemes in this language.
Onset system of Zhangzhou Southern Min.
| Manner | Bilabial | Alveolar | Velar | Pharyngeal | Glottal | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Plosive | p | pʰ | ɓ | t | tʰ | ɗ | k | kʰ | ɠ | ʔ | |
| Fricative | s | z | ħ | ||||||||
| Affricate | ts | tsʰ | |||||||||
Examples of Zhangzhou onset phonemes.
| Onset | Example 1 | Example 2 | Example 3 |
|---|---|---|---|
| /p/ | /pi35/ ‘sad’ | /pɐŋ35/ ‘help’ | /pu33/ ‘hatch’ |
| /pʰ/ | /pʰi35/ ‘drape’ | /pʰɐŋ35/ ‘fragrant’ | /pʰu33/ ‘boil and spill out’ |
| /ɓ/ | /ɓi35/ ‘squint’ | /ɓɐŋ35 / ‘do not’ | /ɓu33/ ‘frog’ |
| /t/ | /ti35/ ‘pig’ | /tɐŋ35/ ‘east’ | /tu22/ ‘kitchen’ |
| /tʰ/ | /tʰi35/ ‘sticky’ | /tʰɐŋ35/ ‘window’ | /tʰu41/ ‘to shovel’ |
| /ɗ/ | /ɗi41/ ‘tear’ | /ɗɐŋ35/ ‘barracuda’ | /ɗu33/ ‘slip’ |
| /k/ | /ki35/ ‘base’ | /kɐŋ35/ ‘river’ | /ku33/ ‘old; worn’ |
| /kʰ/ | /kʰi35/ ‘deceive’ | /kʰɐŋ35/ ‘empty’ | /kʰu33/ ‘mortar’ |
| /ɠ/ | /ɠi35/ ‘childish’ | /ɠɐŋ22/ ‘raise’ | /ɠu33/ ‘giggle (infant)’ |
| /ts/ | /tsi35/ ‘grease’ | /tsɐŋ35/ ‘brown’ | /tsu33/ ‘self; from’ |
| /tsʰ/ | /tsʰi35/ ‘idiotic’ | /tsʰɐŋ35/ ‘spring onion’ | /tsʰu33/ ‘slippery’ |
| /s/ | /si35/ ‘poetry’ | /sɐŋ35/ ‘relax’ | /su33/ ‘thing; issue’ |
| /z/ | /zi35/ ‘money’ | /zin22/ ‘people’ | /zu33/ ‘affluent’ |
| /ħ/ | /ħi35/ ‘weak’ | /ħɐŋ35/ ‘dry’ | /ħu33/ ‘father’ |
| /ʔ/ | /ʔi35/ ‘he/she’ | /ʔɐŋ35/ ‘husband’ | /ʔu33/ ‘have’ |
It should also be noted that conventional work all posited 15 onset phonemes for this language (FJG 1998; Gao 1999; Guo 2014; Lin 1992a, 1992b, 1992c; Ma 1994; Medhurst 1832; Schlegel 1886; Xie 1818; Yang 2008; Zhou 2006; ZZG 1999), with four voiced onsets transcribed differently from this study. They all documented a voiced alveolar affricate /dz/, but this onset was found to be a fricative /z/ in the empirical data across speakers. This can be seen in Figure 1, which shows the spectrograms of morphemes /zi33/ ‘character’ and /zu22/ ‘such as.’ No voiced bars or burst releases were identified before frication for either morpheme. This indicates that no oral constriction is created during their production. In other words, they are not articulated with an affricate manner of articulation, as transcribed as /dz/ in previous work. Instead, they are produced with a fricative manner of articulation, and this onset should thus be posited as /z/ to reflect the reality in the field.

Spectrograms of /zi33/ ‘character’ (left) and /zu22/ ‘such as’ (right).
Prior work documented voiced onsets /b/, /l/, /g/ in their segmental inventory (e.g., FJG 1998; Gao 1999; Guo 2014; Lin 1992a; Lin 1992b; Lin 1992c; Ma 1994; Medhurst 1832; Schlegel 1886; Xie 1818; Yang 2008; Zhou 2006; ZZG 1999); however, these onsets are separately produced as implosives /ɓ/, /ɗ/, /ɠ/ in unmarked environments by native speakers regardless of genders and ages, and have several allophonic variants which will be discussed in the next section. Their implosive articulation can be demonstrated in the spectrograms. As shown in Figure 2, a voiced bar can be clearly identified before the formant patterns across all morphemes. This signifies a voiced manner of articulation and indicates that over the production of related onsets, a complete oral constriction is formed at a certain point in the speaker’s oral cavity, depending on their place of articulation, and the vocal folds vibrate in a regular fashion. In addition, the amplitudes of the air pressure fluctuations in the waveforms were maintained without significant change over the entire portion of the onsets, rather than dropping to zero before the release of oral constriction. This indicates a special ingressive glottalic airstream for implosive sounds. It occurs because speakers block off their glottis during articulation but continually suck in airflow from outside into their mouth until they achieve a maximum amount of air pressure, after which they open their glottis and release the constriction to let the air pressure from both lungs and mouth flow out of the vocal tract (Clements and Osu 2002; Cun 2010; Ladefoged and Maddieson 1996; Mori 2023). As such, it is empirically justifiable to propose implosives /ɓ, ɗ, ɠ/ as onset phonemes that falsify prior transcriptions but advance our knowledge of this language.

Spectrograms and waveforms of Zhangzhou implosives.
3.2 Typologically common and uncommon characteristics
The onset system of this language can be characterized in several ways. It shows several universals in phonemic distribution but also presents several typologically uncommon features that contribute to our understanding of human language diversity. These onsets present a five-way contrast in terms of the place of articulation, which includes bilabial /p, pʰ, ɓ/, alveolar /t, tʰ, ɗ, s, z, ts, tsʰ/, velar /k, kʰ, ɠ/, pharyngeal /ħ/, and glottal /ʔ/. Seven of the 15 onsets were alveolars, accounting for 47 % of the total onsets. This indicates the high contribution of the front of the tongue and the alveolar place in speech production in this language. This reflects a universal tendency; for example, 97.5 % of the 317 languages in Maddieson ’s (1984) survey have alveolar sounds, followed by velars in 89.3 % and bilabials in 82.9 %. Likewise, these onsets present a three-way contrast in terms of the manner of articulation, including plosive /p, pʰ, ɓ, t, tʰ, ɗ, k, kʰ, ɠ, ʔ/, fricative /s, z, ħ/, and affricate /ts, tsʰ/. It can be seen that plosive sounds occupy as many as 67 % of the total onsets, implying that complete constriction is the dominant manner of articulation in speech distinction in this language. It also reflects the universality that languages most commonly contrast unaffricated plosives, particularly at three places of articulation (labial, alveolar, and velar) (e.g. Gordon 2016; Maddieson 1984; Nikolaev 2022).
Supraglottal plosives can be further classified into three subcategories in terms of aspiration and voicing: voiceless unaspirated /p, t, k/, voiceless aspirated /pʰ, tʰ, kʰ/, and voiced /ɓ, ɗ, ɠ/. This is typologically uncommon. For example, according to Maddieson’s (1984) survey of 317 languages, less than 24.0 % of languages present a three-way laryngeal contrast on unaffricated oral stops, while over 51.1 % of languages present a two-way contrast between voiceless and voiced stops (Gordon 2016). In addition, this language is typologically uncommon because it employs a special ingressive glottalic airstream to construct implosive onsets of three places of articulation. This is because only 10 % (Ladefoged and Maddieson 1996) to 20 % (Clements and Osu 2002; Cun 2010) of the world’s languages are reported to have an implosive articulation at an underlying level. The three-way contrast among supraglottal plosives can be demonstrated by the acoustic parameter of voice onset time (VOT), which is the time between the release of an oral constriction for plosive production and the onset of vocal fold vibration (Abramson and Whalen 2017). This is shown in Figure 3. The VOT value is slightly above zero across voiceless unaspirated plosives /p, t, k/, reflecting that speakers vibrate their vocal folds immediately after the release of oral constriction. The VOT value is steeply positive from 0.067 to 0.083 s across voiceless aspirated plosives /pʰ, tʰ, kʰ/, reflecting that after an oral constriction is released, there is a period of articulation for aspiration, which could induce a delay in vocal fold vibration. In contrast, the VOT value is steeply negative between −0.077 and −0.064 s for voiced sounds /ɓ, ɗ, ɠ/, reflecting that speakers vibrate their vocal folds before releasing an oral constriction.

VOT of Zhangzhou occlusive onsets.
An additional typologically uncommon feature of this onset system is that nasal consonants are not phonemic in this onset position. This is cross-linguistically unusual. In Maddieson (1984)’s survey, only seven of 317 languages (2.2 %) lacked phonemic nasal consonants. In Zee and Zee (2014)’s survey of 70 Chinese dialects, only one dialect lacked a labial nasal onset, 12 dialects lacked a velar nasal onset, and 14 dialects lacked an alveolar nasal onset. This language is among the very few to lack any of nasal onsets. It should be noted that while nasal onsets are not attested, the nasality feature is extensively used to construct nasal vowels, syllabic nasals, and nasal codas in this language. For example, four of seven oral vowels can be nasalized and used phonemically, such as /ti35/ ‘pig’ versus /tĩ35/ ‘sweet’, /kɛ35/ ‘home’ versus /kɛ̃35/ ‘net’, /kɐ/35/ ‘glue’ versus /kɐ̃35/ ‘prison’, /kɔ22/ ‘past’ versus /kɔ̃22/ ‘snore’. The bilabial syllabic nasal /m̩/ can carry five different pitch contours to differentiate five different words, such as /ʔm̩24/ ‘drink’, /ʔm̩22/ ‘flower bud’, /ʔm̩51/ ‘aunt’, /ʔm̩41/ ‘affirmative’, and /ʔm̩33/ ‘negative’. The velar syllabic nasal /ŋ̩/ can combine with twelve out of fifteen onsets to create monosyllabic morphemes, such as /pŋ̩51/ ‘list of names’, /pʰŋ̩51/ ‘ebb tide’, /tŋ̩35/ ‘wait’, /tʰŋ̩41/ ‘hot’, /ɗŋ̩22/ ‘small taro’, /sŋ̩51/ ‘play’, /tsŋ̩35/ ‘village’, /tsʰŋ̩22/ ‘bed’, /kŋ̩41/ ‘steel’, /kʰŋ̩41/ ‘hind’, /ħŋ̩35/ ‘prescription’, and /ʔŋ̩35/ ‘centre’. In the meanwhile, nasal codas of a three-way contrast in the place of articulation can be identified, such as /sim35/ ‘heart’, /sin35/ ‘new’, and /siŋ35/ ‘rise’.
Such an asymmetrical distribution of the nasality feature is also typologically rare enough. Because it is common for languages to possess nasal onsets only and lack nasal vowels and syllabic nasals in their phonemic inventory. In contrary, it is uncommon for languages to possess nasal vowels, syllabic nasals and nasal codas and do not have any nasal onset at the underlying level. However, this does not mean that the nasality feature does not perform any function on syllable onsets in this language, instead, it can affect their phonetic realizations and induce diverse outputs at the surface level. This will be discussed in detail in next section.
3.3 Distinctive features
As introduced above, this language possesses an onset system smaller than many of languages in the world. It presents several universals, such as the preference of the alveolar sounds and plosive manner of articulation, reflecting cross-linguistic tendencies in the construction of onset consonants in natural languages. It reveals several typologically unusual characteristics, such as the three-way contrast of supraglottal stops, the phonemic usage of implosive sounds, and the lack of nasal onsets, contributing to the typology of speech sounds and our understanding of the diversity of human languages. However, in spoken languages, speech sounds do not always act independently, instead, multiple sounds often participate in the same sound patterns (Mielke 2008). In other words, a group of sounds in an inventory may behave similarly because of specific properties, rather than because of the sounds themselves (Kennedy 2016; Mielke 2008). To capture the way that speech sounds function as a system, the concept of natural class is proposed in literature to refer to a group of sounds within a language that may either trigger or undergo the same phonological process, and the concept of distinctive features is proposed to group them into natural classes (cf. Arthur 1983; Chomsky and Halle 1968; Clements 2003; Hyman 1975; Keating 1991; Kennedy 2016; Lisker and Abbramson 1971; McCarthy 1988; Mielke 2008). Distinctive features have been widely assumed to be part of Universal Grammar since the mid-twentieth century and have been considered as the building blocks of phonological patterns (Kennedy 2016; Mielke 2008). They decode components of individual phonemes that help distinguish them from others, but also express phonemes formally as groups because they share one or more features.
In this language, a set of ten distinctive features are posited, as presented in Table 5. They cover three major categories, including features of the place of articulation ([labial], [coronal], [dorsal], [pharyngeal]), features of the manner of articulation ([voice], [spread glottis], [constricted glottis]), and laryngeal features ([continuous], [sibilant/strident], [delay release]). The binary features are used to specify onset phonemes, with a positive value [+] denoting the presence of a specific feature, and a negative value [−] indicating the absence of a feature.
Distinctive features for Zhangzhou syllable onsets.
| Class | Feature | p | pʰ | ɓ | t | tʰ | ɗ | k | kʰ | ɠ | ts | tsʰ | s | z | ħ | ʔ |
| Place | labial | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − |
| coronal | − | − | − | + | + | + | − | − | − | + | + | + | + | − | − | |
| dorsal | − | − | − | − | − | − | + | + | + | − | − | − | − | − | − | |
| pharyngeal | − | − | − | − | − | − | − | − | − | − | − | − | − | + | − | |
| glottal | − | − | − | − | − | − | − | − | − | − | − | − | − | + | ||
| Laryngeal | voice | − | − | + | − | − | + | − | − | + | − | − | − | + | − | − |
| spread glottis | − | + | − | − | + | − | − | + | − | − | + | − | − | − | − | |
| constricted glottis | − | − | + | − | − | + | − | − | + | − | − | − | − | − | − | |
| Manner | continuant | − | − | − | − | − | − | − | − | − | − | − | + | + | + | − |
| sibilant/strident | − | − | − | − | − | − | − | − | − | + | + | + | + | − | − | |
| delay release | − | − | − | − | − | − | − | − | − | + | + | − | − | − | − |
As indicated in the table, these ten distinctive features form a geometry to characterize the components of individual onset phonemes in this language. For example, the bilabial implosive /ɓ/ can be denoted as [+labial, +voice, +continuant]. They can also effectively group a set of onsets as a natural class because of sharing one or more distinctive features. For example, the onsets /ɓ, ɗ, ɠ, z/ form a natural class because of the common distinctive feature [+voicing]. This feature geometry is important to serve as a base to explore how onset phonemes change their quality at the surface level; what specific features trigger their alternations, and how their alternations can be expressed using rules within the paradigm of generative phonology.
4 Allophonic variation of Zhangzhou onsets
Phonology is the study of the organization of sounds in human languages. Each language has a fixed set of sounds (known as phonemes) that are contrastive and combinable into longer sequences in patterned ways, to which meanings and function are in turn associated (Gordon 2016; Kennedy 2016; Mielke 2008). For example, in this language, fifteen onsets are tested to be phonemic. However, phonemes are symbols. They are part of the knowledge that represents the mental activity of speakers of a language and that exist within the lexicon of underlying forms in a language. In real-world utterances, phonemes may change their forms because of the nature of adjacent sounds, and the term phonological process is used to express the phenomenon in which the phonetic form of a phoneme changes based on context, and the term allophones are used to refer to different surface forms of a common phoneme if they never occur in equivalent environments. This reflects the manifestations of phonemes adapting to their environments (cf. Arthur 1983; Gordon 2016; Hyman 1975; Kennedy 2016; Mielke 2008). Likewise, the term alternation is used to refer to the phonological process in which a phoneme takes one form in one context, and a different form in some other context. It reflects the reaction of phonemes to the presence of adjacent or proximate segments by changing their forms. The phonological process and alternation are universals that are widely observed in the phonology of the world’s languages. Their existence may induce multiple consequences at the level of segments, such as alternate and constrain the featural properties of sounds (e.g. assimilation and dissimilation), change in the number of sounds (deletion, insertion), and even alternate in the ordering of adjacent or nearby sounds (metathesis) (Gordon 2016). As such, the goal of this phonological analysis of Southern Min onset system should not be restricted to uncover evidence of phonemic contrasts but should include uncovering evidence of allophonic variations and distributions of those contrastive onsets.
Zhangzhou has an onset inventory smaller than many other languages, but this does not mean that its onset system is simple. Phonological processes are incredibly active in this language. Individual onset phonemes can have several variants that are complementary to occur in different environments, for example, the three implosive onsets /ɓ/, /ɗ/, and /ɠ/ surface eleven variants that include [ɓ], [β], [m], [ɗ̪], [lʷ], [n], [ɗ], [ɠʲ], [ɣʷ], [ŋ], [ɠ] (Huang and Hyslop 2022). As a whole, over fifty variants can be derived from its fifteen onsets, which are summarized in Table 6. The symbol * indicates no empirical data are attested in the field. The allophonic alternations of onset phonemes do not randomly occur, instead, they are dominantly triggered by three features from their immediately subsequent sounds, which incorporate the palatal feature from the high vowel /i/ and palatal glide /j/, the labial feature from the round vowel /u/ and labial glide /w/, and the nasality feature from nasal vowels and syllabic nasals. In other words, the phonological processes of onset alternation are local, in which the target (onset consonant) and trigger (subsequent vocalic segment) are adjacent. Moreover, the phonological processes are diverse that can be captured as dentalization, palatalization, labialization, lenition, lamination, glottalization, and nasalization among others. The phenomenon of allophonic variation from a phonetic perspective reflects continuous motions of articulatory apparatus of human beings, leading phonemes to be manifested differently in sequencing. From a phonological perspective, it reflects a universal of assimilation, involving alternating featural properties of sounds because of the nature of adjacent sounds. This section will provide an in-depth exploration into the distributional constraints and alternations of onset phonemes in Zhangzhou Southern Min.
Allophonic variation of Zhangzhou onsets.
| Manner | Place | Onset | Allophonic Variant | ||||
|---|---|---|---|---|---|---|---|
| _/ [i; j] | _/ [u; w] | _/[Ṽ] | _/[N̩] | Elsewhere | |||
| Plosive | bilabial | /p/ | p | p | mp | mp | p |
| /pʰ/ | pʰ | pʰ | mpʰ | mpʰ | pʰ | ||
| /ɓ/ | ɓ | β | m | * | ɓ | ||
| alveolar | /t/ | t̪ | t̻ʷ | nt | nt | t | |
| /tʰ/ | t̪ʰ | t̻ʰʷ | ntʰ | ntʰ | tʰ | ||
| /ɗ/ | ɗ̪ | lʷ | n | * | ɗ | ||
| velar | /k/ | kʲ | kʷ | ŋk | ŋk | k | |
| /kʰ/ | kʰʲ | kʰʷ | ŋkʰ | ŋkʰ | kʰ | ||
| /ɠ/ | ɠʲ | ɣʷ | ŋ | * | ɠ | ||
| glottal | /ʔ/ | ʔʲ | ʔʷ | nʔ | mʔ, ŋʔ | ʔ | |
| Fricative | alveolar | /s/ | ɕ | ʃʷ | ns | ns | s |
| /z/ | ʑ | ʒʷ | * | * | * | ||
| pharyngeal | /ħ/ | ħʲ | hʷ | nɦ | mɦ, ŋɦ | ħ | |
| Affricate | alveolar | /ts/ | tɕ | tʃʷ | nts | nts | ts |
| /tsʰ/ | tɕʰ | tʃʰ | ntsʰ | ntsʰ | tsʰ | ||
4.1 Palatal-conditioned variation
The high front vowel /i/ and palatal glide /j/ are active to trigger allophonic alternation on syllable onsets of this language. The trigger can be further specified as the [+palatal] feature that is shared by these two front vocoids, and it is further observed affecting the surface of onset phonemes in three different ways, depending on their place of articulation at an underlying level. This is generalized in Table 7.
palatal feature-induced variation on Zhangzhou onsets.
| Process | Onset | Before [i] | Before [j] |
|---|---|---|---|
| (a) dentalization | /t/ | [t̪i35] ‘pig’ | [t̪jɐ41] ‘pluck’ |
| /th/ | [t̪ʰi51] ‘tear’ | [t̪ʰjɐ41] ‘crack’ | |
| /ɗ/ | [ɗ̪i33] ‘filter’ | [ɗ̪jɐ41] ‘pry open’ | |
| (b) alveolar-palatalization | /s/ | [ɕi22] ‘time’ | [ɕju22] ‘swim’ |
| /z/ | [ʑi22] ‘press’ | [ʑju22] ‘rub’ | |
| /ts/ | [tɕi22] ‘potato’ | [tɕjɵ22] ‘stone’ | |
| /tsh/ | [tɕhi22] ‘support’ | [tɕhjɵ22] ‘bamboo mat’ | |
| (c) palatalization | /k/ | [kʲi22] ‘flag’ | [kʲjɐ33] ‘steep’ |
| /kʰ/ | [kʰʲi22] ‘leech’ | [kʰʲjɐ33] ‘stand’ | |
| /ɠ/ | [ɠʲi22] ‘doubt’ | [ɠʲjɐ22] ‘carry’ | |
| /ħ/ | [ħji22] ‘fish’ | [ħjjɐ33] ‘tile’ | |
| /ʔ/ | [ʔji22] ‘mother’s sister’ | [ʔjjɐ33] ‘night’ |
(a) Alveolar plosives /t/, /tʰ/ and /ɗ/, regardless of their manners of articulation, are found to be dentalized and become [t̪], [t̪ʰ] and [ɗ̪], respectively, in this palatal environment. This is posited because over the articulation, native speakers do not raise their tongue to the alveolar ridge, but rather raising their tongue against the back of upper incisor. For example, the morpheme /ti35/ ‘pig’ is produced as [t̪i35] by native speakers, and the morpheme /tʰi51/ ‘tear’ is uttered as [t̪ʰi51] in real-world utterances. As can be seen, the alternation between /t, tʰ, ɗ/ and [t̪, t̪ʰ, ɗ̪] involves shifting the location of primary constriction from the alveolar ridge to the back of upper incisor. As such, this allophonic process can be expressed as dentalization, which can be characterized using the distinctive feature [-distributed] (Kochetov 2011). Likewise, the three alveolar plosives that participate in this dentalization process can be grouped as a natural class using the distinctive features [+alveolar; -strident]. Correspondingly, this palatal feature-induced dentalization can be formally expressed using Rule (1) as shown below.
| Dentalization of alveolar plosives |
| C [+alveolar; −strident] → C [−distributed]/_[+palatal] |
(b) Alveolar affricates /ts/ and /tsʰ/ and fricatives /s/ and /z/ are found to surface as alveolar-palatal sounds [tɕ], [tɕʰ], [ɕ], and [ʝ], respectively, in this palatal context. For example, the morpheme /tsi22/ ‘potato’ is pronounced [tɕi22], and the morpheme /zi/ ‘press’ is produced as [ʑi22] in utterances. This can be justified by speakers’ articulatory gesture. They are observed moving their tongue backward to the hard palate to create constriction, rather than maintaining their tongue around the alveolar ridge. As can be seen this alternation between /ts, tsʰ, s, z/ and [tɕ, tɕʰ, ɕ, ʝ] involves shifting the primary place of constriction from the alveolar ridge to the hard palate. As such, this process can be considered as alveolar palatalization and can be interpreted using the feature [-anterior]. The four alveolar affricates and fricatives can also be grouped as a natural class using the distinctive features [+alveolar; +continuous; +anterior]. Correspondingly, the whole alternation can be expressed using Rule (2), as shown below.
| Alveolar palatalization of alveolar affricates and fricatives |
| C [+alveolar; +continuous; +anterior] → C [−anterior]/_[+palatal] |
(c) Onset phonemes that have a place of articulation after the alveolar ridge, like /k/, /kʰ/, /ɠ/, /ħ/, and /ʔ/, are found to undergo palatalization and surface as [kj], [kʰj], [ɠj], [ħj], and [ʔj], respectively, before the high front vowel /i/, and palatal glide /j/. Because native speakers move their tongue forward to the hard palate, but do not arrive the hard palate. For example, the morpheme /ki22/ ‘flag’ is pronounced as [kʲi22], and the morpheme /ħi22/ ‘fish’ is uttered as [ħji22] ‘fish’ by native speakers. As can be seen, this alternation between /k, kʰ, ɠ, ħ, ʔ/ and [kj, kʰj, ɠj, ħj, ʔj] does not involve shifting the primary place of articulation of related onset consonants, as found in other processes on alveolar onsets, but rather, they acquire a secondary articulation of palatalization because of the [+palatal] feature from their subsequent sounds. As such, the alternation can be characterized as palatalization using the feature [+palatal], and the five phonemes can be grouped as a natural class using the features [−bilabial; −alveolar]. Corresponding, the whole alternation can be expressed using Rule (3), as shown below.
| Palatalization of non-labial and non-alveolar onsets |
| C [−bilabial; −alveolar] → C [+palatal]/_[+palatal] |
As discussed above, all onset phonemes, except labial plosives /p, pʰ, ɓ/, are easily affected by the palatal feature from their subsequent sounds and change their forms at the surface level. Labial onsets are exceptional to an alternation in this context, and this may be ascribed to their different articulator at lips, while the palatal-inducing process dominantly affects the tongue, as generalized by Gordon (2016), based on the 100-language WALS sample. In addition, it can be observed that, the alternations with alveolar onsets exclusively involve shifting the location of the primary constriction to either before the alveolar ridge as in dentalization for alveolar plosives /t, tʰ, ɗ / or after the alveolar ridge as in the alveolar palatalization for alveolar affricates and fricatives /ts, tsʰ, s, z/. In contrary, the alternations with onsets that are non-labial and non-alveolar exclusively involve superimpositions of a secondary gesture of palatalization. As such, it can be generalized that, the impact of the palatal feature from vocalic sounds on syllable onsets is asymmetrical in this language, and this is dominantly determined by the place of articulation of target onsets at the underlying level. There appears to be a tendency that alveolar sounds are more likely to be affected by the palatal nature of surrounding sounds and are more likely to shift their primary place of constriction because of the impact.
4.2 Labial-conditioned variation
The high back round vowel /u/ and the labial-velar glide /w/ appear to be the most active to trigger allophonic variations on onsets of Zhangzhou. The trigger can be further specified as [+labial], which is discovered impacting syllable onsets of this language in six different ways, depending on the nature of target onsets. This is generalized in Table 8.
labial feature-induced variation on Zhangzhou onsets.
| Process | Onset | Before [u] | Before [w] |
|---|---|---|---|
| (a) lenition, labialization | /ɓ/ | [βu33] ‘frost’ | [βwi35] ‘smile’ |
| /ɠ/ | [ɣʷu22] ‘cow’ | [ɣʷwɐ51] ‘I’ | |
| (b) lenition, laminalization, labialization | /ɗ/ | [lʷu51] ‘female’ | [lʷwɐ22] ‘spicy’ |
| (c) laminalization, labialization | /t/ | [t̻ʷu35] ‘pile, heap’ | [t̻ʷwɐ41] ‘belt’ |
| /tʰ/ | [t̻ʰʷu41] ‘to shovel’ | [t̻ʰʷwɐ35] ‘drag’ | |
| (d) alveolar-palatalization, labialization | /s/ | [ʃʷu22] ‘word’ | [ʃʷwɐ35] ‘sand’ |
| /z/ | [ʒʷu22] ‘such as’ | [ʒʷwɐ22] ‘hot’ | |
| /ts/ | [tʃʷu22] ‘merciful’ | [tʃʷwɐ51] ‘paper’ | |
| /tsʰ/ | [tʃʰʷu33] ‘slippery’ | [tʃʰʷwi41] ‘mouth’ | |
| (e) glottalization, labialization | /ħ/ | [hʷu22] ‘help’ | [hʷwɐ35] ‘flower’ |
| (f) labialization | /k/ | [kʷu35] ‘beetle’ | [kʷwɐ35] ‘song’ |
| /kʰ/ | [kʰʷu35] ‘human body’ | [kʰʷwɐ35] ‘boast’ | |
| /ʔ/ | [ʔʷu33] ‘have’ | [ʔʷwɐ35] ‘a branch of plant’ |
(a) All onset phonemes surface a labial feature before the vowel /u/ or the labial glide /w/. This is because native speakers are found rounding their lips over their articulation of any onset consonant in this labial environment, such as the morpheme /ku35/ ‘beetle’ is pronounced as [kʷu35], the morpheme /kʰwɐ35/ ‘boast’ is produced as [kʰʷwɐ35], and the morpheme /ʔu33/ ‘have’ is uttered as [ʔʷu33] by speakers. This phenomenon is articulatorily understandable and is phonologically predictable, because it is a natural process for onset consonants to acquire a secondary articulation of labialization due to the labial nature of their immediately subsequent sounds. As such, the alternation can be characterized as labialization and expressed using the Rule (4), as shown below. It should be noted that, the three labial plosives are already characteristic of the [+labial] feature themselves. It is thus redundant to impose a labialization effect on their surface form.
| Labialization of onset phonemes |
| C → C [+labial]/_[+labial] |
(b) The labial implosive /ɓ/ and the velar implosive /ɠ/ are found to separately become a labial voiced fricative [β] and a labialized velar voiced fricative [ɣʷ] in this labial context. Because no complete constriction can be observed from speakers’ articulatory gesture, and the airflow from their lungs can be continuously perceived over the production of related tokens. Such as the morpheme /ɓu33/ ‘frost’ is pronounced as [βu33], and the morpheme /ɠu22/ ‘cow’ is produced as [ɣʷu22] by speakers. As can be seen, the alternation between /ɓ, ɠ/ and voiced [β, ɣʷ] involves shifting the airstream mechanism from glottalic ingressive to pulmonic egressive and changing the manner of articulation from implosive to fricative. As such, this process can be considered as lenition, which is also known as spirantization or fricativization in literature (Gordon 2016; Kennedy 2016). In addition, the voiced velar onset acquires the labial feature as a secondary feature from its immediately subsequent sound. In other words, there are two different processes of lenition and labialization occurring on non-alveolar implosive onsets. As such, the lenition process can be characterized using the distinctive features [+continuous; +strident], and the labial and velar implosives /ɓ, ɠ/ can be grouped as a natural class using the features [−alveolar; −strident; +voiced], while the whole alternation can be formally expressed using Rule (5), shown below.
| Lenition and labialization of non-alveolar implosives |
| C [−alveolar; −strident; +voiced] |
| → C [+labial; +continuous; +strident]/_[+labial] |
(c) The alveolar implosive /ɗ/ is observed to surface as a labialized approximant [lʷ] before the labial segments /u/ or /w/. For example, the morpheme /ɗu51/ ‘female’ is produced as [lʷu5], and the morpheme /ɗwɐ22/ ‘spicy’ is uttered as [lʷwɐ22] by speakers. This occurs because speakers move the central part of their tongue, rather than the tongue tip, nearly approaching but not touching the alveolar ridge to form a constriction that is slightly wider than the constriction needed for fricative sounds. Meanwhile, the airflow can be continually perceived passing through the constriction, while speakers’ lips are rounded over the production. As can be seen, the alternation between /ɗ/ and [lʷ] involves shifting the airstream mechanism from glottalic ingressive to pulmonic egressive, altering the manner of articulation from plosive to approximant, changing the active articulator from the tongue tip to the central part of the tongue, and acquiring a secondary articulation of labialization from subsequent sounds. The allophonic alternation of this alveolar implosive in this labial context can thus be considered as a compounding effect of lenition, laminalization, and labialization, and can be formally expressed using Rule (6), as shown below.
| Lenition, laminalization, and labialization of alveolar implosive |
| /ɗ/→[lʷ]/_[+labial] |
(d) The voiceless alveolar plosives /t/ and /tʰ/ are observed to separately surface as [t̻ʷ] and [t̻ʰʷ] in this labial context. This is because native speakers raise up their tongue blade, rather than their tongue tip, to form an oral obstruction at the alveolar ridge, while their lips are rounded over the production. For example, the morpheme /tu35/ ‘pile, heap’ is pronounced as [t̻ʷu35], while the morpheme [tʰu41] ‘to shovel’ is produced as [t̻ʰʷu41] by speakers. As can be seen, the alternation between /t, tʰ/ and [t̻ʷ, t̻ʰʷ] mainly involves shifting the active articulator from the tongue tip to the tongue blade and acquiring a secondary articulation of labialization. The alternation can thus be considered as a compounding effect of laminalization and labialization, which can be characterized using the features [+laminal; +labial]. The two alveolar plosives /t/ and /tʰ/ can also be grouped as a natural class of the [+alveolar; −voiced; −continuous] features. Correspondingly, their allophonic alternation in this labial context can be formally expressed using Rule (7), as shown below.
| Laminalization and labialization of alveolar voiceless plosives |
| C [+alveolar; −voiced; −continuous] |
| → C [+labial; +laminal]/_[+labial] |
(e) The alveolar fricatives /s/ and /z/ and affricates /ts/ and /tsʰ/ are found to surface as palatal-alveolar sounds [ʃʷ], [ʒʷ], [tʃʷ], and [tʃʰʷ], respectively, in this labial setting. This is because speakers are observed to move their tongue body from the alveolar ridge towards the hard palate to form a constriction, while rounding their lips. For example, they produce the morpheme /su22/ ‘word’ as [ʃʷu22] and the morpheme /zu22/ ‘such as’ as [ʒʷu22]. As can be seen, the alternation between /s, z, ts, tsʰ/ and [ʃʷ, ʒʷ, tʃʷ, tʃʰʷ] involves shifting the location of primary constriction from the alveolar ridge to palatal-alveolar and acquiring a secondary articulation of labialization. The alternation can thus be considered as a compounding effect of alveolar-palatalization and labialization and can be interpreted as [−anterior; +labial]. The four alveolar onsets /s, z, ts, tsʰ / can be grouped as a natural class for sharing distinctive features [+alveolar; +distributed; +strident]. As such, their allophonic alternation can be expressed using Rule (8), as shown below.
| Alveolar-palatalization and labialization of alveolar non-plosive onsets |
| C [+alveolar; +distributed; +strident] |
| → C [−anterior]/_[+labial] |
(f) The pharyngeal fricative /ħ/ is found to surface as a labialized glottal fricative [hʷ] before the labial segment /u/ or /w/. This is because speakers reported they made a partial constriction at the glottis, rather than in the pharyngeal cavity, while rounding their lips over the production of related tokens. For example, they produce the morpheme /ħu22/ ‘help’ as [hʷu22], and the morpheme /ħwɐ35/ ‘flower’ as [hʷwɐ35]. As can be seen, the alternation between /ħ/ and [hʷ] involves shifting the location of primary constriction from the pharyngeal cavity to the glottis and acquiring a secondary articulation of labialization. The allophonic alternation of this onset can thus be considered as a compounding effect of glottalization and labialization, and can be expressed using Rule (9), as shown below.
| Glotalization and labialization of pharyngeal fricative |
| /ħ/→[hʷ]/_[+labial]. |
As discussed above, the allophonic alternations of syllable onsets in this language are much more complex in the labial context than in the palatal context. It can be observed that the labial-inducing impacts on onset phonemes are diverse, which include (a) changing the airstream mechanism, such as from the glottalic ingressive to pulmonic egressive as found in implosive onsets; (b) changing the manner of articulation, such as from implosive to fricative as found in /ɓ, ɠ/, or from implosive to approximant as found in /ɗ/; (c) shifting the active articulator, such as from the tongue tip to the central part of the tongue as found in /ɗ/, and from the tongue tip to the tongue blade as found in /t, tʰ/; (d) shifting the location of primary constriction, such as from alveolar ridge to palatal-alveolar as found in /s, z, ts, tsʰ/, and from the pharyngeal cavity to the glottis as found in /ħ/; and (e) acquiring a secondary articulation of labialization as found in all onsets except the labial plosives, because they have already had the labial feature on their own. Likewise, the types of allophonic processes are also diverse, which include labialization, lenition, laminalization, alveolar-palatalization, and glottalization. In addition, the allophonic processes mostly do not operate on their own, instead, they compound with other process (es) to impact the surface forms of target onsets. For example, the alternation between the alveolar implosive /ɗ/ and the labialized approximant [lʷ] involves lenition, laminalization, and labialization. In general, except the voiceless labial plosives, the two voiceless velar plosives /k, kʰ/ are the least affected in the labial context, because their alternation only involves acquiring a secondary articulation of labialization.
4.3 Nasality-conditioned variation
The nasal vowels /Ṽ/and syllabic nasals /N̩/ can also induce allophonic alternations on onset phonemes of this language. The trigger can be specified as the [+nasal] feature as shared by these two types of nuclei. The alternations mainly manifest in three different ways, depending on the nature of target onsets. This is generalized in Table 9, in which the symbol * indicates no empirical data were tested in the field.
Nasal feature-induced variation on Zhangzhou onsets.
| Process | Onset | Before [Ṽ] | Before [ N̩] |
|---|---|---|---|
| (a) nasalization | /ɓ/ | [mɛ̃33] ‘scold’ | * |
| /ɗ/ | [nɔ̃33] ‘two’ | * | |
| /ɠ/ | [ŋɛ̃33] ‘hard’ | * | |
| (b) glottalization, voicing, pre-nasalization | /ħ/ | [nɦɛ̃41] ‘rest’ | [mɦm̩41] ‘hit with a stick’ |
| [nɦɐ̃41] ‘scorching’ | [ŋɦŋ̩35] ‘recipe’ | ||
| (c) pre-nasalization | /p/ | [mpĩ33] ‘weave’ | [mpŋ̩51] ‘board, list’ |
| /pʰ/ | [mpʰĩ41] ‘slide’ | [mpʰŋ̩41] ‘high tide’ | |
| /t/ | [ntɛ̃33] ‘press’ | [n tŋ̩22] ‘long’ | |
| /tʰ/ | [ntʰɛ̃41] ‘lift up’ | [ntʰŋ̩22] ‘sugar’ | |
| /k/ | [ŋkɐ̃51] ‘dare’ | [ŋkŋ̩35] ‘vat’ | |
| /kʰ/ | [ŋkʰɛ̃35] ‘hollow’ | [ŋkʰŋ̩41] ‘hid’ | |
| /ts/ | [ntsɛ̃35] ‘compete’ | [ntsŋ̩35] ‘dress’ | |
| /tsʰ/ | [ntsʰɛ̃35] ‘star’ | [ntsʰŋ̩35] ‘warehouse’ | |
| /s/ | [nsɐ̃35] ‘three’ | [nsŋ̩35] ‘frost’ | |
| /z/ | * | * | |
| /ʔ/ | [nʔĩ35] ‘sleep’ | [mʔm22] ‘berry’ | |
| [nʔɛ̃35] ‘infant’ | [ŋʔŋ̩35] ‘central’ |
(a) Implosive onsets /ɓ/, /ɗ/, and /ɠ/ are observed to surface as their corresponding homorganic nasal plosives [m], [n], and [ŋ], respectively, in a nasal context. Because speakers are found to release the airflow through their nose, rather than through their mouth, when they produce these onsets before nasal vowels. For example, they pronounce the morpheme /ɓĩ33/ ‘noodle’ as [mĩ33], and the morpheme /ɗɔ̃33/ ‘two’ as [nɔ̃33]. It should be also noted that implosive onsets are not attested to occur before syllabic nasals in this language. As can be seen, the alternation between /ɓ, ɗ, ɠ/ and [m, n, ŋ] involves shifting the manner of articulation from oral to nasal. It thus can be considered as an effect of nasalization. The three implosive onsets can be grouped as a natural class, using the features [+voiced; -strident] and the nasalization process can be formally expressed using Rule (10).
| Nasalization of implosive onsets |
| C [+voiced; −strident] → [+nasal]/_[+nasal] |
(b) The pharyngeal fricative /ħ/ is observed to surface as a prenasalized breathy onset with three different forms that include [mɦ], [nɦ], [ŋɦ], depending on the nature of its subsequent nasal segments. This occurs because speakers reported they moved a constriction from the pharynx to the larynx where they continuously generated airflow from lungs and let them freely pass through their vocal tract. They also reported before the articulation of this onset, their velum had already been lowered as that the airflow could also rush out through their nose. For example, they produce the morpheme /ħɛ̃41/ ‘rest’ as [nɦɛ̃41], the morpheme /ħm̩41/ ‘beat’ as [mɦm41], and the morpheme /ɦŋ̩35/ ‘recipe’ as [ŋɦŋ̩35]. As can be seen, the alternation between the pharyngeal fricative /ħ/ to three forms of prenasalized breathy onset [mɦ, nɦ, ŋɦ] involves shifting the location of primary constriction from pharynx to larynx, creating voicing for the airflow, and acquiring an articulatory gesture of nasalization before onsets. The alternation can thus be considered as a compounding effect of glottalization, voicing, and pre-nasalization. The three surface forms differ in the place where a constriction is created in the oral cavity for the articulation of subsequent nasal sounds. Specifically, the form [mɦ] occurs before the labial syllabic nasal, the form [nɦ] occurs before nasal vowel, while the form [ŋɦ] occurs before the velar syllabic nasal. As such, the allophonic alternations of this onset can be formally expressed using Rule (11).
| Glottalization, voicing, pre-nasalization of pharyngal fricative |
| /ħ/ → N[ɦ]/_[+nasal] |
(c) All other onsets that are non-voiced and non-pharyngal are found to surface a nasal feature in this nasal context. This is because before the articulation of onset consonants, speakers are observed to lower their velum and get an articulatory gesture ready for a nasal production. For example, they produce the morpheme /tɛ̃33/ ‘press’ as [ntɛ̃33], the morpheme /pĩ 35/ ‘weave’ as [mpĩ33], and the morpheme /kɐ̃51/ ‘dare’ as [ŋkɐ̃51]. As can be seen, the alternation involves acquiring an additional articulatory gesture of nasalization before onset articulation and assimilating the place of nasal articulation to that of the target onset. For example, the alveolar plosive /t/ surfaces as [nt], the labial plosive /p/ surfaces as [mp], and the velar plosive /k/ surfaces as [ŋk]. As such, the alternation can be considered as a prenasalization as an impact of the nasal feature from subsequent nasal vowels and syllabic nasals. It can be expressed formally using Rule (12).
| Pre-nasalization of non-voiced and non-pharyngal onsets |
| C [−voiced; −pharyngal] → NC/_[+nasal] |
As discussed in this section, the allophonic alternations of syllable onsets are incredibly active in this language. Twelve different types of alternation processes are generalized from empirical data and are formally expressed using rules within the paradigm of generative phonology. They manifest varying impacts of featural properties of triggers from the vocalic segments, but also reflect varying reactions of onset phonemes in different phonological environments. In general, the labial-inducing alternations are much more complex and diverse that account for half of the total alternation types and involve changing the airstream mechanism, manner of articulation, active articulator, the location of primary constriction, and acquiring a secondary articulation. In most cases, the individual process compounds with other process(es) to impact the surface of onsets. The nasal-inducing alternations are less complex that involve changing the manner of articulation from oral to nasal, shifting the location of primary constriction, and generating pre-nasalization. However, the nasal-inducing alternations can cause certain onsets to have more than one surface form, for example, the pharyngal fricative onset /ħ/ has three surface forms [mɦ], [nɦ], and [ŋɦ], so does the glottal stop with three variants [mʔ], [nʔ], and [ŋʔ]. The palatal-inducing alternations seem the least complex that mainly involve shifting the location of primary constriction on alveolar onsets or acquiring a secondary articulation of palatalization on velar onsets, while labial onsets are not affected in the palatal context.
As for the onset phonemes, they suffer varying impacts from their immediately subsequent sounds. Alveolar and velar onsets, except the voiced fricative /z/, have four surface forms; the pharyngeal fricative /ħ/ and the glottal stop /ʔ/ both have as many as six variants at the output; the labial implosive /ɓ/ has three forms while the other two voiceless labial plosives only have two surface forms. This suggests that the voiceless labial stops are the least affected by their subsequent sounds, while onsets with a place of articulation outside the oral cavity are the most valuable to the featural properties of their subsequent sounds. In addition, voiced onsets are more likely to change the manner of articulation as an impact of their following segments, while voiceless onsets have different reactions. In general, voiceless alveolar onsets are more likely to shift the location of primary constriction, change an active articulator, and/or acquire an additional secondary feature, while voiceless velar onsets largely involve acquiring a secondary feature of articulation. Voiceless labial onsets are the least easy to be affected but if yes, they only involve acquire an additional secondary feature. As indicated, the featural impacts of vocalic segments appear to be most severe on alveolar sounds that involve the engagement of the front of the tongue in this language. This tends to be a cross-linguistic tendency, as Gordon (2016) observed among 100-language WALS sample. It may be ascribed to the fact that the front of tongue executes faster gestures than both dorsum and lips, as such it is more prone to be overlapped by surrounding sounds (Gordon 2016).
5 Phonotactic constraints of Zhangzhou onsets
Each language has an inventory of phonemes that undoubtedly forms an essential part of its phonology. Several phonological databases have been developed, as introduced in Section 1, to compile commensuration data on phoneme inventories of consonants and vowels and their phonological traits for a relatively large and diverse sample of languages, while some may also cover tonemic inventory, and possible forms of syllable onset, nucleus, and coda as shown in the Phonotacticon 1.0 (Joo and Hsu 2025) for 516 lects spoken in the Eurasian macro area. However, those established phonological databases and most descriptive phonological work of natural languages are limited in providing systematic information on encoding the frequency behaviours and distributional characteristics of individual phonemes, and the constraints on their occurrence and combinability with other phonemes within specific entities, although few quantitative research can be found in literature. For example, Hayes and Wilson (2008) advocate using a maximum entropy model, also known as the UCLA Phonotactic Learner, to characterize the grammatical knowledge that permits native speakers to make phototactically well-formed judgements. In this model, the well-formedness is interpreted as probability, and an infinite set Ω consisting of all universally possible Ω phonological surface forms are supposed to be stated in the chosen representative vocabulary. As a cross-linguistic phenomenon, constraints are supposed to occur on the onset system of Zhangzhou Southern Min. This is because phonemes of a given language are not equally active in the construction of linguistic entities, like syllables, morphemes and words. Likewise, phonemes of any language cannot randomly combine with each other to make lexical distinctions. As such, within a wider set of phonological patterns, there are constraints, known as phonotactics, governing their distribution, relative frequency, and inter-relationship with other phonemes within the same constituents (cf. Algeo 1975; Celata and Calderone 2015; Duanmu 2003; Gordon 2016; Joo and Hsu 2024; Kennedy 2016; Kirby and Yu 2007; Maddieson 2023; Pearce 2007; Zec 1995; Zhang 2006).
This section will systematically explore this issue by examining the usage frequency of individual syllable onsets in this language, and the phonotactic constraints on their co-occurrences with other syllable components, tones and finals in particular, in the formation of monosyllabic morphemes. It will fill the research gap in conducting an in-depth analysis of phonotactics in a specific language and will provide new insight and methodology to understand how phonemes are utilized for constructing linguistic entities, like syllables and morphemes; how phonemes are distributed in relation to one other; how phonemes can be combined with other phonemes in a system, and how the occurrence of phonemes may be constrained. Differing from Hayes and Wilson’s model, the phonotactic constraints in this study are not assigned with probabilities and are not calculated from their constraint violations and the weights, instead, their analyses are based on the gap between the theoretically assumed number and the exact number that is obtained in the real world. In addition, this study does not use marked constraints to assess forms for their phonotactic legality, instead, it exhaustively examines all possible combinations of individual onset phonemes and other syllable components, finals and tones in particular, to assess their capability to generate linguistic entities of syllables and monosyllabic morphemes that are empirically used in the speech community. It is hopped the descriptive analysis can stretch our knowledge of the phonological patterns of the onset system in this language and contribute to the understanding of phonotactics as an important phenomenon in human languages. It is also hopped that it can provide a well-established dataset to examine the phonotactic grammar of Southern Min onsets using other quantitative methods, such as Hayes and Wilson’s model.
5.1 Usage frequency of Zhangzhou onsets
Tracing back to the Eastern Han Dynasty (25–220 AD), Chinese scholars had devised the method of Fanqian to document the pronunciation of Chinese Characters, which were monosyllabic, through using two other characters (Dong 2023). The first character represents the initial of the character being referred, which corresponds to a syllable onset in modern linguistics, while the second character represents its final, which refers to the constituent that covers all syllable components except onset. In other words, within the Sinitic convention, individual syllables have long been considered to incorporate two parts of Initial and Final (Dong 2023; Duanmu 1990, 1999; Huang 2021; Li and Yao 2008; Třísková 2011; Zhang 2006). This fanqie approach has thereafter been prevalent as an effective tool to indicate speech sounds of Chinese languages over history. During the Six Dynasties period (220–589 AD), a series of rhyme dictionaries were compiled using this method, such as Lu Fayan’s Qieyun, which are regarded as the earliest and most direct reference to the study of the phonological system of the Early Middle Chinese and above (Dong 2023). In these dictionaries, each rhyme can be seen as a range of rhyming characters for poetry and prose compositions, and characters sharing the same pronunciation are listed together. As such, rhyme dictionaries contain all possible characters being practically used. For example, there are about 12,000 characters in Lu Fayan’s rhyme book Qieyun. They also contain the maximum types of initials and finals, as well as their combinations to formulate monosyllabic characters.
This study adopts the Sinitic convention of syllabification and rhyme dictionary to examine the phonotactics of syllable onsets in Southern Min. Each syllable is divided into two parts of Initial that corresponds to onset and Final that covers all other syllable components. This is hierarchically represented in Figure 4. In total, 15 syllable onsets (initials) are contrastive, as described in Section 3. Likewise, 61 finals can be generalized from the empirical data, as summarized in Table 10 with examples. They can be four types: V, GV, VX, and GVX, in which V covers oral vowels, nasal vowels, and syllabic nasals, while X covers nasal codas, obstruent codas and glide codas. The 15 onsets and 61 finals will serve as an important base to explore how the onset phonemes of this language are distributed in relation to different finals (or final types), and how their combinations may be constrained at the syllable level.

Syllable modelling in this study.
Examples of finals in Zhangzhou.
| Types | Onset | Examples | Types | Examples | Examples |
|---|---|---|---|---|---|
| V | /e/ | ke41‘calculate; plan’ | VX | /iŋ/ | kiŋ41‘respect’ |
| /i/ | ki41‘remember’ | /ɔŋ/ | kɔŋ41‘tribute’ | ||
| /u/ | ku41‘sentence’ | /ɐw/ | kɐw41‘enough’ | ||
| /ɔ/ | kɔ41‘look after’ | /ɐ̃w/ | ɠɐ̃w33‘root of lotus’ | ||
| /ɛ/ | kɛ41‘frame; shelf’ | /ɐj/ | kɐj41‘boundary’ | ||
| /ɵ/ | kɵ41‘tell; sue’ | /ɐ̃j/ | kɐ̃j41‘how about’ | ||
| /ɐ̃/ | kɐ̃41‘yeast’ | /ɐp/ | kɐp41‘pigeon’ | ||
| /ĩ/ | kĩ41‘see; meet’ | /ip/ | kip41‘anxious; urgent’ | ||
| /ɔ̃/ | ɓɔ̃41‘emit; pop up’ | /ɔp/ | ħɔp41‘catch with a cover’ | ||
| /ɛ̃/ | kɛ̃41‘quantifier for machine’ | /ɐt/ | kɐt41‘tie; knot’ | ||
| /m̩/ | ʔm̩41‘oh; all right’ | /it/ | kit41‘orange’ | ||
| /ŋ̩/ | kŋ̩41‘steel’ | /ut/ | kut41‘bone’ | ||
| GV | /jɐ/ | kjɐ41‘post’ | /ɐk/ | kɐk41‘horn; angle’ | |
| /ju/ | kju41‘save; rescue’ | /ik/ | kik41‘leather; transform’ | ||
| /jɔ/ | ħjɔ41‘yes’ | GVX | /ɔk/ | kɔk41‘country; nation’ | |
| /jɵ/ | kjɵ41‘call; order’ | /jɐw/ | kjɐw41‘seize; hand over’ | ||
| /jɐ̃/ | kjɐ̃41‘mirror; glass’ | /jɐ̃w/ | ɠjɐ̃w41‘stingy’ | ||
| /jɔ̃/ | tsjɔ̃41‘dipping sauce’ | /wɐj/ | kwɐj41‘strange; to blame’ | ||
| /jũ/ | ɗjũ51‘turn; tweak’ | /wɐ̃j/ | ʔwɐ̃j51‘sprain; wrench’ | ||
| /wɐ/ | kwɐ41‘hang’ | /jɐm/ | kjɐm41‘sword’ | ||
| /we/ | kwe41‘pass through’ | /jɐn/ | kjɐn41‘build; found’ | ||
| /wi/ | kwi41‘expensive’ | /jɐŋ/ | kʰjɐŋ41‘capable; competent’ | ||
| /wɐ̃/ | kʰwɐ̃41‘look; see’ | /jɔŋ/ | kjɔŋ51‘arch’ | ||
| /wĩ/ | kwĩ41‘volume’ | /wɐn/ | kwɐn41‘be used to’ | ||
| VX | /ɐm/ | kɐm41‘supervise’ | /jɐp/ | kjɐp41‘take by force’ | |
| /im/ | kim41‘prohibit’ | /jɐt/ | kjɐt41‘bear fruit; connect’ | ||
| /ɔm/ | ɗɔm41‘sloshy; muddy’ | /jɐk/ | kjɐk41‘screech’ | ||
| /ɐn/ | kɐn41‘separate’ | /jɔk/ | kjɔk41 ‘chrysanthemum’ | ||
| /in/ | kin41‘strength’ | /wɐt/ | kwɐt41 ‘determine’ | ||
| /un/ | kun41‘stick’ |
The phonotactic exploration in this study is data driven. It is built upon 61 rhyme tables that Huang (2019) adopts the conventional notion of rhyme dictionary to tabulate combinations of individual onsets, finals, and tones to generate syllables in this language. Table 11 illustrates the rhyme table under the final /ɐ/, while Table 12 illustrate the rhyme table under the final /ɐ̃w/. The top row indicates the final being addressed; the second rows list the eight tones in sequence while the leftmost column lists the 15 onset phonemes. Each slot in the table represents a combination of an onset, a final, and a tone to generate a syllable in this language. If a syllable is empirically attested, a represented character is put in that correspondingly represents one or more monosyllabic morpheme(s). For example, the syllable /kɐ35/ in Table 11 is attested because it can generate lexical morphemes like ‘glue’ in Chinese character 胶 ( jiāo ). Likewise, the syllable /ɓɐ̃w33/ in Table 12 can generate a lexical morpheme like ‘appearance’ in Chinese character 貌 ( mào ). In contrast, if a slot is left empty, indicating that its related syllable is theoretically possible to occur, but it is not attested in the field. For example, the syllable /kɐ22/ is not attested, because it is not related to any of morphemes in this language.
Rhyme table of the final /ɐ/ in Zhangzhou.
| Onset | T1 [35] | T2 [22] | T3 [51] | T4 [41] | T5 [33] | T6 [41] | T7 [221] | T8 [22] |
|---|---|---|---|---|---|---|---|---|
| p | 巴 ‘tailor’ | 饱2 ‘full; not hungry’ | 霸 ‘overlord; bully’ | |||||
| pʰ | 脬 ‘bladder; sponge’ | 拍 ‘beat; hit’ | 泡2肿水泡 ‘blister’ | |||||
| ɓ | 疤2 ‘quantifier for bump’ | 麻2 ‘numb’ | 肉2 ‘meat’ | 密2紧紧 ‘close’ | 觅2 ‘seek; look for’ | |||
| t | 焦2 ‘withered; dry; burnt; fragile’ | ◯婴儿吃 ‘(infant) eat’ | 搭2 ‘put up; travel by’ | 踏2 ‘step on; trample’ | ||||
| tʰ | 塔 ‘tower; pagoda’ | 叠2 ‘pile up’ | ||||||
| ɗ | 拉 ‘pull’ | 蜊 ‘clam’ | 邋 ‘sloppy; dowdy’ | ◯搅动 ‘stir’ | 蜡 ‘wax’ | |||
| k | 胶 ‘glue’ | 绞 ‘twist; wring’ | 教2 ‘teach’ | 咬 ‘bite’ | ||||
| kʰ | 脚 ‘foot’ | 卡2 ‘card’ | 敲2 ‘knock’ | |||||
| ɠ | 牙2败牙; 螺丝牙 ‘screw thread’ | |||||||
| ts | 查2查甫; 查某 ‘bound morpheme for man and woman’ | 早 ‘early; long ago’ | 扎2携带; 提 ‘bring along; pull out; tie’ | 闸 ‘brake; gear; penstock’ | ||||
| tsʰ | 差2 ‘difference; weak’ | 柴 ‘firewood’ | 吵 ‘noisy; wrangle’ | 插2 ‘insert’ | ||||
| s | 捎 ‘grasp; take along’ | 傻 ‘foolish’ | 洒 ‘sprinkle’ | 煠沸水快煮 ‘cook in boiled water’ | ||||
| z | ||||||||
| ħ | 哈 ‘sound of laughter’ | 孝2 ‘filial piety’ | ||||||
| ʔ | 鸦 ‘crow’ | 仔 ‘nominative and diminutive suffix’ | 鸭 ‘duck’ | 盒 ‘small box or base’ |
Rhyme table of the final /ɐ̃w/ in Zhangzhou.
| Onset | T1 [35] | T2 [22] | T3 [51] | T4 [41] | T5 [33] | T6 [41] | T7 [221] | T8 [22] |
|---|---|---|---|---|---|---|---|---|
| p | ||||||||
| pʰ | ||||||||
| ɓ | 茅 ‘reeds’ |
◯牙齿脱落; 说话跑气 ‘speak indistinctly for missing teeth’ | 貌 ‘appearance’ | |||||
| t | ||||||||
| tʰ | ||||||||
| ɗ | 脑 ‘brain’ | 闹2 ‘noisy’ | ||||||
| k | ||||||||
| kʰ | ◯啃东西的声音 ‘gnawing sound’ | |||||||
| ɠ | 熬2 ‘stew’ | 藕 ‘root of lotus’ | ◯吠叫 ‘bark’ | |||||
| ts | ||||||||
| tsʰ | ||||||||
| s | ||||||||
| z | ||||||||
| ħ | ◯芋头没煮熟; 口感不佳 ‘flavour of undercooked taro’ | |||||||
| ʔ | ◯说话支支吾吾 ‘hum and haw’ |
As can be seen in these two tables, the arrangement of rhyme table is an effective way to present what syllables are attested within a particular final, and what syllables are not attested but they are theoretically possible to occur. A whole set of 61 rhyme tables thus delivers a full inventory of syllables that are empirically used by the speech community of this language. It is also an effective way to present the occurrence of individual phonemes at the syllable level and show their combinability with other syllable components. If a syllable fails to obtain any lexical morpheme, as shown in an empty slot, it indicates the combination of a specific onset, final and tone is subject to phonotactic constraint(s). In contrast, if a syllable is able to obtain lexical morphemes, it indicates this onset is combinable with the related final and tone to make lexical items, and there is no constraint governing their sequencing. As such, the usage frequency of individual onset phonemes can be quantified by calculating the number of syllables that can be obtained under the target onsets. Likewise, the phonotactic constraints on syllable onsets can also be interpreted by calculating the number of syllables that are theoretically permissible but are not obtained in the field.
Assuming this language has an inventory of 15 onsets, 61 finals, and 8 tones, each onset is theoretically possible to generate 488 syllables (=61 finals × 8 tones), and a total of 7,320 syllables (=15 onsets × 61 finals × 8 tones) are logically possible to be used by the speech community. However, the usage frequency of onset phonemes in real-world situations is significantly lower than the theoretically expected. This can be seen in Table 13. It shows the number of empirically attested syllables under individual onset phonemes, as shown in the column of syllable, and the discrepancy between empirically obtained syllable number and the theoretically expected number, as shown in the column of gap. They are calculated based on the 61 rhyme tables that Huang (2019) constructed. In addition, Figure 5 visualises the results.
Usage frequency of syllable onsets in Zhangzhou.
| Onset | Syllable | Gap | ||
|---|---|---|---|---|
| Labial | stop | p | 127 | 361 |
| pʰ | 99 | 389 | ||
| ɓ | 124 | 364 | ||
| Alveolar | stop | t | 180 | 308 |
| tʰ | 120 | 368 | ||
| ɗ | 174 | 314 | ||
| affricate | ts | 189 | 299 | |
| tsʰ | 142 | 346 | ||
| fricative | s | 174 | 314 | |
| z | 33 | 455 | ||
| Velar | stop | k | 175 | 313 |
| kʰ | 129 | 359 | ||
| ɠ | 79 | 409 | ||
| Pharyngal | fricative | ħ | 172 | 316 |
| Glottal | stop | ʔ | 188 | 300 |

Usage frequency of onset phonemes in Zhangzhou.
As can be seen, individual onset phonemes show great variations in their usage frequency in this language. The alveolar unaspirated affricate /ts/ appears to be the most frequently used with 189 syllables obtained, followed by the glottal stop with 188 syllables obtained in the field. The voiced alveolar fricative /z/ has the lowest usage frequency that only generates 33 syllables. However, in comparison with the number of 488 that each onset is theoretically expected to obtain, there is a great discrepancy. For example, the number of syllables that the onset /ts/ generates is 299 fewer than the expected number, while the number of syllables that the onset /z/ generates is exactly 455 fewer than the theoretical assumption. The great divergency between the expectation and the reality implies that the usage frequency of onset phonemes and their combinability with finals and tones have been substantially constrained at the syllable level. As a matter of fact, only 2105 syllables are attested in the field, and as many as 5215 syllables show gaps in the rhyme tables. This signifies that more than 71% of theoretically possible syllables fail to be employed by native speakers in communication. The rest of this section will be devoted to exploring how the combinability of onsets with other syllable components are constrained and what factors trigger the constraints.
5.2 Onset-final occurrence constraints
Phonotactic constraints occur on the alignments of onset phonemes and finals in Southern Min. Table 14 calculates the number of syllables that are obtained in relation to individual onset-final combinations. For example, in the pɐ combination, 3 syllables are obtained which bear three different tones, while in the zɐ combination, no tokens are tested in the field. Based on the data shown in this table, several generalizations can be made as to how the onset-final occurrence is constrained to formulate syllables in this language.
The number of attested syllables with respect to onset-final combinations.
| Final type | p | pʰ | ɓ | t | tʰ | ɗ | k | kʰ | ɠ | ts | tsʰ | s | z | ħ | ʔ | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| V | ɐ | 3 | 3 | 5 | 4 | 2 | 5 | 4 | 3 | 1 | 4 | 4 | 4 | 0 | 2 | 4 |
| e | 2 | 3 | 5 | 5 | 5 | 6 | 4 | 4 | 2 | 6 | 4 | 4 | 0 | 5 | 5 | |
| i | 5 | 4 | 6 | 5 | 4 | 5 | 5 | 4 | 5 | 6 | 6 | 6 | 5 | 5 | 5 | |
| u | 3 | 4 | 3 | 5 | 1 | 4 | 4 | 4 | 2 | 5 | 4 | 5 | 2 | 5 | 3 | |
| ɔ | 5 | 5 | 4 | 5 | 4 | 4 | 4 | 4 | 2 | 4 | 3 | 3 | 0 | 4 | 4 | |
| ɛ | 6 | 2 | 2 | 2 | 3 | 2 | 6 | 3 | 2 | 4 | 5 | 4 | 0 | 3 | 1 | |
| ɵ | 5 | 5 | 4 | 6 | 5 | 6 | 5 | 3 | 2 | 6 | 3 | 6 | 0 | 3 | 5 | |
| Ṽ | ĩ | 4 | 3 | 5 | 4 | 2 | 4 | 3 | 1 | 1 | 4 | 2 | 3 | 0 | 3 | 5 |
| ɐ̃ | 0 | 1 | 4 | 3 | 1 | 4 | 5 | 1 | 0 | 4 | 1 | 2 | 0 | 4 | 2 | |
| ɛ̃ | 4 | 2 | 4 | 3 | 3 | 3 | 3 | 2 | 4 | 5 | 4 | 3 | 0 | 1 | 3 | |
| ɔ̃ | 0 | 0 | 5 | 0 | 0 | 3 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 3 | 1 | |
| N | m | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 5 |
| ŋ | 2 | 2 | 0 | 5 | 4 | 1 | 3 | 2 | 0 | 2 | 4 | 4 | 0 | 3 | 3 | |
| VN | ɐm | 0 | 0 | 0 | 5 | 3 | 5 | 4 | 4 | 1 | 4 | 4 | 3 | 0 | 5 | 5 |
| ɐn | 4 | 3 | 3 | 5 | 4 | 4 | 3 | 3 | 4 | 4 | 4 | 3 | 0 | 5 | 4 | |
| ɐŋ | 5 | 5 | 4 | 5 | 4 | 5 | 4 | 2 | 1 | 3 | 3 | 2 | 0 | 4 | 3 | |
| im | 0 | 0 | 0 | 4 | 1 | 4 | 4 | 2 | 1 | 4 | 3 | 5 | 4 | 2 | 4 | |
| in | 5 | 4 | 3 | 5 | 1 | 3 | 5 | 4 | 3 | 5 | 2 | 4 | 2 | 3 | 5 | |
| iŋ | 5 | 3 | 3 | 5 | 5 | 4 | 5 | 5 | 1 | 5 | 5 | 5 | 0 | 4 | 4 | |
| ɔm | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 2 | |
| un | 5 | 4 | 3 | 5 | 3 | 4 | 5 | 4 | 1 | 5 | 4 | 5 | 0 | 5 | 5 | |
| ɔŋ | 4 | 5 | 5 | 5 | 4 | 5 | 5 | 4 | 1 | 5 | 2 | 3 | 0 | 5 | 5 | |
| GV | jɐ | 1 | 2 | 0 | 3 | 1 | 2 | 4 | 3 | 3 | 5 | 4 | 5 | 1 | 5 | 6 |
| ju | 1 | 1 | 2 | 4 | 2 | 5 | 5 | 5 | 2 | 5 | 5 | 5 | 1 | 4 | 5 | |
| jɵ | 3 | 2 | 2 | 4 | 2 | 2 | 3 | 1 | 1 | 4 | 3 | 4 | 1 | 2 | 4 | |
| jɔ | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | |
| wi | 3 | 2 | 1 | 4 | 4 | 5 | 4 | 4 | 0 | 4 | 2 | 5 | 0 | 5 | 5 | |
| wɐ | 2 | 2 | 3 | 2 | 3 | 3 | 2 | 3 | 2 | 6 | 2 | 3 | 1 | 5 | 4 | |
| GṼ | jɐ̃ | 3 | 3 | 2 | 3 | 2 | 2 | 5 | 0 | 1 | 5 | 4 | 3 | 0 | 5 | 4 |
| jũ | 0 | 0 | 0 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| jɔ̃ | 0 | 0 | 0 | 5 | 0 | 3 | 1 | 2 | 0 | 5 | 5 | 5 | 0 | 1 | 4 | |
| wĩ | 2 | 0 | 5 | 4 | 3 | 5 | 3 | 1 | 1 | 3 | 2 | 3 | 0 | 3 | 4 | |
| wɐ̃ | 5 | 3 | 3 | 4 | 3 | 3 | 4 | 2 | 0 | 4 | 1 | 2 | 0 | 3 | 4 | |
| VG | ɐw | 6 | 5 | 3 | 5 | 5 | 6 | 6 | 3 | 1 | 5 | 4 | 2 | 0 | 4 | 5 |
| ɐj | 4 | 1 | 3 | 5 | 5 | 3 | 3 | 3 | 2 | 5 | 4 | 4 | 0 | 4 | 3 | |
| ṼG | ɐ̃w | 0 | 0 | 3 | 0 | 0 | 2 | 0 | 1 | 3 | 0 | 0 | 0 | 0 | 1 | 1 |
| ɐ̃j | 0 | 2 | 5 | 2 | 0 | 2 | 1 | 1 | 0 | 2 | 1 | 0 | 0 | 1 | 1 | |
| GVG | jɐw | 2 | 2 | 3 | 4 | 4 | 4 | 5 | 5 | 1 | 5 | 3 | 5 | 3 | 4 | 5 |
| wɐj | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | |
| we | 5 | 4 | 5 | 1 | 2 | 2 | 4 | 3 | 2 | 2 | 3 | 4 | 2 | 5 | 4 | |
| GṼG | jɐ̃w | 0 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 |
| wɐ̃j | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 2 | |
| GVN | jɐm | 0 | 0 | 0 | 5 | 3 | 5 | 4 | 5 | 3 | 4 | 3 | 5 | 1 | 4 | 5 |
| jɐn | 5 | 2 | 3 | 4 | 3 | 5 | 3 | 4 | 4 | 5 | 4 | 5 | 1 | 5 | 4 | |
| jɐŋ | 1 | 0 | 0 | 4 | 0 | 3 | 3 | 2 | 1 | 3 | 5 | 5 | 0 | 3 | 4 | |
| jɔŋ | 0 | 0 | 0 | 4 | 3 | 4 | 4 | 1 | 0 | 4 | 1 | 4 | 2 | 2 | 5 | |
| wɐn | 1 | 0 | 1 | 4 | 2 | 3 | 5 | 3 | 2 | 5 | 3 | 5 | 0 | 5 | 5 | |
| VP | ɐp | 0 | 0 | 0 | 2 | 1 | 2 | 2 | 2 | 0 | 2 | 1 | 1 | 0 | 2 | 1 |
| ɐt | 2 | 0 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 2 | 1 | 0 | 2 | 1 | |
| ɐk | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 1 | |
| ip | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 2 | 1 | 2 | 1 | 2 | 1 | 1 | 1 | |
| it | 2 | 1 | 1 | 2 | 0 | 1 | 1 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | 2 | |
| ik | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 2 | 2 | 1 | 2 | 2 | |
| ut | 2 | 1 | 1 | 2 | 2 | 2 | 2 | 2 | 0 | 2 | 1 | 2 | 0 | 2 | 1 | |
| ɔp | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | |
| ɔk | 2 | 2 | 1 | 2 | 1 | 2 | 2 | 2 | 1 | 2 | 2 | 1 | 0 | 2 | 1 | |
| GVP | jɐp | 0 | 0 | 0 | 2 | 2 | 2 | 1 | 1 | 2 | 2 | 1 | 2 | 1 | 1 | 2 |
| jɐt | 1 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | 1 | 2 | 1 | 2 | 1 | 2 | 2 | |
| jɐk | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | |
| jɔk | 0 | 0 | 0 | 2 | 0 | 1 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | |
| wɐt | 2 | 1 | 2 | 2 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 2 | 2 | |
One of the most significant factors that affect the usage frequency and combinability of onsets can be ascribed to the nasality feature from nuclei in Southern Min. As discussed in previous sections, the nasality feature is not employed phonemically to construct nasal onsets in this language, but the existence of nasal vowels and syllabic nasals can induce diverse phonological processes and change the surface forms of onset phonemes at the output. Similarly, the nasality feature from the nucleus system can also affect the usage frequency of onset phonemes and their combinability with other components at the syllable level. This can be seen in Table 14, no tokens are found to contain syllables like *ɓN, *ɠN, and *zN in the field, and only one lexical item has a voiced onset before a syllabic nasal, which is /ɗŋ22/ ‘small taro’, but it is not extensively used in the community. As such, it can be generalized that there is a phonotactic constraint on the alignment of the voicing feature of onset consonants with the nasality feature from syllabic nasals. This is because voiced onsets dominantly cannot occur before syllabic nasals, whereas voiceless onsets can occur before syllabic nasals. For example, all voiceless onsets can combine with the syllabic velar nasal /ŋ̩/ to formulate monosyllabic morphemes, such as /pŋ̩51/ ‘list of names’, /pʰŋ̩51/ ‘ebb tide’, /tŋ̩35/ ‘wait’, /tʰŋ̩41/ ‘hot’, /sŋ̩51/ ‘play’, /tsŋ̩35/ ‘village’, /tsʰŋ̩22/ ‘bed’, /kŋ̩41/ ‘steel’, /kʰŋ̩41/ ‘hind’, /ħŋ̩35/ ‘prescription’, and /ʔŋ̩35/ ‘centre’. The labial syllabic nasal /m̩/ can only combine with two onsets /ħ/ and /ʔ/ which are both voiceless, such as /ħm̩41/ ‘hit with a stick’ and /ʔm̩35/ ‘drink’.
There shows less restriction on the alignment of onset phonemes with nasal vowels. Because implosive onsets can occur before nasal vowels at an underlying level, such as syllables like ɓṼ (/ɓĩ33/ ‘noodle’), ɗṼ (/ɗɔ̃33/ ‘two’), and ɠṼ (/ɠɔ̃51/ ‘five’) are all attested in the field, but it should also be noted that, these implosive onsets surface as their homorganic nasal stops /m, n, ŋ/ at the output. The voiced fricative /z/ cannot occur before nasal vowels, as no lexical items contain syllables like *zṼ, *zGṼ, *zṼG, *zGṼG. However, although implosives are allowed to combine with nasal vowels, the number of syllables that they can generate is not significant. As can be seen in the table, only 32 syllables are obtained before the Ṽ type, 26 before the GṼ type, 10 before the ṼG type, while only 1 syllable is attested before the GṼG type.
There seems no restriction on the alignment of onset phonemes with the nasality feature form syllable codas. This can be seen in the Table 14. Not only voiceless onsets, but also woiced onsets can occur before finals with nasal codas, for examples, syllables like ɓVN (/ɓɐn51/ ‘mosquito’), ɓGVN (/ɓjɐn51/ ‘waive’), ɗVN (/ɗɐm22/ ‘male’), ɗGVN (ɗjɐm22/ ‘sticky’), ɠVN (/ɠɐn51/ ‘eye’), ɠGVN (/ɠjɐn41/ ‘attractive’), zVN (/zim51/ ‘to tolerate’), and zGVN (/zjɐm51/ ‘infect’) are all attested to construct lexical items in this language. As such, it can be generalized that the phonotactic constraint on onset phonemes with the nasality feature from other syllable components dominantly operates on their alignments with nasal nuclei and does not affect their alignments with nasal codas.
Another significant factor that affects the usage frequency of onset phonemes in this language can be ascribed to the phonotactic constraint on the occurrence of the labial feature within syllables. As can be seen in the table, all labial onsets /p, pʰ, ɓ/, regardless of their manner of articulation, cannot combine with finals of a labial coda. This is because no lexical items can be attested having syllables like *Pɐm; *Pim; *Pɔm; *Pjɐm; *Pɐp; *Pip; *Pɔp; *Pjɐp, which involve as many as 24 combinations (=3 labial onsets × 8 finals). In contrast, these labial onsets can occur before finals with a non-labial coda, as can be seen in morphemes like /pɐn22/ ‘bottle’, /pin22/ ‘side’, /pɐŋ22/ ‘bedroom’, /pjɐn41/ ‘build’, /pɐt221/ ‘another’, /pik41/ ‘force’, and /pjɐt221/ ‘shoot’. Likewise, non-labial onsets can occur before finals with a labial coda, as can be seen in morphemes like /sim35/ ‘heart’, /sɐm35/ ‘shirt’, /sɔm35/ ‘frost’, /sjɐm22/ ‘hug’, /sɔp42/ ‘tie up’, /sip41/ ‘humid’, and /tsjɐp41/ ‘connect’. It thus can be generalized that the labial feature is subject to an OCP (Obligatory Contour Principle) constraint that disallows it to co-occur at both onset and coda positions. Because of this restriction, a considerable number of syllables that are theoretically possible to contain labial onsets and labial codas are not able to be attested in the field. Such a labial restriction is also reported in other Sinitic languages, such as Wu (Zhang 2006) and Cantonese (Kirby and Yu 2007).
Additionally significant factors that can create phonotactic constraints on onset phonemes in this language come from the final type and the complexity of finals. The leftmost column in Table 14 lists the types of finals in accordance with the nature of their nuclei and coda, and the occurrence of prevocalic glides. As can be seen, onset phonemes tend to generate less syllables with a nasal final type (Ṽ and N) than with an oral final type (V). This can be seen that a total of 402 syllables are obtained under the V type, while only134 syllables are obtained under the Ṽ type and 42 syllables under the N type. Likewise, onset phonemes generate less syllables with finals ending in glide and obstruent codas (VG and VP) than with finals ending in nasal codas (VN). This can also be seen that a total of 421 syllables are obtained under the VN type, while only 162 syllables are obtained under the VP type and 109 under the VG type.
In addition, the complexity of finals can also impact the usage frequency of onset phonemes in this language. Voiceless onsets largely cannot combine with the GṼG final to form syllables. This is because no lexical items are found to contain syllables like *pGṼG, *pʰGṼG, *tGṼG, *tʰGṼG, *kGṼG, *kʰGṼG, *tsGṼG, and *tsʰGṼG. Only the voiceless alveolar fricative /s/, pharyngeal fricative /ħ/ and the glottal stop /ʔ/ can occur before this final, which separately generate 2, 1, and 2 syllables, such as /swɐ̃j22/ ‘mango’, /swɐ̃j22/ ‘sound of spinning movement’, /ħwɐ̃j22/ ‘chubby status’, and /ʔwɐ̃j41/ ‘wrench’. Implosive onsets are able to occur before the GṼG final, such as ɓGṼG (/ɓjɐ̃w22/ ‘copy; describe’), ɗGṼG (/ɗjɐ̃w22/ ‘cat’), and ɠGṼG (/ɠjɐ̃w35/ ‘itchy’), but only 7 syllables can be obtained. As a total, only 12 syllables are able to contain this final. As can be seen onset phonemes have the lowest combinability with this final and this can be ascribed to the complexity of this final that contain nasal vowels and prevocalic and postvocalic glides.
5.3 Onset-tone occurrence constraints
Phonotactic constraints also occur on the alignment of onset and tone in this language. Table 15 calculates the number of syllables that are obtained with respect to individual onset-tone combinations. For example, in the p-T1 slot, it presents 22 syllables that are attested to construct morphemes in this language. The letters I, II, III, and IV separately correspond to Ping, Shang, Qu, and Ru tone in terms of the Middle Chinese tonal category; while the letters a and b represent the Yin and Yang registers, respectively, which are determined by the voicing status of syllable onsets in the historical time. As can be seen, several significant characteristics can be generalized to account for the onset-tone occurrence constraints.
The number of attested syllables with respect to onset-tone combination.
| Onset | T1 | T2 | T3 | T4 | T5 | T6 | T7 | T8 | Total |
|---|---|---|---|---|---|---|---|---|---|
| [35] | [22] | [51] | [41] | [33] | [41] | [221] | [22] | ||
| Ia | Ib | II | IIIa | IIIb | IVa | IVb | IVb | ||
| p | 22 | 18 | 22 | 22 | 23 | 7 | 9 | 4 | 127 |
| pʰ | 20 | 16 | 14 | 23 | 11 | 7 | 3 | 5 | 99 |
| ɓ | 14 | 30 | 25 | 10 | 26 | 3 | 8 | 8 | 124 |
| t | 34 | 30 | 25 | 33 | 31 | 10 | 11 | 6 | 180 |
| tʰ | 26 | 20 | 17 | 24 | 16 | 9 | 3 | 5 | 120 |
| ɗ | 23 | 32 | 35 | 23 | 32 | 6 | 13 | 10 | 174 |
| k | 36 | 23 | 31 | 33 | 24 | 13 | 8 | 7 | 175 |
| kʰ | 32 | 17 | 22 | 26 | 9 | 11 | 6 | 6 | 129 |
| ɠ | 3 | 24 | 12 | 7 | 17 | 1 | 10 | 5 | 79 |
| ts | 36 | 27 | 31 | 34 | 28 | 12 | 11 | 10 | 189 |
| tsʰ | 35 | 15 | 25 | 30 | 14 | 11 | 5 | 7 | 142 |
| s | 37 | 24 | 29 | 34 | 22 | 13 | 8 | 7 | 174 |
| z | 2 | 9 | 6 | 1 | 6 | 1 | 6 | 2 | 33 |
| ħ | 33 | 28 | 25 | 29 | 28 | 12 | 9 | 8 | 172 |
| ʔ | 39 | 30 | 33 | 31 | 28 | 12 | 7 | 8 | 188 |
| Total | 392 | 343 | 352 | 360 | 315 | 128 | 117 | 98 | 2105 |
The type of syllable codas that tones are assigned to can substantially constrain the usage frequency of onset phonemes in this language. It can be seen in the table that onsets are consistently less frequently used in Tones 6, 7, and 8. For example, onsets in Tone 1 can generate as many as 392 syllables, whereas in Tone 6 can only generate 128 syllables. In Tone 3, onsets can generate 352 syllables, but in Tone 7, only 117 syllables are obtained. Likewise, in Tone 4, onsets can generate 360 syllables, but the number is reduced to 98 in Tone 8. As has been shown, the number of empirically attested syllables is consistently above 300 across Tone 1 to Tone 5, but it is dropped to around 100 among Tones 6, 7, and 8. This reduction can be ascribed to the diachronic constraint on assigning syllable codas in relation to tones. Tone 6 and Tone 7 both belong to Ru tones in the MC tonal categories, known as stopped tones, which diachronically request their syllables to end in obstruent codas (Huang 2018, 2021). Tone 8 is a new tone that emerges from those syllables that are conventionally transcribed with a glottal stop coda in the MC Yangru tone, but the glottal stop coda is found to have undergone deletion in the synchronic speech, because of this, its related syllables have become open, and a new tone is evoked (Huang 2018). On the contrary, all other tones (1–5) belong to non-Ru categories, known as unstopped tones, which request their syllables to end in sonorant segments, including zero codas, glides, and nasal stops. As can be generalized, the strict restriction on syllable coda type in relation to tones can negatively impact the usage frequency of onset phonemes and their combinability with tones to generate syllables in conversations.
The type of tonal registers can substantially impact the usage frequency of onset phonemes and the onset-tonal combinability in this language. It can be seen in the table that the number of syllables that are obtained in Yin registered tones is consistently greater than in Yang-registered tones. For example, a total of 392 syllables are obtained in the Yinping Tone 1, higher than that of 342 in the Yangping Tone 2. Likewise, a total of 360 syllables are attested in the Yinqu Tone 4, higher than that of 315 syllables in the Yangqu Tone 5. Similarly, in the Yinru Tone 6, 128 syllables are obtained, higher than that of 118 syllables in the Yangru Tone 7. The asymmetrical distributions of onset-tonal combinability in the generation of syllables can be ascribed to the diachronic constraint of assigning tonal registers with respect to the voicing status of syllable onsets. Yin-registered tones are historically assigned to syllables with voiceless onsets, while Yang-registered tones are assigned to syllables with voiced onsets. From a synchronic perspective of phonology, voiced onsets are more marked than voiceless onsets because they request an additional articulation of vocal fold vibration. As such, it is reasonable to obtain less syllables in Yang-registered tones than in Yin-registered tones. Also, it can be generalized that the type of tonal registers can impact the onset-tonal combinability in the generation of syllables for communication purposes.
The nature of onset phonemes themselves can also constrain the onset-tonal combinability and impact their usage frequency. As can be seen in the table, a total of 474 syllables are obtained with alveolar stops /t, tʰ, ɗ / across all tones, which is significantly higher than that of 383 with velar stops /k, kʰ, ɠ/ and that of 350 with labial stops /p, pʰ, ɓ/. This reflects that the feature of the place of articulation can affect the usage frequency of onset phonemes in generating linguistic entities. It also reflects the higher usage frequency of the alveolar place of articulation in speech communications. In addition, it can be observed that, a total of 482 syllables are obtained with voiceless unaspirated stops /p, t, k/ across all tones, higher than that of 377 with voiced stops /ɓ, ɗ, ɠ/ and that of 348 with voiceless aspirated stops /pʰ, tʰ, kʰ/. This reflects that the manner of articulation can also impact the onset-tonal combinations to generate syllables in this language. It also reflects that the higher usage frequency of voiceless unaspirated stops over their aspirated and voiced counterparts. In addition, it shows the feature of aspiration appears to be more marked than the feature of voicing because the former generate less syllables.
As discussed in this section, the usage frequency of onset phonemes and their combinability with other syllable components are subject to severe constraints in this language. This leads to more than 71% of theoretically permissible syllables to fail to be attested and used by native speakers. Only 2105 syllables are obtained in the field. Built upon a large corpus of 61 rhyme tables, this study quantifies the usage frequency of individual onset phonemes with respect to finals and tones and explores what factors have constrained their combinability in the generation of linguistic entities for communication purposes. It shows that the asymmetrical distribution of the nasality feature, the strict restriction of the occurrence of the labial feature, the type of finals and their complexity can all pose phonotactic constraints on the onset-final combinations. For example, voiced onsets dominantly cannot occur before syllabic nasals. Only a small number of syllables can have voiced onsets before nasal vowels. Labial onsets cannot co-occur with labial codas within the same syllables. Onsets are less frequent to combine with finals with marked features, such as nasal vowels, syllabic nasals, obstruent codas, and glide codas than combining with finals with ordinary features, such as oral vowels or nasal codas. Onset phonemes can also reduce their usage frequency when combining with finals with complex internal structures. For example, only 7 syllables are attested to contain the GṼG final type.
This exploration also reveals phonotactic constraints on onset-tonal combinations to generate syllables. It shows that the diachronic requirements on assigning tones with respect to syllable codas and onsets can negatively impact the usage frequency of onset phonemes and the number of empirically attested syllables. Specifically, onsets are less frequently used in tones that are assigned to syllables ending in obstruent codas, and in tones that are assigned to syllables with voiced onsets in the historical time. This is directly reflected in the number of syllables that is attested to be around 100 in stopped Tones 6 and 7 relating to obstruent-ending syllables, and in Tone 8 emerging from the synchronic deletion of glottal stop codas, while the number is above 300 in other tones relating to sonorant-ending syllables. Likewise, the number of syllables in Yin registered tones that are historically related to voiceless onsets is consistently greater than in Yang-registered tones that are related to voiced onsets. The nature of onset phonemes themselves can also influence their combinability with tones. In general, onsets with an alveolar place of articulation are more frequently used than onsets with velar and labial places of articulation. Similarly, onsets with a voiceless and unaspirated manner of articulation are more often used than onsets with other manners of articulation. Onsets with the feature of aspiration are less frequently used than onsets with a voiced feature.
6 Discussion and conclusion
This study undertakes an in-depth exploration into how inventory, process, and constraint work as a system to shape the complexity of syllable onsets in a specific language of Southern Min. It presents an inventory of 15 onset phonemes that is smaller than a vast number of languages in the world. This can be considered as a compensatory effect to its large number of vowels that reaches as many as eleven and is greater than most of the world’s languages. It shows cross-linguistic tendencies on the preference of alveolar sounds over onsets with other places of articulation, and the preference of plosive sounds over onsets with other manners of articulation. It also presents several typologically uncommon features, such as having a three-way contrast among supraglottal stops in terms of aspiration and voicing, the phonemic usage of the ingressive airstream mechanism and no nasal consonants as syllable onsets.
In spite of having a small onset inventory, this language is typologically significant to possess as many as 57 variants at the surface level, and this is triggered by only three distinct features from subsequent sounds. It on the one hand shows the complexity of the onset system of Southern Min, while on the other hand reflecting a universal phenomenon of assimilation that all sounds have slightly different allophones occurring in different contexts, and that a sound becomes more like a nearby sound with respect to one or more featural properties (cf. Arthur 1983; Gordon 2016; Hyman 1975; Kennedy 2016; Mielke 2008). For example, the three implosives in this language surface as their corresponding homorganic nasal stops before nasal vowels. The occurrence of assimilation is typically motivated by consideration of articulatory ease, this is because sharing one or more articulatory properties can minimize transitions required that are required of speech articulators (Gordon 2016). For example, producing implosive sounds requires speakers to block off their nasal cavity, pull down the larynx to allow the airstream from outside to rush into the mouth before they flow out to release an oral constriction, while producing nasal vowels requires speakers to lower their velum to let the air flow to escape through the nose. It is thus articulatorily economic for speakers to surface implosive onsets as nasal stops. Likewise, the reason why the glottal stop onsets surface three forms [mʔ], [nʔ], and [ŋʔ] may also be ascribed to the articulatory ease in assimilating the place but also the manner of articulation to that of their subsequent nasal segments.
This phenomenon of assimilation reflects that humans do not move their organs in isolation and do not utter segments one by one. Instead, their vocal apparatus functions in a continuous motion that can easily lead to overlapping gestures and easily cause adjacent sounds to share assimilating properties (cf. Arthur 1983; Gordon 2016; Hyman 1975; Kennedy 2016; Mielke 2008). Correspondingly, the adjacent sounds become perceptually less distinct from each other in a specific context. It is thus plausible for individual phonemes to surface different forms in according to the nature of their surrounding sounds.
In addition, syllable onsets in this language present varying usage frequency and are severely constrained over their generation of linguistic entities of syllables and morphemes. On onset-final combinations, the nasality feature from nuclei, the labial feature within syllables, and the complexity of final types can all negatively impact their usage frequency. Voiced onsets are banned to occur before syllabic nasals. Labial onsets are disallowed to co-occur with labial codas. Onsets are less frequently used with finals that have a marked feature and/or a complex internal structure. They produce less syllables with nasal final types Ṽ and N than with an oral one V, and with glide and obstruent-ending finals VG and VP than with their nasal counterpart VN, while showing the lowest usage frequency with the GṼG final. On onset-tonal alignments, the types of syllable onset and coda that tones are diachronically requested to assign to, the emergence of a new tone, and the nature of onset themselves can all influence their usage frequency. They show the lowest usage frequency in a new tone that emerges because of the deletion of glottal stop coda in the synchronic speech and are less frequently used in tones that are assigned to syllables ending in obstruent codas, and in Yang-registered tones that are historically related to voiced onsets. Moreover, they are more often used with alveolar onsets than with onsets of velar and labial places of articulation. Likewise, they are more frequently used with voiceless and unaspirated onsets than with onsets of other manners of articulation.
The phenomenon of phonotactics reflects that humans cannot randomly combine speech sounds to formulate entities that are of linguistic significance and that are empirically attested. Instead, the formation of linguistic entities is subject to constraints, and this appears to be universal, although it is relatively less studied in literature. For example, the segmental string *lbick cannot be attested in natural languages because the *lb onset cluster violates the sonority sequencing principle that prefers a rising sonority from syllable edge to nucleus (cf. Clements 1988; Giegerich 1992; Zec 1995). In Cantonese, only 1900 syllables can be attested, whereas there theoretically should have about 5130 syllables (with tones) (Kirby and Yu 2007). In Mandarin Chinese, only about 400 syllables can be attested in the field, whereas the number is theoretically expected to be 1900 (without tones) (Duanmu 2003). Likewise, in Lanzhou Chinese, about 50 % of theoretically possible syllables (without tones) are reported to be missing (Yi and Duanmu 2015).
The exploration of this study shows how inventory, process, and constraint function as parts of a system, and demonstrates the importance of adopting the perspective of a system for a deeper understanding of the phonology of a language. The systematic discussions of typologically common and uncommon characteristics of onset phonemes, a variety of allophonic variation, and different types of phonotactic constraint in Southern Min have a significant implication to contribute to an advanced model in phonological descriptions, which are conventionally restricted to the aspect of inventories, with less attention on processes and constraints. They contribute well-established empirical patterns to the databases of phonological typology, in which processes and constraints have received significantly fewer attention. This study has practical implication to serve as a standing point to investigate syllable onsets in other Sinitic languages and beyond, and to investigate the system of other phonological category, such as syllable nuclei and codas, using the perspective of system. In addition, this study lays out an objective dataset to further examine the nature and mechanism of relative variation processes and phonotactic constraints using other quantitative methods. For example, Yu (2016) adopts seven acoustic parameters (centroid frequency, standard deviation, kurtosis, skewness, peak frequency, ampRatio, and sibilant duration) to examine the degree of palatalization in Cantonese, the method of which can also be applied to this study. Gong and Zhang (2021) employ the UCLA Phonotactics Learner to model Mandarin speakers’ phonotactic knowledge. The results suggest that multiple grammatical principles, such as the OCP, allophonic restrictions, and segmental–suprasegmental co-occurrence constraints, can all influence native speakers’ phonotactic judgement. The method can also be applied to examine the phonotactic constraints on onset phonemes in this language.
-
Research ethics: Not applicable.
-
Informed consent: Not applicable.
-
Conflict of interest: The author declares that there is no conflict of interest.
-
Data availability: The author confirms that the data supporting the findings of this study are available within the article.
References
Abramson, Arthur S. & Douglas H. Whalen. 2017. Voice onset time (VOT) at 50: Theoretical and practical issues in measuring voicing distinctions. Journal of Phonetics 63. 75–86. https://doi.org/10.1016/j.wocn.2017.05.002.Search in Google Scholar
Algeo, John. 1975. Syncope and the phonotactics of English. General Linguistics 15. 71–78.Search in Google Scholar
Anderson, Stephen. 1978. Tone features. In Victoria Fromkin (ed.), Tone: A linguistic survey, 133–175. New York: Academic Press.10.1016/B978-0-12-267350-4.50010-8Search in Google Scholar
Arthur, Brakel. 1983. Phonological markedness and distinctive features. Bloomington: Indiana University Press.Search in Google Scholar
Baxter, William H. & Laurent Sagart. 2014. Old Chinese: A new reconstruction. Oxford: Oxford University Press.10.1093/acprof:oso/9780199945375.001.0001Search in Google Scholar
Bauer, Robert S. & Paul K. Benedict. 1997. Modern Cantonese phonology. Berlin: De Gruyter Mouton.10.1515/9783110823707Search in Google Scholar
Bickford, Anita C. & Rick Floyd. 2006. Articulatory phonetics: Tools for analyzing the world’s languages, 4th edn. Dellas, TX: SIL International.Search in Google Scholar
Blevins, Juliette. 1995. The syllable in phonological theory. In John Goldsmith (ed.), Handbook of phonological theory, 206–255. Oxford: Basil Blackwell.Search in Google Scholar
Celata, Chiara & Basilio Calderone. 2015. Introduction to phonotactics: Cross-linguistic perspectives from acquisition, speech production and corpus studies. Italian Journal of Linguistics 27(1). 3–8.Search in Google Scholar
Chen, Yiya & Carlos Gussenhoven. 2015. Shanghai Chinese. Journal of the International Phonetic Association 45(3). 321–337. https://doi.org/10.1017/s0025100315000043.Search in Google Scholar
Chomsky, Noam & Morris Halle. 1968. The sound pattern of English. New York: Harper & Row.Search in Google Scholar
Clements, George N. 1988. The role of the sonority cycle in core syllabification. Working Papers at the Cornell Phonetics Laboratory 2. 1–68.Search in Google Scholar
Clements, George N. 2003. Feature economy in sound systems. Phonology 20. 287–333. https://doi.org/10.1017/s095267570400003x.Search in Google Scholar
Clements, George N. & Sylvester Osu. 2002. Explosives, implosives, and nonexplosives: The linguistic function of air pressure differences in stops. In Natasha Warne & Carlos Gussenhoven (eds.), Papers in Laboratory phonology, 299–350. Berlin: De Gruyter Mouton.10.1515/9783110197105.2.299Search in Google Scholar
Cohn, Abigail C. 2007. Phonetics in phonology and phonology in phonetics. Working Papers of the Cornell Phonetics Laboratory 16. 1–31.10.3366/edinburgh/9780748623167.003.0015Search in Google Scholar
Cun, Xi. 2010. The phonetic cause of sound change from voiceless stops to implosives. Explorations in Renaissance Culture 4(1). 33–64. https://doi.org/10.1163/23526963-90000519.Search in Google Scholar
Davis, Stuart. 1982. On the internal organization of syllable constituents. Coyote Papers 3. 3–22.Search in Google Scholar
Ding, Picus Sizhi. 2016. Southern Min (Hokkien) as a migrating language: A comparative study of language shift and maintenance across national borders. Singapore: Springer.Search in Google Scholar
Dong, Tonghe. 2023. Historical phonology of Chinese. London: Routledge.10.4324/9781003411444Search in Google Scholar
Duanmu, San. 1990. A formal study of syllable, tone, stress and domain in Chinese languages. Cambridge, MA: Massachusetts Institute of Technology PhD thesis.Search in Google Scholar
Duanmu, San. 1999. Syllable structure in Chinese. In Harry van der Hulst & Nancy A. Ritter (eds.), The syllable: Views and facts, 477–499. Berlin: De Gruyter Mouton.Search in Google Scholar
Duanmu, San. 2003. The syllable phonology of Mandarin and Shanghai. In Proceedings of the fifteenth North American Conference on Chinese Linguistics, 86–102. Los Angeles: University of Southern California Press.Search in Google Scholar
FJG. 1998. Fujian shengzhi – fangyan juan [Fujian province gazette – dialect volume]. Beijing: Local Chronicles Publishing House.Search in Google Scholar
Fudge, Eric C. 1969. Syllables. Journal of Linguistics 5(2). 253–286. https://doi.org/10.1017/s0022226700002267.Search in Google Scholar
Gao, Ran. 1999. Zhangzhou yinxi jianjie [Introduction to the sound system of Zhangzhou]. In Gao Ran (ed.), Minnan fangyan-Zhangzhouhua yanjiu [Southern Min dialect – studies of Zhangzhou variety], 109–116. Beijing: China Federation of Literary and Art Circles Publishing House.Search in Google Scholar
Giegerich, Heinz J. 1992. English phonology: An introduction. Cambridge: Cambridge University Press.10.1017/CBO9781139166126Search in Google Scholar
Gimson, Alfred C. 1980. An introduction to the pronunciation of English, 3rd edn. London: Edward Arnold.Search in Google Scholar
Gong, Shuxiao & Jie Zhang. 2021. Modelling Mandarin speakers’ phonotactic knowledge. Phonology 38(2). 241–275. https://doi.org/10.1017/s0952675721000166.Search in Google Scholar
Gordon, Matthew K. 2016. Phoneme inventories. In Matthew K. Gordon (ed.), Phonological Typology, 43–82.10.1093/acprof:oso/9780199669004.003.0003Search in Google Scholar
Grossman, Eitan, Elad Eisen, Dmitry Nikolaev & Stevan Moran. 2020. SegBo: A database of borrowed sounds in the world’s languages. In Proceedings of the 12th language resources and evaluation conference, 5316–5322. Marseille, France: European Language Resources Association.Search in Google Scholar
Guo, Jinfu. 2014. Zhangzhou minnanhua [The Southern Min dialect of Zhangzhou]. Zhangzhou: Zhangzhou Library.Search in Google Scholar
Gurevich, Naomi. 2011. Lenition. In Marc van Oostendorp, Colin J. Ewen, Elizabeth Hume & Keren Rice (eds.), The Blackwell companion to phonology, vol. 3, phonological process, 1559–1575. Hoboken, NJ: John Wiley & Sons.Search in Google Scholar
Hayes, Bruce & Colin Wilson. 2008. A maximum entropy model of phonotactics and phonotactic learning. Linguistic Inquiry 39(3). 379–440. https://doi.org/10.1162/ling.2008.39.3.379.Search in Google Scholar
Huang, Yishan. 2018. Tones in Zhangzhou: Pitch and beyond. Canberra: The Australian National University PhD thesis.Search in Google Scholar
Huang, Yishan. 2019. Zhangzhou Southern Min: Rhyme tables, homonyms, heteronyms. München: Lincom Europa.Search in Google Scholar
Huang, Yishan. 2021. Zhangzhou Southern Min: Syllables and phonotactics. München: Lincom Europa.Search in Google Scholar
Huang, Yishan. 2023. Encoding syllables in Zhangzhou Southern Min. In Proceedings of the 37th pacific Asia conference on language, information and computation (PACLIC 37), 164–173.Search in Google Scholar
Huang, Yishan. 2023a. Phonotactic constraints on Zhangzhou onsets. In Proceedings of the 35th annual conference on computational linguistics and speech processing (ROCLING 2023), 179–187.Search in Google Scholar
Huang, Yishan. 2024. Tonal phonotactics in Southern Min. Studia Linguistica 78(3). 431–455. https://doi.org/10.1111/stul.12235.Search in Google Scholar
Huang, Yishan & Gwendolyn Hyslop. 2022. Zhangzhou implosives and their variations. In Proceedings of The 20th Annual Workshop of the Australasian Language Technology Association (ALTA 2022), 122–129.Search in Google Scholar
Hyman, Larry M. 1975. Phonology: Theory and analysis. New York: Holt, Rinehart & Winston.Search in Google Scholar
Jones, Russell. 2009. Chiangchew Hokkiens, the true pioneers in the Nanyang. Journal of the Malaysian Branch of the Royal Asiatic Society 82(2). 39–66.Search in Google Scholar
Joo, Ian & Yu-Yin Hsu. 2024. 2025. Phonotacticon: A cross-linguistic phonotactic database. Linguistic Typology 29(2). 405–431. https://doi.org/10.1515/lingty-2023-0094.Search in Google Scholar
Kager, René. 1999. Optimality theory. Cambridge: Cambridge University Press.10.1017/CBO9780511812408Search in Google Scholar
Keating, Patricia A. 1991. Coronal places of articulation. In Carole Paradis & F. Prunet Jean (eds.), Phonetics and phonology: The special status of conorals, internal and external evidence, 29–48. San Diego: Academic Press.10.1016/B978-0-12-544966-3.50008-7Search in Google Scholar
Kennedy, Robert. 2016. Phonology: A coursebook. Cambridge: Cambridge University Press.10.1017/CBO9781107110793Search in Google Scholar
Kirby, James & Alan Yu. 2007. Lexical and phonotactic effects on wordlikeness judgments in Cantonese. Proceedings of the 16th International Congress of Phonetic Sciences. XVI, 1389–1392.Search in Google Scholar
Kochetov, Alexei. 2011. Palatalization. In Marc van Oostendorp, Colin J. Ewen, Elizabeth Hume & Keren Rice (eds.), The Blackwell companion to phonology, vol. 3, phonological process, 1666–1690. Hoboken, NJ: John Wiley & Sons.Search in Google Scholar
Kwok, Bit Chee. 2019. Southern Min: Comparative phonology and subgrouping. London: Routledge.Search in Google Scholar
Ladefoged, Peter & Ferrari Disner. 2012. Vowels and consonants, 3rd edn. Hoboken, NJ: Wiley-Blackwell.Search in Google Scholar
Ladefoged, Peter & Ivan Maddieson. 1996. The sounds of the world’s languages. Oxford: Blackwell.Search in Google Scholar
Li, Rulong & Rongsong Yao. 2008. Minnan fangyan [Southern Min dialects]. Fuzhou: Fujian People’s Publishing House.Search in Google Scholar
Lin, Baoqin. 1992a. Zhangzhou fangyan cihui (yi) [Zhangzhou vocabularies, part I]. Fangyan [Dialect](2). 151–160.Search in Google Scholar
Lin, Baoqin. 1992b. Zhangzhou fangyan cihui (er) [Zhangzhou vocabularies, part II]. Fangyan [Dialect](3). 230–240.Search in Google Scholar
Lin, Baoqin. 1992c. Zhangzhou fangyan cihui (san) [Zhangzhou vocabularies, part III]. Fangyan [Dialect](4). 310–312.Search in Google Scholar
Lisker, Leigh & Arthur S. Abbramson. 1971. Distinctive features and laryngeal control. Language 47. 767–785. https://doi.org/10.2307/412155.Search in Google Scholar
Ma, Chongqi. 1994. Zhangzhou fangyan yanjiu [Studies on Zhangzhou dialect]. Hongkong, China: Zongheng Publishers.Search in Google Scholar
Maddieson, Ian. 1984. Patterns of sounds. Cambridge: Cambridge University Press.10.1017/CBO9780511753459Search in Google Scholar
Maddieson, Ian. 2013a. Syllable structure. In Matthew S. Dryer & Martin Haspelmath (eds.), The world atlas of language structures online. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://www.isfas.uni-kiel.de/de/linguistik-und-phonetik/studium-und-pruefung/Master/materialien-fuer-auswaertige/sprachtypologie/typologie_04_silbe.pdf (accessed 11 September 2025).Search in Google Scholar
Maddieson, Ian. 2013b. Consonant inventories. In Matthew S. Dryer & Martin Haspelmath (eds.). The world atlas of language structures online. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://wals.info/chapter/1 (accessed 11 September 2025).Search in Google Scholar
Maddieson, Ian. 2023. Investigating the ‘what’, ‘where’ and ‘why’ of global phonological typology. Linguistic Typology 27(2). 245–266. https://doi.org/10.1515/lingty-2022-0076.Search in Google Scholar
Maddieson, Ian, Sébastien Flavier, Egidio Marsico, Christophe Coupé & François Pellegrino. 2013. LAPSyd: Lyon-albuquerque phonological systems database. Interspeech 2013. 3022–3026. International Speech Communication Association (ISCA), 25–29 August 2013, Lyon, France.10.21437/Interspeech.2013-660Search in Google Scholar
Maddieson, Ian & Kristin Precoda. 1990. Updating UPSID. UCLA Working Papers in Phonetics 74. 104–111.Search in Google Scholar
Matisoff, James. 1973. Tonogenesis in Southeast Asia. In Larry Hyman (ed.), Consonant types and tone. Southern California Occasional Papers in Linguistics, 72–95. Los Angeles: University of Southern California Linguistics Program.Search in Google Scholar
McCarthy, John. 1988. Feature geometry and dependency: A review. Phonetica 45. 84–108. https://doi.org/10.1159/000261820.Search in Google Scholar
Medhurst, Walter H. 1832. A dictionary of the Hok-këèn dialect of the Chinese language, according to the reading and colloquial idioms: Containing about 12,000 characters. Macao, China: The Honorable East India Company’s Press.Search in Google Scholar
Michaud, Aronoff & Bonny Sands. 2020. Tonogenesis. In Michaud Aronoff (ed.), Oxford research encyclopedia of linguistics. Oxford, United Kingdom: Oxford University Press. Available at: https://shs.hal.science/halshs-02519305v1.10.1093/acrefore/9780199384655.013.748Search in Google Scholar
Mielke, Jeff. 2008. The emergence of distinctive features. Oxford: Oxford University Press.10.1093/oso/9780199207916.001.0001Search in Google Scholar
Moran, Steven, Eitan Grossman & Annemarie Verkerk. 2021. Investigating diachronic trends in phonological inventories using BDPROTO. Language Resources & Evaluation 55(1). 79–103.10.1007/s10579-019-09483-3Search in Google Scholar
Moran, Steven & Daniel McCloy (eds.). 2019. Phoible 2.0. Jena: Max planck institute for the science of human history. https://phoible.org/ (accessed 11 September 2025).Search in Google Scholar
Mori, Miki. 2023. The acoustic characteristics of implosives and plosive bilabials in Shimaore. Journal of the International Phonetic Association 53(3). 950–976. https://doi.org/10.1017/s0025100322000184.Search in Google Scholar
Nikolaev, Dmitry. 2022. A typology of consonant-inventory gaps. Linguistic Typology 26(1). 161–186. https://doi.org/10.1515/lingty-2020-0128.Search in Google Scholar
Norman, Jerry L. 1991. The Mĭn dialects in historical perspective. In Shiyuan Wang (ed.), Languages and dialects of China, 325–358. Berkeley: Project on Linguistic Analysis, University of California.Search in Google Scholar
Ohala, John. 1993. Coarticulation and phonology. Language & Speech 36. 155–170. https://doi.org/10.1177/002383099303600303.Search in Google Scholar
Pearce, Nick. 2007. Rethinking fairness. Public Policy Research 14(1). 11–22. https://doi.org/10.1111/j.1744-540x.2007.00458.x.Search in Google Scholar
Pike, Kenneth L. 1947. Phonetics: A critical analysis of phonetic theory and a technic for the practical description of sounds. Ann Arbor: University of Michigan Press.Search in Google Scholar
Schlegel, Gustaaf. 1886. Nederlandsch-Chineesch woordenboek met de transcriptie der Chineesche karakters in het Tsiang-tsiu dialekt. Leiden: E.J. Brill.10.1163/9789004599413Search in Google Scholar
Selkirk, Elizabeth. 1982. The syllable. In Harry van der Hulst & Noval Smith (eds.), The structure of phonological representations, Part 2, 337–384. Dordrecht: Foris Publications.Search in Google Scholar
Sew, Jyh W. 2020. Hokkien as a heritage language of citizenry in Singapore. Issues in Language Studies 9(2). 18–32. https://doi.org/10.33736/ils.2393.2020.Search in Google Scholar
Szigetvári, Péter. 2010. Syllables. In Nancy C. Kula, Bert Botma & Kuniya Nasukawa (eds.), The Bloomsbury companion to phonology, 64–94. London: Bloomsbury.Search in Google Scholar
Třísková, Hana. 2011. The Structure of the Mandarin syllable: why, when, and how to reach it. Archiv Orientální 79(1). 99–134.Search in Google Scholar
Xie, Xiulan. 1818. Huiji yasu tong shiwu yin [The fifteen initials accessible to both the refined and the vulgar in one collection]. Shanghai: Shanghai Classics Publishing House.Search in Google Scholar
Yang, Xiuming. 2008. Zhangzhou fangyan shengdiao yu diyu wenhua yanjiu [Studies of tones and regional cultures of Zhangzhou dialects]. Beijing: China Social Sciences Press.Search in Google Scholar
Yi, Li & San Duanmu. 2015. Phonemes, features, and syllables: Converting onset and rime inventories to consonants and vowels. Language and Linguistics 16(6). 819–842. https://doi.org/10.1177/1606822x15602610.Search in Google Scholar
Yu, Alan C. L. 2016. Vowel-dependent variation in Cantonese /s/ from an individual-difference perspective. The Journal of the Acoustical Society of America 139(4). 1672–1690. https://doi.org/10.1121/1.4944992.Search in Google Scholar
Yule, George. 2010. The Study of Language. Cambridge: Cambridge University Press.Search in Google Scholar
Zec, Draga. 1995. Sonority constraints on syllable structure. Phonology 12(1). 85–129. https://doi.org/10.1017/s0952675700002396.Search in Google Scholar
Zee, Wai-sum & Eric Zee. 2014. Chinese phonetics. In C.-T. James Huang, Y.-H. Audrey Li & Andrew Simpson (eds.), The handbook of Chinese linguistics, 318–434. Chichester: Wiley Blackwell.Search in Google Scholar
Zhang, Jisheng. 2006. The phonology of Shaoxing Chinese. Leiden: Leiden University PhD thesis.Search in Google Scholar
Zhou, Changji. 2006. Minnan fangyan da cidian [A dictionary of Southern Min dialects]. Fuzhou: Fujian People’s Publishing House.Search in Google Scholar
ZZG. 1999. Zhangzhou shizhi – fangyan [Zhangzhou city gazette – dialect]. Beijing: China Social Sciences Press.Search in Google Scholar
© 2025 the author(s), published by De Gruyter and FLTRP on behalf of BFSU
This work is licensed under the Creative Commons Attribution 4.0 International License.