Abstract
In this study, we look at the distribution of silent pauses within existing multi-language corpora to see whether their location and duration correlate with clause boundaries. Our study is based on data of seven languages from Multi-CAST. We supplemented the original clause boundary annotations with information about silent pauses in order to investigate the alignment of clause boundaries and pausing. We find a gradient association between clause boundary strength and the probability of a pause and a two-way distinction for pause duration within clauses and at clause boundaries.
The right word may be effective, but no word was ever as effective as a rightly timed pause.
– Mark Twain
1 Introduction
Linguists primarily conceptualize language as a continuous flow of speech, broken up by pauses. These pauses can be thought of as temporary “breaks” in which the transmission of content between interlocutors is cessated. They are typically understood as either filled or silent, depending on whether interlocutors perceive the break to contain a voiced section (e.g. um or uh in English) or silence (Belz and Trouvain 2019; Clark and Fox Tree 2002; Zellner 1994; see also Goh et al. 2023). This paper focuses on silent pauses, which go beyond the pure physiological need to breathe; rather, pausing “operates in happy synchrony with some basic functional segmentations of discourse” (Chafe 1994: 57).
The use of silent pauses has been studied from various perspectives in linguistics. From a psycholinguistic angle, silent pauses have been examined for their function of marking prosodic structures which support syntactic parsing and disambiguation, especially in early L1 acquisition (e.g. Christophe et al. 2008; de Carvalho et al. 2017; Hawthorne and Gerken 2014; Speer and Ito 2009). Much theoretical work on the prosody-syntax interface has similarly focused on the association of silent pauses with syntactic structure, notably the co-occurrence of silent pauses with syntactic boundaries (e.g. Goldman-Eisler 1972; Nespor and Vogel 2007; Selkirk 1984, 2011; Truckenbrodt 1995, 2007; Watson and Gibson 2004). Conversation analysis and related disciplines, on the other hand, have examined the use of silent pauses as markers of turn-taking in conversational discourse (e.g. Sacks et al. 1974; Taboada 2006; Weilhammer and Rabold 2003; Zellner 1994). The role of pausing in the perception of fluency has also been investigated (e.g. Belz et al. 2017; Bosker et al. 2013; de Jong 2016; Kahng 2018), typically comparing differences in pausing between native and non-native speakers. Silent pauses have also been studied for their contribution to rhetorical style, as well as to sociolinguistic and cross-cultural variation (e.g. Duez 1982; Kendall 2013; Schleef 2019; Šturm and Volín 2023; Tannen 1985; Walker 1985).
In this paper, we concentrate on the association between syntactic boundaries and pause location as well as pause duration. Using spontaneous speech data from seven typologically distinct languages, we explore the distribution of silent pauses to see whether their location and duration correlate with higher-level syntactic groupings, namely main and dependent clause boundaries. Our results point to a tendency for main clause boundaries to be accompanied by silent pauses across languages, while dependent clause boundaries are less likely to co-occur with pauses. Our results confirm previous findings in that pauses at clause boundaries tend to be longer than pauses within clauses across all languages in our dataset.
The paper is structured as follows. In Section 2 we provide an overview of previous research into the relation of pausing and clause boundaries. We then introduce our data and methods in Section 3. Section 4 presents the results of the present study; we discuss and contextualize them within the wider debate around pausing in Section 5. Section 6 concludes this study.
2 Silent pauses and syntactic structure
2.1 Pauses and the syntax-prosody interface
Silent pauses have often been recognized as one of the most important signals of a prosodic boundary (Du Bois et al. 1992: 17, Himmelmann et al. 2018, Himmelmann 2022: 718, Ladd 2008: 288). They have also long been observed to pattern with syntactic boundaries, especially with clause boundaries (e.g. Cooper and Paccia-Cooper 1980; Goldman-Eisler 1968; Nespor and Vogel 2007). However, pauses do not necessarily co-occur with major syntactic boundaries, that is, they can be observed outside of syntactically determined positions, including word-internally (Bundgaard-Nielsen and Baker 2020; Paschen 2023). Such pauses are usually attributed to performance; for example, for hesitation, processing, style (cf. Nespor and Vogel 2007: 189, 219), and breathing, especially in faster speech (cf. Grosjean and Collins 1979). Syntactic structures are not always sufficient to predict the prosodic structure of an utterance, and several prosodic realizations of the same syntactic unit are possible.
There are thus various reasons to assume a certain degree of independence between syntactic and prosodic structures, while allowing for a high degree of interdependence at the same time. Such a position on the relation between syntax and prosody is captured by approaches which assume an intermediary prosodic structure to which morphosyntax is mapped (see e.g. Nespor and Vogel 2007; Selkirk 2011, 1984). Our study is compatible with the theoretical assumptions of this line of research, but we take an empirical approach by using cross-linguistic corpus data of spontaneous speech to examine to what extent pauses are associated with syntactic boundary positions.
2.2 Pause types and functions
There are different ways in which silent pauses can be distinguished, such as according to where they occur. Previous studies have distinguished between pauses at sentence, clause, and phrase boundaries as well as pauses that do not coincide with any major syntactic boundaries. Example 1 illustrates different possible locations of silent pauses in English in an excerpt from a spontaneous spoken text from the DoReCo corpus (Schiborr 2022). It contains four consecutive sentences split up for convenience, so that each line corresponds to a chunk of speech delimited by a silent pause whose length is indicated in parentheses.
Silent pauses in spontaneous spoken English[1] |
(0298-0300_DoReCo_doreco_sout3282_mc_english_kent02_b) |
So I messed the skin up. (1430 ms) |
It wadn’t no good then, (670 ms) |
’cause I was close to him, you see, blowed [false start] (70 ms) |
a great hole in him. (2503 ms) |
Catched a deer in a (645 ms) |
snare one day. I went down there, as I told you, about how I always […] |
The pauses following (1a) and (1d) coincide with main clause boundaries. The pause at the end of (1b) occurs at a dependent clause boundary. The three remaining rows include pauses that do not coincide with clausal or phrasal boundaries. In (1c), a pause occurs between a transitive verb and its object, where it follows a false start. A similar situation can be seen in the transition between (1e) and (1f), where a pause separates a determiner from its head noun.
Silent pauses such as those in (1a), (1b), and (1d) are typically understood within psycholinguistics as helping to facilitate the parsing of speech into syntactic units. This has been shown for both adult L1 language use (e.g. Frazier et al. 2006; Kjelgaard and Speer 1999; Petrone et al. 2017; Schafer et al. 2000) and for L1 and L2 language acquisition (e.g. Christophe et al. 2008; Goad et al. 2021; Hawthorne and Gerken 2014; Speer and Ito 2009). At the same time, these pauses also help the speaker to prepare and plan further utterances (e.g. Cooper and Paccia-Cooper 1980; Ferreira 1991; Fuchs et al. 2013; Goldman-Eisler 1968; Krivokapić 2007; Krivokapić et al. 2020). They can also be used for interactional purposes, such as yielding the floor during a conversation (Levelt 1993; Maclay and Osgood 1959; Wennerstrom and Siegel 2003).
Other pauses, such as the clause-internal ones in (1c), (1e), and (1f), are argued to result from high processing demands in situations in which speech planning is comparatively difficult. Planning difficulties can be caused by various factors, such as a long or complex following syntactic or prosodic unit, or difficulties in lexical access with low-probability items (see e.g. Beattie and Butterworth 1979; Belz et al. 2017; Goldman-Eisler 1961a; Hartsuiker and Notebaert 2010). Clause-internal pauses may also be associated with attempts to hold the floor during an interaction (Himmelmann 2014; Local and Kelly 1986; Wennerstrom and Siegel 2003).
As we work with corpus data in the present study, we cannot test for specific cognitive processes that would explain pausing in a given context. As such, we do not distinguish between pauses that occur for hesitation, planning, or stylistic reasons. We instead distinguish between three types of pauses based on their location in relation to syntactic boundaries: pauses within clauses, pauses at main clause boundaries, and pauses at dependent clause boundaries (see Section 3 for more details and examples). This distinction allows us to measure the association of different clausal contexts with pauses, and to assess how robust the patterns found are across languages (see Section 4.2).
2.3 Pause duration
Studies into the distribution of pause duration often show a distinction between brief pauses and longer pauses, with longer pauses typically found at sentence boundaries and shorter pauses at dependent clause boundaries and within clauses.[2] In an early study, Goldman-Eisler (1972) reports for spontaneous spoken English data that pauses between sentences mostly have durations above 500 ms, while pauses between clauses (within sentences) tend to be shorter than 500 ms. Grosjean and Deschamps (1972) report comparable pause durations for other French and English datasets. Fletcher et al. (2004) similarly find that pauses within and between words in Dalabon (Gunwinyguan, Australia) fall into two groups in terms of duration: medium pauses (around 500 ms) and long pauses (around 1,000 ms or more).
Pause distribution and duration in spontaneous French data show similar trends, even when more detailed distinctions of pause type are made. Duez (1982: 24) finds that pauses within phrases (∅ = 401 ms) and within clauses (∅ = 632 ms) tend to be shorter than pauses between clauses (∅ = 802 ms) in casual interviews. In a more recent study, Candea (2000) examines French spontaneous speech data and reports that pauses at sentence boundaries have average durations of 900 ms, while pauses at clause boundaries within sentences average 600 ms (Candea 2000: 168). In contrast, hesitation pauses within clauses are reported to have an average duration of 560 ms (Candea 2000: 181).[3] Campione and Véronis (2002) focus on pause duration in French spontaneous speech data and read speech from five European languages. They find a trimodal distribution of pauses in terms of duration and distinguish between brief (<200 ms), medium (200–1,000 ms) and long (>1,000 ms) pauses. Read speech of all five languages only featured brief and medium pauses, with long pauses occurring in addition to short and medium pauses in spontaneous speech.
While the exact details differ across studies, the picture emerges that we can distinguish between different pause types in terms of their duration, and that pauses at clause boundaries tend to be longer than pauses within clauses, at least for English and French. We assess to what extent this association of pause location and duration is reflected in our data, how it relates to previous findings, and how robust the patterns are across languages (see Section 4.3).
3 Data and annotation
3.1 Dataset
In our study, we investigated how pauses were distributed across seven typologically distinct languages in the Multi-CAST corpora (Haig and Schnell 2021). The languages are listed in Table 1 together with information on the number of different speakers in the corpus, the number of utterances (as defined by Multi-CAST), and the genres of texts used. We chose a subset of data which consisted of monological data, specifically mostly autobiographical narratives (AN) and traditional narratives (TN), with the addition of one stimulus-based narrative (SN) in Tondano. As can be seen in Table 1, each corpus ranges from 160 to 1,300 utterances from two to six different speakers. We used the WAV files along with the accompanying annotations in EAF format provided in Multi-CAST (Haig and Schnell 2021).
Overview of the dataset (AN = autobiographical narrative; SN = stimulus-based narrative; TN = traditional narrative).
Language | No. of speakers | No. of utterances | Genre |
---|---|---|---|
Arta (Austronesian) | 3 | 227 | AN, TN |
Kimoto (2019) | |||
Nafsan (Austronesian) | 3 | 163 | TN |
Thieberger and Brickell (2019) | |||
Teop (Austronesian) | 4 | 1,019 | TN |
Mosel and Schnell (2015) | |||
Tondano (Austronesian) | 6 | 1,254 | SN, AN |
Brickell (2016) | |||
Mandarin (Sino-Tibetan) | 3 | 845 | TN |
Vollmer (2020) | |||
Tabasaran (Nakh-Daghestanian) | 2 | 629 | TN, AN |
Bogomolova et al. (2021) | |||
Northern Kurdish (Indo-European) | 2 | 555 | TN |
Haig et al. (2015) |
3.2 Clause annotation
The majority of spontaneous speech corpora are created for language documentation purposes, and as such, differ as to how elaborate their morphosyntactic annotation is. Information on clausal boundaries below the utterance level is usually not explicitly annotated for, Multi-CAST being an important exception.
We took the annotation of clausal boundaries from GRAID (Grammatical Relations and Animacy in Discourse) annotations in Multi-CAST (see Haig and Schnell 2014). In GRAID, a clause is generally defined as a predicate with its arguments, with more language-specific criteria for how to deal with, for example, multi-verb predicates (Haig and Schnell 2014: 45). Main clauses are signalled with an annotation of “##” at their left edge, while dependent clauses are annotated with “#” at the left edge. The end of a dependent clause is generally not marked explicitly, as it usually coincides with the beginning of a new main clause. However, the end of a centre-embedded dependent clause that does not correspond to the end of a main clause is additionally signalled by “%” in GRAID.
Examples (2)–(4) show how clause boundaries are annotated in GRAID (see Haig and Schnell (2014: §2.6, §4.1) for more details).[4] Example (2) consists of a main clause in Mandarin, marked as such by the initial “##”.[5]
Mandarin (mandarin_lzh_0007-0008) | |||||
## ránhòu | yě | jiù | shì | súchēng=de | zhù yuánwài ## |
## then | also | adv | cop | popular_name=mod | Zhu landlord ## |
‘He was called Zhu landlord by people.’ |
In example (3), we see an utterance from the Northern Kurdish corpus, consisting of a main clause with a dependent relative clause. In this case, the end of the relative clause corresponds to the end of the main clause and does not receive an explicit annotation.
Northern Kurdish (n_kurd_muserz01_003-004) | ||||
## kur-ek-î | wî | hebû-ye | # nav-ê kur-ê | Mihemed |
## son-indf-ez | 3sg.obl | exist.pst-pfv.3sg | # name-ez son-ez 3sg.obl | Mihemed |
bû-ye | ## | |||
cop.pst-pfv.3sg | ## | |||
‘He had a son, whose name was Mihemed.’ |
The Nafsan example in (4) shows a somewhat more complex utterance. It is made up of three main clauses, the first of which contains an additional dependent clause. Note that the end of the dependent clause does not coincide with the end of the main clause, which is why it is additionally marked with the annotation by “%” in (4a).
Nafsan (nafsan_tafra_0001) |
## # | selwan | tu=paakor | nametp̃g | ntau | % ra=to | tu teesa |
## # | while | 1pl.in.rs=arrive | end | year | % 1du.ex.rs=habit | give child |
tete | nanromien | |||||
some | present | |||||
‘When we got to the end of the year we would give the children a present … ’ |
## ru=to | ni | apu | go | atien | negar-wes |
##3pl.rs=stay | with | grandfather | and | grandmother | 3pl.poss-3sg.obl |
‘… for them to take to their Apu and Ati … ’ |
## nanromien | sees | pan | tu-e-r | ki-n | Ertap ## |
## present | small | go | give-ts-3pl.o | prep-dst | Eratap ## |
‘… a small present they could give to them at Eratap.’ |
We used this clause information from the original GRAID annotation to distinguish between main clause boundaries (cb_main), dependent clause boundaries (cb_dep), and clause-internal positions (no_cb). Main and dependent clause boundaries as shown in (2)–(4) were directly taken from the annotation. The remaining annotations were classified as clause-internal.
3.3 Pause annotation
Spontaneous speech corpora often do not include annotations for phonological or prosodic properties.[6] As such, the re-use of the Multi-CAST corpora for this paper required us to additionally annotate silent pauses in order to explore any potential associations between pauses and clause boundaries. The pause annotation process we followed is described in detail in the supplementary document “data-extraction-processing.pdf”; we provide a brief summary here and some illustrative examples for the reader’s orientation.[7]
We firstly annotated silent pauses using a silence recognizer and followed this with a round of manual checks and corrections. This resulted in an annotation of pause and speech segments. We then classified both pause and speech segments into one of three types (cb_main, cb_dep, no_cb) according to whether or not a main or dependent clause boundary fell into their duration. This was done by automatically comparing the time stamps of the pre-existing clause boundary annotations and those of the newly annotated silent pause and speech segments. Since the Multi-CAST data is only aligned with the acoustic signal on the utterance level and not on the clausal level, we could not automatically specify the location of a pause occurring within the utterance further. As described in more detail in Section 4 of the supplementary materials, we performed selected manual checks of the position of clausal boundaries in relation to pauses, which suggested that the automatic extraction works sufficiently well for our purposes.
Tables 2–6 show examples of the resulting pause annotation for examples (2)–(4) from Section 3.2. The first row shows the utterance including the original clause boundary annotation from Multi-CAST. The second “pause” row then shows whether or not a silent pause was detected. If a pause was detected, the “type” row indicates which type of pause we are dealing with, and the “dur” row shows its duration. In the Mandarin example in Table 2, we see that both the initial and final clause boundaries coincide with a pause, which we annotated as cb_main. In addition, we find another silent pause occurring within the clause, which was marked as no_cb.
Pause annotation for Mandarin example (2).
## | ránhòu yě jiù shì súchēng=de zhù yuánwài | ## | |
---|---|---|---|
Pause | ✓ | ✓ | ✓ |
Type | cb_main | no_cb | cb_main |
dur | 124 ms | 235 ms | 226 ms |
Pause annotation for Northern Kurdish example (3).
## | kur-ek-î wî hebû-ye | # | nav-ê kur-ê wî Mihemed bû-ye | ## | |
---|---|---|---|---|---|
Pause | ✓ | ✗ | ✗ | ✗ | ✓ |
Type | cb_main | cb_main | |||
dur | 2,343 ms | 948 ms |
Pause annotation for Nafsan example (4a).
## # | selwan tu=paakor nametp̃ag ntau | % | ra=to tu teesa tete nanromien | |
---|---|---|---|---|
Pause | ✗ | ✗ | ✓ | ✓ |
Type | cb_dep | no_cb | ||
dur | 1790 ms | 482 ms |
Pause annotation for Nafsan example (4b).
## | ru=to ni apu go atien negar-wes | |
---|---|---|
Pause | ✓ | ✗ |
Type | cb_main | |
dur | 1,125 ms |
Pause annotation for Nafsan example (4c).
## | nanromien sees pan tu-e-r ki-n Ertap | ## | |
---|---|---|---|
Pause | ✓ | ✓ | ✓ |
Type | cb_main | no_cb | cb_main |
dur | 1,692 ms | 533 ms | 1,237 ms |
In the Northern Kurdish utterance in Table 3, the main clause boundaries also coincide with silent pauses. The dependent clause boundary between the two main clauses, however, is not accompanied by a silent pause. In addition, no pauses occur within the two clauses shown in Table 3. The Nafsan example from (4) is shown in Tables 4 to 6. Table 4 contains the first main clause together with its dependent clause. Here, silent pauses only occur at the end of the dependent clause (cb_dep) and within the second part of the main clause (no_cb). Table 5 shows that another silent pause coincides with the boundary between the two main clauses in Tables 4 and 5. Finally, we see in Table 6 that both clausal boundaries co-occur with a pause (cb_main), and that the main clause itself also contains a silent pause (no_cb).
4 Results
4.1 Overall tendencies
We first inspect the distribution of clause boundaries. Table 7 shows how the three clausal contexts are distributed across speech and pause segments (“No. of segments”) in the seven languages. By segments we refer to contiguous chunks of speech or silent pause in the speech signal. Pause/speech segments with no clause boundaries (no_cb) make up around 55–70 % of annotations in all languages, with an average of 61 % across the dataset. The proportions of cb_main and especially cb_dep contexts are more language-dependent. For example, dependent clause boundaries make up 23 % of all boundaries in Northern Kurdish, while they only amount to 4 % in Mandarin.
Proportion of the three clausal contexts in speech and pause segments.
Language | no_cb | cb_main | cb_dep | No. of segments |
---|---|---|---|---|
Arta | 0.66 | 0.20 | 0.14 | 1,205 |
Mandarin | 0.62 | 0.33 | 0.04 | 3,259 |
Nafsan | 0.61 | 0.27 | 0.11 | 1,716 |
Northern Kurdish | 0.56 | 0.21 | 0.23 | 4,112 |
Tabasaran | 0.57 | 0.22 | 0.21 | 3,798 |
Teop | 0.59 | 0.33 | 0.08 | 3,156 |
Tondano | 0.70 | 0.24 | 0.06 | 3,717 |
Average | 0.61 | 0.26 | 0.13 |
Figure 1 builds on the distribution of clausal contexts shown in Table 7 by including information about how they relate to pause and speech segments in the seven languages. Comparing the distribution of the three clausal contexts in pause and speech segments (left vs. right bars), we see clearly that pause segments contain a higher proportion of clause boundaries than speech segments in all languages. The exact proportions differ across languages, with clausal boundaries co-occurring with 75 % of all pauses in Northern Kurdish but with only around 40 % of all pauses in Tondano.

The raw distribution of clausal contexts across speech and pause segments.
The overall distribution of clausal contexts across our dataset suggests that clause boundaries more often co-occur with a pause than with speech. However, in Tondano and Arta, pauses are more likely to occur within clauses than in either of the clause boundary conditions. The probability of pauses co-occurring with dependent clause boundaries is also substantially lower in Tondano, Mandarin, and Teop than in other languages, likely due to the overall low proportion of dependent clause boundaries in these languages to begin with (see Table 7). This suggests that the presence of a clausal boundary does not necessarily entail the occurrence of a silent pause and that the association between clause boundaries and pauses is indeed more complex than a simple one-to-one relationship.
4.2 The association of clause boundaries with pauses
The first question that this paper addresses is to what extent the different clausal contexts are associated with silent pauses. To do this, we fitted a Bayesian logistic regression model to assess the probability of pauses across the three types of clausal contexts. We added varying intercepts as well as varying slopes over clausal contexts for individual speakers.[8] The model was fitted using Bayesian methods with Stan (Carpenter et al. 2017) and brms (Bürkner 2017) in R (R Core Team 2021).
Figure 2 shows the conditional effects of clausal contexts on the probabilities of pauses. The points in Figure 2 correspond to the mean of the probability density of the predictions; the whiskers correspond to the 95 % uncertainty intervals. This means that we can be 95 % confident that the actual probability will lie within this interval based on the data and the model.

Predicted probability of pauses across clausal contexts.
While we see a certain degree of variation across the seven languages, the overall pattern is robust. The probability of pauses occurring gradually increases from environments within clauses (no_cb) to dependent clause boundaries (cb_dep) and main clause boundaries (cb_main). For main clause boundaries, the probability of co-occurrence with pauses is between 0.5 and just over 0.75 in all languages, the lowest prediction being 0.53 for Tondano and the highest being 0.76 for Arta. For dependent clause boundaries, the mean posterior co-occurrence probabilities for all languages are slightly lower, ranging from 0.29 in Teop to 0.57 in Nafsan. For the no clause boundary context, we find the overall lowest probability of pause co-occurrence, from 0.14 in Teop to 0.28 in Tondano. We furthermore generally find a higher level of cross-linguistic variation in the cb_main and cb_dep contexts than in the no_cb condition.
The robust patterning of pause co-occurrence probabilities across our dataset illustrates a clear trend. As the level of the syntactic boundary juncture increases, so too does the overall predicted probability of a pause co-occurring. Looking at the uncertainty intervals, we find that they overlap between “neighbouring” categories, that is, within clauses and dependent clause boundaries, as well as dependent and main clause boundaries. Only Nafsan differs in that both types of clause boundaries have a similarly higher probability of pauses compared to boundaries within clauses. Still, the overall results suggest that there is no cross-linguistically robust, binary distinction either between main clause boundaries and other contexts or between non-clausal boundaries and clausal boundaries. Instead, we found a gradual increase in pause probability from no_cb to cb_dep and cb_main.
4.3 Pause durations
The second question that this paper aims to address concerns pause durations in relation to their location. Figure 3 shows the raw distribution of pause durations across the seven languages of the dataset. The general trends are fairly robust. The majority of silent pauses in the dataset have a duration up to 750 ms, few pauses are longer than 1,000 ms, and pauses longer than 2,500 ms are rare. However, Arta, Nafsan, and Northern Kurdish have a somewhat larger portion of silent pauses longer than 750 ms.

Observed pause duration by language.
Furthermore, the average pause duration is more or less comparable across languages. As can be seen in Figure 3, the median pause duration (the solid vertical line) is between 500 ms and 1,000 ms for all languages, apart from Mandarin. However, we see more variation of the mean pause duration (dotted black line) across languages. Arta, Nafsan, and Tondano show a larger difference between median and mean pause duration, as these three languages have more longer pauses than the other languages of the dataset. Still, we can say that the distribution of pause length, albeit not identical, is comparable across the seven languages.
Figure 4 shows the distribution of pause duration together with median and mean values across the three clausal contexts with data from all seven languages pooled together. These distributions along with the median and mean suggest a difference in pause duration associated with the absence or presence of a clause boundary. Most pauses within clauses have a length of below 750 ms, with only 15 % of all pauses being longer than 1,000 ms. In contrast, we find quite a substantial proportion of pauses with durations above 1,000 ms up to 3,000 ms for both types of clause boundaries, namely 40 % for cb_dep and 38 % for cb_main, respectively. This binary difference between pauses within clauses and pauses at clausal boundaries is further reflected in their median pause durations: 537 ms for no_cb versus 796 ms and 788 ms for cb_dep and cb_main, respectively.

Observed pause duration by clausal context.
To test the robustness of the observations from the raw distributions, we fitted a Bayesian regression model to predict the duration of pauses from clausal contexts and languages while controlling for the effects of single speakers.[9] Figure 5 shows the conditional effects of clausal contexts and languages on pause durations. The model predictions support the previous observation of a two-way distinction of pauses based on their duration. Pause durations are predicted to be between 524 ms and 781 ms very consistently across languages when no clause boundary is present. At main and dependent clause boundaries, the average predicted pause duration varies to a greater extent within and across languages. Pauses are predicted to be between 1,000 ms and 1,300 ms for both types of clausal boundaries in all languages except for Arta. Arta exhibits shorter pause durations in all three contexts compared to the other languages. However, the relative difference between the boundaries within clauses (525 ms) and clausal boundaries (802 ms and 814 ms) is maintained. We should be careful before concluding that this indeed reflects a real cross-linguistic difference, as the Arta subcorpus is comparatively small (see Table 1).

Estimated pause duration by clausal context.
Larger datasets with data from more speakers are needed to be more certain about what seems to be a language-specific preference towards shorter pauses in our data.
5 Discussion
Our results evidence a complex interaction between pausing and clause boundaries. Pauses occur both within clauses and at clause boundaries in all seven languages examined in this study. However, the probability of a pause occurring increases as the syntactic boundary increases in strength. In Section 4.2, we found a gradient increase for all languages except Nafsan in the probability of a pause occurring within clauses to pauses at dependent clause boundaries and at main clause boundaries. Only Nafsan showed evidence for a potential binary opposition between no clausal boundaries and the two clausal boundary contexts. The main trend in our data mirrors previous findings for English and French. For instance, Goldman-Eisler (1972: 105) found that temporal integration decreases as clause boundary independence increases in spontaneous spoken English, with 77.9 % of all main clauses separated by pauses greater than 500 ms, but less than a third of dependent clause boundaries being this long. Grosjean and Deschamps (1972: 146) report that in their French data, over 60 % of all silent pauses occur at main clause boundaries, with approximately 14 % of the remaining pauses occurring at lower-level syntactic boundaries; Candea (2000: 166) similarly found that most of the pauses occur in her data at main clause boundaries with fewer pauses occurring at dependent clause boundaries and within clauses.
In line with previous findings, our study suggests that there is a cross-linguistically robust increase of the probability of pauses to occur within clauses, at dependent clause boundaries, and at main clause boundaries. This finding is particularly remarkable given the inherent variation present in the dataset and that the languages included in this study feature a great deal of variability in what kinds of dependent clauses are present and how they are used by speakers.
Our second investigation in Section 4.3 found a two-way opposition in the duration of pauses occurring within clauses and those at clause boundaries. The attested lengths in this study match those found for Dalabon (Fletcher et al. 2004), and those of medium and long pauses in French (Campione and Véronis 2002). However, these comparisons should be taken with a grain of salt, as the methodological definitions of pauses and locations, the choice of pause threshold, and the genres of texts involved all differ to different extents across the studies (see also Rochester 1973). Our results, however, fit in with the general trend reported in the literature on pausing whereby silent pauses at clause boundaries tend to be longer than pauses that do not align with major syntactic boundaries such as main clause boundaries (Candea 2000; Duez 1982; Goldman-Eisler 1972; Grosjean and Deschamps 1972).
In addition, we find a robust pattern across the languages surveyed here that there is no substantial difference between pause duration at main and dependent clause boundaries. This fits in with the findings reported by Candea (2000: 166–167), who notes that pauses occurring both at dependent clause boundaries (“inter-constituant”) and at boundaries between two coordinated or juxtaposed clauses (“fin de proposition”) are significantly shorter than pauses at main clause boundaries which serve to separate two discourse segments (“fin d’énoncé”). This lack of difference between certain types of main and dependent clause pause duration suggests that pauses do not simply reflect syntactic structures but also help to structure discourse more generally. Our study reports the first evidence of this type for silent pauses in non-European languages, joining other recent work examining the relationship between filled pauses and discourse structure in typologically diverse languages (e.g. Kaland and Bardají 2023).
6 Conclusions
Using naturalistic and typologically diverse spontaneous speech data, we found that clause boundaries were associated with pauses, with stronger boundaries more likely to co-occur with a pause. Pause duration showed a two-way distinction between pauses at clause boundaries and those which occurred within a clause. Furthermore, this study showed how existing grammatically annotated corpora can be used in the investigation of other research questions. With minimal extra annotation, we were able to conduct a first investigation into the role of silent pausing at the prosody-syntax interface in a range of languages not commonly represented in work on pause distribution and duration.
Funding source: Deutsche Forschungsgemeinschaft
Award Identifier / Grant number: 406074683
Acknowledgments
We would like to thank both Wifek Bouaziz and Sharon Kromer for their work in helping us manually align the data. A further thanks to those who gave enthusiastic feedback about this project at our departmental reading group at Freiburg, the Language Documentation and Theory 6 workshop on intonation, and the 14th International Conference of the Association for Linguistic Typology, as well as for the detailed suggestions from two anonymous reviewers which helped improve this paper substantially.
-
Research funding: This work was supported by the Deutsche Forschungsgemeinschaft (406074683) and the Open Access fees were covered based on an agreement between the University of Freiburg and De Gruyter.
References
Beattie, Geoffrey W. & Brian L. Butterworth. 1979. Contextual probability and word frequency as determinants of pauses and errors in spontaneous speech. Language and Speech 22(3). 201–211. https://doi.org/10.1177/002383097902200301.Search in Google Scholar
Belz, Malte, Simon Sauer, Anke Lüdeling & Christine Mooshammer. 2017. Fluently disfluent? Pauses and repairs of advanced learners and native speakers of German. International Journal of Learner Corpus Research 3(2). 118–148. https://doi.org/10.1075/ijlcr.3.2.02bel.Search in Google Scholar
Belz, Malte & Jürgen Trouvain. 2019. Are “silent” pauses always silent? In Sasha Calhoun, Paola Escudero, Marija Tabain & Paul Warren (eds.), 19th International Congress of Phonetic Sciences (ICPhS), 2744–2748. Canberra: Australasian Speech Science and Technology Association Inc.Search in Google Scholar
Bogomolova, Natalia, Dmitry Ganenkov & Nils N. Schiborr. 2021. Multi-CAST Tabasaran. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Bosker, Hans Rutger, Anne-France Pinget, Hugo Quené, Ted Sanders & Nivja H. de Jong. 2013. What makes speech sound fluent? The contributions of pauses, speed and repairs. Language Testing 30(2). 159–175. https://doi.org/10.1177/0265532212455394.Search in Google Scholar
Brickell, Timothy. 2016. Multi-CAST Tondano. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Bundgaard-Nielsen, Rikke L. & Brett J. Baker. 2020. Pause acceptability indicates word-internal structure in Wubuy. Cognition 198. 104167. https://doi.org/10.1016/j.cognition.2019.104167.Search in Google Scholar
Bürkner, Paul-Christian. 2017. brms: An r package for bayesian multilevel models using stan. Journal of Statistical Software 80(1). 1–28. https://doi.org/10.18637/jss.v080.i01.Search in Google Scholar
Campione, Estelle & Jean Véronis. 2002. A large-scale multilingual study of silent pause duration. In Bernard Bel & Isabelle Marlien (eds.), Proceedings of Eurospeech 2002, 199–202. Aix-en-Provence: Laboratoire Parole et Langage.10.21437/SpeechProsody.2002-35Search in Google Scholar
Candea, Maria. 2000. Contribution a` l’e´tude des pauses silencieuses et des phe´nome`nes dits “d’he´sitation” en franc¸ais oral spontane´: Etude sur un corpus de re´cits en classe de franc¸ais. Paris: Université Paris III dissertation.Search in Google Scholar
Carpenter, Bob, Andrew Gelman, Matthew Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li & Allen Riddell. 2017. Stan: A probabilistic programming language. Journal of Statistical Software 76(1). 1–32. https://doi.org/10.18637/jss.v076.i01.Search in Google Scholar
Chafe, Wallace L. 1994. Discourse, consciousness, and time: The flow and displacement of conscious experience in speaking and writing. Chicago: University of Chicago Press.Search in Google Scholar
Christophe, Anne, Séverine Millotte, Savita Bernal & Jeffrey Lidz. 2008. Bootstrapping lexical and syntactic acquisition. Language and Speech 51(1-2). 61–75. https://doi.org/10.1177/00238309080510010501.Search in Google Scholar
Clark, Herbert H. & Jean E. Fox Tree. 2002. Using uh and um in spontaneous speaking. Cognition 84(1). 73–111. https://doi.org/10.1016/S0010-0277(02)00017-3.Search in Google Scholar
Cooper, William E. & Jeanne Paccia-Cooper. 1980. Syntax and speech. Cambridge, MA: Harvard University Press.10.4159/harvard.9780674283947Search in Google Scholar
de Carvalho, Alex, Isabelle Dautriche, Isabelle Lin & Anne Christophe. 2017. Phrasal prosody constrains syntactic analysis in toddlers. Cognition 163. 67–79. https://doi.org/10.1016/j.cognition.2017.02.018.Search in Google Scholar
de Jong, Nivja H. 2016. Predicting pauses in L1 and L2 speech: The effects of utterance boundaries and word frequency. International Review of Applied Linguistics in Language Teaching 54(2). 113–132. https://doi.org/10.1515/iral-2016-9993.Search in Google Scholar
Du Bois, John, Stephan Schuetze-Coburn, Danae Paolino & Susanna Cumming. 1992. Discourse transcription (Santa Barbara Papers in Linguistics 4). Santa Barbara, CA: University of California Santa Barbara, Department of Linguistics.Search in Google Scholar
Duez, Danielle. 1982. Silent and non-silent pauses in three speech styles. Language and Speech 25(1). 11–28. https://doi.org/10.1177/002383098202500102.Search in Google Scholar
Ferreira, Fernanda. 1991. Effects of length and syntactic complexity on initiation times for prepared utterances. Journal of Memory and Language 30(2). 210–233. https://doi.org/10.1016/0749-596X(91)90004-4.Search in Google Scholar
Ferreira, Fernanda. 1993. Creation of prosody during sentence production. Psychological Review 100(2). 233–253. https://doi.org/10.1037/0033-295X.100.2.233.Search in Google Scholar
Fletcher, Janet, Nicholas Evans & Belinda Ross. 2004. Pausing strategies and prosodic boundaries in Dalabon. In Proceedings of the 10th Australian International Conference on Speech Science & Technology, 436–439. Sydney: Macquarie University.Search in Google Scholar
Frazier, Lyn, Katy Carlson & Charles Clifton. 2006. Prosodic phrasing is central to language comprehension. Trends in Cognitive Sciences 10(6). 244–249. https://doi.org/10.1016/j.tics.2006.04.002.Search in Google Scholar
Fuchs, Susanne, Caterina Petrone, Jelena Krivokapić & Philip Hoole. 2013. Acoustic and respiratory evidence for utterance planning in German. Journal of Phonetics 41(1). 29–47. https://doi.org/10.1016/j.wocn.2012.08.007.Search in Google Scholar
Gee, James Paul & François Grosjean. 1983. Performance structures: A psycholinguistic and linguistic appraisal. Cognitive Psychology 15(4). 411–458. https://doi.org/10.1016/0010-0285(83)90014-2.Search in Google Scholar
Goad, Heather, Natália Brambatti Guzzo & Lydia White. 2021. Parsing ambiguous relative clauses in L2 English: Learner sensitivity to prosodic cues. Studies in Second Language Acquisition 43(1). 83–108. https://doi.org/10.1017/S0272263120000285.Search in Google Scholar
Goh, Rui Zhe, Ian B. Phillips & Chaz Firestone. 2023. The perception of silence. Proceedings of the National Academy of Sciences 120(29). e2301463120. https://doi.org/10.1073/pnas.2301463120.Search in Google Scholar
Goldman-Eisler, Frieda. 1961a. A comparative study of two hesitation phenomena. Language and Speech 4(1). 18–26. https://doi.org/10.1177/002383096100400102.Search in Google Scholar
Goldman-Eisler, Frieda. 1961b. The distribution of pause durations in speech. Language and Speech 4(4). 232–237. https://doi.org/10.1177/002383096100400405.Search in Google Scholar
Goldman-Eisler, Frieda. 1968. Psycholinguistics: Experiments in spontaneous speech. London: Academic Press.Search in Google Scholar
Goldman-Eisler, Frieda. 1972. Pauses, clauses, sentences. Language and Speech 15(2). 103–113. https://doi.org/10.1177/002383097201500201.Search in Google Scholar
Grosjean, François & Maryann Collins. 1979. Breathing, pausing and reading. Phonetica 36(2). 98–114. https://doi.org/10.1159/000259950.Search in Google Scholar
Grosjean, François & Alain Deschamp. 1972. Analyse des variables temporelles du franc,ais spontane´. Phonetica 26(3). 129–156. https://doi.org/10.1159/000259407.Search in Google Scholar
Grosjean, François, Lysiane Grosjean & Harlan Lane. 1979. The patterns of silence: Performance structures in sentence production. Cognitive Psychology 11(1). 58–81. https://doi.org/10.1016/0010-0285(79)90004-5.Search in Google Scholar
Haig, Geoffrey & Stefan Schnell. 2014. Annotations using GRAID (Grammatical Relations and Animacy in Discourse). Manual version 7.0. University of Bamberg. https://multicast.aspra.uni-bamberg.de/data/pubs/graid/Haig+Schnell2014_GRAID-manual_v7.0.pdf (accessed 20 February 2024).Search in Google Scholar
Haig, Geoffrey & Stefan Schnell. 2021. Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Haig, Geoffrey, Maria Vollmer & Hanna Thiele. 2015. Multi-CAST Northern Kurdish. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Hartsuiker, Robert J. & Lies Notebaert. 2010. Lexical access problems lead to disfluencies in speech. Experimental Psychology 57(3). 169–177. https://doi.org/10.1027/1618-3169/a000021.Search in Google Scholar
Hawthorne, Kara & LouAnn Gerken. 2014. From pauses to clauses: Prosody facilitates learning of syntactic constituency. Cognition 133(2). 420–428. https://doi.org/10.1016/j.cognition.2014.07.013.Search in Google Scholar
Himmelmann, Nikolaus P. 2014. Asymmetries in the prosodic phrasing of function words: Another look at the suffixing preference. Language 90(4). 927–960. https://doi.org/10.1353/lan.2014.0105.Search in Google Scholar
Himmelmann, Nikolaus P. 2022. Prosodic phrasing and the emergence of phrase structure. Linguistics 60(3). 715–743. https://doi.org/10.1515/ling-2020-0135.Search in Google Scholar
Himmelmann, Nikolaus P., Meytal Sandler, Jan Strunk & Volker Unterladstetter. 2018. On the universality of intonational phrases: A cross-linguistic interrater study. Phonology 35(2). 207–245. https://doi.org/10.1017/S0952675718000039.Search in Google Scholar
Kahng, Jimin. 2018. The effect of pause location on perceived fluency. Applied Psycholinguistics 39(3). 569–591. https://doi.org/10.1017/S0142716417000534.Search in Google Scholar
Kaland, Constantijn & Maria Bardají. 2023. Phonetic description of filled pauses as discourse markers in Totoli. In Radek Skarnitzl & Jan Volín (eds.), 20th International Congress of Phonetic Sciences (ICPhS 2023), 3241–3245. Prague: Guarant International.Search in Google Scholar
Kendall, Tyler. 2013. Speech rate, pause and sociolinguistic variation. London: Palgrave Macmillan UK.10.1057/9781137291448Search in Google Scholar
Kimoto, Yukinori. 2019. Multi-CAST Arta. In Geoffrey Haig & Stefan Schnell (eds.). Multi-cast: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Kjelgaard, Margaret M. & Shari R. Speer. 1999. Prosodic facilitation and interference in the resolution of temporary syntactic closure ambiguity. Journal of Memory and Language 40(2). 153–194. https://doi.org/10.1006/jmla.1998.2620.Search in Google Scholar
Klatt, Dennis H. 1976. Linguistic uses of segmental duration in English: Acoustic and perceptual evidence. The Journal of the Acoustical Society of America 59(5). 1208–1221. https://doi.org/10.1121/1.380986.Search in Google Scholar
Krivokapić, Jelena. 2007. Prosodic planning: Effects of phrasal length and complexity on pause duration. Journal of Phonetics 35(2). 162–179. https://doi.org/10.1016/j.wocn.2006.04.001.Search in Google Scholar
Krivokapić, Jelena, Will Styler & Parrell Benjamin. 2020. Pause postures: The relationship between articulation and cognitive processes during pauses. Journal of Phonetics 79. 100953. https://doi.org/10.1016/j.wocn.2019.100953.Search in Google Scholar
Ladd, Robert. 2008. Intonational phonology. Cambridge: Cambridge University Press.10.1017/CBO9780511808814Search in Google Scholar
Levelt, Willem J. M. 1993. Speaking: From intention to articulation. Cambridge, MA: MIT Press.Search in Google Scholar
Local, John & John Kelly. 1986. Projection and ‘silences’: Notes on phonetic and conversational structure. Human Studies 9(2/3). 185–204. https://doi.org/10.1007/bf00148126.Search in Google Scholar
Maclay, Howard & Charles Osgood. 1959. Hesitation phenomena in spontaneous English speech. Word 15(1). 19–44. https://doi.org/10.1080/00437956.1959.11659682.Search in Google Scholar
Mosel, Ulrike & Stefan Schnell. 2015. Multi-CAST Teop. In Geoffrey Haig & Stefan Schnell (eds.). Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Nespor, Marina & Irene Vogel. 2007. Prosodic phonology: With a new foreword. Berlin: De Gruyter.10.1515/9783110977790Search in Google Scholar
Paschen, Ludger. 2023. Two apparent inefficiencies of silent and filled pauses. Talk presented at the workshop “Efficiency in grammar”. Freiburg Institute of Advanced Studies (FRIAS), University of Freiburg. https://laurabecker.gitlab.io/efficiency-workshop/slides_paschen.pdf (accessed 5 February 2024).Search in Google Scholar
Petrone, Caterina, Hubert Truckenbrodt, Caroline Wellmann, Julia Holzgrefe-Lang, Isabell Wartenburger & Barbara Höhle. 2017. Prosodic boundary cues in German: Evidence from the production and perception of bracketed lists. Journal of Phonetics 61. 71–92. https://doi.org/10.1016/j.wocn.2017.01.002.Search in Google Scholar
R Core Team. 2021. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.Search in Google Scholar
Rochester, Sherry R. 1973. The significance of pauses in spontaneous speech. Journal of Psycholinguistic Research 2(1). 51–81. https://doi.org/10.1007/BF01067111.Search in Google Scholar
Sacks, Harvey, Emanuel A. Schegloff & Gail Jefferson. 1974. A simplest systematics for the organization of turn-taking for conversation. Language 50(4). 696–735. https://doi.org/10.2307/412243.Search in Google Scholar
Schafer, Amy J., Shari R. Speer, Paul Warren & S. David White. 2000. Intonational disambiguation in sentence production and comprehension. Journal of Psycholinguistic Research 29(2). 169–182. https://doi.org/10.1023/A:1005192911512.10.1023/A:1005192911512Search in Google Scholar
Schiborr, Nils Norman. 2022. English (Southern England) DoReCo dataset. In Frank Seifart, Ludger Paschen & Matthew Stave (eds.). Language documentation reference corpus (DoReCo) 1.2. Berlin & Lyon: Leibniz-Zentrum Allgemeine Sprachwissenschaft & laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2).Search in Google Scholar
Schleef, Erik. 2019. The evaluation of unfilled pauses: Limits of the prestige, solidarity and dynamism dimensions. Lingua 228. 102707. https://doi.org/10.1016/j.lingua.2019.06.008.Search in Google Scholar
Seifart, Frank, Ludger Paschen & Matthew Stave (eds.). 2022. Language Documentation Reference Corpus (DoReCo) 1.2. Berlin & Lyon: Leibniz-Zentrum Allgemeine Sprachwissenschaft & laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2).Search in Google Scholar
Selkirk, Elisabeth. 1984. Phonology and syntax: The relation between sound and structure. Cambridge: MIT Press.Search in Google Scholar
Selkirk, Elisabeth. 2011. The syntax-phonology interface. In John Goldsmith, Jason Riggle & Alan Yu (eds.), The handbook of phonological theory, 2nd edn., 435–484. Oxford: Blackwell.10.1002/9781444343069.ch14Search in Google Scholar
Speer, Shari R. & Kiwako Ito. 2009. Prosody in first language acquisition – acquiring intonation as a tool to organize information in conversation: Prosody in first language acquisition. Language and Linguistics Compass 3(1). 90–110. https://doi.org/10.1111/j.1749-818X.2008.00103.x.Search in Google Scholar
Šturm, Pavel & Jan Volín. 2023. Occurrence and duration of pauses in relation to speech tempo and structural organization in two speech genres. Languages 8(1). 23. https://doi.org/10.3390/languages8010023.Search in Google Scholar
Taboada, Maite. 2006. Spontaneous and non-spontaneous turn-taking. Pragmatics 16(2-3). 329–360. https://doi.org/10.1075/prag.16.2-3.04tab.Search in Google Scholar
Tannen, Deborah. 1985. Silence: Anything but. In Deborah Tannen & Muriel Saville-Troike (eds.), Perspectives on silence, 93–111. Norwood, NJ: Ablex.Search in Google Scholar
Thieberger, Nick & Timothy Brickell. 2019. Multi-CAST Nafsan. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Truckenbrodt, Hubert. 1995. Phonological phrases: Their relation to syntax, focus, and prominence. Cambridge, MA: MIT dissertation.Search in Google Scholar
Truckenbrodt, Hubert. 2007. The syntax-phonology interface. In Paul de Lacy (ed.), The Cambridge handbook of phonology, 435–456. Cambridge: Cambridge University Press.10.1017/CBO9780511486371.019Search in Google Scholar
Vollmer, Maria. 2020. Multi-CAST Mandarin. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar
Walker, Anne. 1985. The two faces of silence: The effect of witness hesitancy on lawyers’ impressions. In Deborah Tannen & Muriel Saville-Troike (eds.), Perspectives on silence, 55–75. Norwood, NJ: Ablex.Search in Google Scholar
Watson, Duane & Edward Gibson. 2004. The relationship between intonational phrasing and syntactic structure in language production. Language and Cognitive Processes 19(6). 713–755. https://doi.org/10.1080/01690960444000070.Search in Google Scholar
Weilhammer, Karl & Susen Rabold. 2003. Durational aspects in turn taking. In Maria-Josep Solé, Daniel Recasens & Joaquín Romero (eds.), Proceedings of the International Conference of Phonetic Sciences, 2145–2148. Barcelona: International Phonetic Association.Search in Google Scholar
Wennerstrom, Ann & Andrew F. Siegel. 2003. Keeping the floor in multiparty conversations: Intonation, syntax, and pause. Discourse Processes 36(2). 77–107. https://doi.org/10.1207/S15326950DP3602_1.Search in Google Scholar
Yang, Li-chiung. 2004. Duration and pauses as cues to discourse boundaries in speech. In Proceedings of speech prosody 2004, 267–270. Nara, Japan: ISCA.10.21437/SpeechProsody.2004-62Search in Google Scholar
Zellner, Brigitte. 1994. Pauses and the temporal structure of speech. In Eric Keller (ed.), Fundamentals of speech synthesis and speech recognition, 41–62. Chichester: John Wiley.Search in Google Scholar
© 2024 the author(s), published by De Gruyter, Berlin/Boston
This work is licensed under the Creative Commons Attribution 4.0 International License.
Articles in the same Issue
- Frontmatter
- Editorial
- Editorial 2024
- Phonetics & Phonology
- The role of recoverability in the implementation of non-phonemic glottalization in Hawaiian
- Epenthetic vowel quality crosslinguistically, with focus on Modern Hebrew
- Japanese speakers can infer specific sub-lexicons using phonotactic cues
- Articulatory phonetics in the market: combining public engagement with ultrasound data collection
- Investigating the acoustic fidelity of vowels across remote recording methods
- The role of coarticulatory tonal information in Cantonese spoken word recognition: an eye-tracking study
- Tracking phonological regularities: exploring the influence of learning mode and regularity locus in adult phonological learning
- Morphology & Syntax
- #AreHashtagsWords? Structure, position, and syntactic integration of hashtags in (English) tweets
- The meaning of morphomes: distributional semantics of Spanish stem alternations
- A refinement of the analysis of the resultative V-de construction in Mandarin Chinese
- L2 cognitive construal and morphosyntactic acquisition of pseudo-passive constructions
- Semantics & Pragmatics
- “All women are like that”: an overview of linguistic deindividualization and dehumanization of women in the incelosphere
- Counterfactual language, emotion, and perspective: a sentence completion study during the COVID-19 pandemic
- Constructing elderly patients’ agency through conversational storytelling
- Language Documentation & Typology
- Conative animal calls in Macha Oromo: function and form
- The syntax of African American English borrowings in the Louisiana Creole tense-mood-aspect system
- Syntactic pausing? Re-examining the associations
- Bibliographic bias and information-density sampling
- Historical & Comparative Linguistics
- Revisiting the hypothesis of ideophones as windows to language evolution
- Verifying the morpho-semantics of aspect via typological homogeneity
- Psycholinguistics & Neurolinguistics
- Sign recognition: the effect of parameters and features in sign mispronunciations
- Influence of translation on perceived metaphor features: quality, aptness, metaphoricity, and familiarity
- Effects of grammatical gender on gender inferences: Evidence from French hybrid nouns
- Processing reflexives in adjunct control: an exploration of attraction effects
- Language Acquisition & Language Learning
- How do L1 glosses affect EFL learners’ reading comprehension performance? An eye-tracking study
- Modeling L2 motivation change and its predictive effects on learning behaviors in the extramural digital context: a quantitative investigation in China
- Ongoing exposure to an ambient language continues to build implicit knowledge across the lifespan
- On the relationship between complexity of primary occupation and L2 varietal behavior in adult migrants in Austria
- The acquisition of speaking fundamental frequency (F0) features in Cantonese and English by simultaneous bilingual children
- Sociolinguistics & Anthropological Linguistics
- A computational approach to detecting the envelope of variation
- Attitudes toward code-switching among bilingual Jordanians: a comparative study
- “Let’s ride this out together”: unpacking multilingual top-down and bottom-up pandemic communication evidenced in Singapore’s coronavirus-related linguistic and semiotic landscape
- Across time, space, and genres: measuring probabilistic grammar distances between varieties of Mandarin
- Navigating linguistic ideologies and market dynamics within China’s English language teaching landscape
- Streetscapes and memories of real socialist anti-fascism in south-eastern Europe: between dystopianism and utopianism
- What can NLP do for linguistics? Towards using grammatical error analysis to document non-standard English features
- From sociolinguistic perception to strategic action in the study of social meaning
- Minority genders in quantitative survey research: a data-driven approach to clear, inclusive, and accurate gender questions
- Variation is the way to perfection: imperfect rhyming in Chinese hip hop
- Shifts in digital media usage before and after the pandemic by Rusyns in Ukraine
- Computational & Corpus Linguistics
- Revisiting the automatic prediction of lexical errors in Mandarin
- Finding continuers in Swedish Sign Language
- Conversational priming in repetitional responses as a mechanism in language change: evidence from agent-based modelling
- Construction grammar and procedural semantics for human-interpretable grounded language processing
- Through the compression glass: language complexity and the linguistic structure of compressed strings
- Could this be next for corpus linguistics? Methods of semi-automatic data annotation with contextualized word embeddings
- The Red Hen Audio Tagger
- Code-switching in computer-mediated communication by Gen Z Japanese Americans
- Supervised prediction of production patterns using machine learning algorithms
- Introducing Bed Word: a new automated speech recognition tool for sociolinguistic interview transcription
- Decoding French equivalents of the English present perfect: evidence from parallel corpora of parliamentary documents
- Enhancing automated essay scoring with GCNs and multi-level features for robust multidimensional assessments
- Sociolinguistic auto-coding has fairness problems too: measuring and mitigating bias
- The role of syntax in hashtag popularity
- Language practices of Chinese doctoral students studying abroad on social media: a translanguaging perspective
- Cognitive Linguistics
- Metaphor and gender: are words associated with source domains perceived in a gendered way?
- Crossmodal correspondence between lexical tones and visual motions: a forced-choice mapping task on Mandarin Chinese
Articles in the same Issue
- Frontmatter
- Editorial
- Editorial 2024
- Phonetics & Phonology
- The role of recoverability in the implementation of non-phonemic glottalization in Hawaiian
- Epenthetic vowel quality crosslinguistically, with focus on Modern Hebrew
- Japanese speakers can infer specific sub-lexicons using phonotactic cues
- Articulatory phonetics in the market: combining public engagement with ultrasound data collection
- Investigating the acoustic fidelity of vowels across remote recording methods
- The role of coarticulatory tonal information in Cantonese spoken word recognition: an eye-tracking study
- Tracking phonological regularities: exploring the influence of learning mode and regularity locus in adult phonological learning
- Morphology & Syntax
- #AreHashtagsWords? Structure, position, and syntactic integration of hashtags in (English) tweets
- The meaning of morphomes: distributional semantics of Spanish stem alternations
- A refinement of the analysis of the resultative V-de construction in Mandarin Chinese
- L2 cognitive construal and morphosyntactic acquisition of pseudo-passive constructions
- Semantics & Pragmatics
- “All women are like that”: an overview of linguistic deindividualization and dehumanization of women in the incelosphere
- Counterfactual language, emotion, and perspective: a sentence completion study during the COVID-19 pandemic
- Constructing elderly patients’ agency through conversational storytelling
- Language Documentation & Typology
- Conative animal calls in Macha Oromo: function and form
- The syntax of African American English borrowings in the Louisiana Creole tense-mood-aspect system
- Syntactic pausing? Re-examining the associations
- Bibliographic bias and information-density sampling
- Historical & Comparative Linguistics
- Revisiting the hypothesis of ideophones as windows to language evolution
- Verifying the morpho-semantics of aspect via typological homogeneity
- Psycholinguistics & Neurolinguistics
- Sign recognition: the effect of parameters and features in sign mispronunciations
- Influence of translation on perceived metaphor features: quality, aptness, metaphoricity, and familiarity
- Effects of grammatical gender on gender inferences: Evidence from French hybrid nouns
- Processing reflexives in adjunct control: an exploration of attraction effects
- Language Acquisition & Language Learning
- How do L1 glosses affect EFL learners’ reading comprehension performance? An eye-tracking study
- Modeling L2 motivation change and its predictive effects on learning behaviors in the extramural digital context: a quantitative investigation in China
- Ongoing exposure to an ambient language continues to build implicit knowledge across the lifespan
- On the relationship between complexity of primary occupation and L2 varietal behavior in adult migrants in Austria
- The acquisition of speaking fundamental frequency (F0) features in Cantonese and English by simultaneous bilingual children
- Sociolinguistics & Anthropological Linguistics
- A computational approach to detecting the envelope of variation
- Attitudes toward code-switching among bilingual Jordanians: a comparative study
- “Let’s ride this out together”: unpacking multilingual top-down and bottom-up pandemic communication evidenced in Singapore’s coronavirus-related linguistic and semiotic landscape
- Across time, space, and genres: measuring probabilistic grammar distances between varieties of Mandarin
- Navigating linguistic ideologies and market dynamics within China’s English language teaching landscape
- Streetscapes and memories of real socialist anti-fascism in south-eastern Europe: between dystopianism and utopianism
- What can NLP do for linguistics? Towards using grammatical error analysis to document non-standard English features
- From sociolinguistic perception to strategic action in the study of social meaning
- Minority genders in quantitative survey research: a data-driven approach to clear, inclusive, and accurate gender questions
- Variation is the way to perfection: imperfect rhyming in Chinese hip hop
- Shifts in digital media usage before and after the pandemic by Rusyns in Ukraine
- Computational & Corpus Linguistics
- Revisiting the automatic prediction of lexical errors in Mandarin
- Finding continuers in Swedish Sign Language
- Conversational priming in repetitional responses as a mechanism in language change: evidence from agent-based modelling
- Construction grammar and procedural semantics for human-interpretable grounded language processing
- Through the compression glass: language complexity and the linguistic structure of compressed strings
- Could this be next for corpus linguistics? Methods of semi-automatic data annotation with contextualized word embeddings
- The Red Hen Audio Tagger
- Code-switching in computer-mediated communication by Gen Z Japanese Americans
- Supervised prediction of production patterns using machine learning algorithms
- Introducing Bed Word: a new automated speech recognition tool for sociolinguistic interview transcription
- Decoding French equivalents of the English present perfect: evidence from parallel corpora of parliamentary documents
- Enhancing automated essay scoring with GCNs and multi-level features for robust multidimensional assessments
- Sociolinguistic auto-coding has fairness problems too: measuring and mitigating bias
- The role of syntax in hashtag popularity
- Language practices of Chinese doctoral students studying abroad on social media: a translanguaging perspective
- Cognitive Linguistics
- Metaphor and gender: are words associated with source domains perceived in a gendered way?
- Crossmodal correspondence between lexical tones and visual motions: a forced-choice mapping task on Mandarin Chinese