Home Syntactic pausing? Re-examining the associations
Article Open Access

Syntactic pausing? Re-examining the associations

  • Naomi Peck ORCID logo EMAIL logo and Laura Becker ORCID logo
Published/Copyright: August 20, 2024

Abstract

In this study, we look at the distribution of silent pauses within existing multi-language corpora to see whether their location and duration correlate with clause boundaries. Our study is based on data of seven languages from Multi-CAST. We supplemented the original clause boundary annotations with information about silent pauses in order to investigate the alignment of clause boundaries and pausing. We find a gradient association between clause boundary strength and the probability of a pause and a two-way distinction for pause duration within clauses and at clause boundaries.

The right word may be effective, but no word was ever as effective as a rightly timed pause.

– Mark Twain

1 Introduction

Linguists primarily conceptualize language as a continuous flow of speech, broken up by pauses. These pauses can be thought of as temporary “breaks” in which the transmission of content between interlocutors is cessated. They are typically understood as either filled or silent, depending on whether interlocutors perceive the break to contain a voiced section (e.g. um or uh in English) or silence (Belz and Trouvain 2019; Clark and Fox Tree 2002; Zellner 1994; see also Goh et al. 2023). This paper focuses on silent pauses, which go beyond the pure physiological need to breathe; rather, pausing “operates in happy synchrony with some basic functional segmentations of discourse” (Chafe 1994: 57).

The use of silent pauses has been studied from various perspectives in linguistics. From a psycholinguistic angle, silent pauses have been examined for their function of marking prosodic structures which support syntactic parsing and disambiguation, especially in early L1 acquisition (e.g. Christophe et al. 2008; de Carvalho et al. 2017; Hawthorne and Gerken 2014; Speer and Ito 2009). Much theoretical work on the prosody-syntax interface has similarly focused on the association of silent pauses with syntactic structure, notably the co-occurrence of silent pauses with syntactic boundaries (e.g. Goldman-Eisler 1972; Nespor and Vogel 2007; Selkirk 1984, 2011; Truckenbrodt 1995, 2007; Watson and Gibson 2004). Conversation analysis and related disciplines, on the other hand, have examined the use of silent pauses as markers of turn-taking in conversational discourse (e.g. Sacks et al. 1974; Taboada 2006; Weilhammer and Rabold 2003; Zellner 1994). The role of pausing in the perception of fluency has also been investigated (e.g. Belz et al. 2017; Bosker et al. 2013; de Jong 2016; Kahng 2018), typically comparing differences in pausing between native and non-native speakers. Silent pauses have also been studied for their contribution to rhetorical style, as well as to sociolinguistic and cross-cultural variation (e.g. Duez 1982; Kendall 2013; Schleef 2019; Šturm and Volín 2023; Tannen 1985; Walker 1985).

In this paper, we concentrate on the association between syntactic boundaries and pause location as well as pause duration. Using spontaneous speech data from seven typologically distinct languages, we explore the distribution of silent pauses to see whether their location and duration correlate with higher-level syntactic groupings, namely main and dependent clause boundaries. Our results point to a tendency for main clause boundaries to be accompanied by silent pauses across languages, while dependent clause boundaries are less likely to co-occur with pauses. Our results confirm previous findings in that pauses at clause boundaries tend to be longer than pauses within clauses across all languages in our dataset.

The paper is structured as follows. In Section 2 we provide an overview of previous research into the relation of pausing and clause boundaries. We then introduce our data and methods in Section 3. Section 4 presents the results of the present study; we discuss and contextualize them within the wider debate around pausing in Section 5. Section 6 concludes this study.

2 Silent pauses and syntactic structure

2.1 Pauses and the syntax-prosody interface

Silent pauses have often been recognized as one of the most important signals of a prosodic boundary (Du Bois et al. 1992: 17, Himmelmann et al. 2018, Himmelmann 2022: 718, Ladd 2008: 288). They have also long been observed to pattern with syntactic boundaries, especially with clause boundaries (e.g. Cooper and Paccia-Cooper 1980; Goldman-Eisler 1968; Nespor and Vogel 2007). However, pauses do not necessarily co-occur with major syntactic boundaries, that is, they can be observed outside of syntactically determined positions, including word-internally (Bundgaard-Nielsen and Baker 2020; Paschen 2023). Such pauses are usually attributed to performance; for example, for hesitation, processing, style (cf. Nespor and Vogel 2007: 189, 219), and breathing, especially in faster speech (cf. Grosjean and Collins 1979). Syntactic structures are not always sufficient to predict the prosodic structure of an utterance, and several prosodic realizations of the same syntactic unit are possible.

There are thus various reasons to assume a certain degree of independence between syntactic and prosodic structures, while allowing for a high degree of interdependence at the same time. Such a position on the relation between syntax and prosody is captured by approaches which assume an intermediary prosodic structure to which morphosyntax is mapped (see e.g. Nespor and Vogel 2007; Selkirk 2011, 1984). Our study is compatible with the theoretical assumptions of this line of research, but we take an empirical approach by using cross-linguistic corpus data of spontaneous speech to examine to what extent pauses are associated with syntactic boundary positions.

2.2 Pause types and functions

There are different ways in which silent pauses can be distinguished, such as according to where they occur. Previous studies have distinguished between pauses at sentence, clause, and phrase boundaries as well as pauses that do not coincide with any major syntactic boundaries. Example 1 illustrates different possible locations of silent pauses in English in an excerpt from a spontaneous spoken text from the DoReCo corpus (Schiborr 2022). It contains four consecutive sentences split up for convenience, so that each line corresponds to a chunk of speech delimited by a silent pause whose length is indicated in parentheses.

(1)
Silent pauses in spontaneous spoken English[1]
(0298-0300_DoReCo_doreco_sout3282_mc_english_kent02_b)
a.
So I messed the skin up. (1430 ms)
b.
It wadn’t no good then, (670 ms)
c.
’cause I was close to him, you see, blowed [false start] (70 ms)
d.
a great hole in him. (2503 ms)
e.
Catched a deer in a (645 ms)
f.
snare one day. I went down there, as I told you, about how I always […]

The pauses following (1a) and (1d) coincide with main clause boundaries. The pause at the end of (1b) occurs at a dependent clause boundary. The three remaining rows include pauses that do not coincide with clausal or phrasal boundaries. In (1c), a pause occurs between a transitive verb and its object, where it follows a false start. A similar situation can be seen in the transition between (1e) and (1f), where a pause separates a determiner from its head noun.

Silent pauses such as those in (1a), (1b), and (1d) are typically understood within psycholinguistics as helping to facilitate the parsing of speech into syntactic units. This has been shown for both adult L1 language use (e.g. Frazier et al. 2006; Kjelgaard and Speer 1999; Petrone et al. 2017; Schafer et al. 2000) and for L1 and L2 language acquisition (e.g. Christophe et al. 2008; Goad et al. 2021; Hawthorne and Gerken 2014; Speer and Ito 2009). At the same time, these pauses also help the speaker to prepare and plan further utterances (e.g. Cooper and Paccia-Cooper 1980; Ferreira 1991; Fuchs et al. 2013; Goldman-Eisler 1968; Krivokapić 2007; Krivokapić et al. 2020). They can also be used for interactional purposes, such as yielding the floor during a conversation (Levelt 1993; Maclay and Osgood 1959; Wennerstrom and Siegel 2003).

Other pauses, such as the clause-internal ones in (1c), (1e), and (1f), are argued to result from high processing demands in situations in which speech planning is comparatively difficult. Planning difficulties can be caused by various factors, such as a long or complex following syntactic or prosodic unit, or difficulties in lexical access with low-probability items (see e.g. Beattie and Butterworth 1979; Belz et al. 2017; Goldman-Eisler 1961a; Hartsuiker and Notebaert 2010). Clause-internal pauses may also be associated with attempts to hold the floor during an interaction (Himmelmann 2014; Local and Kelly 1986; Wennerstrom and Siegel 2003).

As we work with corpus data in the present study, we cannot test for specific cognitive processes that would explain pausing in a given context. As such, we do not distinguish between pauses that occur for hesitation, planning, or stylistic reasons. We instead distinguish between three types of pauses based on their location in relation to syntactic boundaries: pauses within clauses, pauses at main clause boundaries, and pauses at dependent clause boundaries (see Section 3 for more details and examples). This distinction allows us to measure the association of different clausal contexts with pauses, and to assess how robust the patterns found are across languages (see Section 4.2).

2.3 Pause duration

Studies into the distribution of pause duration often show a distinction between brief pauses and longer pauses, with longer pauses typically found at sentence boundaries and shorter pauses at dependent clause boundaries and within clauses.[2] In an early study, Goldman-Eisler (1972) reports for spontaneous spoken English data that pauses between sentences mostly have durations above 500 ms, while pauses between clauses (within sentences) tend to be shorter than 500 ms. Grosjean and Deschamps (1972) report comparable pause durations for other French and English datasets. Fletcher et al. (2004) similarly find that pauses within and between words in Dalabon (Gunwinyguan, Australia) fall into two groups in terms of duration: medium pauses (around 500 ms) and long pauses (around 1,000 ms or more).

Pause distribution and duration in spontaneous French data show similar trends, even when more detailed distinctions of pause type are made. Duez (1982: 24) finds that pauses within phrases ( = 401 ms) and within clauses ( = 632 ms) tend to be shorter than pauses between clauses ( = 802 ms) in casual interviews. In a more recent study, Candea (2000) examines French spontaneous speech data and reports that pauses at sentence boundaries have average durations of 900 ms, while pauses at clause boundaries within sentences average 600 ms (Candea 2000: 168). In contrast, hesitation pauses within clauses are reported to have an average duration of 560 ms (Candea 2000: 181).[3] Campione and Véronis (2002) focus on pause duration in French spontaneous speech data and read speech from five European languages. They find a trimodal distribution of pauses in terms of duration and distinguish between brief (<200 ms), medium (200–1,000 ms) and long (>1,000 ms) pauses. Read speech of all five languages only featured brief and medium pauses, with long pauses occurring in addition to short and medium pauses in spontaneous speech.

While the exact details differ across studies, the picture emerges that we can distinguish between different pause types in terms of their duration, and that pauses at clause boundaries tend to be longer than pauses within clauses, at least for English and French. We assess to what extent this association of pause location and duration is reflected in our data, how it relates to previous findings, and how robust the patterns are across languages (see Section 4.3).

3 Data and annotation

3.1 Dataset

In our study, we investigated how pauses were distributed across seven typologically distinct languages in the Multi-CAST corpora (Haig and Schnell 2021). The languages are listed in Table 1 together with information on the number of different speakers in the corpus, the number of utterances (as defined by Multi-CAST), and the genres of texts used. We chose a subset of data which consisted of monological data, specifically mostly autobiographical narratives (AN) and traditional narratives (TN), with the addition of one stimulus-based narrative (SN) in Tondano. As can be seen in Table 1, each corpus ranges from 160 to 1,300 utterances from two to six different speakers. We used the WAV files along with the accompanying annotations in EAF format provided in Multi-CAST (Haig and Schnell 2021).

Table 1:

Overview of the dataset (AN = autobiographical narrative; SN = stimulus-based narrative; TN = traditional narrative).

Language No. of speakers No. of utterances Genre
Arta (Austronesian) 3 227 AN, TN
Kimoto (2019)
Nafsan (Austronesian) 3 163 TN
Thieberger and Brickell (2019)
Teop (Austronesian) 4 1,019 TN
Mosel and Schnell (2015)
Tondano (Austronesian) 6 1,254 SN, AN
Brickell (2016)
Mandarin (Sino-Tibetan) 3 845 TN
Vollmer (2020)
Tabasaran (Nakh-Daghestanian) 2 629 TN, AN
Bogomolova et al. (2021)
Northern Kurdish (Indo-European) 2 555 TN
Haig et al. (2015)

3.2 Clause annotation

The majority of spontaneous speech corpora are created for language documentation purposes, and as such, differ as to how elaborate their morphosyntactic annotation is. Information on clausal boundaries below the utterance level is usually not explicitly annotated for, Multi-CAST being an important exception.

We took the annotation of clausal boundaries from GRAID (Grammatical Relations and Animacy in Discourse) annotations in Multi-CAST (see Haig and Schnell 2014). In GRAID, a clause is generally defined as a predicate with its arguments, with more language-specific criteria for how to deal with, for example, multi-verb predicates (Haig and Schnell 2014: 45). Main clauses are signalled with an annotation of “##” at their left edge, while dependent clauses are annotated with “#” at the left edge. The end of a dependent clause is generally not marked explicitly, as it usually coincides with the beginning of a new main clause. However, the end of a centre-embedded dependent clause that does not correspond to the end of a main clause is additionally signalled by “%” in GRAID.

Examples (2)–(4) show how clause boundaries are annotated in GRAID (see Haig and Schnell (2014: §2.6, §4.1) for more details).[4] Example (2) consists of a main clause in Mandarin, marked as such by the initial “##”.[5]

(2)
Mandarin (mandarin_lzh_0007-0008)
## ránhòu jiù shì súchēng=de zhù yuánwài ##
## then also adv cop popular_name=mod Zhu landlord ##
‘He was called Zhu landlord by people.’

In example (3), we see an utterance from the Northern Kurdish corpus, consisting of a main clause with a dependent relative clause. In this case, the end of the relative clause corresponds to the end of the main clause and does not receive an explicit annotation.

(3)
Northern Kurdish (n_kurd_muserz01_003-004)
## kur-ek-î hebû-ye # nav-ê kur-ê Mihemed
## son-indf-ez 3sg.obl exist.pst-pfv.3sg # name-ez son-ez 3sg.obl Mihemed
bû-ye ##
cop.pst-pfv.3sg ##
‘He had a son, whose name was Mihemed.’

The Nafsan example in (4) shows a somewhat more complex utterance. It is made up of three main clauses, the first of which contains an additional dependent clause. Note that the end of the dependent clause does not coincide with the end of the main clause, which is why it is additionally marked with the annotation by “%” in (4a).

(4)
Nafsan (nafsan_tafra_0001)
a.
## # selwan tu=paakor nametp̃g ntau % ra=to tu teesa
## # while 1pl.in.rs=arrive end year % 1du.ex.rs=habit give child
tete nanromien
some present
‘When we got to the end of the year we would give the children a present … ’
b.
## ru=to ni apu go atien negar-wes
##3pl.rs=stay with grandfather and grandmother 3pl.poss-3sg.obl
‘… for them to take to their Apu and Ati … ’
c.
## nanromien sees pan tu-e-r ki-n Ertap ##
## present small go give-ts-3pl.o prep-dst Eratap ##
‘… a small present they could give to them at Eratap.’

We used this clause information from the original GRAID annotation to distinguish between main clause boundaries (cb_main), dependent clause boundaries (cb_dep), and clause-internal positions (no_cb). Main and dependent clause boundaries as shown in (2)–(4) were directly taken from the annotation. The remaining annotations were classified as clause-internal.

3.3 Pause annotation

Spontaneous speech corpora often do not include annotations for phonological or prosodic properties.[6] As such, the re-use of the Multi-CAST corpora for this paper required us to additionally annotate silent pauses in order to explore any potential associations between pauses and clause boundaries. The pause annotation process we followed is described in detail in the supplementary document “data-extraction-processing.pdf”; we provide a brief summary here and some illustrative examples for the reader’s orientation.[7]

We firstly annotated silent pauses using a silence recognizer and followed this with a round of manual checks and corrections. This resulted in an annotation of pause and speech segments. We then classified both pause and speech segments into one of three types (cb_main, cb_dep, no_cb) according to whether or not a main or dependent clause boundary fell into their duration. This was done by automatically comparing the time stamps of the pre-existing clause boundary annotations and those of the newly annotated silent pause and speech segments. Since the Multi-CAST data is only aligned with the acoustic signal on the utterance level and not on the clausal level, we could not automatically specify the location of a pause occurring within the utterance further. As described in more detail in Section 4 of the supplementary materials, we performed selected manual checks of the position of clausal boundaries in relation to pauses, which suggested that the automatic extraction works sufficiently well for our purposes.

Tables 26 show examples of the resulting pause annotation for examples (2)–(4) from Section 3.2. The first row shows the utterance including the original clause boundary annotation from Multi-CAST. The second “pause” row then shows whether or not a silent pause was detected. If a pause was detected, the “type” row indicates which type of pause we are dealing with, and the “dur” row shows its duration. In the Mandarin example in Table 2, we see that both the initial and final clause boundaries coincide with a pause, which we annotated as cb_main. In addition, we find another silent pause occurring within the clause, which was marked as no_cb.

Table 2:

Pause annotation for Mandarin example (2).

## ránhòu yě jiù shì súchēng=de zhù yuánwài ##
Pause
Type cb_main no_cb cb_main
dur 124 ms 235 ms 226 ms
Table 3:

Pause annotation for Northern Kurdish example (3).

## kur-ek-î wî hebû-ye # nav-ê kur-ê wî Mihemed bû-ye ##
Pause
Type cb_main cb_main
dur 2,343 ms 948 ms
Table 4:

Pause annotation for Nafsan example (4a).

## # selwan tu=paakor nametp̃ag ntau % ra=to tu teesa tete nanromien
Pause
Type cb_dep no_cb
dur 1790 ms 482 ms
Table 5:

Pause annotation for Nafsan example (4b).

## ru=to ni apu go atien negar-wes
Pause
Type cb_main
dur 1,125 ms
Table 6:

Pause annotation for Nafsan example (4c).

## nanromien sees pan tu-e-r ki-n Ertap ##
Pause
Type cb_main no_cb cb_main
dur 1,692 ms 533 ms 1,237 ms

In the Northern Kurdish utterance in Table 3, the main clause boundaries also coincide with silent pauses. The dependent clause boundary between the two main clauses, however, is not accompanied by a silent pause. In addition, no pauses occur within the two clauses shown in Table 3. The Nafsan example from (4) is shown in Tables 4 to 6. Table 4 contains the first main clause together with its dependent clause. Here, silent pauses only occur at the end of the dependent clause (cb_dep) and within the second part of the main clause (no_cb). Table 5 shows that another silent pause coincides with the boundary between the two main clauses in Tables 4 and 5. Finally, we see in Table 6 that both clausal boundaries co-occur with a pause (cb_main), and that the main clause itself also contains a silent pause (no_cb).

4 Results

4.1 Overall tendencies

We first inspect the distribution of clause boundaries. Table 7 shows how the three clausal contexts are distributed across speech and pause segments (“No. of segments”) in the seven languages. By segments we refer to contiguous chunks of speech or silent pause in the speech signal. Pause/speech segments with no clause boundaries (no_cb) make up around 55–70 % of annotations in all languages, with an average of 61 % across the dataset. The proportions of cb_main and especially cb_dep contexts are more language-dependent. For example, dependent clause boundaries make up 23 % of all boundaries in Northern Kurdish, while they only amount to 4 % in Mandarin.

Table 7:

Proportion of the three clausal contexts in speech and pause segments.

Language no_cb cb_main cb_dep No. of segments
Arta 0.66 0.20 0.14 1,205
Mandarin 0.62 0.33 0.04 3,259
Nafsan 0.61 0.27 0.11 1,716
Northern Kurdish 0.56 0.21 0.23 4,112
Tabasaran 0.57 0.22 0.21 3,798
Teop 0.59 0.33 0.08 3,156
Tondano 0.70 0.24 0.06 3,717
Average 0.61 0.26 0.13

Figure 1 builds on the distribution of clausal contexts shown in Table 7 by including information about how they relate to pause and speech segments in the seven languages. Comparing the distribution of the three clausal contexts in pause and speech segments (left vs. right bars), we see clearly that pause segments contain a higher proportion of clause boundaries than speech segments in all languages. The exact proportions differ across languages, with clausal boundaries co-occurring with 75 % of all pauses in Northern Kurdish but with only around 40 % of all pauses in Tondano.

Figure 1: 
The raw distribution of clausal contexts across speech and pause segments.
Figure 1:

The raw distribution of clausal contexts across speech and pause segments.

The overall distribution of clausal contexts across our dataset suggests that clause boundaries more often co-occur with a pause than with speech. However, in Tondano and Arta, pauses are more likely to occur within clauses than in either of the clause boundary conditions. The probability of pauses co-occurring with dependent clause boundaries is also substantially lower in Tondano, Mandarin, and Teop than in other languages, likely due to the overall low proportion of dependent clause boundaries in these languages to begin with (see Table 7). This suggests that the presence of a clausal boundary does not necessarily entail the occurrence of a silent pause and that the association between clause boundaries and pauses is indeed more complex than a simple one-to-one relationship.

4.2 The association of clause boundaries with pauses

The first question that this paper addresses is to what extent the different clausal contexts are associated with silent pauses. To do this, we fitted a Bayesian logistic regression model to assess the probability of pauses across the three types of clausal contexts. We added varying intercepts as well as varying slopes over clausal contexts for individual speakers.[8] The model was fitted using Bayesian methods with Stan (Carpenter et al. 2017) and brms (Bürkner 2017) in R (R Core Team 2021).

Figure 2 shows the conditional effects of clausal contexts on the probabilities of pauses. The points in Figure 2 correspond to the mean of the probability density of the predictions; the whiskers correspond to the 95 % uncertainty intervals. This means that we can be 95 % confident that the actual probability will lie within this interval based on the data and the model.

Figure 2: 
Predicted probability of pauses across clausal contexts.
Figure 2:

Predicted probability of pauses across clausal contexts.

While we see a certain degree of variation across the seven languages, the overall pattern is robust. The probability of pauses occurring gradually increases from environments within clauses (no_cb) to dependent clause boundaries (cb_dep) and main clause boundaries (cb_main). For main clause boundaries, the probability of co-occurrence with pauses is between 0.5 and just over 0.75 in all languages, the lowest prediction being 0.53 for Tondano and the highest being 0.76 for Arta. For dependent clause boundaries, the mean posterior co-occurrence probabilities for all languages are slightly lower, ranging from 0.29 in Teop to 0.57 in Nafsan. For the no clause boundary context, we find the overall lowest probability of pause co-occurrence, from 0.14 in Teop to 0.28 in Tondano. We furthermore generally find a higher level of cross-linguistic variation in the cb_main and cb_dep contexts than in the no_cb condition.

The robust patterning of pause co-occurrence probabilities across our dataset illustrates a clear trend. As the level of the syntactic boundary juncture increases, so too does the overall predicted probability of a pause co-occurring. Looking at the uncertainty intervals, we find that they overlap between “neighbouring” categories, that is, within clauses and dependent clause boundaries, as well as dependent and main clause boundaries. Only Nafsan differs in that both types of clause boundaries have a similarly higher probability of pauses compared to boundaries within clauses. Still, the overall results suggest that there is no cross-linguistically robust, binary distinction either between main clause boundaries and other contexts or between non-clausal boundaries and clausal boundaries. Instead, we found a gradual increase in pause probability from no_cb to cb_dep and cb_main.

4.3 Pause durations

The second question that this paper aims to address concerns pause durations in relation to their location. Figure 3 shows the raw distribution of pause durations across the seven languages of the dataset. The general trends are fairly robust. The majority of silent pauses in the dataset have a duration up to 750 ms, few pauses are longer than 1,000 ms, and pauses longer than 2,500 ms are rare. However, Arta, Nafsan, and Northern Kurdish have a somewhat larger portion of silent pauses longer than 750 ms.

Figure 3: 
Observed pause duration by language.
Figure 3:

Observed pause duration by language.

Furthermore, the average pause duration is more or less comparable across languages. As can be seen in Figure 3, the median pause duration (the solid vertical line) is between 500 ms and 1,000 ms for all languages, apart from Mandarin. However, we see more variation of the mean pause duration (dotted black line) across languages. Arta, Nafsan, and Tondano show a larger difference between median and mean pause duration, as these three languages have more longer pauses than the other languages of the dataset. Still, we can say that the distribution of pause length, albeit not identical, is comparable across the seven languages.

Figure 4 shows the distribution of pause duration together with median and mean values across the three clausal contexts with data from all seven languages pooled together. These distributions along with the median and mean suggest a difference in pause duration associated with the absence or presence of a clause boundary. Most pauses within clauses have a length of below 750 ms, with only 15 % of all pauses being longer than 1,000 ms. In contrast, we find quite a substantial proportion of pauses with durations above 1,000 ms up to 3,000 ms for both types of clause boundaries, namely 40 % for cb_dep and 38 % for cb_main, respectively. This binary difference between pauses within clauses and pauses at clausal boundaries is further reflected in their median pause durations: 537 ms for no_cb versus 796 ms and 788 ms for cb_dep and cb_main, respectively.

Figure 4: 
Observed pause duration by clausal context.
Figure 4:

Observed pause duration by clausal context.

To test the robustness of the observations from the raw distributions, we fitted a Bayesian regression model to predict the duration of pauses from clausal contexts and languages while controlling for the effects of single speakers.[9] Figure 5 shows the conditional effects of clausal contexts and languages on pause durations. The model predictions support the previous observation of a two-way distinction of pauses based on their duration. Pause durations are predicted to be between 524 ms and 781 ms very consistently across languages when no clause boundary is present. At main and dependent clause boundaries, the average predicted pause duration varies to a greater extent within and across languages. Pauses are predicted to be between 1,000 ms and 1,300 ms for both types of clausal boundaries in all languages except for Arta. Arta exhibits shorter pause durations in all three contexts compared to the other languages. However, the relative difference between the boundaries within clauses (525 ms) and clausal boundaries (802 ms and 814 ms) is maintained. We should be careful before concluding that this indeed reflects a real cross-linguistic difference, as the Arta subcorpus is comparatively small (see Table 1).

Figure 5: 
Estimated pause duration by clausal context.
Figure 5:

Estimated pause duration by clausal context.

Larger datasets with data from more speakers are needed to be more certain about what seems to be a language-specific preference towards shorter pauses in our data.

5 Discussion

Our results evidence a complex interaction between pausing and clause boundaries. Pauses occur both within clauses and at clause boundaries in all seven languages examined in this study. However, the probability of a pause occurring increases as the syntactic boundary increases in strength. In Section 4.2, we found a gradient increase for all languages except Nafsan in the probability of a pause occurring within clauses to pauses at dependent clause boundaries and at main clause boundaries. Only Nafsan showed evidence for a potential binary opposition between no clausal boundaries and the two clausal boundary contexts. The main trend in our data mirrors previous findings for English and French. For instance, Goldman-Eisler (1972: 105) found that temporal integration decreases as clause boundary independence increases in spontaneous spoken English, with 77.9 % of all main clauses separated by pauses greater than 500 ms, but less than a third of dependent clause boundaries being this long. Grosjean and Deschamps (1972: 146) report that in their French data, over 60 % of all silent pauses occur at main clause boundaries, with approximately 14 % of the remaining pauses occurring at lower-level syntactic boundaries; Candea (2000: 166) similarly found that most of the pauses occur in her data at main clause boundaries with fewer pauses occurring at dependent clause boundaries and within clauses.

In line with previous findings, our study suggests that there is a cross-linguistically robust increase of the probability of pauses to occur within clauses, at dependent clause boundaries, and at main clause boundaries. This finding is particularly remarkable given the inherent variation present in the dataset and that the languages included in this study feature a great deal of variability in what kinds of dependent clauses are present and how they are used by speakers.

Our second investigation in Section 4.3 found a two-way opposition in the duration of pauses occurring within clauses and those at clause boundaries. The attested lengths in this study match those found for Dalabon (Fletcher et al. 2004), and those of medium and long pauses in French (Campione and Véronis 2002). However, these comparisons should be taken with a grain of salt, as the methodological definitions of pauses and locations, the choice of pause threshold, and the genres of texts involved all differ to different extents across the studies (see also Rochester 1973). Our results, however, fit in with the general trend reported in the literature on pausing whereby silent pauses at clause boundaries tend to be longer than pauses that do not align with major syntactic boundaries such as main clause boundaries (Candea 2000; Duez 1982; Goldman-Eisler 1972; Grosjean and Deschamps 1972).

In addition, we find a robust pattern across the languages surveyed here that there is no substantial difference between pause duration at main and dependent clause boundaries. This fits in with the findings reported by Candea (2000: 166–167), who notes that pauses occurring both at dependent clause boundaries (“inter-constituant”) and at boundaries between two coordinated or juxtaposed clauses (“fin de proposition”) are significantly shorter than pauses at main clause boundaries which serve to separate two discourse segments (“fin d’énoncé”). This lack of difference between certain types of main and dependent clause pause duration suggests that pauses do not simply reflect syntactic structures but also help to structure discourse more generally. Our study reports the first evidence of this type for silent pauses in non-European languages, joining other recent work examining the relationship between filled pauses and discourse structure in typologically diverse languages (e.g. Kaland and Bardají 2023).

6 Conclusions

Using naturalistic and typologically diverse spontaneous speech data, we found that clause boundaries were associated with pauses, with stronger boundaries more likely to co-occur with a pause. Pause duration showed a two-way distinction between pauses at clause boundaries and those which occurred within a clause. Furthermore, this study showed how existing grammatically annotated corpora can be used in the investigation of other research questions. With minimal extra annotation, we were able to conduct a first investigation into the role of silent pausing at the prosody-syntax interface in a range of languages not commonly represented in work on pause distribution and duration.


Corresponding author: Naomi Peck, Department of General Linguistics, 9174 University of Freiburg , Freiburg im Breisgau, Germany, E-mail:

Award Identifier / Grant number: 406074683

Acknowledgments

We would like to thank both Wifek Bouaziz and Sharon Kromer for their work in helping us manually align the data. A further thanks to those who gave enthusiastic feedback about this project at our departmental reading group at Freiburg, the Language Documentation and Theory 6 workshop on intonation, and the 14th International Conference of the Association for Linguistic Typology, as well as for the detailed suggestions from two anonymous reviewers which helped improve this paper substantially.

  1. Research funding: This work was supported by the Deutsche Forschungsgemeinschaft (406074683) and the Open Access fees were covered based on an agreement between the University of Freiburg and De Gruyter.

References

Beattie, Geoffrey W. & Brian L. Butterworth. 1979. Contextual probability and word frequency as determinants of pauses and errors in spontaneous speech. Language and Speech 22(3). 201–211. https://doi.org/10.1177/002383097902200301.Search in Google Scholar

Belz, Malte, Simon Sauer, Anke Lüdeling & Christine Mooshammer. 2017. Fluently disfluent? Pauses and repairs of advanced learners and native speakers of German. International Journal of Learner Corpus Research 3(2). 118–148. https://doi.org/10.1075/ijlcr.3.2.02bel.Search in Google Scholar

Belz, Malte & Jürgen Trouvain. 2019. Are “silent” pauses always silent? In Sasha Calhoun, Paola Escudero, Marija Tabain & Paul Warren (eds.), 19th International Congress of Phonetic Sciences (ICPhS), 2744–2748. Canberra: Australasian Speech Science and Technology Association Inc.Search in Google Scholar

Bogomolova, Natalia, Dmitry Ganenkov & Nils N. Schiborr. 2021. Multi-CAST Tabasaran. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Bosker, Hans Rutger, Anne-France Pinget, Hugo Quené, Ted Sanders & Nivja H. de Jong. 2013. What makes speech sound fluent? The contributions of pauses, speed and repairs. Language Testing 30(2). 159–175. https://doi.org/10.1177/0265532212455394.Search in Google Scholar

Brickell, Timothy. 2016. Multi-CAST Tondano. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Bundgaard-Nielsen, Rikke L. & Brett J. Baker. 2020. Pause acceptability indicates word-internal structure in Wubuy. Cognition 198. 104167. https://doi.org/10.1016/j.cognition.2019.104167.Search in Google Scholar

Bürkner, Paul-Christian. 2017. brms: An r package for bayesian multilevel models using stan. Journal of Statistical Software 80(1). 1–28. https://doi.org/10.18637/jss.v080.i01.Search in Google Scholar

Campione, Estelle & Jean Véronis. 2002. A large-scale multilingual study of silent pause duration. In Bernard Bel & Isabelle Marlien (eds.), Proceedings of Eurospeech 2002, 199–202. Aix-en-Provence: Laboratoire Parole et Langage.10.21437/SpeechProsody.2002-35Search in Google Scholar

Candea, Maria. 2000. Contribution a` l’e´tude des pauses silencieuses et des phe´nome`nes dits “d’he´sitation” en franc¸ais oral spontane´: Etude sur un corpus de re´cits en classe de franc¸ais. Paris: Université Paris III dissertation.Search in Google Scholar

Carpenter, Bob, Andrew Gelman, Matthew Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li & Allen Riddell. 2017. Stan: A probabilistic programming language. Journal of Statistical Software 76(1). 1–32. https://doi.org/10.18637/jss.v076.i01.Search in Google Scholar

Chafe, Wallace L. 1994. Discourse, consciousness, and time: The flow and displacement of conscious experience in speaking and writing. Chicago: University of Chicago Press.Search in Google Scholar

Christophe, Anne, Séverine Millotte, Savita Bernal & Jeffrey Lidz. 2008. Bootstrapping lexical and syntactic acquisition. Language and Speech 51(1-2). 61–75. https://doi.org/10.1177/00238309080510010501.Search in Google Scholar

Clark, Herbert H. & Jean E. Fox Tree. 2002. Using uh and um in spontaneous speaking. Cognition 84(1). 73–111. https://doi.org/10.1016/S0010-0277(02)00017-3.Search in Google Scholar

Cooper, William E. & Jeanne Paccia-Cooper. 1980. Syntax and speech. Cambridge, MA: Harvard University Press.10.4159/harvard.9780674283947Search in Google Scholar

de Carvalho, Alex, Isabelle Dautriche, Isabelle Lin & Anne Christophe. 2017. Phrasal prosody constrains syntactic analysis in toddlers. Cognition 163. 67–79. https://doi.org/10.1016/j.cognition.2017.02.018.Search in Google Scholar

de Jong, Nivja H. 2016. Predicting pauses in L1 and L2 speech: The effects of utterance boundaries and word frequency. International Review of Applied Linguistics in Language Teaching 54(2). 113–132. https://doi.org/10.1515/iral-2016-9993.Search in Google Scholar

Du Bois, John, Stephan Schuetze-Coburn, Danae Paolino & Susanna Cumming. 1992. Discourse transcription (Santa Barbara Papers in Linguistics 4). Santa Barbara, CA: University of California Santa Barbara, Department of Linguistics.Search in Google Scholar

Duez, Danielle. 1982. Silent and non-silent pauses in three speech styles. Language and Speech 25(1). 11–28. https://doi.org/10.1177/002383098202500102.Search in Google Scholar

Ferreira, Fernanda. 1991. Effects of length and syntactic complexity on initiation times for prepared utterances. Journal of Memory and Language 30(2). 210–233. https://doi.org/10.1016/0749-596X(91)90004-4.Search in Google Scholar

Ferreira, Fernanda. 1993. Creation of prosody during sentence production. Psychological Review 100(2). 233–253. https://doi.org/10.1037/0033-295X.100.2.233.Search in Google Scholar

Fletcher, Janet, Nicholas Evans & Belinda Ross. 2004. Pausing strategies and prosodic boundaries in Dalabon. In Proceedings of the 10th Australian International Conference on Speech Science & Technology, 436–439. Sydney: Macquarie University.Search in Google Scholar

Frazier, Lyn, Katy Carlson & Charles Clifton. 2006. Prosodic phrasing is central to language comprehension. Trends in Cognitive Sciences 10(6). 244–249. https://doi.org/10.1016/j.tics.2006.04.002.Search in Google Scholar

Fuchs, Susanne, Caterina Petrone, Jelena Krivokapić & Philip Hoole. 2013. Acoustic and respiratory evidence for utterance planning in German. Journal of Phonetics 41(1). 29–47. https://doi.org/10.1016/j.wocn.2012.08.007.Search in Google Scholar

Gee, James Paul & François Grosjean. 1983. Performance structures: A psycholinguistic and linguistic appraisal. Cognitive Psychology 15(4). 411–458. https://doi.org/10.1016/0010-0285(83)90014-2.Search in Google Scholar

Goad, Heather, Natália Brambatti Guzzo & Lydia White. 2021. Parsing ambiguous relative clauses in L2 English: Learner sensitivity to prosodic cues. Studies in Second Language Acquisition 43(1). 83–108. https://doi.org/10.1017/S0272263120000285.Search in Google Scholar

Goh, Rui Zhe, Ian B. Phillips & Chaz Firestone. 2023. The perception of silence. Proceedings of the National Academy of Sciences 120(29). e2301463120. https://doi.org/10.1073/pnas.2301463120.Search in Google Scholar

Goldman-Eisler, Frieda. 1961a. A comparative study of two hesitation phenomena. Language and Speech 4(1). 18–26. https://doi.org/10.1177/002383096100400102.Search in Google Scholar

Goldman-Eisler, Frieda. 1961b. The distribution of pause durations in speech. Language and Speech 4(4). 232–237. https://doi.org/10.1177/002383096100400405.Search in Google Scholar

Goldman-Eisler, Frieda. 1968. Psycholinguistics: Experiments in spontaneous speech. London: Academic Press.Search in Google Scholar

Goldman-Eisler, Frieda. 1972. Pauses, clauses, sentences. Language and Speech 15(2). 103–113. https://doi.org/10.1177/002383097201500201.Search in Google Scholar

Grosjean, François & Maryann Collins. 1979. Breathing, pausing and reading. Phonetica 36(2). 98–114. https://doi.org/10.1159/000259950.Search in Google Scholar

Grosjean, François & Alain Deschamp. 1972. Analyse des variables temporelles du franc,ais spontane´. Phonetica 26(3). 129–156. https://doi.org/10.1159/000259407.Search in Google Scholar

Grosjean, François, Lysiane Grosjean & Harlan Lane. 1979. The patterns of silence: Performance structures in sentence production. Cognitive Psychology 11(1). 58–81. https://doi.org/10.1016/0010-0285(79)90004-5.Search in Google Scholar

Haig, Geoffrey & Stefan Schnell. 2014. Annotations using GRAID (Grammatical Relations and Animacy in Discourse). Manual version 7.0. University of Bamberg. https://multicast.aspra.uni-bamberg.de/data/pubs/graid/Haig+Schnell2014_GRAID-manual_v7.0.pdf (accessed 20 February 2024).Search in Google Scholar

Haig, Geoffrey & Stefan Schnell. 2021. Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Haig, Geoffrey, Maria Vollmer & Hanna Thiele. 2015. Multi-CAST Northern Kurdish. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Hartsuiker, Robert J. & Lies Notebaert. 2010. Lexical access problems lead to disfluencies in speech. Experimental Psychology 57(3). 169–177. https://doi.org/10.1027/1618-3169/a000021.Search in Google Scholar

Hawthorne, Kara & LouAnn Gerken. 2014. From pauses to clauses: Prosody facilitates learning of syntactic constituency. Cognition 133(2). 420–428. https://doi.org/10.1016/j.cognition.2014.07.013.Search in Google Scholar

Himmelmann, Nikolaus P. 2014. Asymmetries in the prosodic phrasing of function words: Another look at the suffixing preference. Language 90(4). 927–960. https://doi.org/10.1353/lan.2014.0105.Search in Google Scholar

Himmelmann, Nikolaus P. 2022. Prosodic phrasing and the emergence of phrase structure. Linguistics 60(3). 715–743. https://doi.org/10.1515/ling-2020-0135.Search in Google Scholar

Himmelmann, Nikolaus P., Meytal Sandler, Jan Strunk & Volker Unterladstetter. 2018. On the universality of intonational phrases: A cross-linguistic interrater study. Phonology 35(2). 207–245. https://doi.org/10.1017/S0952675718000039.Search in Google Scholar

Kahng, Jimin. 2018. The effect of pause location on perceived fluency. Applied Psycholinguistics 39(3). 569–591. https://doi.org/10.1017/S0142716417000534.Search in Google Scholar

Kaland, Constantijn & Maria Bardají. 2023. Phonetic description of filled pauses as discourse markers in Totoli. In Radek Skarnitzl & Jan Volín (eds.), 20th International Congress of Phonetic Sciences (ICPhS 2023), 3241–3245. Prague: Guarant International.Search in Google Scholar

Kendall, Tyler. 2013. Speech rate, pause and sociolinguistic variation. London: Palgrave Macmillan UK.10.1057/9781137291448Search in Google Scholar

Kimoto, Yukinori. 2019. Multi-CAST Arta. In Geoffrey Haig & Stefan Schnell (eds.). Multi-cast: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Kjelgaard, Margaret M. & Shari R. Speer. 1999. Prosodic facilitation and interference in the resolution of temporary syntactic closure ambiguity. Journal of Memory and Language 40(2). 153–194. https://doi.org/10.1006/jmla.1998.2620.Search in Google Scholar

Klatt, Dennis H. 1976. Linguistic uses of segmental duration in English: Acoustic and perceptual evidence. The Journal of the Acoustical Society of America 59(5). 1208–1221. https://doi.org/10.1121/1.380986.Search in Google Scholar

Krivokapić, Jelena. 2007. Prosodic planning: Effects of phrasal length and complexity on pause duration. Journal of Phonetics 35(2). 162–179. https://doi.org/10.1016/j.wocn.2006.04.001.Search in Google Scholar

Krivokapić, Jelena, Will Styler & Parrell Benjamin. 2020. Pause postures: The relationship between articulation and cognitive processes during pauses. Journal of Phonetics 79. 100953. https://doi.org/10.1016/j.wocn.2019.100953.Search in Google Scholar

Ladd, Robert. 2008. Intonational phonology. Cambridge: Cambridge University Press.10.1017/CBO9780511808814Search in Google Scholar

Levelt, Willem J. M. 1993. Speaking: From intention to articulation. Cambridge, MA: MIT Press.Search in Google Scholar

Local, John & John Kelly. 1986. Projection and ‘silences’: Notes on phonetic and conversational structure. Human Studies 9(2/3). 185–204. https://doi.org/10.1007/bf00148126.Search in Google Scholar

Maclay, Howard & Charles Osgood. 1959. Hesitation phenomena in spontaneous English speech. Word 15(1). 19–44. https://doi.org/10.1080/00437956.1959.11659682.Search in Google Scholar

Mosel, Ulrike & Stefan Schnell. 2015. Multi-CAST Teop. In Geoffrey Haig & Stefan Schnell (eds.). Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Nespor, Marina & Irene Vogel. 2007. Prosodic phonology: With a new foreword. Berlin: De Gruyter.10.1515/9783110977790Search in Google Scholar

Paschen, Ludger. 2023. Two apparent inefficiencies of silent and filled pauses. Talk presented at the workshop “Efficiency in grammar”. Freiburg Institute of Advanced Studies (FRIAS), University of Freiburg. https://laurabecker.gitlab.io/efficiency-workshop/slides_paschen.pdf (accessed 5 February 2024).Search in Google Scholar

Petrone, Caterina, Hubert Truckenbrodt, Caroline Wellmann, Julia Holzgrefe-Lang, Isabell Wartenburger & Barbara Höhle. 2017. Prosodic boundary cues in German: Evidence from the production and perception of bracketed lists. Journal of Phonetics 61. 71–92. https://doi.org/10.1016/j.wocn.2017.01.002.Search in Google Scholar

R Core Team. 2021. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.Search in Google Scholar

Rochester, Sherry R. 1973. The significance of pauses in spontaneous speech. Journal of Psycholinguistic Research 2(1). 51–81. https://doi.org/10.1007/BF01067111.Search in Google Scholar

Sacks, Harvey, Emanuel A. Schegloff & Gail Jefferson. 1974. A simplest systematics for the organization of turn-taking for conversation. Language 50(4). 696–735. https://doi.org/10.2307/412243.Search in Google Scholar

Schafer, Amy J., Shari R. Speer, Paul Warren & S. David White. 2000. Intonational disambiguation in sentence production and comprehension. Journal of Psycholinguistic Research 29(2). 169–182. https://doi.org/10.1023/A:1005192911512.10.1023/A:1005192911512Search in Google Scholar

Schiborr, Nils Norman. 2022. English (Southern England) DoReCo dataset. In Frank Seifart, Ludger Paschen & Matthew Stave (eds.). Language documentation reference corpus (DoReCo) 1.2. Berlin & Lyon: Leibniz-Zentrum Allgemeine Sprachwissenschaft & laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2).Search in Google Scholar

Schleef, Erik. 2019. The evaluation of unfilled pauses: Limits of the prestige, solidarity and dynamism dimensions. Lingua 228. 102707. https://doi.org/10.1016/j.lingua.2019.06.008.Search in Google Scholar

Seifart, Frank, Ludger Paschen & Matthew Stave (eds.). 2022. Language Documentation Reference Corpus (DoReCo) 1.2. Berlin & Lyon: Leibniz-Zentrum Allgemeine Sprachwissenschaft & laboratoire Dynamique Du Langage (UMR5596, CNRS & Université Lyon 2).Search in Google Scholar

Selkirk, Elisabeth. 1984. Phonology and syntax: The relation between sound and structure. Cambridge: MIT Press.Search in Google Scholar

Selkirk, Elisabeth. 2011. The syntax-phonology interface. In John Goldsmith, Jason Riggle & Alan Yu (eds.), The handbook of phonological theory, 2nd edn., 435–484. Oxford: Blackwell.10.1002/9781444343069.ch14Search in Google Scholar

Speer, Shari R. & Kiwako Ito. 2009. Prosody in first language acquisition – acquiring intonation as a tool to organize information in conversation: Prosody in first language acquisition. Language and Linguistics Compass 3(1). 90–110. https://doi.org/10.1111/j.1749-818X.2008.00103.x.Search in Google Scholar

Šturm, Pavel & Jan Volín. 2023. Occurrence and duration of pauses in relation to speech tempo and structural organization in two speech genres. Languages 8(1). 23. https://doi.org/10.3390/languages8010023.Search in Google Scholar

Taboada, Maite. 2006. Spontaneous and non-spontaneous turn-taking. Pragmatics 16(2-3). 329–360. https://doi.org/10.1075/prag.16.2-3.04tab.Search in Google Scholar

Tannen, Deborah. 1985. Silence: Anything but. In Deborah Tannen & Muriel Saville-Troike (eds.), Perspectives on silence, 93–111. Norwood, NJ: Ablex.Search in Google Scholar

Thieberger, Nick & Timothy Brickell. 2019. Multi-CAST Nafsan. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Truckenbrodt, Hubert. 1995. Phonological phrases: Their relation to syntax, focus, and prominence. Cambridge, MA: MIT dissertation.Search in Google Scholar

Truckenbrodt, Hubert. 2007. The syntax-phonology interface. In Paul de Lacy (ed.), The Cambridge handbook of phonology, 435–456. Cambridge: Cambridge University Press.10.1017/CBO9780511486371.019Search in Google Scholar

Vollmer, Maria. 2020. Multi-CAST Mandarin. In Geoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2108. Bamberg: University of Bamberg.Search in Google Scholar

Walker, Anne. 1985. The two faces of silence: The effect of witness hesitancy on lawyers’ impressions. In Deborah Tannen & Muriel Saville-Troike (eds.), Perspectives on silence, 55–75. Norwood, NJ: Ablex.Search in Google Scholar

Watson, Duane & Edward Gibson. 2004. The relationship between intonational phrasing and syntactic structure in language production. Language and Cognitive Processes 19(6). 713–755. https://doi.org/10.1080/01690960444000070.Search in Google Scholar

Weilhammer, Karl & Susen Rabold. 2003. Durational aspects in turn taking. In Maria-Josep Solé, Daniel Recasens & Joaquín Romero (eds.), Proceedings of the International Conference of Phonetic Sciences, 2145–2148. Barcelona: International Phonetic Association.Search in Google Scholar

Wennerstrom, Ann & Andrew F. Siegel. 2003. Keeping the floor in multiparty conversations: Intonation, syntax, and pause. Discourse Processes 36(2). 77–107. https://doi.org/10.1207/S15326950DP3602_1.Search in Google Scholar

Yang, Li-chiung. 2004. Duration and pauses as cues to discourse boundaries in speech. In Proceedings of speech prosody 2004, 267–270. Nara, Japan: ISCA.10.21437/SpeechProsody.2004-62Search in Google Scholar

Zellner, Brigitte. 1994. Pauses and the temporal structure of speech. In Eric Keller (ed.), Fundamentals of speech synthesis and speech recognition, 41–62. Chichester: John Wiley.Search in Google Scholar

Received: 2022-12-14
Accepted: 2024-06-05
Published Online: 2024-08-20

© 2024 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

  1. Frontmatter
  2. Editorial
  3. Editorial 2024
  4. Phonetics & Phonology
  5. The role of recoverability in the implementation of non-phonemic glottalization in Hawaiian
  6. Epenthetic vowel quality crosslinguistically, with focus on Modern Hebrew
  7. Japanese speakers can infer specific sub-lexicons using phonotactic cues
  8. Articulatory phonetics in the market: combining public engagement with ultrasound data collection
  9. Investigating the acoustic fidelity of vowels across remote recording methods
  10. The role of coarticulatory tonal information in Cantonese spoken word recognition: an eye-tracking study
  11. Tracking phonological regularities: exploring the influence of learning mode and regularity locus in adult phonological learning
  12. Morphology & Syntax
  13. #AreHashtagsWords? Structure, position, and syntactic integration of hashtags in (English) tweets
  14. The meaning of morphomes: distributional semantics of Spanish stem alternations
  15. A refinement of the analysis of the resultative V-de construction in Mandarin Chinese
  16. L2 cognitive construal and morphosyntactic acquisition of pseudo-passive constructions
  17. Semantics & Pragmatics
  18. “All women are like that”: an overview of linguistic deindividualization and dehumanization of women in the incelosphere
  19. Counterfactual language, emotion, and perspective: a sentence completion study during the COVID-19 pandemic
  20. Constructing elderly patients’ agency through conversational storytelling
  21. Language Documentation & Typology
  22. Conative animal calls in Macha Oromo: function and form
  23. The syntax of African American English borrowings in the Louisiana Creole tense-mood-aspect system
  24. Syntactic pausing? Re-examining the associations
  25. Bibliographic bias and information-density sampling
  26. Historical & Comparative Linguistics
  27. Revisiting the hypothesis of ideophones as windows to language evolution
  28. Verifying the morpho-semantics of aspect via typological homogeneity
  29. Psycholinguistics & Neurolinguistics
  30. Sign recognition: the effect of parameters and features in sign mispronunciations
  31. Influence of translation on perceived metaphor features: quality, aptness, metaphoricity, and familiarity
  32. Effects of grammatical gender on gender inferences: Evidence from French hybrid nouns
  33. Processing reflexives in adjunct control: an exploration of attraction effects
  34. Language Acquisition & Language Learning
  35. How do L1 glosses affect EFL learners’ reading comprehension performance? An eye-tracking study
  36. Modeling L2 motivation change and its predictive effects on learning behaviors in the extramural digital context: a quantitative investigation in China
  37. Ongoing exposure to an ambient language continues to build implicit knowledge across the lifespan
  38. On the relationship between complexity of primary occupation and L2 varietal behavior in adult migrants in Austria
  39. The acquisition of speaking fundamental frequency (F0) features in Cantonese and English by simultaneous bilingual children
  40. Sociolinguistics & Anthropological Linguistics
  41. A computational approach to detecting the envelope of variation
  42. Attitudes toward code-switching among bilingual Jordanians: a comparative study
  43. “Let’s ride this out together”: unpacking multilingual top-down and bottom-up pandemic communication evidenced in Singapore’s coronavirus-related linguistic and semiotic landscape
  44. Across time, space, and genres: measuring probabilistic grammar distances between varieties of Mandarin
  45. Navigating linguistic ideologies and market dynamics within China’s English language teaching landscape
  46. Streetscapes and memories of real socialist anti-fascism in south-eastern Europe: between dystopianism and utopianism
  47. What can NLP do for linguistics? Towards using grammatical error analysis to document non-standard English features
  48. From sociolinguistic perception to strategic action in the study of social meaning
  49. Minority genders in quantitative survey research: a data-driven approach to clear, inclusive, and accurate gender questions
  50. Variation is the way to perfection: imperfect rhyming in Chinese hip hop
  51. Shifts in digital media usage before and after the pandemic by Rusyns in Ukraine
  52. Computational & Corpus Linguistics
  53. Revisiting the automatic prediction of lexical errors in Mandarin
  54. Finding continuers in Swedish Sign Language
  55. Conversational priming in repetitional responses as a mechanism in language change: evidence from agent-based modelling
  56. Construction grammar and procedural semantics for human-interpretable grounded language processing
  57. Through the compression glass: language complexity and the linguistic structure of compressed strings
  58. Could this be next for corpus linguistics? Methods of semi-automatic data annotation with contextualized word embeddings
  59. The Red Hen Audio Tagger
  60. Code-switching in computer-mediated communication by Gen Z Japanese Americans
  61. Supervised prediction of production patterns using machine learning algorithms
  62. Introducing Bed Word: a new automated speech recognition tool for sociolinguistic interview transcription
  63. Decoding French equivalents of the English present perfect: evidence from parallel corpora of parliamentary documents
  64. Enhancing automated essay scoring with GCNs and multi-level features for robust multidimensional assessments
  65. Sociolinguistic auto-coding has fairness problems too: measuring and mitigating bias
  66. The role of syntax in hashtag popularity
  67. Language practices of Chinese doctoral students studying abroad on social media: a translanguaging perspective
  68. Cognitive Linguistics
  69. Metaphor and gender: are words associated with source domains perceived in a gendered way?
  70. Crossmodal correspondence between lexical tones and visual motions: a forced-choice mapping task on Mandarin Chinese
Downloaded on 18.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/lingvan-2022-0156/html
Scroll to top button