Home Stress deletion or stress demotion? An acoustic study of stress in Spanish lexical compounds
Article Open Access

Stress deletion or stress demotion? An acoustic study of stress in Spanish lexical compounds

  • Dongmei Lin ORCID logo EMAIL logo and Carlos-Eduardo Piñeros
Published/Copyright: May 21, 2024
Become an author with De Gruyter Brill

Abstract

In Spanish lexical compounds, stress on the final constituent is consistently preserved; however, it is uncertain whether stress on the initial constituent is preserved as well. Some linguists claim that stress is deleted from the initial constituent, whereas a few argue, instead, that it is demoted from primary to secondary. Given that empirical evidence has been scarce, this issue remains unresolved. To inform the debate, the present study examines all known phonetic correlates of stress in both accented and unaccented contexts. The acoustic measures of lexical compounds are compared to those of non-compound words and phrases that have a similar segmental structure. The results demonstrate that the initial constituent of lexical compounds bears secondary stress. This secondary stress, as a result of stress demotion, differs from rhetorical secondary stress both phonetically and phonologically. These findings provide fresh insights into the definitional properties as well as the prosodic structure of Spanish compounds.

1 Introduction

The stress of Spanish lexical compounds has been the topic of a long-standing debate. Some linguists argue that the stress on the first word appearing in a lexical compound is deleted (1a), which would make such compounds prosodically analogous to non-compound words; they would bear a single primary stress (Bustos Gisbert 1986; Hualde 2007; Shwayder 2015). The alternative view is that the stress on the first word appearing in a lexical compound is merely demoted (1b); consequently, there would be a secondary stress in addition to the primary one (Rainer and Varela 1992; Val Álvaro 1999).

(1)
a.
Stress-Deletion Hypothesis:
[ˈla.βa] + [ˈpla.tos] = [la.βa.ˈpla.tos]
b.
Stress-Demotion Hypothesis:
[ˈla.βa] + [ˈpla.tos] = [ˌla.βa.ˈpla.tos]
lava platos lavaplatos
‘(s/he) washes’ ‘plates’ ‘dishwasher’

While most of the previous studies have addressed this matter based on the authors’ intuitions, only two of them have resorted to experimental evidence, i.e., Real Academia Española (hereafter RAE) (2011) and Rao (2015). RAE (2011) detected an initial pitch accent in three lexical compounds (i.e., bienvenido ‘welcome’, aguanieve ‘sleet’ and hierbabuena ‘mint’), from which it was inferred that there is a secondary stress. Rao (2015) concluded, by contrast, that stress deletion is the norm, as he did not consistently detect such a pitch accent in the 15 lexical compounds that he examined. Considering that both studies limited their search to a single phonetic correlate of stress, neither of those conclusions can be definitive.

To procure more reliable evidence, the present study inspects all known stress correlates using a comparative approach: the acoustic measures of the syllable that has the potential to bear secondary stress in lexical compounds are compared to those of the syllable located in the same structural position in phrases and non-compound words. Data collection took two important precautions to ensure comparability. The syllables to be compared consist of the exact same consonants and vowels and the intonational context where those syllables appear was also held constant. In this way, potential disturbance from segmental and positional phenomena was avoided.

The remainder of the introduction is organised as follows. Section 1.1 introduces Spanish lexical compounds by locating them in the larger context of Spanish compound words and by reviewing the different compound types. Section 1.2 presents an overview of the stress types in Spanish as well as their phonetic correlates. Finally, Section 1.3 specifies the research questions to be addressed in this study.

1.1 Spanish lexical compounds

Spanish compound words can be divided into two major classes: (i) lexical (also known as orthographic or proper) and (ii) syntactic (also known as phrasal or syntagmatic); they are mainly distinguished by the criterion of morpho-phonological integration (RAE 2011; Varela 2012). The words forming a lexical compound are morpho-phonologically integrated, as evinced by the preservation of a single primary stress and by structural adjustments taking place at word edges (e.g., identical vowel simplification in telaraña ‘spider web’, cf. *telaaraña); they are normally represented as a single unit.[1] Syntactic compounds generally show, by contrast, no signs of morpho-phonological integration (e.g., cuenta corriente ‘checking account’). It is often posited that they retain the stress of each constituent word, and they are orthographically represented as separate units.[2] The two categories of compound, according to Varela (2012), may also differ in their way of inflection. Lexical compounds always have their inflectional markers at the end of the whole compound, whereas in syntactic compounds it is their head that always receives the inflectional markers (p. 218).[3]

Within lexical compounds, two additional distinctions are relevant. The first one involves the notion of head, which is usually defined on syntactic and semantic grounds. Syntactically, the head determines the lexical category of the whole compound, and semantically, it functions as a hypernym. Depending on whether the head is internal or external, compounds may be endocentric or exocentric (Kornfeld 2009, p. 437; Rainer and Varela 1992, p. 122; Val Álvaro 1999, p. 4766; Varela 2012, p. 218; among others). In the former, one of the embedded words functions as the head (e.g., media noche ‘half night’ → medianoche ‘midnight’), whereas in the latter none of the embedded words plays that role (e.g., cara rota ‘face broken’ → cararrota ‘cheeky person’). The other relevant distinction is based on the relationship between the constituent words (Scalise and Bisetto 2009; Scalise and Vogel 2010). When the constituent words are of equal status, the compound is coordinate (e.g., agrio y dulce ‘bitter and sweet’ → agridulce ‘bittersweet’). When the constituent words differ in status, the compound may be subordinate, if one word complements the other (e.g., saca corchos ‘pulls corks’ → sacacorchos ‘corkscrew’), or attributive, if one word assigns an attribute to the other (e.g., noche buena ‘good night’ → nochebuena ‘Christmas Eve’). According to Bustos Gisbert (1986), Spanish compounds of different morphosyntactic and semantic types might exhibit different stress patterns (p. 184). To verify this hypothesis, the current study examines lexical compounds that represent all the different types as introduced above.

1.2 Spanish stresses and their phonetic correlates

Unlike English which has two levels of word stress (i.e., primary and secondary), Spanish has only one word stress, i.e., primary. It is noteworthy, however, that in the literature several authors have observed secondary rhythmic stresses falling on alternating syllables from the syllable bearing primary stress (e.g., Harris 1983, 1991; Navarro Tomás 1918; Roca 1986) or on the word-initial syllable (e.g., Bolinger 1962; Prieto and van Santen 1996; Stockwell et al. 1956). This type of stress has been often found associated with a falling pitch accent and an amplitude peak (Prieto and van Santen 1996). Linguists have recently defined this type of stress as rhetorical secondary stress for it being an optional post-lexical phenomenon used for attention-getting or authority-seeking (Hualde and Nadeu 2014). It has been reported in radio talks, news broadcasts, political discourses and lectures, but it is relatively rare in conversational speech styles (Fabregat and de la Mota 2010; Hualde and Nadeu 2014; Hualde 2007, 2009, 2010; Kuder 2020; Quilis 1981; Rao 2011). Moreover, it should be noted that one of the basic parameters of lexical stress is its distinctive function (Hualde 2012, p. 153). For example, the primary stress in the verb pa ra ‘he/she/it stops’ seems to be the only sound property distinguishing this word from the preposition para ‘for’ within a phrase. By contrast, the rhetorical secondary stress never contrasts words, and accordingly, it is not considered lexical stress.

In Spanish, all content words including nouns, adjectives, verbs, and adverbs, as well as some classes of function words invariably receive lexical (or primary) stress on one of their last three syllables, which is known as the Three-Syllable Window. Previous studies have identified four phonetic properties that may serve to cue lexical stress in Spanish: pitch, duration, intensity, and spectral tilt. They do not appear to be equally reliable, though. It has been observed that variations in pitch and duration correlate with variations in stress more systematically than variations in intensity and spectral tilt do.

In Spanish linguistics literature, the view that pitch accent is the most important indicator of lexical stress in Spanish is dominant (Bollinger and Hodapp 1961; Contreras 1963, 1964; Enríquez et al. 1989; Figueras and Santiago 1993; Gili Gaya 1975; Llisterri et al. 2005; Monroy-Casas 1977; Quilis 1971, 1981, 1993). The reason why pitch or f0 is recognised as a reliable stress cue is that in stress-accent languages like Spanish, there is a strong tendency for stressed syllables to be accompanied by pitch accents (e.g., Beckman and Edwards 1994; Beckman and Pierrehumbert 1986; Bolinger 1958; Gussenhoven 2004; Ladd 1996; Pierrehumbert 1980; etc.). In the case of Spanish, seven types of pitch accent have been recognised. Using the ToBI labelling scheme, they are transcribed as H*, L*, H + L*, L* + H, L + H*, L + ¡H* and L + <H* (Aguilar et al. 2009; Hualde and Prieto 2015).[4] Among them, the rising pitch accents are the most common types associated with lexical stress in Spanish words in declarative non-phrase-final position (Hualde 2003; Llisterri et al. 2005; etc.). Moreover, rising and high pitch accents are deemed perceptually more prominent than falling and low pitch accents. According to Prieto et al. (2005), rising and high pitch accents tend to be perceived as high tones, whereas falling and low pitch accents are perceived as low tones (p. 375).

Despite the interweaving of stress and pitch accent, one must bear in mind that stress and pitch accent embody different types of prominence. Stress signals prominence at word level, whereas pitch accent signals prominence at phrase level. It is thus possible for a syllable to be stressed or unstressed depending on the pattern of lexical prominence of its word. Additionally, it is possible for a stressed syllable to be accented or unaccented depending on the intonational pattern of the utterance where it is found (e.g., Beckman and Edwards 1994; Ladd 1996; Pierrehumbert 1980). Convergence of both types of prominence on the same syllable raises the problem of how to tease them apart. To ensure that the effects of stress are not confused with those of pitch accent, this study scrutinises the linguistic structures under investigation under two different conditions: in normal declarative sentences where pitch accents are expected to cue stress, and in post-focal position where no pitch accent is expected and any prosodic prominence on a syllable can be considered a reliable indication of stress (for deaccenting in post-focal position, see de la Mota 1995, 1997; Face 2001).

Duration has proven to be a reliable stress cue in numerous languages (Beckman and Edwards 1994, for English; van Heuven and Sluijter 1996, Sluijter et al. 1997, for Dutch; Dogil and Williams 1999, for German; Ortega-Llebaria 2006, Ortega-Llebaria and Prieto 2008, 2010, for Spanish; etc.). Stressed syllables and their vowels tend to be significantly longer than unstressed ones. This tendency holds strong in Spanish; in fact, studies such as Canellada and Kuhlman-Madson (1987), Garrido et al. (1995) and Díaz-Campos (2000) argue that duration is the main cue to Spanish stress. One must be cautious, however, because stress is not the only factor affecting duration. Segmental structure, position within words and phrases, intonation, and speech rate do so too (e.g., Clark et al. 2007, p. 333; Ortega-Llebaria 2006, p. 111; Yang 1998, p. 42). To prevent interference by those additional factors, the present study takes the following precautions: the syllables to be compared share similar segmental structure and within- word position, the words containing them are kept away from phrase-final position, the intonational context surrounding the words under comparison is held constant, and relative rather than raw duration values are employed.

As for intensity, there was a time when it was regarded as the main stress cue in Spanish. Back in the early 1900s, Navarro Tomás (1918) claimed that intensity is the strongest cue to stress in this language, and to underscore its centrality, he dubbed this type of prominence acento de intensidad (‘intensity stress’). This view has been endorsed by recent acoustic studies such as González et al. (2006) and Urrutia Cardenas (2007). Their statistical analyses revealed that intensity is the most important indicator of stress at both word and phrase levels. Not all recent studies agree, though. Ortega-Llebaria and Prieto (2008) found that while stressed and unstressed vowels differ in overall intensity within declarative sentences (i.e., accented contexts), that is not so within parenthetical sentences (i.e., unaccented context). In addition, Ortega-Llebaria and Prieto (2010) established that although stressed vowels have greater overall intensity than unstressed ones, the difference is virtually imperceptible.

Spectral tilt, the intensity difference between the lower and higher frequencies of the spectrum, has also been linked to stress; however, its role is controversial. Evidence that it serves to cue stress is provided in Ortega-Llebaria and Prieto (2008). They found that stressed syllables, regardless of context, tend to have greater intensity in the higher frequencies, and consequently, their spectral tilt tends to be flatter than that of their unstressed counterparts. In direct conflict with this are the findings of another study by the same authors. Ortega-Llebaria and Prieto (2010) concluded that while spectral tilt does correlate with vowel quality in Catalan, it does not correlate with stress in Spanish.

These controversial findings, as well as those presented above on intensity, might be attributed to the failure of those studies to control mouth-to-microphone distance during recording. It should be noted that intensity levels decrease significantly when sound source moves further away from microphone. Therefore, for intensity and spectral tilt being reliable amplitude measurements of vowels and syllables, the distance between the speaker’s mouth and the microphone must be kept constant. To this end, the current study used a head-mounted microphone for recording.

1.3 Research questions

The current study aims to provide solid acoustic evidence concerning the stress pattern of Spanish lexical compounds by conducting a comparative analysis of all known stress correlates across different types of linguistic structure. Three specific questions will be addressed. First, is the initial constituent of lexical compounds pronounced with stress? Second, if so, which level of stress is it and what are its phonetic cues? And last, are there any cross-compound-type variations in the prosodic behaviour of lexical compounds?

2 Materials and Methods

2.1 Materials

Eighteen lexical compounds representing different syntactic and semantic types were carefully selected from those used in the previous studies. Each of them was matched with a non-compound word or a phrase that has a similar segmental structure. In view of the fact that formant structure, duration and intensity are properties that vary from one segment to another, the only case in which it would be warranted to attribute differences in acoustic values to suprasegmental phenomena is when they arise in the same segmental context. Therefore, it was imperative to ensure that the compound words and their counterparts have the same phonological strings, at least in the syllables where stress cues are expected to arise. An example of such word group is given in (2) below, where the syllables to be examined and compared (i.e., the target syllables) are underlined and the primary-stressed syllables are indicated in bold.

(2)
a. nochebuena ‘Christmas Eve’ lexical compound
b. nochecita ‘night’ non-compound word
c. no che buena ‘good night’ phrase

Additionally, in the selection of words, mid to high-frequency words were preferred over low-frequency ones, utilising Mark Davies’ Corpus del Español: Web/Dialects (2016) for assistance. For example, sacapuntas, with 548 counts, was favoured over sacamuelas (‘tooth-puller’) with 51 counts. The purpose of adopting this criterion was to make it easier for participants to produce natural speech in the elicitation tasks. The complete list of the selected items is attached in Appendix A (37 items in total).

For each selected item, two tasks, each involving a scripted question and a picture, were designed (sample tasks are given in Appendix B). The first task was designed to elicit an answer using the target item in a normal declarative sentence, while the second was intended to elicit it in post-focal position, i.e., an unaccented context. Moreover, in order to avoid the interference of boundary tones, phrase accents and final-lengthening effects, the task design ensured that the target items were located away from phrase-final or prepausal position.

Tasks that incorporate pictures with partially scripted conversation were selected as our elicitation method for two reasons. Firstly, non-verbal materials such as pictures are more promising for the elicitation of spontaneous, natural and meaningful speech (e.g., Himmelmann and Ladd 2008, p. 266). Secondly, partially scripted conversation enables researchers to control which linguistic structures will be produced (Caldecott and Koch 2014, p. 225). A task of this kind is typically performed as follows: while presented with a picture, the speaker is asked to answer a scripted question based on what s/he sees on that picture. Since questions and pictures are designed ahead of time, researchers can select the lexical items to be elicited but are still able to attain natural and spontaneous productions with the help of non-linguistic prompts and conversational style.

2.2 Participants

Thirty native Spanish speakers (three male and three female from each of the following countries: Spain, Mexico, Colombia, Chile and Argentina) participated in the experiment. They were either visiting or residing in Auckland and all of them were maintaining consistent use of Spanish on a daily basis. Of these participants, 21 were university students, while the remaining 9 also possessed a high-level education background. None of them had speaking or hearing problems. Moreover, none of them had training in phonetics or linguistics; that is, all were naive speakers. Their participation in this study was approved by the Human Participants Ethics Committee of the University of Auckland (June 2017; reference number 018498).

2.3 Data collection

Data were collected in a sound-treated recording studio at the City Campus of the University of Auckland. The participants’ responses were audio-recorded using a lightweight head-mounted microphone (Senneheister HSP2), which was connected to a recorder through an in-line preamplifier. Given that head-mounted microphones allow to maintain a consistent mouth-to-microphone distance throughout the recording, they are often considered the best option for measuring intensity.

Prior to performing the tasks, the participants were asked to provide their personal information (e.g., occupation and educational background). This informal talk served to help the participants feel more comfortable with using the microphone. Prior to the formal recording, the participants were provided with instructions and a sample task. They were encouraged to practice with the sample task as many times as they wanted. They were also assured that production errors would not matter, for they could repeat if they felt they made a mistake. When unnatural or unclear pronunciations were detected, the researcher requested the participants to repeat.

2.4 Data processing

The recordings were processed using Praat (Boersma and Weenink 2019). Target items were firstly extracted from the recordings and saved as separate sound files, i.e., 2,220 tokens (37 items * 30 speakers * 2 conditions). Then, 334 tokens (which constitute 15 % of the data) were removed as they were found affected by recording or production problems such as unexpected noises, unnatural pronunciations or pauses and failure to produce deaccented words in post-focal position. The removed items included 179 lexical compounds, 121 non-compound words and 34 phrases. After removal of all the disqualified items, there were a total of 1,886 tokens.

For each of the 1,886 tokens, boundaries of syllables and syllabic nuclei were manually labelled on separate tiers in Praat textgrids. This was done manually on the basis of visual inspection of waveforms and spectrograms (display frequency range: 0–8,000 Hz, bandwidth: 260 Hz), as well as with reference to auditory evaluation. Even though manual labelling is in principle the most precise and reliable method, labelling consistency needs to be monitored. To reduce inter and intra-item labelling discrepancies, explicit segmentation criteria were set out following Machač and Skarnitzl’s (2009) Principles of Phonetic Segmentation and Hualde’s (2005) introduction to acoustic characterisation of Spanish sounds. The criteria are as follows. Vowel boundaries were placed with reference to the appearance of full formant structure in the spectrogram. When two vowels were in contact, the midpoint of the formant-transition phase in the spectrogram was taken as the separation point. As for liquids and nasals, a simpler wave shape, lower amplitude and drop in the formant structure were attended in labelling their boundaries. The segmentation criteria for fricatives were fairly straightforward: the presence of high-frequency formant columns in the spectrogram and ‘spiky’ or ‘hairy’ waveform. Absence of formant structure provides a reliable labelling indication for both voiceless stops /p t k/ and voiced stops /b d g/. However, the segmentation of the approximant allophones of /b d g/ was problematic. In cases where changes in the formant structure and the waveform did not allow unambiguous boundary placement, the boundaries were placed at the temporal midpoints of the transition phases. Moreover, to increase the intra-item consistency, the manual segmentation was performed on an item-by-item basis, rather than a speaker-by-speaker basis.

2.5 Acoustic measurements

The acoustic measurements were also made using Praat (Boersma and Weenink 2019). The procedure involved running several scripts to automatically extract information on f0, pitch slope, duration, intensity and spectral tilt from each token.

2.5.1 f0

Following work by Campbell and Beckman (1997), Štekauer et al. (2007) and Plag et al. (2011), f0 was measured by calculating the mean f0 throughout the measurement intervals. The mean f0 in hertz (Hz) was extracted for the nucleus of the target syllable (V1) (i.e., the voiced portion of the target syllable where formants showed stable values). The script that was created for that purpose incorporated the automatic pitch range estimation proposed by Hirst (2011), which is known as the two-pass method. On a first pass, a wide pitch range (i.e., pitch floor and pitch ceiling at 50 Hz and 700 Hz, respectively) is set to include all possible human voices and f0 is calculated on the basis of that pitch range. On a second pass, the values of q 1 and q 3 (i.e., the first and third quartile) of the f0 distribution are taken and the values for the pitch floor and pitch ceiling are reset at 0.75*q 1 and 2.5*q 3 , respectively. Compared to the traditional pitch range setting, which uses default parameters, this automatic pitch-estimation method allows more accurate analyses on pitch and related measures (Hirst 2011, p. 59).

The pitch values that were extracted through the Praat script were then converted from linear hertz into logarithmic semitone (ST) values relative to 100 Hz. The semitone scale consists of units corresponding to equal perceptual intervals, and among all psycho-acoustic scales, it is the most effective to represent the perception of intonational spans in speech (Nolan 2003). The formula that was used for pitch semitone transformation is that in (3), where X stands for the pitch value in Hz.

(3)
f0Hz−ST = 12 * log2 (X/100)

2.5.2 Pitch slope

While mean f0 values mainly reflect the pitch level of the target syllable, it does not inform us with the pitch change or the shape of pitch contour within a given measurement interval: in principle, a vowel accompanied by a falling, a level or a rising pitch contour may have the same mean f0. However, for experimental studies on stress, information on pitch change is crucial as well since, as discussed in Section 1.2, rising and falling pitch accents are perceived with different levels of prominence (Prieto et al. 2005). Therefore, following Plag et al. (2011), pitch slopes were calculated as a way to capture the difference between falling and rising pitch contours. For V1 (i.e., the nucleus of the target syllable of each token), pitch slope (in ST/s) is derived with the formula in (4), where f0max and f0min are the maximum f0 and minimum f0 in the interval which starts from the beginning of target syllable nucleus and finishes at the end of the following syllable, and tmax and tmin are the times at which the maximum and minimum pitches are observed. It is noteworthy that, in Spanish declaratives, the most common type of prenuclear pitch accent normally begins to rise on the stressed syllable and frequently peaks on the post-tonic syllable (Hualde 2005, p. 243). To capture the complete pitch accent configuration, it was imperative to include the post-tonic syllable in the measurement interval.

(4)
Pitch slope = (f0max – f0min) / (tmax – tmin)

2.5.3 Duration measurements

The duration measurements were straightforward: syllabic nucleus intervals were measured in milliseconds. However, if one takes into account that a speaker’s articulation rate is not completely consistent in natural speech, comparing the raw duration values of V1 across items might not be informative. To ward off contextual factors such as speech rate, relative duration values were calculated as the duration ratio between the nucleus of the target syllable (V1) and the nucleus of the following unstressed syllable (V2). In this way, the relative duration values would represent the length of the target syllable nucleus by taking the length of the nucleus of an unstressed syllable as reference.

2.5.4 Intensity measurements

Intensity was measured in decibels (dB) as the overall intensity (or mean intensity) of the measurement intervals. Pitch floor, which determines the duration of the Gaussian (or double Kaiser) window within which the average intensity is weighted, was set automatically for each token using the two-pass method proposed by Hirst (2011). Same as for duration, relative values were calculated for intensity with a view to warding off contextual factors. Given that the decibel is already a logarithmic unit, relative intensity was calculated as the difference, instead of the ratio, between V1 intensity and V2 intensity.

2.5.5 Spectral tilt

Following Sluijter et al. (1995), spectral tilt was measured as both H1-A2 and H1-A3, which calculate the difference between the intensity of the first harmonic (H1) and the intensity of the second formant (A2) and the difference between the intensity of the first harmonic (H1) and the intensity of the third formant (A3), respectively.[5] The values of H1, A2 and A3 were extracted from V1 and V2 of each token. The extraction of these acoustic values was done through a script based on Mayer (2014), where the automatic method for pitch range estimation proposed by Hirst (2011) was also adopted. Lastly, relative values were calculated as the difference in H1-A2 between V1 and V2, and the difference in H1-A3 between V1 and V2.

2.6 Statistical analysis

The acoustic measures that were entered into statistical analyses are summarised in Table 1 below. All the statistical analyses were performed using the statistical software R (R Core Team 2017).

Table 1:

The acoustic measures that were entered into statistical analyses.

f0 Mean f0 (i.e., mean f0 of V1)
Pitch contour Pitch slope (i.e., pitch slope surrounding V1)
Duration Relative dur (i.e., duration ratio between V1 and V2)
Intensity Relative int (i.e., difference in mean intensity between V1 and V2)
Spectral tilt Relative H1-A2 (i.e., difference in H1A2 between V1 and V2)
Relative H1-A3 (i.e., difference in H1A3 between V1 and V2)

To test for significant differences in the above acoustic measures among different linguistic structure types (i.e., lexical compound, non-compound word and phrase), linear mixed model (LMM) analyses were carried out using the “lmer” function from the lmerTest package, which extends the lme4 package to provide p-values for the fixed effects through Satterthwaite’s or Kenward–Roger’s degrees of freedom approximations (Kuznetsova et al. 2017). LMM analyses were performed using each of the above acoustic measures as response, with linguistic structure type as predictor, the effects of linguistic structure type within each word group as random slopes, and participants as random intercepts.[6] Among those LMM analyses, the analyses on f0 and pitch slope were conducted on accented data only since they are irrelevant in unaccented data. For duration, intensity and spectral tilt measures, LMM analyses were carried on the full set of data, and then repeated on the unaccented data only. It should also be noted that in these analyses, the diagnostic qq-plots (using “qqnorm” and “qqline” functions) for pitch slope measures show strong evidence of outliers; outliers were excluded from the corresponding analyses with the help of the interquartile range method (IQR). Moreover, the duration measures were log-transformed due to their skewed distribution; log transformations significantly improved the normality of the duration distributions.

It is also imperative to acknowledge that conducting multiple statistical tests on the same dataset can lead to the multiple comparison problem, consequently elevating the familywise error rate (FWER), which represents the likelihood of committing a Type I error. In response to this concern, we adopted the Bonferroni-adjusted significance level of 0.005, as determined by the formula presented in (5), to effectively control the FWER.

(5)
αBonferroni = α / m
α is the desired overall significance level (i.e., 0.05).
m is the number of tests conducted on the same dataset (i.e., 10).

Lastly, to assess potential variation among types of lexical compounds, LMM analyses were conducted on the acoustic measures of lexical compounds, with the types of compounds (i.e., endocentric vs. exocentric, attributive vs. coordinate vs. subordinate) serving as predictors, and items and participants as random intercepts. The Bonferroni-adjusted significance level of 0.005 was also applied in these analyses to identify statistically significant differences.

3 Results

This section reports the statistical analysis results. Differences in the acoustic measurements among lexical compounds, non-compound words and phrases are firstly presented in Section 3.1. Results on f0, pitch slope, duration, intensity and spectral tilt are reported separately, followed by a summary of significant acoustic differences. Section 3.2 is dedicated to the results on cross-compound-type variations.

3.1 Acoustic differences among lexical compounds, non-compound words and phrases

3.1.1 Results on f0

Table 2 presents the results of LMM analyses on the mean f0 of the target syllable nucleus (i.e., V1). The estimates (b) reveal that phrases have the highest mean f0. More specifically, the mean f0 of V1 in phrases is 0.81 ST higher than that in lexical compounds, and 0.78 ST higher than that in non-compound words. However, the observed p-values suggest that the differences fail to meet the threshold established by the Bonferroni correction, i.e., 0.005.[7]

Table 2:

Results of LMM on mean f0.

Measures Factor b SE df t p
Mean f0 Lexical compounds (intercept) 7.14 0.64 62 11.23 <0.001
Non-compound words 0.03 0.20 14 0.15 =0.89
Phrases 0.81 0.29 3 2.82 =0.07
Non-compound words (intercept) 7.17 0.66 69 10.89 <0.001
Lexical compounds −0.03 0.20 14 −0.15 =0.89
Phrases 0.78 0.28 4 2.78 <0.05

3.1.2 Results on pitch slope

The results on pitch slope surrounding V1 are shown in Table 3 below. Phrases exhibit the highest pitch slope value, which is approximately 21 ST/s higher than that of lexical compounds and non-compound words. This is also evident in Figure 1 below, where the pitch slope values are mostly positive in phrases but generally negative in lexical compounds and non-compound words. Furthermore, if we recall the way in which pitch slope was calculated (Section 2.5.2), here the positive pitch slope values in phrases suggest that V1 of phrases are generally surrounded by rising pitch contours, whereas the negative pitch slope values in lexical compounds and non-compound words are a natural consequence of the frequent falling pitch contours surrounding their initial syllable.

Table 3:

Results of LMM on pitch slope.

Measures Factor b SE df t p
Pitch slope Lexical compounds (intercept) −23.55 3.30 13 −7.13 <0.001
Non-compound words 0.56 2.84 10 0.20 =0.85
Phrases 21.10 4.48 21 4.71 <0.001
Non-compound words (intercept) −22.98 3.16 12 −7.26 <0.001
Lexical compounds −0.56 2.84 10 −0.20 =0.85
Phrases 20.54 4.50 16 4.57 <0.001
Figure 1: 
Pitch slope values in lexical compounds (LC), non-compound words (NC) and phrases (Pr).
Figure 1:

Pitch slope values in lexical compounds (LC), non-compound words (NC) and phrases (Pr).

3.1.3 Results on duration

When it comes to duration, the results on relative dur (i.e., duration ratio between V1 and V2) are presented in Table 4 below. Significant differences are found between non-compound words, on the one hand, and lexical compounds and phrases, on the other.

Table 4:

Results of LMM on relative dur.

Measures Factor b SE df t p
Relative dur (log-transformed) Lexical compounds (Intercept) −0.002 0.07 14 −0.04 =0.97
Non-compound words −0.24 0.04 12 −5.41 <0.001
Phrases 0.06 0.05 4 1.31 =0.27
Non-compound words (Intercept) −0.24 0.07 14 −3.40 <0.01
Lexical compounds 0.24 0.04 12 5.41 <0.001
Phrases 0.30 0.03 7 8.72 <0.001

Given that the relative duration measures were log-transformed due to their skewed distribution, the estimates (b) in the above table do not represent the actual differences across linguistic structure types. To turn them into interpretable results, they were converted into a percentage by using the formula in (6).

(6)
Difference in percentage = (exp(b) – 1) * 100

The application of this formula reveals that relative dur is 27 % greater in lexical compounds than in non-compound words, and it is 35 % greater in phrases than in non-compound words. By limiting the analyses to unaccented tokens only, as shown in Table 5 below, only the difference between lexical compounds and non-compound words is significant.[8] Upon the application of (6), the duration values are 35 % greater in lexical compounds than in non-compound words.

Table 5:

Results of LMM on relative dur (unaccented context only).

Measures Factor b SE df t p
Relative dur (log-transformed) (unaccented) Lexical compounds (Intercept) 0.01 0.07 14 0.21 =0.84
Non-compound words −0.30 0.05 11 −6.06 <0.001
Phrases −0.11 0.07 3 −1.60 =0.21
Non-compound words (Intercept) −0.29 0.07 14 −4.34 <0.001
Lexical compounds 0.30 0.05 11 6.06 <0.001
Phrases 0.19 0.09 2 2.24 =0.17

3.1.4 Results on intensity and spectral tilt

Regarding the results of amplitude parameters, which are given in Tables 6 8 below, there are no significant differences among the three types of linguistic structure.[9]

Table 6:

Results of LMM on relative intV.

Measures Factor b SE df t p
Relative intV Lexical compounds (intercept) 0.71 0.34 15 2.05 =0.06
Non-compound words 0.33 0.48 14 0.69 =0.50
Phrases −0.19 0.61 5 −0.31 =0.77
Relative intV (unaccented) Lexical compounds (intercept) 0.59 0.35 17 1.68 =0.11
Non-compound words 0.22 0.41 13 0.53 =0.60
Phrases −0.27 0.50 4 −0.55 =0.61
Table 7:

Results of LMM on relative H1-A2.

Measures Factor b SE df t p
Relative H1-A2 Lexical compounds (intercept) −1.05 1.19 14 −0.88 =0.39
Non-compound words 0.18 1.00 14 0.18 =0.86
Phrases 0.43 3.21 3 0.13 =0.90
Relative H1-A2 (unaccented) Lexical compounds (intercept) 0.08 0.92 11 0.09 =0.93
Non-compound words −0.97 1.28 11 −0.76 =0.47
Phrases 1.21 1.89 6 0.65 =0.55
Table 8:

Results of LMM on relative H1-A3.

Measures Factor b SE df t p
Relative H1-A3 Lexical compounds (intercept) 1.05 1.40 14 0.75 =0.47
Non-compound words 0.94 0.87 14 1.07 =0.30
Phrases 0.23 2.14 4 0.11 =0.92
Relative H1-A3 (unaccented) Lexical compounds (intercept) 0.86 1.21 13 0.71 =0.49
Non-compound words 1.13 1.25 11 0.91 =0.38
Phrases 1.01 1.54 14 0.65 =0.52

3.1.5 Summary

All statistically significant differences among lexical compounds, non-compound words and phrases are assembled in Table 9 below. These differences can be described as follows: phrases exhibit greater pitch slope values than lexical compounds and non-compound words; lexical compounds show greater relative duration values than non-compound words in both accented and unaccented contexts.

Table 9:

Summary of acoustic differences among linguistic structure types.

Pitch slope Phrases > lexical compounds and non-compound words
Relative dur Lexical compounds > non-compound words

3.2 Results on cross-compound-type variations

Significant differences across different compound types have only been found in the relative H1-A2 values with a significance threshold set at 0.05, as seen in Table 10. Coordinate compounds have lower relative H1-A2 values than attributive and subordinate compounds; however, with the adoption of a Bonferroni-adjusted significance level at 0.005, these differences are no longer considered statistically significant.

Table 10:

Results of LMM on relative H1-A2 in different types of compound.

Measures Compound type b SE df t p
Relative H1-A2 Coordinate (intercept) −7.22 1.91 14 −3.77 <0.01
Attributive 7.26 2.36 14 3.08 <0.01
Subordinate 8.03 2.22 14 3.62 <0.01
Relative H1-A2 (unaccented) Coordinate (intercept) −3.16 1.78 10 −1.78 =0.10
Attributive 3.96 2.20 11 1.80 =0.10
Subordinate 4.84 2.08 11 2.33 <0.05

4 Discussion and conclusions

This study has compared a set of possible acoustic correlates of stress in Spanish lexical compounds, non-compound words and phrases in order to determine the level of prominence on the initial compound constituent. The main findings are summarised and discussed in Section 4.1. Subsequently, theoretical implications are explored in Section 4.2. Finally, Section 4.3 outlines prospective avenues for further investigation.

4.1 Main findings

4.1.1 Acoustic evidence for a secondary stress in Spanish lexical compounds

The results presented in Section 3.1 have revealed significant differences among lexical compounds, non-compound words and phrases in pitch slope and duration. Pitch slope distinguishes phrases from lexical compounds and non-compound words (Section 3.1.2). The pitch slope values are positive in phrases, whereas those values in lexical compounds and non-compound words are generally negative. These findings suggest that the target syllable in phrases is normally accompanied by a rising pitch contour; by contrast, falling pitch contour is the most frequent type that surrounds the target syllable in lexical compounds and non-compound words. The relative duration values, on the other hand, differentiate lexical compounds and non-compound words (Section 3.1.3). The target syllable nucleus of lexical compounds is consistently longer than that of non-compound words. We saw in Tables 4 and 5 that in both accented and unaccented contexts, the target syllable nucleus of lexical compounds is more than 20 % longer than that of non-compound words.

These results indicate different levels of prominence. Compared to the target syllable nucleus in non-compound words, the target syllable nucleus in lexical compounds is consistently longer, a sign of greater prominence. If we recall that at the experimental design stage all contextual and segmental factors on duration were carefully controlled for, any difference in duration, such as the one addressed above, can only be attributed to stress. This difference in duration, furthermore, is perceptible according to just noticeable difference (JND) uncovered by previous studies (e.g., Fant 1960; Lehiste 1970; Stevens 2000; etc.). Regardless of the absolute segmental duration, it has been observed that 10 % is a good rule of thumb for the JND of auditory perception; in other words, one speech signal is auditorily perceived as lasting longer than another speech signal if the former is at least 10 % longer than the latter (Quéne 2004, pp. 153–156). The duration difference that we found between the target syllable nucleus of lexical compounds and that of non-compound words, which is greater than 20 %, is obviously above the JND threshold.

Now we see that the duration difference evidences that the target syllable in lexical compounds is more prominent than the target unstressed syllable in non-compound words, i.e., the initial constituent of lexical compounds bears a stress. However, the duration results are not enough to determine whether it corresponds to primary or secondary stress. To do so, we need to factor in the acoustic differences, if any, between the target syllable of lexical compounds and the corresponding primary stressed syllable in phrases. This is when the pitch slope results become relevant. As discussed above, the results indicate that rising pitch contours were present in most phrasal tokens, whereas falling pitch contours were more frequent in non-compound words and lexical compounds. This can be taken as evidence of lower prosodic prominence in lexical compounds than in phrases on the grounds that rising pitch accents are generally deemed perceptually more prominent than falling and low pitch accents (e.g., Prieto et al. 2005, p. 375). Therefore, these results lead us to conclude that the prominence detected on the target syllable of lexical compounds is secondary stress.

It is important to note that this secondary stress is different from what is commonly understood as secondary stress in Spanish. That is to say that it is not the same phenomenon as rhetorical secondary stress. The differences are both phonetic and phonological. The phonetic difference is that they draw on different cues. As established above, the secondary stress in lexical compounds is cued by duration (i.e., consistently lengthened vowel duration), whereas the rhetorical secondary stress does not have a lengthening effect; as previously shown by Prieto and van Santen (1996), it tends to be associated with a falling pitch accent and an amplitude peak. Phonologically, the occurrence of secondary stress in lexical compounds is obligatory, with a predetermined locus; it consistently occurs on the syllable that would carry primary stress were the initial compound constituent to stand as an independent word. This form of stress is lexical in nature, constituting an integral part of the compound’s phonological structure, with its placement dictated by the lexical properties of the compound constituents. Conversely, rhetorical secondary stress, characterised by its optional presence and flexible locus, functions as a post-lexical mechanism employed for communicative purposes such as drawing attention or asserting authority (Hualde and Nadeu 2014).

4.1.2 No variation across compound types

As presented in Section 3.2, acoustic differences across compound types were only found in the H1-A2 values among attributive, coordinate and subordinate compounds. However, under the application of the Bonferroni correction with an alpha level set at 0.005, these differences cease to retain statistical significance. Consequently, they do not provide sufficient grounds for asserting varying levels of prominence across different types of compound words.

4.1.3 Conclusions

Overall, it can be concluded that Spanish lexical compounds, in general, feature a secondary stress on their initial constituent, i.e., the stress demotion hypothesis in (1b). By having examined all possible stress correlates in both accented and unaccented contexts, this study demonstrates that the secondary stress in lexical compounds is cued by a longer vowel duration. Its phonetic cue distinguishes this stress from rhetorical secondary stress, which is cued by a falling pitch accent and an amplitude peak (Prieto and van Santen 1996). Lastly, the present study found no evidence indicating that compounds of different syntactic or semantic types vary in stress.

4.2 Theoretical implications

The findings have important theoretical implications. To begin with, the findings support the use of stress as a diagnostic for compoundhood (Lieber and Štekauer, 2009, p. 8). What makes this possible is the presence of stress on each constituent of lexical compounds. This prosodic property differentiates lexical compounds from non-compound words, which in Spanish, are limited to a single word stress. A prosodic diagnostic will be useful when dealing with borderline cases, such as telenovela ‘soap opera’, autoescuela ‘driving school’, fotocopia ‘photocopy’, and the like. In such words, the morphological status of the initial constituent is not clear. If that constituent is regarded as a root, the overall structure would be a compound word, but if that constituent is regarded as a derivational affix, the overall structure would be a non-compound word (Gifre 1984, p. 223; Rainer and Varela 1992, p. 121). Future experimental studies will be able to settle this issue by applying the prosodic diagnostic.

We also gain insight into the prosodification of Spanish compounds. The finding that Spanish lexical compounds are produced with a secondary stress on the initial constituent and a primary stress on the final constituent rules out some options. They cannot be represented as phonological phrases (PPhs) because that would require retention of primary stress on each constituent word. Their representation as prosodic word (PW) is not viable either, as that would overlook the fact that this language assigns only one stress per PW. An alternative to consider is the representation of Spanish lexical compounds as instances of prosodic word recursion (e.g., Itô and Mester 2007, 2009, 2012; Selkirk 2011). On this view, the compound would be a recursive PW, within which there would be two embedded PWs. These units would be prosodically asymmetrical. The second would function as the head, in virtue of which it would retain its primary stress. The first embedded PW would, by contrast, be a subordinate to the head, and thus, would inevitably have its stress demoted from primary to secondary. In a phonological model where PW recursion is not allowed, the option would be to represent the compound as an intermediate constituent between the PW and the PPh, e.g., a prosodic word group (Vigário 2003, 2010), or a composite group (Vogel 2009, 2010, 2020). The PWs arising from the compound constituents would then be embedded within this new prosodic category. This analysis would also involve prosodic subordination: the second embedded PW would be the head, in virtue of which it would retain its primary stress, whereas the first embedded PW would be a subordinate to the head, and thus, its stress would be demoted from primary to secondary.

4.3 Directions for future research

Although the current study offers fresh insight into the prosody of Spanish lexical compounds, it leaves a number of gaps to be filled in the future. Firstly, while having identified acoustic differences in the prosodic behaviour among non-compound words, lexical compounds and phrases, this study defers to subsequent research the task of determining the perceptual salience of these differences. This inquiry into perceptual salience is crucial for understanding not only how these prosodic variations are processed cognitively by listeners but also how phonetic cues contribute to the distinction of different linguistic structure types.

Secondly, recall that, for the purpose of obtaining natural speech, infrequent items were left out during word selection. As a result, the question of whether lexical compounds with relatively low usage frequencies exhibit the same stress pattern or not cannot be answered here. Lang (1992) suggests that the frequency of use is a useful criterion that helps to distinguish compounds from other types of linguistic structure (p. 94). This may imply that less frequently used compounds could present variations in stress pattern not observed in this study, highlighting the need for further investigation into how frequency impacts the prosodic behaviour of compounds.

Lastly, despite the implementation of measures to render the elicited speech as naturalistic as possible, it remains categorised as laboratory speech, which diverges from spontaneous speech in several phonetic dimensions (Face 2003). Therefore, subsequent research is necessary to examine compound stress patterns within spontaneous speech to determine the extent to which these findings are replicated in more naturalistic linguistic contexts.


Corresponding author: Dongmei Lin, School of Foreign Languages, Qingdao University, 308 Ningxia Road, Qingdao, 266071, China, E-mail:

Appendix A: List of selected items

The different types of compound including subordinate, attributive, coordinate, endocentric and exocentric are abbreviated as SUB, ATT, COO, END and EXO, respectively; the target syllable of each item is underlined.

Lexical compounds Non-compound words and phrases
aguanieve ‘sleet’ COO, END aguacero ‘downpour’
agua dulce ‘sweet water’
altavoz ‘speaker’ ATT, EXO albañil ‘bricklayer’
baloncesto ‘basketball’ SUB, END baloncito ‘small ball’
bocacalle ‘entrance into a street’ SUB, END bocanada ‘puff’
caradura ‘cheeky (person)’ ATT, EXO caramelo ‘candy’
coliflor ‘cauliflower’ COO, END colibrí ‘hummingbird’
lavamanos ‘washbasin’ SUB, EXO lavadoras ‘washing machine’
lavaplatos ‘dishwasher’ SUB, EXO
medianoche ‘midnight’ ATT, END medianero ‘middle’
nochebuena ‘Christmas Eve’ ATT, END noche buena ‘good night’
nochecita ‘night’
ochocientos ‘eight hundred’ ATT, END oponentes ‘opponent’
ocho dientes ‘eight teeth’
parabrisas ‘windscreen’ SUB, EXO paraguayos ‘Paraguayan’
pasaporte ‘passport’ SUB, EXO pasajero ‘passenger’
pasatiempo ‘pastime’ SUB, EXO
pelirrojo ‘red-haired’ ATT, EXO peligroso ‘dangerous’
sacacorchos ‘corkscrew’ SUB, EXO sacarina ‘saccharine’
sacapuntas ‘pencil sharpener’ SUB, EXO
veintiuna ‘twenty one’ COO, END veintena ‘twenty’
veinte años ‘twenty years’

Appendix B: Sample Tasks

1.
a.
A picture showing a Spanish passport
Question: ¿Qué es? (‘What’s this?’)
Expected answer: Es un pasaporte español.
(‘It’s a Spanish passport.’)
b.
A picture where a man is showing his Spanish passport to a woman
Question: ¿La mujer tiene un pasaporte español?
(‘Does the woman have a Spanish passport?’)
Expected answer: No, EL HOMBRE tiene un pasaporte español.
(‘No, the man has a Spanish passport.’)
2.
a.
A picture showing passengers on a plane
Question: ¿Cómo se llaman las personas que viajan por avión?
(‘How do you call the people who travel by plane?’)
Expected answer: Se llaman pasajeros de avión.
(‘They are called airplane passengers.’)
b.
A picture showing a pilot and a flight attendant
Question: ¿El piloto atiende a los pasajeros de avión?
(‘Does the pilot look after the airplane passengers?’)
Expected answer: No, LA AZAFATA atiende a los pasajeros de avión.
(‘No, the flight attendant looks after the airplane passengers.’)
3.
a.
A picture showing two basketball and a football
Question: ¿Qué son? (‘What are these?’)
Expected answer: Son dos baloncestos y un fútbol.
(‘They are two basketball and a football.’)
b.
A picture where a man is paying for the balls and the cashier is a woman
Question: ¿La mujer compra los baloncestos y el fútbol?
(‘Does the woman buy the basketballs and the football?’)
Expected answer: No, EL HOMBRE compra los baloncestos y el fútbol.
(‘No, the man buys the basketballs and the football.’)
4.
a.
A picture showing chocolate balls
Question: ¿Qué son? (‘What are these?’)
Expected answer: Son baloncitos de chocolate.
(‘They are chocolate balls.’)
b.
A picture where a man is paying for the chocolate balls
Question: ¿La mujer compra los baloncitos de chocolate?
(‘Does the woman buy the chocolate balls?’)
Expected answer: No, EL HOMBRE compra los baloncitos de chocolate.
(‘No, the man buys the chocolate balls.’)
5.
a.
A picture showing two green dishwashers
Question: ¿Qué son estos? (‘What are these?’)
Expected answer: Son dos lavaplatos verdes.
(‘These are two green dishwashers.’)
b.
A picture where a man is paying for the two green dishwashers
Question: ¿La mujer compra los lavaplatos verdes?
(‘Does the woman buy the green dishwashers?’)
Expected answer: No, EL HOMBRE compra los lavaplatos verdes.
(‘No, the man buys the green dishwashers.’)
6.
a.
A picture showing two black washing machines
Question: ¿Qué son? (‘What are these?’)
Expected answer: Son dos lavadoras negras.
(‘These are two black washing machines.’)
b.
A picture where a man is paying for the black washing machines
Question: ¿La mujer compra las lavadoras negras?
(‘Does the woman buy the black washing machines?’)
Expected answer: No, EL HOMBRE compra las lavadoras negras.
(‘No, the man buys the black washing machines.’)

References

Aguilar, Lourdes, Carme De-la-Mota & Pilar Prieto. 2009. Sp_ToBi training materials. Available at: http://prosodia.upf.edu/sp_tobi/.Search in Google Scholar

Beckman, Mary E. & Jan Edwards. 1994. Articulatory evidence for differentiating stress categories. Papers in Laboratory Phonology 3. 7–33. https://doi.org/10.1017/cbo9780511659461.002.Search in Google Scholar

Beckman, Mary E. & Janet B. Pierrehumbert. 1986. Intonational structure in Japanese and English. Phonology Yearbook 3. 255–309. https://doi.org/10.1017/s095267570000066x.Search in Google Scholar

Boersma, Paul & David Weenink. 2019. Praat: Doing phonetics by computer (6.0.19). Available at: http://www.praat.org/.Search in Google Scholar

Bolinger, Dwight L. 1958. A theory of pitch accent in English. Word 14(2–3). 109–149. https://doi.org/10.1080/00437956.1958.11659660.Search in Google Scholar

Bolinger, Dwight L. 1962. ‘Secondary stress’ in Spanish. Romance Philology 15(3). 273–279.Search in Google Scholar

Bollinger, Dwight L. & Marion Hodapp. 1961. Acento melódico, acento de intensidad. Boletín de Filología de la Universidad 13. 33–48.Search in Google Scholar

Bustos Gisbert, Eugenio. 1986. La composición nominal en español. Salamanca: Ediciones Universidad de Salamanca.Search in Google Scholar

Caballero, Gabriela & Lucien Carroll. 2015. Tone and stress in Choguita Rarámuri (Tarahumara) word prosody. International Journal of American Linguistics 81. 459–493. https://doi.org/10.1086/683157.Search in Google Scholar

Caldecott, Marion & Karsten Koch. 2014. Using mixed media tools for eliciting discourse in indigenous languages. Language Documentation & Conservation 8. 209–240.Search in Google Scholar

Campbell, Nick & Mary Beckman. 1997. Stress, prominence, and spectral tilt. In Antonis Botinis (ed.), Proceedings of the ESCA workshop intonation: Theory, models and applications, September 18–20, 67–71. Arthens. Available at: https://www.iscaspeech.org/archive_open/archive_papers/int_97/inta_067.pdf.Search in Google Scholar

Canellada, María Josefa & John Kuhlmann-Madsen. 1987. Pronunciación del español. Lengua hablado y literaria. Madrid: Editorial Castalia.Search in Google Scholar

Clark, John, Collin Yallop & Janet Fletcher. 2007. An introduction to phonetics and phonology, 3rd edn. Oxford: Wiley-Blackwell.Search in Google Scholar

Contreras, Heles. 1963. Sobre el acento en español. Boletín del Instituto de Filología de la Universidad de Chile 15. 223–237.Search in Google Scholar

Contreras, Heles. 1964. Tiene el español un acento de intensidad? Boletín del Instituto de Filología de la Universidad de Chile 16. 237–239.Search in Google Scholar

de la Mota, Carme. 1995. La representación gramatical de la información nueva en el discurso. Universitat Autònoma de Barcelona Doctoral dissertation.Search in Google Scholar

de la Mota, Carme. 1997. Prosody of sentences with contrastive new information in Spanish. In Antonis Botinis (ed.), Intonation: Theory, Models and Applications. Proceedings of an ESCA Workshop, 75–78. Athens, Greece.Search in Google Scholar

Díaz-Campos, Manuel. 2000. The phonetic manifestation of secondary stress in Spanish. In Héctor Campos, Elena Herburger, Alfonso Morales-Front & Thomas J. Walsh (eds.), Hispanic Linguistics at the turn of the millenium: Papers from the 3rd Hispanic Linguistic Symposium, 49–65. Somerville: Cascadilla.Search in Google Scholar

Dogil, Grzegorz & Briony Williams. 1999. The phonetic manifestation of word stress. In Harry van der Hulst (ed.), Word prosodic systems in the languages of Europe, 273–334. Berlin: De Gruyter Mouton. Available at: https://hdl.handle.net/2022/25830.Search in Google Scholar

Enríquez, Emilia-V., Celia Casado & Andrés Santos. 1989. La percepción del acento en español. Lingüística Española Actual 11. 241–269.Search in Google Scholar

Fabregat, Beatriz Belda & Carme de la Mota. 2010. Sobreacentuación en la locución de boletines informativos en radio: Radio Nacional de España. In Pablo Cano López (ed.), XXXIX Simposio Internacional de la Sociedad Espanola de Lingüıstica (SEL), 67. Santiago de Compostela: Facultad de Filología de la Universidad de Santiago de Compostela.Search in Google Scholar

Face, Timothy L. 2001. Contrastive focus and global intonation patterns in Spanish. Linguistic Association of Korea Journal 9(2). 1–21.Search in Google Scholar

Face, Timothy L. 2003. Intonation in Spanish declaratives: Differences between lab speech and spontaneous speech. Catalan Journal of Linguistics 2. 115–131. https://doi.org/10.5565/rev/catjl.46.Search in Google Scholar

Fant, Gunnar. 1960. Acoustic theory of speech production. The Hague: Mouton & Co.-S-Gravenhage.Search in Google Scholar

Figueras, Carolina & Marisa Santiago. 1993. Investigaciones sobre la naturaleza del acento a través del Visi-pitch. Estudios de Fonética Experimental 5. 81–112.Search in Google Scholar

Gabriel, Christoph, Ingo Feldhausen, Andrea Pešková, Laura Colantoni, Su-Ar Lee, Valeria Arana & Leopoldo Labastía. 2010. Argentinian Spanish intonation. In Pilar Prieto & Paulo Proseano (eds.), Transcription of intonation of the Spanish language, 285–317. München: Lincom.Search in Google Scholar

Garrido, Juan M., Joaquim Llisterri, Carme de la Mota, Rafael Marín & Antonio Ríos. 1995. Prosodic markers at syntactic boundaries in Spanish. In Proceedings of the 13th International Congress of Phonetic Sciences, Stockholm, Sweden, 13–19 August, 1995 (Vol. 2), 370–373.Search in Google Scholar

Gifre, Emma Martinell. 1984. De la complementación a la composición en el sintagma nominal. Revista Española de Lingüística 14(2). 223–244.Search in Google Scholar

Gili Gaya, Samuel. 1975. Elementos de fonética general. Madrid: Gredos.Search in Google Scholar

González, Luis Candia, Hernán Urrutia Cárdenas & Teresa Fernández Ulloa. 2006. Rasgos acústicos de la prosodia acentual del español. Boletín de Filología XLI. 11–44.Search in Google Scholar

Gussenhoven, Carlos. 2004. The phonology of tone and intonation. Cambridge: Cambridge University Press.10.1017/CBO9780511616983Search in Google Scholar

Hadfield, Jarrod D. 2010. MCMC methods for multi-response generalized linear mixed models: The MCMCglmm R package. Journal of Statistical Software 33(2). 1–22. https://doi.org/10.18637/jss.v033.i02.Search in Google Scholar

Harris, James Wesley. 1983. Syllable structure and stress in Spanish: A nonlinear analysis. Cambridge, Mass: MIT Press.Search in Google Scholar

Harris, James Wesley. 1991. With respect to metrical constituents in Spanish. In Héctor Campos & Fernando Martínez-Gil (eds.), Current studies in Spanish linguistics, 447–473. Washington D.C.: Georgetown University Press.Search in Google Scholar

Himmelmann, Nikolaus P. & D. Robert Ladd. 2008. Prosodic description: An introduction for fieldworkers. Language Documentation and Conservation 2(2). 244–274.Search in Google Scholar

Hirst, Daniel. 2011. The analysis by synthesis of speech melody. Journal of Speech Sciences 1(1). 55–83. https://doi.org/10.20396/joss.v1i1.15011.Search in Google Scholar

Hualde, José Ignacio. 2003. El modelo métrico y autosegmental. In Pilar Prieto (ed.), Teorías de la entonación, 155–184. Barcelona: Ariel.Search in Google Scholar

Hualde, José Ignacio. 2005. The sounds of Spanish. Cambridge: Cambridge University Press.Search in Google Scholar

Hualde, José Ignacio. 2007. Stress removal and stress addition in Spanish. Journal of Portuguese Linguistics 5(2)/6(1). 59–89. https://doi.org/10.5334/jpl.145.Search in Google Scholar

Hualde, José Ignacio. 2009. Unstressed words in Spanish. Language Sciences 31. 199–212. https://doi.org/10.1016/j.langsci.2008.12.003.Search in Google Scholar

Hualde, José Ignacio. 2010. Secondary stress and stress clash in Spanish. In Marta Ortega Llebaria (ed.), Selected proceedings of the 4th conference on laboratory approaches to Spanish phonology, 11–19. Somerville MA: Cascadilla Proceedings Project.Search in Google Scholar

Hualde, José Ignacio. 2012. Stress and rhythm. In José Ignacio Hualde, Antxon Olarrea & Erin O’Rour (eds.), The Handbook of Hispanic linguistics, 153–171. Oxford: Wiley-Blackwell.10.1002/9781118228098.ch8Search in Google Scholar

Hualde, José Ignacio & Marianna Nadeu. 2014. Rhetorical stress in Spanish. In Harry van der Hulst (ed.), Word stress: Theoretical and typological issues, 228–252. Cambridge: Cambridge University Press.10.1017/CBO9781139600408.010Search in Google Scholar

Hualde, José Ignacio & Pilar Prieto. 2015. Intonational variation in Spanish: European and American varieties. In Sónia Frota & Pilar Prieto (eds.), Intonation in romance, 350–391. Oxford: Oxford University Press.10.1093/acprof:oso/9780199685332.003.0010Search in Google Scholar

Itô, Junko & Armin Mester. 2007. Prosodic adjunction in Japanese compounds. MIT Working Papers in Linguistics 55. 95–111.Search in Google Scholar

Itô, Junko & Armin Mester. 2009. The extended prosodic word. In Artemis Alexiadou, T. Alan Hall, Janet Grijzenhout & Baris Kabak (eds.), Phonological domains: Universals and deviations, 135–194. Mouton de Gruyter.10.1515/9783110217100.2.135Search in Google Scholar

Itô, Junko & Armin Mester. 2012. Recursive prosodic phrasing in Japanese. In Prosody matters: Essays in honor of Elisabeth Selkirk, 280–303. Sheffield & Bristol, Conn.: Equinox.Search in Google Scholar

Kaland, Constantijn. 2019. Acoustic correlates of word stress in Papuan Malay. Journal of Phonetics 74. 55–74. https://doi.org/10.1016/j.wocn.2019.02.003.Search in Google Scholar

Kornfeld, Laura Malena. 2009. IE, Romance: Spanish. In Rochelle Lieber & Pavol Stekauer (eds.), The Oxford handbook of compounding, 417–435. Oxford: OUP.Search in Google Scholar

Kuder, Emily. 2020. Rhetorical stress in Spanish second language classroom instruction. Hispania 103(2). 225–244. https://doi.org/10.1353/hpn.2020.0034.Search in Google Scholar

Kuznetsova, Alexandra, Per B. Brockhoff & Rune Haubo Bojesen Christensen. 2017. lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software 82(13). 1–26. https://doi.org/10.18637/jss.v082.i13.Search in Google Scholar

Ladd, D. Robert. 1996. Intonational phonology. Cambridge: Cambridge University Press.Search in Google Scholar

Lang, Mervyn F. 1992. Formación de palabras en español: Morfología derivativa productiva en el léxico moderno. Madrid: Cátedra.Search in Google Scholar

Lehiste, Ilse. 1970. Suprasegmentals. Cambridge, MA: The MIT Press.Search in Google Scholar

Lieber, Rochelle & Pavol, Štekauer. 2009. Introduction: Status and definition of compounding. In The Oxford handbook of compounding, 3–18. Oxford: OUP.Search in Google Scholar

Llisterri, Joaquim, María Machuca, Carme De la Mota, Montserrat Riera & Antonio Ríos. 2005. The perception of lexical stress in Spanish. In Maria-Josep Solé, Daniel Recasens & Joaquín Romero (eds.), Proceedings of the 15th International Conference of Phonetic Sciences, 2023–2026. Barcelona: Causal Productions.Search in Google Scholar

Machač, Pavel & Radek Skarnitzl. 2009. Principles of phonetic segmentation. Prague: Epocha.Search in Google Scholar

Mayer, James. 2014. Spectral_profile.txt Version 2014/05/02. Available at: http://praatpfanne.lingphon.net/.Search in Google Scholar

Monroy-Casas, Rafael. 1977. Acento de intensidad en español? Revista de Español Vivo 32. 18–26.Search in Google Scholar

Moyna, María Irene. 2021. The diachrony of Spanish compounding. In Fábregas Antonio, Víctor Acedo-Matellán, Grant Armstrong, María Cristina Cuervo & Isabel Pujol (eds.). The Routledge Handbook of Spanish Morphology. 303–316. New York: Routledge.10.4324/9780429318191-23Search in Google Scholar

Navarro Tomás, Tomás. 1918. Manual de pronunciación española. Madrid: Publicaciones de la Revista de filología española.Search in Google Scholar

Nolan, Francis. 2003. Intonational equivalence: An experimental evaluation of pitch scales. In Proceedings of the 15th International Congress of Phonetic Sciences, 771–774.Search in Google Scholar

Ortega-Llebaria, Marta. 2006. Phonetic cues to stress and accent in Spanish. In Selected Proceedings of the 2nd Conference on Laboratory Approaches to Spanish Phonetics and Phonology, 104–118.Search in Google Scholar

Ortega-Llebaria, Marta & Pilar Prieto. 2008. Disentangling stress from accent in Spanish. In Pilar Prieto, Joan Mascaró & Maria Josep Solé (eds.), Segmental and prosodic issues in Romance phonology, 155–175. Amsterdam: Benjamins.10.1075/cilt.282.11ortSearch in Google Scholar

Ortega-Llebaria, Marta & Pilar Prieto. 2010. Acoustic correlates of stress in central Catalan and Castilian Spanish. Language and Speech 54(1). 73–97. https://doi.org/10.1177/0023830910388014.Search in Google Scholar

Pierrehumbert, Janet. 1980. The phonology and phonetics of English intonation. MIT Doctoral thesis. Published 1988 by IULC, Bloomington, IN.Search in Google Scholar

Plag, Ingo, Gero Kunter & Mareile Schramm. 2011. Acoustic correlates of primary and secondary stress in North American English. Journal of Phonetics 39(3). 362–374. https://doi.org/10.1016/j.wocn.2011.03.004.Search in Google Scholar

Prieto, Pilar & Jan van Santen. 1996. Secondary stress in Spanish: Some experimental evidence. In Claudia Parodi, Carlos Quícoli, Mario Saltarelli & Maria Luisa Zubizarreta (eds.), Aspects of romance linguistics, 337–356. Washington, DC: Georgetown University Press.Search in Google Scholar

Prieto, Pilar, Mariapaola d’Imperio & Barbara Gili Fivela. 2005. Pitch accent alignment in Romance: Primary and secondary associations with metrical structure. Language and Speech 48(4). 359–396. https://doi.org/10.1177/00238309050480040301.Search in Google Scholar

Prieto, Pilar, Chilin Shih & Holly Nibert. 1996. Pitch downtrend in Spanish. Journal of Phonetics 24(4). 445–473. https://doi.org/10.1006/jpho.1996.0024.Search in Google Scholar

Quéne, Hugo. 2004. What is the Just noticeable difference of tempo in speech? In Hugo Quené & Vincent van Heuven (eds.), On Speech and language. Studies for Sieb G. Nooteboom, 149–158. Utrecht, The Netherlands: Netherlands Graduate School of Linguistics (LOT).Search in Google Scholar

Quilis, Antonio. 1971. Caracterización fonética del acento español. Travaux de Linguistique et de Littérature 9. 53–72.Search in Google Scholar

Quilis, Antonio. 1981. Fonética acústica de la lengua española. Madrid: Biblioteca Romántica Hisánica, Gredos.Search in Google Scholar

Quilis, Antonio. 1993. Tratado de fonología y fonética españolas. Madrid: Gredos.Search in Google Scholar

R Core Team. 2017. R: A language and environment for statistical computing. R Foundation for Statistical Computing. Available at: https://www.R-project.org/.Search in Google Scholar

Rainer, Franz. 1996. Inflection inside derivation: Evidence from Spanish and Portuguese. In Yearbook of Morphology 1995, 83–91. Dordrecht: Springer Netherlands.10.1007/978-94-017-3716-6_5Search in Google Scholar

Rainer, Franz & Soledad Varela. 1992. Compounding in Spanish. Revista Di Linguistica 4(1). 117–142.Search in Google Scholar

Rao, Rajiv. 2011. Intonation in Spanish classroom-style didactic speech. Journal of Language Teaching & Research 2(3). https://doi.org/10.4304/jltr.2.3.493-507.Search in Google Scholar

Rao, Rajiv. 2015. On the phonological status of Spanish compound words. Word Structure 8(1). 84–118. https://doi.org/10.3366/word.2015.0074.Search in Google Scholar

Real Academia Española. 2011. El acento. In Nueva Gramática de la Lengua Española Fonética y fonología, 355–433. Asociación de Academias de la Lengua Española, Espasa Libros.Search in Google Scholar

Real Academia Española. n.d. Portaviones. In Diccionario panhispánico de dudas, 2nd edn. https://www.rae.es/dpd/portaviones (accessed 21 October 2023).Search in Google Scholar

Roca, Iggy. 1986. Secondary stress and metrical rhythm. Phonology Yearbook 3. 341–370. https://doi.org/10.1017/s0952675700000683.Search in Google Scholar

Scalise, Sergio & Antonietta Bisetto. 2009. The classification of compounds. In The Oxford handbook of compounding, 34–53. Oxford: OUP.Search in Google Scholar

Scalise, Sergio & Irene Vogel. 2010. Why compounding? In Irene Vogel & Sergio Scalise (eds.), Cross-disciplinary issues in compounding, 1–18. Amsterdam/Philadelphia: John Benjamins Publishing.10.1075/cilt.311.02scaSearch in Google Scholar

Selkirk, Elisabeth. 2011. The syntax-phonology interface. In The handbook of phonological theory, 435–484.10.1002/9781444343069.ch14Search in Google Scholar

Shwayder, Kobey. 2015. Word-level recursion in Spanish compounds. University of Pennsylvania Working Papers in Linguistics 21(1). 1–10.Search in Google Scholar

Sluijter, Agaath M.C., Stefanie Shattuck-Hufnagel, Kenneth N. Stevens & Vincent J. van Heuven. 1995. Supralaryngeal resonance and glottal pulse shape as correlate of stress and accent. In Proceedings of 13th ICPhS, 630–633. Stockholm.Search in Google Scholar

Sluijter, Agaath M.C., Vincent J. Van Heuven & Jos J.A. Pacilly. 1997. Spectral balance as a cue in the perception of linguistic stress. The Journal of the Acoustical Society of America 101(1). 503–513. https://doi.org/10.1121/1.417994.Search in Google Scholar

Štekauer, Pavol, Július Zimmermann & Renáta Gregová. 2007. Stress in compounds: An experimental research. Acta Linguistica Hungarica 54(2). 193–215. https://doi.org/10.1556/ALing.54.2007.2.5.Search in Google Scholar

Stevens, Kenneth N. 2000. Acoustic phonetics. Cambridge, MA: The MIT Press.10.7551/mitpress/1072.001.0001Search in Google Scholar

Stockwell, Robert P., J. Donald Bowen & Ismael Silva-Fuenzalida. 1956. Spanish juncture and intonation. Language 32(4). 641–665. https://doi.org/10.2307/411088.Search in Google Scholar

Suñer, Margarita. 1975. Spanish adverbs: Support for the phonological cycle? Linguistic Inquiry 6(4). 602–605.Search in Google Scholar

Urrutia Cardenas, Hérnan. 2007. La naturaleza del acento en español: nuevos datos y perspectivas. Revista de Lingüística Teórica y Aplicada 45(2). 135–142. https://doi.org/10.4067/S0718-48832007000200010.Search in Google Scholar

Val Álvaro, José Francisco. 1999. La composición. In Ignacio Bosque & Violeta Demonte (eds.), Gramática descriptiva de la lengua española, 4757–4841. Madrid: Espasa.Search in Google Scholar

van Heuven, Vincent & Agaath Sluijter. 1996. Notes on the phonetics of word prosody. In Rob Goedemans, Harry van der Hulst & Ellis Visch (eds.), Stress patterns of the world, 233–269. Holland Academic Graphics. Available at: https://scholarlypublications.universiteitleiden.nl/access/item%3A2883471/view.Search in Google Scholar

Varela, Soledad. 2012. Derivation and compounding. In José Ignacio Hualde, Antxon Olarrea & Erin O’Rourke (eds.), The Handbook of Hispanic linguistics, 209–226. Wiley-Blackwell.10.1002/9781118228098.ch11Search in Google Scholar

Vigário, Marina. 2003. The prosodic word in European Portuguese. Berlin/Boston: Mouton de Gruyter.10.1515/9783110900927Search in Google Scholar

Vigário, Marina. 2010. Prosodic structure between the prosodic word and the phonological phrase: Recursive nodes or an independent domain? Linguistic Review 27(4). 485–530. https://doi.org/10.1515/tlir.2010.017.Search in Google Scholar

Vogel, Irene. 2009. The status of the clitic group. In Artemis Alexiadou, T. Alan Hall, Janet Grijzenhout & Baris Kabak (eds.), Phonological domains: Universals and deviations, 15–46. Berlin: Mouton de Gruyter. Available at: http://ebookcentral.proquest.com/lib/auckland/detail.action?docID=453852.10.1515/9783110217100.1.15Search in Google Scholar

Vogel, Irene. 2010. The phonology of compounds. In Irene Vogel & Sergio Scalise (eds.), Cross-disciplinary issues in compounding, 145–163. Amsterdam/Philadelphia: John Benjamins.10.1075/cilt.311.12vogSearch in Google Scholar

Vogel, Irene. 2020. Life after the strict layer hypothesis prosodic structure geometry. In Hongming Zhang & Youyong Qian (eds.), Prosodic studies: Challenges and prospects, 9–60. London: Routledge.10.4324/9781351212878-2Search in Google Scholar

Yang, Li-chiung. 1998. Contextual effects on syllable duration. In The Third ESCA/COCOSDA Workshop on Speech Synthesis. Available at: http://www.iscaaspeech.org/archive.Search in Google Scholar

Zagona, Karen. 1990. Mente adverbs, compound interpretation and the projection principle. Probus 2(1). 1–30. https://doi.org/10.1515/prbs.1990.2.1.1.Search in Google Scholar

Received: 2024-04-24
Accepted: 2024-04-25
Published Online: 2024-05-21
Published in Print: 2025-05-26

© 2024 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 23.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/probus-2024-0002/html
Scroll to top button