Abstract
Phonological knowledge is influenced by a variety of cues that reflect predictability (e.g. semantic predictability). Listeners utilize various aspects of predictability when determining what they have heard. In the present paper, we ask how aspects of the acoustic phonetic signal (e.g. speaking rate) interact with other knowledge reflecting predictability (e.g. lexical frequency and collocation strength) to influence how speech is perceived. Specifically, we examine perception of function words by native and non-native speakers. Our results suggest that both native and non-native speakers are sensitive to factors that influence the predictability of the signal, including speaking rate, frequency, and collocation strength, when listening to speech, and use these factors to predict the phonological structure of stretches of ambiguous speech. However, reliance on these cues differs as a function of their experience and proficiency with the target language. Non-native speakers are less sensitive to some aspects of the acoustic phonetic signal (e.g. speaking rate). However, they appear to be quite sensitive to other factors, including frequency. We discuss how these results inform our understanding of the interplay between predictability and speech perception by different listener populations and how use of features reflecting predictability interacts with recovery of phonological structure of spoken language.
References
Baese-Berk, M. M., C. C. Heffner, L. C. Dilley, M. A. Pitt, T. H. Morrill & J. D. McAuley. 2014. Long-term temporal tracking of speech rate affects spoken-word recognition. Psychological Science 25(8). 1546–1553.10.1177/0956797614533705Search in Google Scholar
Baese-Berk, M. M., T. H. Morrill & L. C. Dilley. 2016. Do non-native speakers use context speaking rate in spoken word recognition? Proceedings of Speech Prosody 8. 979–983.10.21437/SpeechProsody.2016-201Search in Google Scholar
Barnes, R. & M. R. Jones. 2000. Expectancy, attention, and time. Cognitive Psychology 41. 254–311.10.1006/cogp.2000.0738Search in Google Scholar
Bates, D., M. Maechler, B. Bolker & S. Walker. 2014. lme4: Linear mixed-effects models using Eigen and S4. R package version 1(7). 1–23.10.18637/jss.v067.i01Search in Google Scholar
Best, C. T. 1995. A direct realist view of cross-language speech perception. In W. Strange (ed.), Speech perception and linguistic experience: Issues in cross-language research, 171–204. Timonium, MD: York Press.Search in Google Scholar
Boersma, P. & D. Weenink. 2014. Praat: Doing phonetics by computer (Version 5.3.70) [computer software]. Retrieved from http://www.praat.org/.Search in Google Scholar
Bosker, H. R. 2017. Accounting for rate-dependent category boundary shifts in speech perception. Attention, Perception & Psychophysics 79. 333–343.10.3758/s13414-016-1206-4Search in Google Scholar
Bradlow, A. R. & J. A. Alexander. 2007. Semantic and phonetic enhancements for speech-in-noise recognition by native and non-native listeners. The Journal of the Acoustical Society of America 1214. 2339–2349.10.1121/1.2642103Search in Google Scholar
Clopper, C. G., R. Turnbull & R. S. Burdin. 2018. Assessing predictability effects in connected read speech. Linguistics Vanguard 4(S2).10.1515/lingvan-2017-0044Search in Google Scholar
Cohen Priva, U. & T. F. Jaeger. 2018. The interdependence of frequency, predictability, and informativity. Linguistics Vanguard 4(S2).10.1515/lingvan-2017-0028Search in Google Scholar
Daland, R. & K. Zuraw. 2018. Loci and locality of informational effects on phonetic implementation. Linguistics Vanguard 4(S2).10.1515/lingvan-2017-0045Search in Google Scholar
Davies, M. 2008. The corpus of contemporary American English (COCA): 520 million words, 1990–present. Available online at http://corpus.byu.edu/coca/.Search in Google Scholar
Dilley, L. C., T. H. Morrill & E. Banzina. 2013. New tests of the distal speech rate effect: Examining cross-linguistic generalization. Frontiers in Psychology 4. 1002.10.3389/fpsyg.2013.01002Search in Google Scholar
Dilley, L. C. & M. A. Pitt. 2010. Altering context speech rate can cause words to appear or disappear. Psychological Science 21(11). 1664–1670.10.1177/0956797610384743Search in Google Scholar
Dupoux, E., K. Kakehi, Y. Hirose, C. Pallier & J. Mehler. 1999. Epenthetic vowels in Japanese: A perceptual illusion? Journal of Experimental Psychology: Human Perception and Performance 25(6). 1568–1578.10.1037/0096-1523.25.6.1568Search in Google Scholar
Flege, J. E. 1995. Second language speech learning: Theory, findings, and problems. In W. Strange (ed.), Speech perception and linguistic experience: Issues in cross-language research, 233–277. Timonium, MD: York Press.Search in Google Scholar
Gahl, S. 2008. Time and thyme are not homophones: The effect of lemma frequency on word durations in spontaneous speech. Language 84(3). 474–496.10.1353/lan.0.0035Search in Google Scholar
Gahl, S. & S. M. Garnsey. 2004. Knowledge of grammar, knowledge of usage: Syntactic probabilities affect pronunciation variation. Language 80(4). 748–775.10.1353/lan.2004.0185Search in Google Scholar
Hunnicutt, S. 1985. Intelligibility versus redundancy-conditions of dependency. Language and Speech 28(1). 47–56.10.1177/002383098502800103Search in Google Scholar
Jones, M. R. & J. D. McAuley. 2005. Time judgments in global temporal contexts. Attention, Perception, & Psychophysics 67. 398–417.10.3758/BF03193320Search in Google Scholar
Jurafsky, D., A. Bell, M. Gregory & W. D. Raymond. 2001. Probabilistic relations between words: Evidence from reduction in lexical production. In J. L. Bybee and P. J. Hopper (eds.), Frequency and the Emergence of Linguistic Structure, 229–254. Amsterdam: Benjamins.10.1075/tsl.45.13jurSearch in Google Scholar
Kleinschmidt, D. F. & T. F. Jaeger. 2015. Robust speech perception: Recognize the familiar, generalize to the similar, and adapt to the novel. Psychological Review 122(2). 148.10.1037/a0038695Search in Google Scholar
Lai, W. & L. Dilley. 2016. Cross-linguistic generalization of the distal rate effect: Speech rate in context affects whether listeners hear a function word in Chinese Mandarin. Proceedings of Speech Prosody 8. 1124–1128.10.21437/SpeechProsody.2016-231Search in Google Scholar
Large, E. W. & M. R. Jones. 1999. The dynamics of attending: How people track time-varying events. Psychological Review 106. 119–159.10.1037/0033-295X.106.1.119Search in Google Scholar
Liberman, A. M., P. C. Delattre, L. J. Gerstman & F. S. Cooper. 1956. Tempo of frequency change as a cue for distinguishing classes of speech sounds. Journal of Experimental Psychology 52(2). 127.10.1037/h0041240Search in Google Scholar
Lieberman, P. 1963. Some effects of semantic and grammatical context on the production and perception of speech. Language and Speech 6(3). 172–187.10.1177/002383096300600306Search in Google Scholar
Lotto, A. J., K. R. Kluender & K. P. Green. 1996. Spectral discontinuities and the vowel length effect. Attention, Perception & Psychophysics 58(7). 1005–1014.10.3758/BF03206828Search in Google Scholar
Mayo, L. H., M. Florentine & S. Buus. 1997. Age of second-language acquisition and perception of speech in noise. Journal of Speech, Language, and Hearing Research 40(3). 686–693.10.1044/jslhr.4003.686Search in Google Scholar
McAuley, J. D. & M. R. Jones. 2003. Modeling effects of rhythmic context on perceived duration: A comparison of interval and entrainment approaches to short-interval timing. Journal of Experimental Psychology: Human Perception and Performance 29. 1102–1125.10.1037/0096-1523.29.6.1102Search in Google Scholar
McAuley, J. D. & N. S. Miller. 2007. Picking up the pace: Effects of global temporal context on sensitivity to the tempo of auditory sequences. Attention, Perception & Psychophysics 69. 709–718.10.3758/BF03193773Search in Google Scholar
Miller, J. L. & A. M. Liberman. 1979. Some effects of later-occurring information on the perception of stop consonant and semivowel. Perception & Psychophysics 25(6). 457–465.10.3758/BF03213823Search in Google Scholar
Morrill, T. H., M. M. Baese-Berk, C. C. Heffner & L. C. Dilley. 2015. Interactions between distal speech rate, linguistic knowledge, and speech environment. Psychonomic Bulletin & Review 22(5). 1451–1457.10.3758/s13423-015-0820-9Search in Google Scholar
Peirce, J. W. 2007. PsychoPy – Psychophysics software in Python. Journal of Neuroscience Methods 162(1–2). 8–13.10.1016/j.jneumeth.2006.11.017Search in Google Scholar
Pickett, J. M. & L. R. Decker. 1960. Time factors in perception of a double consonant. Language and Speech 3(1). 11–17.10.1177/002383096000300103Search in Google Scholar
Pisoni, D. B., T. D. Carrell & S. J. Gans. 1983. Perception of the duration of rapid spectrum changes in speech and nonspeech signals. Perception & Psychophysics 34(4). 314–322.10.3758/BF03203043Search in Google Scholar
Pitt, M. A., C. Szostak & L. C. Dilley. 2016. Rate dependent speech processing can be speech specific: Evidence from the perceptual disappearance of words under changes in context speech rate. Attention, Perception & Psychophysics 78(1). 334–345.10.3758/s13414-015-0981-7Search in Google Scholar
R Development Core Team. 2014. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. http://www.R-project.org.Search in Google Scholar
Reinisch, E. & M. J. Sjerps. 2013. The uptake of spectral and temporal cues in vowel perception is rapidly influenced by context. Journal of Phonetics 41. 101–116.10.1016/j.wocn.2013.01.002Search in Google Scholar
Sawusch, J. R. & R. S. Newman. 2000. Perceptual normalization for speaking rate II: Effects of signal discontinuities. Attention, Perception & Psychophysics 62(2). 285–300.10.3758/BF03205549Search in Google Scholar
Wade, T. & L. L. Holt. 2005. Perceptual effects of preceding nonspeech rate on temporal properties of speech categories. Perception & Psychophysics 67(6). 939–950.10.3758/BF03193621Search in Google Scholar
©2018 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Predictability and phonology: past, present and future
- Predictability and perception for native and non-native listeners
- Mergers in Bardi: contextual probability and predictors of sound change
- Predictability of stop consonant phonetics across talkers: Between-category and within-category dependencies among cues for place and voice
- Assessing predictability effects in connected read speech
- The interdependence of frequency, predictability, and informativity in the segmental domain
- Loci and locality of informational effects on phonetic implementation
- Three steps forward for predictability. Consideration of methodological robustness, indexical and prosodic factors, and replication in the laboratory
- Distributional learning is error-driven: the role of surprise in the acquisition of phonetic categories
- Truncation in message-oriented phonology: a case study using Korean vocative truncation
- Durational contrast in gemination and informativity
- Practice makes perfect: the consequences of lexical proficiency for articulation
- Patterns of probabilistic segment deletion/reduction in English and Japanese
- The role of predictability in shaping phonological patterns
Articles in the same Issue
- Predictability and phonology: past, present and future
- Predictability and perception for native and non-native listeners
- Mergers in Bardi: contextual probability and predictors of sound change
- Predictability of stop consonant phonetics across talkers: Between-category and within-category dependencies among cues for place and voice
- Assessing predictability effects in connected read speech
- The interdependence of frequency, predictability, and informativity in the segmental domain
- Loci and locality of informational effects on phonetic implementation
- Three steps forward for predictability. Consideration of methodological robustness, indexical and prosodic factors, and replication in the laboratory
- Distributional learning is error-driven: the role of surprise in the acquisition of phonetic categories
- Truncation in message-oriented phonology: a case study using Korean vocative truncation
- Durational contrast in gemination and informativity
- Practice makes perfect: the consequences of lexical proficiency for articulation
- Patterns of probabilistic segment deletion/reduction in English and Japanese
- The role of predictability in shaping phonological patterns