Predictability and perception for native and non-native listeners

Melissa Baese-Berk; Tuuli H. Morrill; Laura Dilley

doi:10.1515/lingvan-2017-0022

Article

Predictability and perception for native and non-native listeners

Melissa Baese-Berk , Tuuli H. Morrill and Laura Dilley

Published/Copyright: September 8, 2018

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Linguistics Vanguard Volume 4 Issue s2

Abstract

Phonological knowledge is influenced by a variety of cues that reflect predictability (e.g. semantic predictability). Listeners utilize various aspects of predictability when determining what they have heard. In the present paper, we ask how aspects of the acoustic phonetic signal (e.g. speaking rate) interact with other knowledge reflecting predictability (e.g. lexical frequency and collocation strength) to influence how speech is perceived. Specifically, we examine perception of function words by native and non-native speakers. Our results suggest that both native and non-native speakers are sensitive to factors that influence the predictability of the signal, including speaking rate, frequency, and collocation strength, when listening to speech, and use these factors to predict the phonological structure of stretches of ambiguous speech. However, reliance on these cues differs as a function of their experience and proficiency with the target language. Non-native speakers are less sensitive to some aspects of the acoustic phonetic signal (e.g. speaking rate). However, they appear to be quite sensitive to other factors, including frequency. We discuss how these results inform our understanding of the interplay between predictability and speech perception by different listener populations and how use of features reflecting predictability interacts with recovery of phonological structure of spoken language.

Keywords: predictability; non-native speech; speaking rate; frequency; collocation strength

References

Baese-Berk, M. M., C. C. Heffner, L. C. Dilley, M. A. Pitt, T. H. Morrill & J. D. McAuley. 2014. Long-term temporal tracking of speech rate affects spoken-word recognition. Psychological Science 25(8). 1546–1553.10.1177/0956797614533705Search in Google Scholar

Baese-Berk, M. M., T. H. Morrill & L. C. Dilley. 2016. Do non-native speakers use context speaking rate in spoken word recognition? Proceedings of Speech Prosody 8. 979–983.10.21437/SpeechProsody.2016-201Search in Google Scholar

Barnes, R. & M. R. Jones. 2000. Expectancy, attention, and time. Cognitive Psychology 41. 254–311.10.1006/cogp.2000.0738Search in Google Scholar

Bates, D., M. Maechler, B. Bolker & S. Walker. 2014. lme4: Linear mixed-effects models using Eigen and S4. R package version 1(7). 1–23.10.18637/jss.v067.i01Search in Google Scholar

Best, C. T. 1995. A direct realist view of cross-language speech perception. In W. Strange (ed.), Speech perception and linguistic experience: Issues in cross-language research, 171–204. Timonium, MD: York Press.Search in Google Scholar

Boersma, P. & D. Weenink. 2014. Praat: Doing phonetics by computer (Version 5.3.70) [computer software]. Retrieved from http://www.praat.org/.Search in Google Scholar

Bosker, H. R. 2017. Accounting for rate-dependent category boundary shifts in speech perception. Attention, Perception & Psychophysics 79. 333–343.10.3758/s13414-016-1206-4Search in Google Scholar

Bradlow, A. R. & J. A. Alexander. 2007. Semantic and phonetic enhancements for speech-in-noise recognition by native and non-native listeners. The Journal of the Acoustical Society of America 1214. 2339–2349.10.1121/1.2642103Search in Google Scholar

Clopper, C. G., R. Turnbull & R. S. Burdin. 2018. Assessing predictability effects in connected read speech. Linguistics Vanguard 4(S2).10.1515/lingvan-2017-0044Search in Google Scholar

Cohen Priva, U. & T. F. Jaeger. 2018. The interdependence of frequency, predictability, and informativity. Linguistics Vanguard 4(S2).10.1515/lingvan-2017-0028Search in Google Scholar

Daland, R. & K. Zuraw. 2018. Loci and locality of informational effects on phonetic implementation. Linguistics Vanguard 4(S2).10.1515/lingvan-2017-0045Search in Google Scholar

Davies, M. 2008. The corpus of contemporary American English (COCA): 520 million words, 1990–present. Available online at http://corpus.byu.edu/coca/.Search in Google Scholar

Dilley, L. C., T. H. Morrill & E. Banzina. 2013. New tests of the distal speech rate effect: Examining cross-linguistic generalization. Frontiers in Psychology 4. 1002.10.3389/fpsyg.2013.01002Search in Google Scholar

Dilley, L. C. & M. A. Pitt. 2010. Altering context speech rate can cause words to appear or disappear. Psychological Science 21(11). 1664–1670.10.1177/0956797610384743Search in Google Scholar

Dupoux, E., K. Kakehi, Y. Hirose, C. Pallier & J. Mehler. 1999. Epenthetic vowels in Japanese: A perceptual illusion? Journal of Experimental Psychology: Human Perception and Performance 25(6). 1568–1578.10.1037/0096-1523.25.6.1568Search in Google Scholar

Flege, J. E. 1995. Second language speech learning: Theory, findings, and problems. In W. Strange (ed.), Speech perception and linguistic experience: Issues in cross-language research, 233–277. Timonium, MD: York Press.Search in Google Scholar

Gahl, S. 2008. Time and thyme are not homophones: The effect of lemma frequency on word durations in spontaneous speech. Language 84(3). 474–496.10.1353/lan.0.0035Search in Google Scholar

Gahl, S. & S. M. Garnsey. 2004. Knowledge of grammar, knowledge of usage: Syntactic probabilities affect pronunciation variation. Language 80(4). 748–775.10.1353/lan.2004.0185Search in Google Scholar

Hunnicutt, S. 1985. Intelligibility versus redundancy-conditions of dependency. Language and Speech 28(1). 47–56.10.1177/002383098502800103Search in Google Scholar

Jones, M. R. & J. D. McAuley. 2005. Time judgments in global temporal contexts. Attention, Perception, & Psychophysics 67. 398–417.10.3758/BF03193320Search in Google Scholar

Jurafsky, D., A. Bell, M. Gregory & W. D. Raymond. 2001. Probabilistic relations between words: Evidence from reduction in lexical production. In J. L. Bybee and P. J. Hopper (eds.), Frequency and the Emergence of Linguistic Structure, 229–254. Amsterdam: Benjamins.10.1075/tsl.45.13jurSearch in Google Scholar

Kleinschmidt, D. F. & T. F. Jaeger. 2015. Robust speech perception: Recognize the familiar, generalize to the similar, and adapt to the novel. Psychological Review 122(2). 148.10.1037/a0038695Search in Google Scholar

Lai, W. & L. Dilley. 2016. Cross-linguistic generalization of the distal rate effect: Speech rate in context affects whether listeners hear a function word in Chinese Mandarin. Proceedings of Speech Prosody 8. 1124–1128.10.21437/SpeechProsody.2016-231Search in Google Scholar

Large, E. W. & M. R. Jones. 1999. The dynamics of attending: How people track time-varying events. Psychological Review 106. 119–159.10.1037/0033-295X.106.1.119Search in Google Scholar

Liberman, A. M., P. C. Delattre, L. J. Gerstman & F. S. Cooper. 1956. Tempo of frequency change as a cue for distinguishing classes of speech sounds. Journal of Experimental Psychology 52(2). 127.10.1037/h0041240Search in Google Scholar

Lieberman, P. 1963. Some effects of semantic and grammatical context on the production and perception of speech. Language and Speech 6(3). 172–187.10.1177/002383096300600306Search in Google Scholar

Lotto, A. J., K. R. Kluender & K. P. Green. 1996. Spectral discontinuities and the vowel length effect. Attention, Perception & Psychophysics 58(7). 1005–1014.10.3758/BF03206828Search in Google Scholar

Mayo, L. H., M. Florentine & S. Buus. 1997. Age of second-language acquisition and perception of speech in noise. Journal of Speech, Language, and Hearing Research 40(3). 686–693.10.1044/jslhr.4003.686Search in Google Scholar

McAuley, J. D. & M. R. Jones. 2003. Modeling effects of rhythmic context on perceived duration: A comparison of interval and entrainment approaches to short-interval timing. Journal of Experimental Psychology: Human Perception and Performance 29. 1102–1125.10.1037/0096-1523.29.6.1102Search in Google Scholar

McAuley, J. D. & N. S. Miller. 2007. Picking up the pace: Effects of global temporal context on sensitivity to the tempo of auditory sequences. Attention, Perception & Psychophysics 69. 709–718.10.3758/BF03193773Search in Google Scholar

Miller, J. L. & A. M. Liberman. 1979. Some effects of later-occurring information on the perception of stop consonant and semivowel. Perception & Psychophysics 25(6). 457–465.10.3758/BF03213823Search in Google Scholar

Morrill, T. H., M. M. Baese-Berk, C. C. Heffner & L. C. Dilley. 2015. Interactions between distal speech rate, linguistic knowledge, and speech environment. Psychonomic Bulletin & Review 22(5). 1451–1457.10.3758/s13423-015-0820-9Search in Google Scholar

Peirce, J. W. 2007. PsychoPy – Psychophysics software in Python. Journal of Neuroscience Methods 162(1–2). 8–13.10.1016/j.jneumeth.2006.11.017Search in Google Scholar

Pickett, J. M. & L. R. Decker. 1960. Time factors in perception of a double consonant. Language and Speech 3(1). 11–17.10.1177/002383096000300103Search in Google Scholar

Pisoni, D. B., T. D. Carrell & S. J. Gans. 1983. Perception of the duration of rapid spectrum changes in speech and nonspeech signals. Perception & Psychophysics 34(4). 314–322.10.3758/BF03203043Search in Google Scholar

Pitt, M. A., C. Szostak & L. C. Dilley. 2016. Rate dependent speech processing can be speech specific: Evidence from the perceptual disappearance of words under changes in context speech rate. Attention, Perception & Psychophysics 78(1). 334–345.10.3758/s13414-015-0981-7Search in Google Scholar

R Development Core Team. 2014. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. http://www.R-project.org.Search in Google Scholar

Reinisch, E. & M. J. Sjerps. 2013. The uptake of spectral and temporal cues in vowel perception is rapidly influenced by context. Journal of Phonetics 41. 101–116.10.1016/j.wocn.2013.01.002Search in Google Scholar

Sawusch, J. R. & R. S. Newman. 2000. Perceptual normalization for speaking rate II: Effects of signal discontinuities. Attention, Perception & Psychophysics 62(2). 285–300.10.3758/BF03205549Search in Google Scholar

Wade, T. & L. L. Holt. 2005. Perceptual effects of preceding nonspeech rate on temporal properties of speech categories. Perception & Psychophysics 67(6). 939–950.10.3758/BF03193621Search in Google Scholar

Received: 2017-05-02

Accepted: 2018-06-26

Published Online: 2018-09-08

You are currently not able to access this content.

Articles in the same Issue

https://doi.org/10.1515/lingvan-2017-0022

Keywords for this article

predictability; non-native speech; speaking rate; frequency; collocation strength