Startseite Perspectives on Speech Timing: Coupled Oscillator Modeling of Polish and Finnish
Artikel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

Perspectives on Speech Timing: Coupled Oscillator Modeling of Polish and Finnish

  • Zofia Malisz ORCID logo , Michael O’Dell , Tommi Nieminen und Petra Wagner
Veröffentlicht/Copyright: 23. Februar 2017

Abstract

This stud y was ai med at analyzing empirical duration data for Polish spoken at different tempos using an updated version of the Coupled Oscillator Model of speech timing and rhythm variability (O'Dell and Nieminen, 1999, 2009). We use Bayesian inference on parameters relating to speech rate to investigate how tempo affects timing in Polish. The model parameters found are then compared with parameters obtained for equivalent material in Finnish to shed light on which of the effects represent general speech rate mechanisms and which are specific to Polish. We discuss the model and its predictions in the context of current perspectives on speech timing.


verified



*Zofia Malisz, Department of Speech, Music and Hearing, KTH, SE-10044 Stockholm (Sweden), E-Mail malisz@kth.se

References

1 Abercrombie D (1973): A phonetician's view of verse structure; in Phonetics in Linguistics. London, Longman's Publishing Group.Suche in Google Scholar

2 Abraham RH, Shaw CD (2000): Dynamics: The Geometry of Behavior, ed 4. Aerial Press.Suche in Google Scholar

3 Aylett M, Turk A (2004): The smooth signal redundancy hypothesis: a functional explanation for relationships between redundancy, prosodic prominence, and duration in spontaneous speech. Lang Speech 47(pt 1):31-56.10.1177/00238309040470010201Suche in Google Scholar

4 Aylett M, Turk A (2006): Language redundancy predicts syllabic duration and the spectral characteristics of vocalic syllable nuclei. J Acoust Soc Am 119(5 pt 1):3048-3058.10.1121/1.2188331Suche in Google Scholar

5 Barbosa PA (2006): Incursões em torno do ritmo da fala [Investigations of speech rhythm]. Campinas, Pontes.Suche in Google Scholar

6 Barbosa PA (2007): From syntax to acoustic duration: a dynamical model of speech rhythm production. Speech Commun 49:725-742.10.1016/j.specom.2007.04.013Suche in Google Scholar

7 Beckman ME (1992): Evidence for speech rhythms across languages; in Tohkura Y, Vatikiotis-Bateson E, Sagisaka Y (eds): Speech Perception, Production and Linguistic Structure. Tokyo, OHM Publishing Co., pp 457-463.Suche in Google Scholar

8 Bertinetto PM, Bertini C (2010): Towards a unified predictive model of natural language rhythm; in Russo M (ed): Prosodic Universals: Comparative Studies in Rhythmic Modeling and Rhythm Typology. Naples, Aracne, pp 43-78.Suche in Google Scholar

9 Bolinger DLM (1965): Forms of English: Accent, Morpheme, Order. Cambridge, MA, Harvard University Press.Suche in Google Scholar

10 Bouzon C, Hirst D (2004): Isochrony and prosodic structure in British English. Proceedings of the 2nd International Conference on Speech Prosody, Nara, pp 223-226.Suche in Google Scholar

11 Brady MC, Port RF (2007): Quantifying vowel onset periodicity in Japanese. Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrücken, pp 337-342.Suche in Google Scholar

12 Browman CP, Goldstein L (1992): Articulatory phonology: an overview. Phonetica 49:155-180.10.1159/000261913Suche in Google Scholar

13 Cetnarowska B (2000): On the (non-)recursivity of the prosodic word in Polish. ZAS Papers Linguist 19:1-21.10.21248/zaspil.19.2000.66Suche in Google Scholar

14 Cummins F (2011): Periodic and aperiodic synchronization in skilled action. Front Hum Neurosci 170.10.3389/fnhum.2011.00170Suche in Google Scholar

15 Cummins F, Port R (1998): Rhythmic constraints on English stress timing. J Phon 26:145-171.10.1006/jpho.1998.0070Suche in Google Scholar

16 Cutler A (1994): The perception of rhythm in language. Cognition 50:79-81.10.1016/0010-0277(94)90021-3Suche in Google Scholar

17 De Jong K (1995): The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation. J Acoust Soc Am 97:491-504.10.1121/1.412275Suche in Google Scholar

18 Dellwo V, Steiner I, Aschenberner B, Dankovicova J, Wagner P (2004): BonnTempo-corpus and BonnTempo-tools: a database for the study of speech rhythm and rate. Proceedings of INTERSPEECH, Jeju, pp 777-780.10.21437/Interspeech.2004-294Suche in Google Scholar

19 Dellwo V, Wagner P (2003): Relations between language rhythm and speech rate. 15th International Congress of Phonetic Sciences, Barcelona, pp 471-474.Suche in Google Scholar

20 Dłuska M (1950): Fonetyka polska [The phonetics of Polish]. PWN Polskie Wydawnictwo Naukowe, Warszawa.Suche in Google Scholar

21 Dogil G (1979): Autosegmental Account of Phonological Emphasis. Carbondale, Illinois and Edmonton, Canada, Linguistic Research. Inc.Suche in Google Scholar

22 Eriksson A (1991): Aspects of Swedish Speech Rhythm; PhD thesis, University of Göteborg.Suche in Google Scholar

23 Fowler CA (1980): Coarticulation and theories of extrinsic timing. J Phon 8:113-133.10.1016/S0095-4470(19)31446-9Suche in Google Scholar

24 Gahl S, Yao Y, Johnson K (2012): Why reduce? Phonological neighborhood density and phonetic reduction in spontaneous speech. J Mem Lang 66:789-806.10.1016/j.jml.2011.11.006Suche in Google Scholar

25 Gelman A, Carlin JB, Stern HS, Rubin DB (2004): Bayesian Data Analysis, ed 2. Boca Raton, FL, Chapman & Hall/CRC.10.1201/9780429258480Suche in Google Scholar

26 Gelman A, Hill J (2007): Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge, Cambridge University Press.10.1017/CBO9780511790942Suche in Google Scholar

27 Gibson JJ (1975): Events are perceivable but time is not; in Fraser JT, Lawrence N (eds): The Study of Time II: Proceedings of the Second Conference of the International Society for the Study of Time, Lake Yamanaka-Japan. Berlin, Heidelberg, Springer Berlin Heidelberg, pp 295-301.Suche in Google Scholar

28 Gordon M (2011): Stress systems; in Goldsmith J, Riggle J, Yu AC (eds): The Handbook of Phonological Theory, ed 2. Chichester, Wiley Blackwell, pp 141-163.Suche in Google Scholar

29 Hanson K, Kiparsky P (1996): A parametric theory of poetic meter. Language 72:287-335.10.2307/416652Suche in Google Scholar

30 Hayes B (1980): A Metrical Theory of Stress Rules; PhD thesis, Cambridge, MIT.Suche in Google Scholar

31 Hayes B, Puppel S (1985): On the rhythm rule in Polish; in van der Hulst H, Smith N (eds): Advances in Nonlinear Phonology. Dordrecht, Foris Publications, pp 59-81.Suche in Google Scholar

32 Hualde JI, Nadeu M (2014): Rhetorical stress in Spanish; in van der Hulst H (ed): Word Stress: Theoretical and Typological Issues. Cambridge, Cambridge University Press, p 228.Suche in Google Scholar

33 Jaeger TF, Buz E (2016): Signal reduction and linguistic encoding; in Fernández EM, Cairns HS (eds): Handbook of Psycholinguistics. Hoboken, NJ, Wiley-Blackwell.10.1002/9781118829516.ch3Suche in Google Scholar

34 Jassem W, Hill DR, Witten IH (1984): Isochrony in English speech: its statistical validity and linguistic relevance; in Gibbon D, Richter H (eds): Intonation, Accent and Rhythm: Studies in Discourse Phonology. Berlin, Walter de Gruyter, pp 203-225.Suche in Google Scholar

35 Jones MR, Boltz M (1989): Dynamic attending and responses to time. Psychol Rev 96:459-491.10.1037/0033-295X.96.3.459Suche in Google Scholar

36 Kelso JAS (1995): Dynamic patterns: the self organization of brain and behavior. Cambridge, MA, MIT Press.Suche in Google Scholar

37 Kim H, Cole J (2005): The stress foot as a unit of planned timing: evidence from shortening in the prosodic phrase. Proceedings of INTERSPEECH 2005, Lisbon, pp 2365-2368.10.21437/Interspeech.2005-37Suche in Google Scholar

38 Kochanski G, Loukina, A, Keane E, Shih C, Rosner B (2010): Long-range prosody prediction and rhythm. Proceedings of the 5th International Conference on Speech Prosody, Chicago, IL, pp 1-4.Suche in Google Scholar

39 Kopell N (1988): Toward a theory of modelling central pattern generators; in Cohen AH, Rossignol S, Grillner S (eds): Neural Control of Rhythmic Movements in Vertebrates. New York, John Wiley & Sons, pp 369-413.Suche in Google Scholar

40 Kuperman V, Ernestus M, Baayen H (2008): Frequency distributions of uniphones, diphones, and triphones in spontaneous speech. J Acoust Soc Am 124:3897-3908.10.1121/1.3006378Suche in Google Scholar

41 Lee MW, Gibbons J (2007): Rhythmic alternation and the optional complementiser in English: new evidence of phonological influence on grammatical encoding. Cognition 105:446-456.10.1016/j.cognition.2006.09.013Suche in Google Scholar

42 Lehiste I (1977): Isochrony reconsidered. J Phon 5:253-263.10.1016/S0095-4470(19)31139-8Suche in Google Scholar

43 Lehtonen J (1970): Aspects of Quantity in Standard Finnish. No. VI in Studia Philologica Jyväskyläensia. Jyväskylä, University of Jyväskylä.Suche in Google Scholar

44 Liberman M, Prince A (1977): On stress and linguistic rhythm. Linguist Inq 8:249-336.Suche in Google Scholar

45 Lindblom B (1990): Explaining phonetic variation: a sketch of the H&H theory; in Marchal A (ed): Speech Production and Speech Modeling. Dordrecht, Kluwer Academic Publishers.10.1007/978-94-009-2037-8_16Suche in Google Scholar

46 Louwerse MM, Dale R, Bard EG, Jeuniaux P (2012): Behavior matching in multimodal communication is synchronized. Cogn Sci 36:1404-1426.10.1111/j.1551-6709.2012.01269.xSuche in Google Scholar

47 Malisz Z (2011): Tempo differentiated analyses of timing in Polish. Proceedings of the 17th International Congress of Phonetic Sciences, Hong Kong, pp 1322-1325.Suche in Google Scholar

48 Malisz Z (2013): Speech Rhythm Variability in Polish and English: A Study of Interaction between Rhythmic Levels; PhD thesis, Adam Mickiewicz University, Poznań.Suche in Google Scholar

49 Malisz Z, Zygis M, Pompino-Marschall B (2013): Rhythmic structure effects on glottalisation: a study of different speech styles in Polish and German. Lab Phonol 4:119-158.10.1515/lp-2013-0006Suche in Google Scholar

50 McAuley JD, Fromboluti EK (2014): Attentional entrainment and perceived event duration. Philos Trans R Soc Lond B Biol Sci 369:20130401.10.1098/rstb.2013.0401Suche in Google Scholar

51 Newlin-Łukowicz L (2012): Polish stress: looking for phonetic evidence of a bidirectional system. Phonology 29:271-329.10.1017/S0952675712000139Suche in Google Scholar

52 Nolan F, Jeon HS (2014): Speech rhythm: a metaphor? Philos Trans R Soc Lond B Biol Sci 369:20130396.10.1098/rstb.2013.0396Suche in Google Scholar

53 Ntzoufras I (2002): Gibbs variable selection using BUGS. J Stat Softw 7:1-19.10.18637/jss.v007.i07Suche in Google Scholar

54 O'Dell M, Lennes M, Nieminen T (2008): Hierarchical levels of rhythm in conversational speech. Proceedings of the 4th International Conference on Speech Prosody, Campinas, pp 355-358.Suche in Google Scholar

55 O'Dell M, Lennes M, Werner S, Nieminen T (2007): Looking for rhythms in conversational speech. Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrücken, pp 1201-1204.Suche in Google Scholar

56 O'Dell M, Nieminen T (1998): Reasons for an underlying unity in rhythm dichotomy. Linguistica Uralica 3:178-185.Suche in Google Scholar

57 O'Dell M, Nieminen T (2002): How long is a stress group? Cadernos de Estudos. Lingüísticos 43:93-108.Suche in Google Scholar

58 O'Dell M, Nieminen T (2006): Tahdin ajoitus suomessa oskillaattorimallin näkökulmasta [Timing of feet in Finnish from an oscillator model perspective]; in Aulanko R, Wahlberg L, Vainio M (eds): Fonetiikan päivät 2006 [The Phonetics Symposium 2006]. No. 53 in Helsingin Yliopiston Puhetieteiden Laitoksen Julkaisuja, pp 134-143.Suche in Google Scholar

59 O'Dell ML (2003): Intrinsic Timing and Quantity in Finnish; PhD thesis, University of Tampere.Suche in Google Scholar

60 O'Dell ML, Nieminen T (1999): Coupled oscillator model of speech rhythm. Proceedings of the 14th International Congress of Phonetic Sciences, San Francisco, pp 1075-1078.Suche in Google Scholar

61 O'Dell ML, Nieminen T (2009): Coupled oscillator model for speech timing: overview and examples. Nordic Prosody: Proceedings of the 10th Conference, Helsinki, pp 179-190.Suche in Google Scholar

62 O'Dell ML, Nieminen T, Mustanoja L (2011): The effect of synchronous reading on speech rhythm. Presented at the 13th International Rhythm Perception and Production Workshop, Leipzig.Suche in Google Scholar

63 Pate JK, Goldwater S (2015): Talkers account for listener and channel characteristics to communicate efficiently. J Mem Lang 78:1-17.10.1016/j.jml.2014.10.003Suche in Google Scholar

64 Pellegrino F, Coupé C, Marsico E (2011): A cross-language perspective on speech information rate. Language 87:539-558.10.1353/lan.2011.0057Suche in Google Scholar

65 Perkell JS, Klatt DH (1986): Invariance and Variability in Speech Processes. Lawrence Erlbaum.Suche in Google Scholar

66 Port R (2013): Coordinative structures for the control of speech production. www.cs.indiana.edu/∼port/teach/641/coord.strctr.html (last checked April 1, 2013).Suche in Google Scholar

67 Port R, Tajima K, Cummins F (1999): Speech and rhythmic behavior; in Savelsburgh GJP, van der Maas H, van Geert PCL (eds): The Non-Linear Analysis of Developmental Processes. Amsterdam, Elsevier, pp 5-45.Suche in Google Scholar

68 Port RF, Leary AP (2005): Against formal phonology. Language 81:927-964.10.1353/lan.2005.0195Suche in Google Scholar

69 Port RF, Van Gelder T (1995): Mind as Motion: Explorations in the Dynamics of Cognition. Cambridge, MA, MIT Press.Suche in Google Scholar

70 Quené H, Port RF (2005): Effects of timing regularity and metrical expectancy on spoken-word perception. Phonetica 62:1-13.10.1159/000087222Suche in Google Scholar

71 Rubach J, Booij G (1985): A grid theory of stress in Polish. Lingua 66:281-319.10.1016/0024-3841(85)90032-4Suche in Google Scholar

72 Sadeniemi M (1949): Metriikkamme perusteet [Foundations of our metrics]. No. 236 in SKS:n toimituksia. Helsinki, Suomalaisen Kirjallisuuden Seura.Suche in Google Scholar

73 Schlink B (1994): Selbs Betrug. Zürich, Diogenes Verlag AG.Suche in Google Scholar

74 Schlüter J (2005): Rhythmic Grammar. The Influence of Rhythm on Grammatical Variation and Change in English, Volume 46 of Topics in English Linguistics. Berlin, Mouton de Gruyter.10.1515/9783110219265Suche in Google Scholar

75 Seyfarth S (2014): Word informativity influences acoustic duration: effects of contextual predictability on lexical representation. Cognition 133:140-155.10.1016/j.cognition.2014.06.013Suche in Google Scholar

76 Shannon CE (1948): A mathematical theory of communication. Bell System Technical J 27:379-423, 623-656.10.1002/j.1538-7305.1948.tb00917.xSuche in Google Scholar

77 Shih SS (2014): Towards Optimal Rhythm; PhD thesis, Stanford University.Suche in Google Scholar

78 Shockley K, Richardson DC, Dale R (2009): Conversation and coordinative structures. Top Cogn Sci 1:305-319.10.1111/j.1756-8765.2009.01021.xSuche in Google Scholar

79 Sievers E (1893): Grundzüge der Phonetik zur Einführung in das Studium der Lautlehre der Indogermanischen Sprachen, ed 4. Leipzig, Breitkopf & Härtel.Suche in Google Scholar

80 Sovijärvi A (1946): Huomioita puherytmiikasta [Notes on speech rhythm]. Virittäjä 50:117-129.Suche in Google Scholar

81 Temperley D (2009): Distributional stress regularity: a corpus study. J psycholinguist Res 38:75-92.10.1007/s10936-008-9084-0Suche in Google Scholar

82 Tilsen S (2011): Metrical regularity facilitates speech planning and production. Lab Phonol 2:185-218.10.1515/labphon.2011.006Suche in Google Scholar

83 Turk A (2010): Does prosodic constituency signal relative predictability? A smooth signal redundancy hypothesis. Lab Phonol 1:227-262.10.1515/labphon.2010.012Suche in Google Scholar

84 Turk A, Shattuck-Hufnagel S (2013): What is speech rhythm? A commentary on Arvaniti and Rodriquez, Krivokapic and Goswami and Leong. Lab Phonol 4:93-118.10.1515/lp-2013-0005Suche in Google Scholar

85 Turk A, Shattuck-Hufnagel S (2014): Timing in talking: what is it used for, and how is it controlled? Philos Trans R Soc B Biol Sci 369:20130395.10.1098/rstb.2013.0395Suche in Google Scholar

86 Turvey MT (1990): Coordination. Am Psychol 45:938-953.10.1037/0003-066X.45.8.938Suche in Google Scholar

87 van der Hulst H (2014): Representing rhythm; in van der Hulst H (ed): Word Stress: Theoretical and Typological Issues. Cambridge, Cambridge University Press, p 325.Suche in Google Scholar

88 Vogel R, van de Vijver R, Kotz S, Kutscher A, Wagner P (2015): Function words in rhythmic optimisation; in van de Vijver R, Vogel R (eds): Rhythm in Cognition and Grammar: A Germanic Perspective. Berlin, De Gruyter, pp 253-274.Suche in Google Scholar

89 Wagner P (2012): Meter specific timing and prominence in German poetry and prose; in Niebuhr·(ed): Understanding Prosody, Language, Context and Cognition. Berlin, Walter de Gruyter, pp 219-236.Suche in Google Scholar

90 Wagner P, Malisz Z, Inden B, Wachsmuth I (2013): Interaction phonology - a temporal co-ordination component enabling representational alignment within a model of communication. Alignment in Communication. Towards a New Theory of Communication, pp 109-132.10.1075/ais.6.06wagSuche in Google Scholar

91 Wheeldon LR, Lahiri A (2002): The minimal unit of phonological encoding: prosodic or lexical word. Cognition 85:B31-B41.Suche in Google Scholar

92 White L (2014): Communicative function and prosodic form in speech timing. Speech Commun 63:38-54.10.1016/j.specom.2014.04.003Suche in Google Scholar

93 Windmann A, Šimko J, Wagner P (2014a): Probing theories of speech timing using optimization modeling. Proceedings of the 7th International Conference on Speech Prosody, Dublin, pp 346-350.10.21437/SpeechProsody.2014-57Suche in Google Scholar

94 Windmann A, Šimko J, Wagner P (2014b): A unified account of prominence effects in an optimization-based model of speech timing. Proceedings of INTERSPEECH 2014, Singapore.10.21437/Interspeech.2014-44Suche in Google Scholar

95 Windmann A, Šimko J, Wagner P (2015): Optimization-based modeling of speech timing. Speech Commun 74:76-92.10.1016/j.specom.2015.09.007Suche in Google Scholar

96 Zipf GK (1935): The Psycho-Biology of Language. Houghton, Mifflin.Suche in Google Scholar

Received: 2015-03-20
Accepted: 2016-09-15
Published Online: 2017-02-23
Published in Print: 2017-02-01

© 2017 S. Karger AG, Basel

Heruntergeladen am 9.9.2025 von https://www.degruyterbrill.com/document/doi/10.1159/000450829/html
Button zum nach oben scrollen