Modeling and Perception of ‘Gesture Reduction’
-
René Carré
und Pierre L. Divenyi
Abstract
The phenomenon of vowel reduction is investigated by modeling ‘gesture reduction’ with the use of the Distinctive Region Model (DRM). First, a definition is proposed for the term gesture, i.e. an acoustically efficient command aimed at deforming, in the time domain, the area function of the vocal tract. Second, tests are reported on the perception of vowel-to-vowel transitions obtained with reduced gestures. These tests show that a dual representation of formant transitions is required to explain the reduction phenomenon: the trajectory in the F<sub>1</sub>–F<sub>2</sub> plane and the time course of the formant changes. The results also suggest that time-domain integration of the trajectories constitutes an integral part of the auditory processing of transitions. Perceptual results are also discussed in terms of the acoustic traces of DRM gestures.
verified
References
1 d’Alessandro, C.; Castellengo, M.: The pitch of short duration vibrato tones. J. acoust. Soc. Am. 95: 1617–1630 (1994).10.1121/1.408548Suche in Google Scholar
2 Badin, P.; Fant, G.: Notes on the vocal tract computations. Q. Prog. Status Rep., Speech Transm. Lab., R. Inst. Technol., Stockh., No. 2/3, pp. 53–107 (1984).Suche in Google Scholar
3 Beautemps, D.: Récupération des gestes de la parole à partir de trajectoires formantiques: identification de cibles vocaliques non atteintes et modèles pour les profils sagittaux des consonnes fricatives; thèse Institut National Polytechnique, Grenoble (1993).Suche in Google Scholar
4 Browman, C.; Goldstein, L.: Towards an articulatory phonology; in Ewan, Anderson, Phonol. Yb., pp. 219–252 (Cambridge University Press, Cambridge 1986).10.1017/S0952675700000658Suche in Google Scholar
5 Brownlee, S.A.: The role of sentence stress in vowel reduction and formant undershoot: a study of lab speech and informal spontaneous speech; PhD thesis University of Texas, Austin (1996).Suche in Google Scholar
6 Carré, R.; Chennoukh, S.; Divenyi, P.; Lindblom, B.: On the perceptual characteristics of ‘speech gestures’. J. acoust. Soc. Am. 96: S3326 (1994).10.1121/1.410724Suche in Google Scholar
7 Carré, R.; Mody, M.: Prediction of Vowel and Consonant Place of Articulation. Proc. 3rd Meet. ACL Special Interest Group in Computational Phonol. SIGPHON 97, Madrid 1997, pp. 26–32.Suche in Google Scholar
8 Carré, R.; Mrayati, M.: Vowel-vowel trajectories and region modeling. J. Phonet. 19: 433–443 (1991).10.1016/S0095-4470(19)30334-1Suche in Google Scholar
9 Chiba, T.; Kajiyama, M.: The vowel: its nature and structure (Tokyo-Kaiseikan Publishing Company, Tokyo 1941).Suche in Google Scholar
10 Di Benedetto, M.G.: Frequency and time variations of the first formant: properties relevant to the perception of vowel height. J. acoust. Soc. Am. 86: 67–77 (1989).10.1121/1.398221Suche in Google Scholar
11 Divenyi, P.; Lindblom, B.; Carré, R.: The role of transition velocity in the perception of V1V2 complexes. Proc. 13th Int. Congr. Phonet. Sci., Stockholm 1995, pp. 258–261.Suche in Google Scholar
12 Fant, G.: Speech sounds and features (MIT Press, Cambridge 1973).Suche in Google Scholar
13 Fant, G.: Vocal tract area and length perturbations. Q. Prog. Status Rep., Speech Transm. Lab., R. Inst. Technol., Stockh., No. 4, pp. 1–14 (1975).Suche in Google Scholar
14 Fowler, C.A.: Phonological and articulatory characteristics of spoken language. Haskins Lab. Status Rep. Speech Res., SR 109/110, pp. 1–12 (Haskins Laboratories, New Haven 1992).Suche in Google Scholar
15 Gay, T.: Effect of speaking rate on vowel formant movements. J. acoust. Soc. Am. 63: 223–230 (1978).10.1121/1.381717Suche in Google Scholar
16 House, A.S.; Fairbanks, G.: The influence of consonant environment upon the secondary acoustical characteristics of vowels. J. acoust. Soc. Am. 25: 105–113 (1953).10.1121/1.1906982Suche in Google Scholar
17 Huang, C.B.: Perception of first and second formant frequency trajectories in vowels. Int. Congr. on Phonet. Sci., Tallinn 1987, pp. 194–197.Suche in Google Scholar
18 Johnson, K.: Speaker perception without speaker normalization. An exemplar model; in Johnson, Mullennix, Talker variability in speech processing, pp. 145–165 (Academic Press, New York 1997).Suche in Google Scholar
19 Kozhevnikov, V.A.; Chistovich, L.A.: Speech, articulation, and perception. JPRS-30543. NTIS (US Department of Commerce, 1965).Suche in Google Scholar
20 Kuehn, D.P.; Moll, K.L.: A cineradiographic study of VC and CV articulatory velocities. J. Phonet. 4: 303–320 (1976).10.1016/S0095-4470(19)31257-4Suche in Google Scholar
21 Kuhl, P.: Infants’ perception and representation of speech: development of a new theory. Proc. ICSLP ’92, Banff 1992, pp. 449–456.10.21437/ICSLP.1992-3Suche in Google Scholar
22 Kuwabara, H.: An approach to normalization of coarticulation effects for vowels in connected speech. J. acoust. Soc. Am. 77: 686–694 (1985).10.1121/1.392337Suche in Google Scholar
23 Lindblom, B.: Spectrographic study of vowel reduction. J. acoust. Soc. Am. 35: 1773–1781 (1963).10.1121/1.1918816Suche in Google Scholar
24 Lindblom, B.: Explaining phonetic variation: a sketch of the H and H theory; in Marchal, Hardcastle, Speech production and speech modelling, NATO ASI Series, pp. 403–439 (Kluwer Academic Publishers, Dordrecht 1990).10.1007/978-94-009-2037-8_16Suche in Google Scholar
25 Lindblom, B.; Studdert-Kennedy, M.: On the role of formant transitions in vowel perception. J. acoust. Soc. Am. 42: 830–843 (1967).10.1121/1.1910655Suche in Google Scholar
26 Lindgren, R.; Lindblom, B.: Reduction of vowel chaos. Q. Prog. Status Rep., Speech Transm. Lab., R. Inst. Technol., Stockh., No. 2, pp. 1–4 (1996).Suche in Google Scholar
27 Mattingly, I.G.: The global character of phonetic gesture. J. Phonet. 18: 445–452 (1990).10.1016/S0095-4470(19)30372-9Suche in Google Scholar
28 Moore, B.C.J.; Sek, A.: Discrimination of frequency glides with superimposed random glides in level. J. acoust. Soc. Am., 104: 411–421 (1998).10.1121/1.423297Suche in Google Scholar
29 Mrayati, M.; Carré, R.; Guérin, B.: Distinctive region and modes: a new theory of speech production. Speech Commun. 7: 257–286 (1988).10.1016/0167-6393(88)90073-8Suche in Google Scholar
30 Mrayati, M.; Carré, R.; Guérin, B.: Distinctive regions and modes: articulatory-acoustic-phonetic aspects. A reply to Boë and Perrier comments. Speech Commun. 9: 231–238 (1990).10.1016/0167-6393(90)90059-ISuche in Google Scholar
31 Nord, L.: Acoustic studies of vowel reduction in Swedish. Q. Prog. Status Rep., Speech Transm. Lab., R. Inst. Technol., Stockh., No. 4, pp. 19–36 (1986).Suche in Google Scholar
32 Peterson, G.E.; Barney, H.L.: Control methods used in the study of the vowels. J. acoust. Soc. Am. 24: 175–184 (1952).10.1121/1.1906875Suche in Google Scholar
33 Son, R.J.J.H. van: Vowel perception: a closer look at the literature. Proc. Inst. Phonet. Sci., Univ. Amsterdam 17: 33–64 (1993).Suche in Google Scholar
34 Son, R.J.J.H. van; Pols, L.C.W.: Formant movements of Dutch vowels in a text, read at normal and fast rate. J. acoust. Soc. Am. 92: 121–127 (1992).10.1121/1.404277Suche in Google Scholar
35 Son, R.J.J.H. van; Pols, L.C.W.: Vowel identification as influenced by vowel duration and formant track shape. Proc. Eurospeech ’93, Berlin, pp. 285–288 (1993).Suche in Google Scholar
36 Strange, W.: Dynamic specifications of coarticulated vowels spoken in sentence context. J. acoust. Soc. Am. 85: 2135–2153 (1989).10.1121/1.397863Suche in Google Scholar
37 Strange, W.; Bohn, O.S.: Dynamic specification of coarticulated German vowels: perceptual and acoustical studies. J. acoust. Soc. Am. 104: 488–504 (1998).10.1121/1.423299Suche in Google Scholar
38 Strange, W.; Jenkins, J.J.; Johnson, T.L.: Dynamic specification of coarticulated vowel. J. acoust. Soc. Am. 74: 695–705 (1983).10.1121/1.389855Suche in Google Scholar
© 2000 S. Karger AG, Basel
Artikel in diesem Heft
- Special Section
- Title Page
- Foreword
- Acoustic Patterning of Speech Its Linguistic and Physiological Bases
- Investigating Unscripted Speech: Implications for Phonetics and Phonology
- Emotive Transforms
- The Source-Filter Frame of Prominence
- The C/D Model and Prosodic Control of Articulatory Behavior
- Diverse Acoustic Cues at Consonantal Landmarks
- Perceptual Processing
- Modeling and Perception of ‘Gesture Reduction’
- General Auditory Processes Contribute to Perceptual Accommodation of Coarticulation
- Adaptive Dispersion in Vowel Perception
- Language Acquisition as Complex Category Formation
- Biology of Communication and Motor Processes
- Singing Birds, Playing Cats, and Babbling Babies: Why Do They Do It?
- The Phonetic Potential of Nonhuman Vocal Tracts: Comparative Cineradiographic Observations of Vocalizing Animals
- Dynamic Simulation of Human Movement Using Large-Scale Models of the Body
- En Route to Adult Spoken Language / Language Development
- An Embodiment Perspective on the Acquisition of Speech Perception
- Speech to Infants as Hyperspeech: Knowledge-Driven Processes in Early Word Recognition
- The Construction of a First Phonology
- Auditory Constraints on Sound Structures
- Searching for an Auditory Description of Vowel Categories
- Commentary
- Imitation and the Emergence of Segments
- Deriving Speech from Nonspeech: A View from Ontogeny
- Paper
- Developmental Origins of Adult Phonology: The Interplay between Phonetic Emergents and the Evolutionary Adaptations of Sound Patterns
- Further Section
- Publications Björn Lindblom
- Index autorum Vol. 57, 2000
- Contents Vol. 57, 2000
Artikel in diesem Heft
- Special Section
- Title Page
- Foreword
- Acoustic Patterning of Speech Its Linguistic and Physiological Bases
- Investigating Unscripted Speech: Implications for Phonetics and Phonology
- Emotive Transforms
- The Source-Filter Frame of Prominence
- The C/D Model and Prosodic Control of Articulatory Behavior
- Diverse Acoustic Cues at Consonantal Landmarks
- Perceptual Processing
- Modeling and Perception of ‘Gesture Reduction’
- General Auditory Processes Contribute to Perceptual Accommodation of Coarticulation
- Adaptive Dispersion in Vowel Perception
- Language Acquisition as Complex Category Formation
- Biology of Communication and Motor Processes
- Singing Birds, Playing Cats, and Babbling Babies: Why Do They Do It?
- The Phonetic Potential of Nonhuman Vocal Tracts: Comparative Cineradiographic Observations of Vocalizing Animals
- Dynamic Simulation of Human Movement Using Large-Scale Models of the Body
- En Route to Adult Spoken Language / Language Development
- An Embodiment Perspective on the Acquisition of Speech Perception
- Speech to Infants as Hyperspeech: Knowledge-Driven Processes in Early Word Recognition
- The Construction of a First Phonology
- Auditory Constraints on Sound Structures
- Searching for an Auditory Description of Vowel Categories
- Commentary
- Imitation and the Emergence of Segments
- Deriving Speech from Nonspeech: A View from Ontogeny
- Paper
- Developmental Origins of Adult Phonology: The Interplay between Phonetic Emergents and the Evolutionary Adaptations of Sound Patterns
- Further Section
- Publications Björn Lindblom
- Index autorum Vol. 57, 2000
- Contents Vol. 57, 2000