Chapter 12. From pitch stylization to automatic tonal annotation of speech corpora
-
Piet Mertens
Abstract
This chapter proposes a labeling scheme for pitch-related aspects of speech prosody and describes an automatic annotation system using this scheme.
In the labelling scheme, the fine-grained transcription provides labels indicating the pitch level and pitch movement of individual syllables. The pitch levels “bottom” and “top” indicate the boundaries of the speaker’s pitch range. Three additional levels – “low”, “mid”, “high” – are defined on the basis of pitch changes in the local context. For pitch movements, both simple and compound, the transcription indicates direction (rise, fall, level) and size (large and small melodic intervals), using size categories adjusted to the speaker’s pitch range.
The automatic tonal annotation system combines several processing steps: segmentation into syllabic nuclei, pause detection, pitch stylization, pitch range estimation, pitch movement classification, and pitch level assignment. It uses a rule-based procedure, which unlike commonly used supervized learning techniques does not require a labelled corpus to train the model.
Abstract
This chapter proposes a labeling scheme for pitch-related aspects of speech prosody and describes an automatic annotation system using this scheme.
In the labelling scheme, the fine-grained transcription provides labels indicating the pitch level and pitch movement of individual syllables. The pitch levels “bottom” and “top” indicate the boundaries of the speaker’s pitch range. Three additional levels – “low”, “mid”, “high” – are defined on the basis of pitch changes in the local context. For pitch movements, both simple and compound, the transcription indicates direction (rise, fall, level) and size (large and small melodic intervals), using size categories adjusted to the speaker’s pitch range.
The automatic tonal annotation system combines several processing steps: segmentation into syllabic nuclei, pause detection, pitch stylization, pitch range estimation, pitch movement classification, and pitch level assignment. It uses a rule-based procedure, which unlike commonly used supervized learning techniques does not require a labelled corpus to train the model.
Chapters in this book
- Prelim pages i
- Table of contents v
- Acknowledgments ix
- Preface xi
- Introduction 1
- Chapter 1. Collecting data for the Rhapsodie treebank 7
- Chapter 2. Orthographic and phonetic transcriptions of Rhapsodie recording 21
- Chapter 3. Syntactic annotation of the Rhapsodie corpus 35
- Chapter 4. Microsyntactic annotation 49
- Chapter 5. The annotation of list structures 69
- Chapter 6. Macrosyntactic annotation 97
- Chapter 7. Annotation tools for syntax 127
- Chapter 8. Prosodic annotation of the Rhapsodie corpus 147
- Chapter 9. The annotation of syllabic prominences and disfluencies 157
- Chapter 10. Segmentation into intonational periods 175
- Chapter 11. Derivation of the prosodic structure 213
- Chapter 12. From pitch stylization to automatic tonal annotation of speech corpora 233
- Chapter 13. Tonal annotation 251
- Chapter 14. Tools for fundamental frequency estimation in Rhapsodie 261
- Chapter 15. Exploration of the Rhapsodie corpus 271
- Chapter 16. Macrosyntax at work 285
- Chapter 17. The distribution of prosodic features in the Rhapsodie corpus 315
- Chapter 18. Syntax and prosody mapping: What and how? 339
- Chapter 19. Conclusion 365
- References 369
- Subject index 393
Chapters in this book
- Prelim pages i
- Table of contents v
- Acknowledgments ix
- Preface xi
- Introduction 1
- Chapter 1. Collecting data for the Rhapsodie treebank 7
- Chapter 2. Orthographic and phonetic transcriptions of Rhapsodie recording 21
- Chapter 3. Syntactic annotation of the Rhapsodie corpus 35
- Chapter 4. Microsyntactic annotation 49
- Chapter 5. The annotation of list structures 69
- Chapter 6. Macrosyntactic annotation 97
- Chapter 7. Annotation tools for syntax 127
- Chapter 8. Prosodic annotation of the Rhapsodie corpus 147
- Chapter 9. The annotation of syllabic prominences and disfluencies 157
- Chapter 10. Segmentation into intonational periods 175
- Chapter 11. Derivation of the prosodic structure 213
- Chapter 12. From pitch stylization to automatic tonal annotation of speech corpora 233
- Chapter 13. Tonal annotation 251
- Chapter 14. Tools for fundamental frequency estimation in Rhapsodie 261
- Chapter 15. Exploration of the Rhapsodie corpus 271
- Chapter 16. Macrosyntax at work 285
- Chapter 17. The distribution of prosodic features in the Rhapsodie corpus 315
- Chapter 18. Syntax and prosody mapping: What and how? 339
- Chapter 19. Conclusion 365
- References 369
- Subject index 393