Chapter 9. Cross-linguistic comparison of automatic detection of speech breaks in read and narrated speech in four languages
-
Plínio A. Barbosa
Abstract
This chapter tests an algorithm for the automatic detection of speech breaks in read and narrated speech in Brazilian Portuguese (BP), European Portuguese (EP), French, and German. The algorithm is independent of previous transcription or linguistic analysis (syllable, phone labeling and segmentation), requiring only the audio file. It operates in two stages: vowel onsets detection firstly, followed by V-to-V duration intervals normalization for smoothed duration z-scores. Peaks over 2.5 of the latter were considered speech breaks. Compared to human segmentation, hits for reading (70%) were higher than for narration (60%). Crosslinguistic results show EP and French having the highest proportion of hits. A test with the English Navy audio file reveals a hit proportion similar to German.
Abstract
This chapter tests an algorithm for the automatic detection of speech breaks in read and narrated speech in Brazilian Portuguese (BP), European Portuguese (EP), French, and German. The algorithm is independent of previous transcription or linguistic analysis (syllable, phone labeling and segmentation), requiring only the audio file. It operates in two stages: vowel onsets detection firstly, followed by V-to-V duration intervals normalization for smoothed duration z-scores. Peaks over 2.5 of the latter were considered speech breaks. Compared to human segmentation, hits for reading (70%) were higher than for narration (60%). Crosslinguistic results show EP and French having the highest proportion of hits. A test with the English Navy audio file reveals a hit proportion similar to German.
Chapters in this book
- Prelim pages i
- Table of contents vii
- Acknowledgments xi
- Introduction. In search of a basic unit of spoken language 1
-
Part I
- Chapter 1. Russian spoken discourse 35
- Chapter 2. The basic unit of spoken language and the interfaces between prosody, discourse and syntax 77
- Chapter 3. Prosody and the organization of information in Central Pomo, a California indigenous language 107
- Chapter 4. Syntactic and prosodic segmentation in spoken French 127
- Chapter 5. Design and annotation of two-level utterance units in Japanese 155
- Chapter 6. The pragmatic analysis of speech and its illocutionary classification according to the Language into Act Theory 181
- Chapter 7. Illocution as a unit of reference for spontaneous speech 221
- Chapter 8. Narrative discourse segmentation in clinical linguistics 257
- Chapter 9. Cross-linguistic comparison of automatic detection of speech breaks in read and narrated speech in four languages 285
-
Part II
- Same texts, different approaches to segmentation 303
- Chapter 1. Segmentation and analysis of the two English excerpts 309
- Chapter 2. Analysis of two English spontaneous speech examples with the dependency incremental prosodic structure model 327
- Chapter 3. Applying criteria of spontaneous Hebrew speech segmentation to English 337
- Chapter 4. Basic units of speech segmentation 349
- Chapter 5. Segmentation of the English texts Navy and Hearts with SUU and LUU 359
- Chapter 6. The Moscow approach to local discourse structure 367
- Chapter 7. Some notes on the Hearts and Navy excerpts according to the Language into Act Theory 383
- Chapter 8. Comparing annotations for the prosodic segmentation of spontaneous speech 403
- Index 433
Chapters in this book
- Prelim pages i
- Table of contents vii
- Acknowledgments xi
- Introduction. In search of a basic unit of spoken language 1
-
Part I
- Chapter 1. Russian spoken discourse 35
- Chapter 2. The basic unit of spoken language and the interfaces between prosody, discourse and syntax 77
- Chapter 3. Prosody and the organization of information in Central Pomo, a California indigenous language 107
- Chapter 4. Syntactic and prosodic segmentation in spoken French 127
- Chapter 5. Design and annotation of two-level utterance units in Japanese 155
- Chapter 6. The pragmatic analysis of speech and its illocutionary classification according to the Language into Act Theory 181
- Chapter 7. Illocution as a unit of reference for spontaneous speech 221
- Chapter 8. Narrative discourse segmentation in clinical linguistics 257
- Chapter 9. Cross-linguistic comparison of automatic detection of speech breaks in read and narrated speech in four languages 285
-
Part II
- Same texts, different approaches to segmentation 303
- Chapter 1. Segmentation and analysis of the two English excerpts 309
- Chapter 2. Analysis of two English spontaneous speech examples with the dependency incremental prosodic structure model 327
- Chapter 3. Applying criteria of spontaneous Hebrew speech segmentation to English 337
- Chapter 4. Basic units of speech segmentation 349
- Chapter 5. Segmentation of the English texts Navy and Hearts with SUU and LUU 359
- Chapter 6. The Moscow approach to local discourse structure 367
- Chapter 7. Some notes on the Hearts and Navy excerpts according to the Language into Act Theory 383
- Chapter 8. Comparing annotations for the prosodic segmentation of spontaneous speech 403
- Index 433