Home General Interest Chapter 15. Exploration of the Rhapsodie corpus
Chapter
Licensed
Unlicensed Requires Authentication

Chapter 15. Exploration of the Rhapsodie corpus

Data structure, formats and query tools
  • Anne Lacheret-Dujour , Sylvain Kahane , Rachel Bawden , Serge Fleury and Ilaine Wang
View more publications by John Benjamins Publishing Company
Rhapsodie
This chapter is in the book Rhapsodie

Abstract

This chapter describes the data structure of the Rhapsodie Treebank and discusses methodological issues stemming from the complexity of this structure, articulated around three independent, non-aligned, hierarchies: Microsyntactic, macrosyntactic and prosodic, and the challenging questions to be resolved in this context. It discusses the specific problems posed by the simultaneous processing of the phonological stream (prosodic level) and the orthographic stream (syntactic level), which are often far from being isomorphic in French, and the related problem of the processing of disfluent and/or overlapped strings, which have not the same representation in the syntactic and the prosodic hierarchy. Then, it presents the formats adopted to encode prosodic and syntactic annotations and query them simultaneously, given that the prosodic architecture is a non-recursive time-aligned representation while the syntactic one is a recursive tree-based representation.

Abstract

This chapter describes the data structure of the Rhapsodie Treebank and discusses methodological issues stemming from the complexity of this structure, articulated around three independent, non-aligned, hierarchies: Microsyntactic, macrosyntactic and prosodic, and the challenging questions to be resolved in this context. It discusses the specific problems posed by the simultaneous processing of the phonological stream (prosodic level) and the orthographic stream (syntactic level), which are often far from being isomorphic in French, and the related problem of the processing of disfluent and/or overlapped strings, which have not the same representation in the syntactic and the prosodic hierarchy. Then, it presents the formats adopted to encode prosodic and syntactic annotations and query them simultaneously, given that the prosodic architecture is a non-recursive time-aligned representation while the syntactic one is a recursive tree-based representation.

Downloaded on 12.2.2026 from https://www.degruyterbrill.com/document/doi/10.1075/scl.89.16lac/html
Scroll to top button