Abstract
This article presents the first analysis of the word order of Early Old Catalan by examining four texts dating from the 10th and 11th centuries. While the texts vary in nature, they all fall within the legal register, and they are all deemed suitable for the study of syntax based on extra-linguistic criteria. Approaching the data from a generative and cartographic perspective, it is shown that (i) broadly speaking, Early Old Catalan patterns with contemporary Romance texts in exhibiting an active low V2 grammar; (ii) it shares with other Old Romance varieties the use of expletive elements to satisfy the V2 parameter; (iii) Early Old Catalan differs notably from 13th century Catalan in the use of the left periphery and the application of V2. The findings in this study cast light over the fragmentation of the Romance linguistic continuum at syntactic level and open new avenues of research on the matter.
1 Introduction
The clausal syntax of Catalan before the 13th century has never been explored in the literature concerned with the history of this language since literary texts have traditionally been the pool of data explored in studies concerned with diachronic Catalan syntax (Badia i Margarit 1951; Batllori 1993; Batllori et al. 2005; Batllori and Hernanz 2009, 2015; Fischer and Xavier in press; Par 1923; Pujol i Campeny 2025a; Pujol i Campeny, Afra 2018), and literary texts in Catalan language are not attested until the 13th century. In this article we examine the sentential syntax and information structure of four texts dating from the 10th and 11th centuries (to which we refer as Early Old Catalan or EOC) to establish that: (i) Early Old Catalan patterns with contemporary varieties in presenting low V2 syntax; (ii) it shares with other Old Romance varieties the use of expletive elements to satisfy the V2 parameter; (iii) Early Old Catalan differs notably from 13th century Catalan in the use of the left periphery and the distribution of V2, and the use of expletive elements. Three of the texts examined are written in Latin, and either display non-canonical Latin word order or syntactic glosses that inform us about the syntax of the Catalan of the period under study. The remaining text is written entirely in Catalan and offers a glimpse of the language of the turn of the 11th century. In incorporating texts in Latin, this article echoes Troberg and Whitman (2022) in defending the usefulness of Latin texts of early Romance provenance as a source of data for historical syntax research. These authors specifically engage with texts with syntactic glosses, but here we also engage with texts displaying non-canonical Latin word order, widening the pool of sources used to inform us about the syntax of Early Romance.
By maximising the evidence available to us, this article fills a gap in the literature on Old Catalan syntax and provides the broader field of Romance linguistics with a nuanced account of the clausal syntax of Early Old Catalan in order to consolidate our understanding of the evolution of the Romance languages in their early days. By comparing the robust low V2 syntax of Early Old Catalan to that of 13th century Catalan, where V2 only surfaces in archaising pockets such as embedded clauses (Pujol i Campeny 2025a, in press), contrasting with main declarative clauses' unmarked SVO, we are able to establish a syntactic periodisation of the language. This allows us to position Catalan diachronically in the Romance linguistic continuum and to reassess its syntactic fragmentation after the joint development of V2 by exploring how, when, and where V2 started to weaken within the continuum, which, by the 13th century, might be more appropriately referred to as linkage (François 2014; Ross 1988).
For the analysis of Old Catalan word order within the wider Romance context, we assume a generativist definition of V2 and a cartographic conception of the left periphery (Benincà 2006; Rizzi 1997). From this perspective, V2 is understood not as a linear phenomenon but as a syntactic requirement that involves two separate operations:
| a. A functional head in the left periphery attracts the finite verb. |
| b. This functional head has an EPP feature that triggers the re-merging of a constituent to its specifier position. |
| Holmberg (2015: 375) |
A language can be considered to be V2 when it exhibits the features highlighted in (1) in unmarked declarative main clauses (Cruschina and Sailor 2022; Roberts 1993; Sailor 2020). V2 might be present elsewhere, as is the case in most Modern Romance varieties, where verb movement to the left periphery is attested in constructions associated to marked informational readings (Cruschina and Sailor 2022; Leonetti and Escandell-Vidal 2009; Pujol i Campeny 2025a; Sailor 2020), but that does not render these varieties structurally V2.
Within the cartographic programme, the left periphery corresponds to the syntactic space located above the Inflectional Phrase. It acts as a hinge between the clause and the wider discourse, and it is further divided into the following projections and fields:
| [Frame/HT [ Force | [Top | [Foc | [ Fin | [IP… |
| Adapted from Wolfe (2022: 3) |
At the extremes of the left periphery, we find ForceP, a projection associated with a clause’s illocutionary force, and FinP, a projection connected to the clause’s finiteness and mood. Between ForceP and FinP, the Top(ic)P and Foc(us)P fields host constituents receiving different types of topical and focal readings. In the case of topics, we take them to be generally base-generated in the left periphery (Benincà 2004, 2006; Benincà and Poletto 2004; Cinque 1990; De Cat 2009), while constituents occurring in the Focus field undergo movement to the projection connected with their reading. Finally, above ForceP there is the Frame field, a scene-setting space that hosts constituents anchoring the proposition contained in the clause in the wider discourse in terms of speech participants and spatio-temporal deixis (Benincà and Poletto 2004).
Depending on the landing site targeted by the verb in the left periphery (Wolfe 2018b; Wolfe 2020, et seq.), two types of V2 grammars can be distinguished. On the one hand, in relaxed or low V2 languages the verb targets FinP. This allows constituents to be first-merged in the Topic and Frame fields, preceding the verb and the constituent moved to the specifier of Fino. Therefore, in low V2 languages, V3+ orders, where more than constituent precedes the verb, are licensed, as illustrated in (3), a 12th century Old French example:
| [Frame | En | icel | termine [Force [Topic | li | marchis | Bonifaces de Monferrat |
| in | that | time | the | marquis | Boniface de Monferrat | |
| remut | de | Salenique [Focus [Fin | si [Fin° | s’en ala. . .]]]]]] | ||
| come.back.ptcp | from | Salenica | si | rfl.3sg=adv.cl=left.3sg | ||
| ‘Then at that time, Marquess Boniface of Montferrat who had come from Salenica, left.’ | ||||||
| Villehardouin 270, 456, apud Wolfe (2018a, 2018b: 355) | ||||||
In high or strict V2 grammars, the verb raises to ForceP, where it is only preceded by the element merged in the specifier of the projection. V3 orders are only ever licensed when adverbs are first merged in the Frame field:[1]
| [Frame [SpecForce | si [Force o | en | fu [Topic [Focus [Fin [SpecIP | la | reïne |
| si | of.it.cl= | was.3sg | the | queen | |
| [vP Vi [VP | mout | carrociee]]]]] | |||
| very | angry | ||||
| ‘The queen was very upset about it’ | |||||
| Mort Artu, §166, apud Benincà (2006:64) | |||||
The presence of an active V2 grammar has been shown to correlate with the following syntactic phenomena (Klævik-Pettersen 2019; Poletto 2014; 2019; Wolfe 2018a):
| i. Subject inversion: whenever a constituent other than the subject is fronted and the subject is overtly realised in postverbal position, followed by elements that are unequivocally located in vP (such as low subjects or direct objects): [CP [SpecXP XP [X’ VAux [IP S [VP VLex |
| ii. Unspecified preverbal field: fronting of an informationally unspecified constituent other than the subject to a preverbal position; i.e., elements receiving new information focus can freely occur in preverbal position, contrasting with most Modern Romance languages; |
| iii. Object scrambling: [vP [VP V O]] > [vP O [VP V |
| iv. Asymmetric pro-drop between main and embedded clauses (found in Old Northern Romance, Ledgeway (2020: §2)); |
| v. Enclisis (application of Tobler-Mussafia Law); |
| vi. Presence of expletives of adverbial origin in low V2 languages, and pronominal origin in high V2 ones (Poletto 2019). |
It is key to highlight that the only unequivocal cue for the acquisition of a V2 grammar are incontestable cases of inversion, as described in (5.i) (Lightfoot 1999; Westergaard 2009; Yang 2000). The rest of the phenomena listed here correlate with the presence of an active V2 grammar but can be parsed under non-V2 grammars. In the light of this, this paper focuses on inversion structures and the nature of the preverbal field to determine whether the data at hand would allow for the acquisition of a V2 grammar or not. Additionally, other aspects connected to the active presence of a V2 grammar are also considered, albeit in less detail due to space constraints: the possibility of encoding information focus in the left periphery, as a symptom of an underspecified preverbal field; clitic distribution obeying Tobler–Mussafia law whereby clitics occur enclitically whenever the verb occurs in clause-initial position (Ledgeway 2010);[2] and the presence of expletives, which have been identified for Old Italian (Poletto 2005, 2019), Old French (Wolfe 2018b), and Old Occitan (Donaldson 2016; Pujol i Campeny 2025c). Each of these traits is explored at length where relevant in the discussion.
We cannot undertake this analysis of EOC clausal syntax without first considering the extra-linguistic variables that shape the limited pool of data available to us.
2 Syntax, register, genre, and the data at hand
The so-called ‘bad data problem’ (Weinreich et al. 1968) lays out one of the central issues that we face in diachronic linguistics: language change occurs in spoken language, and we have to rely on written data (where available) in order to reconstruct previous stages of a language. To bypass this problem, it has been suggested that researchers should focus on written documents that bring us closer to spoken language, such as familial correspondence, trial proceedings, and dialogues from prose drama (Culpeper and Kytö 2010; Elspaß et al. 2007; Kytö 2016).
However, for certain periods and varieties, as is the case for 10th-11th century Old Catalan, we only count with scant written traces of the language of the time, occurring exclusively in legal texts, which contrast heavily with oral-like texts in terms of conventionalisation and formalisation of their register and style. While in diachronic Romance linguistics there was scepticism regarding language variation connected to medium, register, and genre in early periods when vernacular Romance was not yet standardised nor ‘established’ (Posner 1997), recent studies have clearly shown that the pre-modern textual record does reflect written-medium tendencies (Ingham 2016: 383; Pujol i Campeny 2025a, in press) and is subject to register and genre constraints that intensely interact with the syntax of texts and should be taken into account.
In this article, we qualitatively examine four texts that can be described as legal given their format and content. They fall in two categories: three legal documents of Catalan provenance written in Latin, and one code of law, written in EOC. The three Latin texts of Catalan provenance were identified on the basis of philological analyses. Two of the texts, written in the 10th century, exhibit non-canonical Latin syntax and have been described as ‘oral-like’, ‘colloquial’, and narrative in nature (Alturo i Perucho and Alaix 2022, 2023). Therefore, despite their legal nature, these texts contrast with the bulk of written evidence of the period, which tends to display formulaic and relatively correct Latin syntax. The third text, the Liber Iudicorum (11th century), contains extensive syntactic glosses (Bellés i Sallent and Alturo i Perucho 2008) akin to those analysed in Troberg and Whitman (2022) in a manuscript of Occitan provenance. The fourth text, the Usatges de Barcelona, consists on a complete legal code written entirely in Catalan and compiled during the 11th century. This text lends itself to the study of clausal syntax thanks to its length and style.
In recent years, two other texts written in Old Catalan have been re-dated as having been produced in the 11th century and the turn of the 12th: the Greuges de Guitard Isarn and a fragment of a translation of the Liber Iudicum from la Seu d’Urgell. The Greuges de Guitard Isarn or Grievences of Guitard Isarn, newly dated as produced in 1105 by Alturo i Perucho and Alaix (2023), presents very particular syntax, with a prevalence of pro-drop V1 clauses headed by the verb rancurar-se ‘to complain’ and the phrase ésser rancurós ‘to be greived’, as Guitard Isarn complains about the offences committed against him by Guillem Arnau and his son Mir Arnau.[3] The fragment of the Liber Iudicum is characterised by the use of concatenated embedded clauses and the lack of canonical and complete declarative clauses. Therefore, neither text serves the objective of this article. Finally, feudal oaths have also been excluded from the article since, despite exhibiting Catalan lexis, it is simply inserted within a formulaic Latin syntactic frame (Russel-Gebbett 1965). Finally, the last potential candidate to appear in this study would be the Cançó de Santa Fe, an 11th century poem. Two reasons have brought us to dismiss it: on the one hand, meter requirements heavily interact with word order, rendering verse an unreliable source of information. On the other, its linguistic filiation (whether the text instantiates early Catalan or early Occitan) remains contested to this day (Lafont 1998; Riquer 1964: 197–300).
The first text examined here is a 10th century diploma written by a certain Presbyter Borrell in Vic on April 5th 933 (Vic, ABEV, AC, cal. 6, núm. 152). Diplomas were legal documents emitted by someone with legal authority (the precise title and function of the person emitting them depended on the geographical location)[4] where an event, agreement, or testimony was presented as a true fact. The text is studied from a philological perspective in Alturo i Perucho and Alaix (2021: 207), where they describe its language as follows:
| ‘a rather more colloquial expression (…) has a narrative tone less dependent on traditional diplomatic formulae, and thus, of a freer and more personal character’5 |
| Alturo and Alaix (2021: 207) |
- 5
Translations from Catalan are my own.
We then examine two documents produced by Judge Bonsom of Barcelona, active during the last quarter of the 10th century and the first quarter of the 11th century. The first document, written in the vicinity of Barcelona in 988, consists of a diploma of narrative nature telling the story of two brothers and how they came to inherit the property they owned. Alturo i Perucho and Alaix (2022) suppose that Bonsom used notes made by someone else when composing the text, given the presence of a few errors attributed to the copying process. Bonsom’s Latin is described as grammatically correct but not spotless.
The second document produced by Bonsom analysed here is his copy of the Liber Iudicorum, a 7th century code of law containing an admixture of Visigothic and Roman law, as well as laws deriving from Conciliar activity (Alturo i Perucho 2003; Bellés i Sallent and Alturo i Perucho 2008; Collins 2004). The code was applied in the Iberian Peninsula from the fall of the Roman empire, throughout the Arab invasion, and until the High Middle Ages (Bastardas 1984). By the 11th century, it was slowly being replaced by feudal costums. Bonsom’s copy, the Liber Iudiciorum Popularis, accompanies the core legal text with a variety of additional content, including a eulogy of St Isidore of Seville, a chronicon of Frankish kings, a calendar of religious festivities celebrated in Barcelona and the surrounding area, rendering the copy a pedagogical device in addition to a legal one. We focus on two fragments of the Liber, a metaphrase, found in the opening of the first book (fol. 12v-13r, 20 lines, 155 words), and the syntactic glosses of a fragment concerned with inheritance law (fol. 103v-105r, 65 lines, 728 words).
Finally, we discuss the syntax of the Usatges de Barcelona (the Customs of Barcelona). While in the opening of the text its composition is attributed to Ramon Berenguer I and his wife Almodis of the Marche in 1068, the first compilation of the text that has reached us is dated around 1179. Costums were added to the text until the 13th century, when it was fixed and canonised under the kingship of James I (Martí i Castell 2002). Therefore, the dating of this text is to be taken with caution, but as shown below, it exhibits archaic features that set it apart from 13th century Catalan and would place it in an earlier period. We follow Alturo i Perucho and Alaix’s (2023: 191) dating of the text, grounded on philological information.
In what follows, we analyse the syntax of each of these texts to see a picture of the sentential syntax of EOC emerge clearly: it had an active low V2 grammar, patterning with the results of research on the syntax of other Early Old Romance varieties,[6] but contrasting with 13th century Catalan, as described in Pujol i Campeny (2018, 2025a, in press). The steps followed for the analysis of each text depend on its length. Both Borrell’s diploma and Bonsom’s diploma are short texts of less than 1,000 words (300 for the former and 728 for the latter). Their syntax is analysed exclusively qualitatively, with frequency being solely taken as a potential indicator of clausal structure, and not a reflection of the underlying syntax. Texts are closely analysed to establish the presence or absence of the features listed in (5) for the identification of V2. Bonsom’s metaphrase and syntactic gloss lend themselves to a more nuanced analysis. Patterns of change from the initial Latin word order to the paraphrased or glossed word order are explored to determine whether the grammar reflected in the metaphrase exhibits features aligned with those of V2 or not. Finally, in the analysis of the Usatges, we explore the clausal syntax of 400 clauses contained in the first 10 folia of the text, belonging to the 11th century core. In this case, frequency of word order patterns is explored, but simply as an indicator of what the underlying syntax might be. The Usatges are then qualitatively analysed in order to identify the presence (or lack thereof) of structures that correlate with an active V2 grammar. Given their length and the fact that it is the first text written entirely in Catalan of its extension, it affords the possibility of a more detailed discussion clausal syntax and the conclusions reached in this paper pivot mainly on evidence from this text.
3 The 10th century: Judge Borrell’s diploma
Here, we examine the sentential word order of Judge Borrell’s diploma. Given the length of the text, evidence from this the text can only hint at the operating grammar of the time through the analysis of word order tendencies, clitic placement, and the use of expletives.
There is a clear preference for (S)VO orders in both main (8/12) and embedded clauses (12/14), contrasting with Classical Latin (Wanner 1987: 380):
| Et | ego | Adalgis | adquieui | tua | voluntate | ut (…) |
| and | I | Adalgis | agreed.1sg | by.your | will | that |
| ‘And I, Adalgis, agreed, as per your wishes, that (…)’ | ||||||
| Borrell, paragraph 2 | ||||||
| (…) | si | procreauerit | proles, |
| if | produced.3sg | offspring | |
| ‘if he produced offspring, (…)’ | |||
| Borrell, paragraph 3 | |||
Medieval Romance clitic pronouns abide by the so-called Tobler-Mussafia Law, which can be loosely formulated as the descriptive observation stating that object clitics not being able to appear in clause initial position. In Borrell’s diploma, the distribution of Latin demonstratives is akin to that of Medieval Romance clitics, consistently appearing proclitically when a suitable phonological host is available preverbally:
| Et | ego | Nectar | presbiter | sic | eum | redimo | de | mea |
| and | I | Nectar | presbiter | sic | him= | liberate.1sg | from | my |
| rem | propria | propter | Deum | et | remedium | anume | mee | |
| thing | own | for | God | and | salvation | soul | mine |
| Et | sic | illi | perdono | ipsos | solidos | XXX | ab | integrum, (…). |
| and | sic | to.him= | forgive.1sg | those | solidi | 30 | to | full |
| ‘And I, presbiter Nectar, I liberate him with my own money for God and the salvation of my soul. And I forgive him these 30 solidi fully (…).’ | ||||||||
| Borrell, paragraph 4 | ||||||||
In (9), the demonstratives eum and illi appear proclitically, as expected given that the adverb sic ‘thus, indeed’ preceds the verb. The Romance equivalent of sic, sí, sì, or si depending on the orthography, is found in preverbal position in the earliest Langue d’Oil text, the Oaths of Strasbourg (842), and in the Greuges de Guitard Isarn, an early 12th century Catalan text:
| Pro Deo amur | et | pro | christian | poblo | et | nostro | commun |
| for God love | and | for | Christian | people | and | our | common |
| saluament, | d’ist | di | en | auant, | in quant | Deus | |
| salvation | from;this | day | in | onwards | in as much | God | |
| sauir | et | podir | me | dunat, | si | saluarai | eo |
| knowledge | and | power | to.me= | gave.3sg | sic | save.1sg.fut | I |
| cist | meon | fradre | Karlo, (…). | ||||
| this | my | brother | Charles | ||||
| ‘For the love of God and the christian people and our joint salvation, from this day forward, in what knowledge and power God has bestowed upon me, I will stand by this brother of mine Charles, (…)’ | |||||||
| Oaths of Strasbourg | |||||||
| Et | fuit rancurós de Guilelm Arnal e | ssón·o | del | fil | quar | |
| and | was grudgy of Guilelm Arnal and | am=it.CL | of;the | son | since | |
| sí pres | mils | Mir Arnall | che | jo no·l | li | doné, |
| si took.3SG | better | Mir Arnal | that | I not=it.cl | to.him.cl= | gave.3sg |
| e·l | sí | fed | a | ssi | solidar (…) | |
| and he | si | did.3sg | to | himself | consolidate.inf | |
| ‘And I hold a grudge against Guillem Arnau and his son, since he took more from Mir Arnau than I had given him, and he did so to consolidate his position, (…)’ | ||||||
| Greuges de Guitard Isarn | ||||||
In (10), the earliest instance of si recorded in Old French, a low V2 language at the time (Wolfe 2018b) is reproduced. Si appears between a concatenation of scene setters and the verb. In (11), si occurs twice, first, within an embedded adverbial clause introduced by quar/car ‘since’, which can be described as a peripheral adverbial clause, as it does not modify the structure of the event contained in the main clause (Haegeman 2004: 61). Peripheral adverbial clauses are characterised by not being integrated in the core structure of the main clause, and by being inserted in the left periphery. Internally, they present a fully-fledged periphery and possess their own illocutionary force, which translates into them being able to accommodate main clause phenomena (Haegeman 2012: 173; Hooper and Thompson 1973). Therefore, in the case of the quar clause, V2 syntax is expected to apply (Haegeman 2012: 179), and si satisfies the requirement of filling SpecFinP, assuming that the verb raises to Fino, in a low V2 configuration.
In the most recent analysis of the particle in Old French, Wolfe (2018b; 2020) proposes that si starts off as a temporal deictic adverb that undergoes movement to the left periphery to satisfy the language’s V2 requirement in the 11th century, and then, grammaticalises as a FinP expletive by the late 12th century. As such, it merges directly in SpecFinP when there would be no constituent available to move to it (Wolfe 2018b: 349).
In examples (9–11), it is not possible to establish whether sic/ sí is an adverb moved to the left periphery or an expletive. According to Wolfe (2018b), Old French si does not reach expletive status until the 12th century. Nevertheless, there are no cases of postverbal si in the Early Catalan textual record,[7] which would be expected if it were still used as a full adverb like així/axí/aissi ‘thus’,[8] which does appear postverbally when not fronted for specific informational purposes. This suggests that in the examples above sí being used as an expletive.
Old Occitan si exhibits the same distribution as that shown in examples (8–10). Donaldson (2016) argues that si to fulfils the second requirement of Old Occitan’s low-V2 grammar that by appearing preverbally in sentences where a dislocated subject introduces a new discourse referent, as in (12), or where the verb is preceded by an element occurring in the Frame Field, as in (13):
| Bertrans de Born | si | fo | un | castellans… |
| Bertran de Born | si | was.3sg | a | nobleman |
| ‘Betran de Born was a nobleman (…)’ | ||||
| Vida XIA; Boutière 1964: 65, apud Donaldson (2016, example 18) | ||||
| E | [quan | lo | vescons s’en aperceup], | si s’estranjet. |
| and | when | the | viscount himself;of.it=realised.3sg | si himself=went away.3sg |
| ‘And when the viscount learned of it, he distanced himself’. | ||||
| Vida VIA; Boutière 1964: 20, apud Donaldson (2016: 170, example 19) | ||||
Old Occitan, in line with the other Medieval Romance varieties (Batllori et al. 2005; Benincà 2006), presents focus sensitive enclisis/proclisis variation (Batllori et al. 2005; Benincà 2006; Troberg and Whitman 2022; Wolfe 2021b). When si or a focalised constituent is present, it triggers proclisis, as in (13). In contrast, when the element satisfying the second step of the V2 requirement is topical in nature and si is not present, enclisis follows, as in (14):
| E | qant | s’en | venia | per | Roine | en | una |
| and | when | himself.3sg;of there= | came.3sg | by | Rhône | in | one |
| barca | preiren | lo | li | pescador | d’En | Aimar | |
| boat | took.3pl = | him | the | fishermen | of;Sir | Aimar | |
| ‘And when he came via the Rhone in a boat, Sir Aimar’s fishermen took him.’ | |||||||
| Vida LXXII; Boutière 1964: 485, apud Donaldson (2016: 485) | |||||||
On the basis of clitic placement, Donaldson supposes si to occur in FocP:
| [ForceP [FrameP [TopicP | Bertrans de Born [FocusP | si | [Focus o fo]]]]] | uns | castellans. |
| Bertrand of Born | si | was.3sg | a | castella | |
| ‘Bertran de Born was a nobleman.’ | |||||
| Vida XIA; Boutière (1964: 65), apud Donaldson (2016: 48) | |||||
Donaldson’s analysis of si in the Vidas matches the distribution that we find in Borrell’s diploma, where sic occurs between the subject ‘ego Nectar presbiter’ (which acts as discourse topic), and the verb, resulting in proclisis. This suggests that the grammar of this text aligns with that of Old Occitan, a low V2 language (Donaldson 2016; 2020; Wolfe 2015a, 2015b: chap. 4).
Summing up, Borrell’s diploma (i) displays a clear preference for (S)VO word orders in both main and embedded clauses, contrasting with Latin (S)OV; (ii) shows Latin demonstrative pronoun distribution patterning with the expected distribution of Romance clitic pronouns; (iii) the distribution of sic patterns with that of contemporary texts in general, and more particularly, with that of 13th century Old Occitan si as analysed by Donaldson (2016), where it is connected with the satisfaction of a low V2 requirement in specific syntactico-pragmatic configurations.
4 The 10th century: Judge Bonsom’s diploma
Composed 55 years later than Borrell’s diploma, Bonsom’s diploma presents the expected genre features (understood in the sense of Bieber and Conrad 2009: chap. 2) of this kind of legal document, but it is set apart by its eminently narrative nature (Alturo i Perucho and Alaix 2022). We first consider word order tendencies, then turn to clitic placement and function, and finally, we examine cases of XVS or subject-verb inversion.
Like Borrell’s diploma, Bonsom’s shows a preference for (S)VO word order, especially in narrative segments, as in (16) and (17). In contrast, the opening, closing and final paragraphs, where legal matters are addressed, (S)OV is dominant due to the use of Latin formulaic expressions.
| Et | prelibata | mater | nostra | non | ualuit | uiuere | sine |
| and | beloved | mother | ours | not | be.able.3sg | live.inf | without |
| uiro, | transiuit | ad | maritum | satis | simplicem, | ||
| husband | go.over.3sg | to | husband | rather | honest | ||
| nobilem | et | opulentissimum, (…) | |||||
| noble | and | very.rich | |||||
| ‘And our aforementioned mother was not able to live without a husband, she took a rather honest, noble and rich husband, (…).’ | |||||||
| Bonsom 998, paragraph 2 | |||||||
| Et | nos | transiuimus | ad | tuitionem | de | cuidam | auunculo |
| and | we | went over.1pl | to | guardianship | of | certain | uncle |
| nostro, | ad | quibus (…) | |||||
| ours | to | whom | |||||
| ‘And we passed over to the guardianship of a certain uncle of ours, to whom (…)’ | |||||||
| Bonsom 998, paragraph 2 | |||||||
In this diploma, demonstratives standing in loco Romance clitics also inform us about the syntax of the language of the time, being used in the core of the clause to resume clitic left dislocated elements, as expected in Medieval Romance (Benincà and Poletto 2004; Benincà 2006):
| Et | [prescriptas | uineas] i | uendidit eas i | per | cartulam |
| and | aforementioned | vines | sold.3sg=them.cl | by | certificate |
| uenditionis | ad | Geribertum, (…). | |||
| of.sale | to | Geribert | |||
| ‘And the aforementioned vines, she sold them by sale certificate to Geribert, (…)’ | |||||
| Bonsom 998, paragraph 8 | |||||
| (…) | [predictam | comunicationem | et | prenotatam |
| aforementioned | communication | and | marked | |
| uenditionem] i | laudamus | eas i , (…) | ||
| sale | praise.1pl | =those.cl | ||
| ‘(…)the aforementioned permutation and said sale, we praise them’. | ||||
| Bonsom 998, paragraph 9 | ||||
In both (18) and (19), the makeshift clitic, appearing in the guise of a Latin demonstrative, occurs postverbally. Following the classic Tobler–Mussafia law, it would be expected to occur preverbally, as the verb is preceded by a constituent. Nevertheless, if we follow Batllori et al. (2005); Benincà (2006) and Donaldson (2016) analysis of clitic placement in Medieval Romance, proclisis is not connected to the presence of any constituent in preverbal position, but dependant on the presence of a focalised element (be it a constituent or a head) in preverbal position, interacting with the information structure and the prosody of the clause. (18) and (19) abide by Batllori et al. (2005)’s analysis, in that the presence of a topicalised constituent in preverbal position correlates with enclisis instead of proclisis. The clitic distribution attested in Borrell’s diploma instantiates a grammar of Catalan that is already distinctive from that of Oil dialects, where clitic distribution abides by the Tobler–Mussafia law in the classical sense. This is confirmed by cases such as (20), where proclisis is triggered in the presence of a fronted focalised constituent:
| Cum | nos accepisset, | onerosum | ei fuit | nostram |
| when | us.cl=accepted.3sg | burdensome | to;him.cl=was.3sg | our |
| paruitatem | sufferre. | |||
| young age | endure.inf | |||
| ‘Upon accepting us, it was burdensome to him to endure our young age, (…)’ | ||||
| Bonsom 998, paragraph 2 | ||||
(20) also supports the presence of an active low V2 grammar in 10th century Catalan. The temporal clause acts as a scene setter, and as such, it is first-merged in the frame field and cannot check the EPP feature on FinP, assuming the low V2 analysis suggested by the presence of si and the distribution of clitics in Borrell’s diploma. The adjective onerosum ‘burdensome’, functioning as a subject complement, is fronted to the left periphery triggering linear XVS and satisfying the second requirement of low V2. The presence of onerosum ‘burdensome’ preverbally shows that the preverbal space can host information focus, which is not the case in Modern Catalan. However, since the precise position of the subject cannot be determined, as the object clause that follows it is heavy and its low position within the clause could be motivated solely by its weight, the analysis remains speculative.
In brief, Bonsom’s diploma reflects a grammar with a preference for (S)VO orders and informationally sensitive clitic placement aligned with Batllori et al. (2005)’s analysis. It further points at the presence of an active low V2 grammar with an informationally unspecified preverbal field.
5 The 11th century: Judge Bonsom’s Liber Iudiciorum Popularis
As mentioned above, Bonsom’s copy of the Liber Iudicorum was not a mere copy of the laws applying at the time, but a multidimensional pedagogical tool. Two fragments are of special interest for the study of the evolution of the syntax of Catalan (covered in more detail in Pujol i Campeny (2013; in Pujol i Campeny 2025b)): a metaphrase, found in the opening of the first book; and (fol. 12v-13r, 20 lines, 155 words), and the syntactic glosses of a fragment concerned with inheritance law (fol. 103v-105r, 65 lines, 728 words). The metaphrase consists in the reorganisation of the lexical items of the first paragraph of book 1 into EOC syntax. This mechanism for rendering the text accessible to a reader without full mastery of Latin allows for Bonsom to omit or introduce lexical items as required by EOC grammar, while striving to do his best to adhere to the Latin material as much as possible. The syntactic glosses do not afford Bonsom this flexibility. In them, Bonsom labels each individual word with a letter of the alphabet (or dots ‘·’, ‘:’, ‘⁝’, once he runs out of letters), as shown in Figure 1:

Reproduction of the beginning of the syntactic gloss, (Manuscript Z-II-2, Fol. 103v).
In order to establish the presence or absence of an active V2 grammar, we assess the syntax of the metaphrase (henceforth M) and the glosses (henceforth G) in relation to (i) their sentential word order patterns; (ii) the syntax and information structure of verb-initial clauses; (iii) the distribution and syntax of XV(S) clauses and the nature informational and categorial nature of their preverbal constituent.
Table 1 shows word order changes from the original Latin text to M and G in both main and embedded clauses:
Word order changes in M and G.
| Main clauses | Embedded clauses | Total | % | ||
|---|---|---|---|---|---|
| (S)OV to (S)VO | 17 | 11 | 28 | 50 % | |
| Other configuration to (S)VO | 6 | 6 | 12 | 21 % | |
| V1 to SVO | 2 | 0 | 2 | 4 % | |
| Other configuration to XVS | 2 | 1 | 3 | 5 % | |
| SVO to VSO | 0 | 2 | 2 | 4 % | |
| Other configuration to VS(O) | 0 | 2 | 2 | 4 % | |
| Same | VO | 1 | 0 | 1 | 2 % |
| XXVS | 1 | 0 | 1 | 2 % | |
| XSV | 0 | 1 | 1 | 2 % | |
| SXXV | 0 | 1 | 1 | 2 % | |
| SVX | 1 | 0 | 1 | 2 % | |
| SV | 0 | 1 | 1 | 2 % | |
| XXVX | 1 | 0 | 1 | 2 % | |
| 31 | 25 | 56 | 100 % | ||
Approximately half of main and embedded clauses undergo reorganisation from Latin (S)OV to (S)VO. Another 20 % of clauses, initially presenting other word order patterns, also surface as (S)VO in M and G; as well as two initially verb initial clauses. Thus, (S)VO makes up for 75 % of the total of clauses contained in M and G. Only 7 % of clauses present XVS orders that could point at cases of inversion. Five of them derive from other word order patterns and are reorganised as XVS, while one of them preserves XVS from the original text.
Examples (21) and (22) contain two cases of XVS:
| Original word order: |
| In | inprovisis | certe | acuta | se | expetit | ratio |
| in | unforseen | surely | sharp | itself | demands | reason |
| indagatione | cognosci; | in | non | ignotis | autem | |
| inquiry | be.known.inf | in | not | unknown | however | |
| experimento | faciendi | se | properat | reserari. | ||
| through.experience | by.doing | itself | promptly | be.manifested.inf | ||
| Fol. 12v | ||||||
| Metaphrase: | ||||||
| Certe expetit se ratio cognosci acuta indagatione in inprovisis; in non ignotis autem properat se reserari experimento fatiendi. | ||||||
| M, s. 5 | ||||||
| ‘In innovation, reason demands surely to be known through research, however, when it comes to known things, it promptly manifests itself through experience in acting’. | ||||||
| Original word order: |
| Cum | vero | expertos | usus | in | speculum | uisionis |
| when | in fact | experimental | practice | in | mirror | of.vision |
| fides | ueritatis | adducit. | Non | iam | materiam | forme |
| faith | of.truth | leads.3sg | not | already | material | through.form |
| ratiocinatione | dicti | sed | operationem | facti | deposcit. | |
| rationalisation | of.saying | but | act | of.doing | requires | |
| Fol. 12v | ||||||
| Metaphrase: | ||||||
| Cum vero adducit fides ueritatis expertos usus in speculum uisionis non iam deposcit materiam forme ratiocinatione dicti sed operationem facti. | ||||||
| M, s. 7 | ||||||
| ‘When faith in truth leads to an experimental practice in the mirror of contemplation, it does not require verbal disquisition but the act of doing’. | ||||||
In (21), the original sentence is reorganised for the adverb certe ‘certanly’ to appear preverbally. The verb expetit ‘demands’ is immediately followed by the subject, ratio ‘reason’, which is in turn followed by the infinitival object clause headed by cognosci ‘to be known’. If we assume the verb to be in FinP, certe then needs to be a manner adverb akin to Old Catalan cert ‘surely, with certitude’, generated in the low adverbial space (Cinque 1999) and moved to SpecFinP to satisfy the second requirement of low V2. From there, it raises to TopP. This supported by the position of the reflexive clitic pronoun se, which, unlike in the original sentence, is enclitic. As we saw in Sections 3 and 4, clitic placement appears to be sensitive to the saturation of FocP, and enclisis occurs when the element that precedes the verb is topical. The fact that se occurs in enclisis can be used as evidence of the topicality of certe.
In (22), the embedded clause headed by cum is reorganised to display XVSOX. vero is maintained as the first element of the clause, following the subordinating conjunction cum. Kroon (1995) describes vero as a subjective modality marker that is used to contrast the content of a proposition against the discourse background. As such, its natural position would be FocP, if available. Cum introduces a peripheral adverbial clause that constitutes a separate illocutionary act from that of the main clause (Haegeman 2012: sec. 4.5). In terms of internal syntax, peripheral adverbial clauses present a fully-fledged left periphery (contrasting with central adverbial clauses, which have a truncated left periphery). Therefore, cum is merged in ForceP, and all projections below this head are available, including FocP. This translates into peripheral adverbial clauses exhibiting Main Clause Phenomena (Hooper and Thompson 1973), amongst which, V2. The choice of vero as the element preceding the verb in the metaphrase could be motivated to the contrastive nature of the adverb, which, after moving to SpecFinP, would have moved to FocP to receive emphasis. The verb is immediately followed by the subject, which is in turn followed by the object. The fact that the subject is followed by the object renders the position of the former ambiguous between SpecIP and SpecVP, as in both cases, the object would follow the subject.
In terms of information structure, informationally new objects are consistently located in postverbal position. The only case in which it could be argued that informationally novel elements are located left-peripherally is (20), where the adverb certe is fronted. However, the subject, informationally old, is followed by an informationally new object. The postverbal subject in (21) is also informationally old. The possibility of having informationally old subjects postverbally highlights the fact that the preverbal field is not specialised for hosting informationally old subjects, as is the case in modern SVO Romance languages, where postverbal subjects tend to be informationally new.
Summing up, the Liber Iudiciorum exhibits cases of inversion consistent with an active V2 grammar, and the distribution of new and old information is compatible with it.
6 The 11th century: Usatges of Barcelona
The extension of this text affords us the chance to obtain a more fine-grained picture of EOC syntax in the 11th century and confirms that EOC had an active low V2 grammar, a fact that was hinted at in the data examined in the previous sections. First, we examine the position of the verb in main and embedded clauses. Then, we move onto the analysis of the information structure of the preverbal field and cases of inversion, and the syntax of V1 clauses. Finally, we consider the distribution of potential expletive elements.
Table 2 shows the frequency of linear verb position in the Usatges:
Verb position in Usatges.
| V1 | V2 | V3* | Total | |
|---|---|---|---|---|
| Main | 47 (23 %) | 110 (53 %) | 49 (24 %) | 206 |
| Embedded | 84 (42.7 %) | 84 (44.7 %) | 26 (13.7 %) | 194 |
In both main and embedded clauses, verb-second clauses are the dominant word order pattern. In main clauses, V1 and V3 clauses constitute roughly a quarter of the total. In embedded clauses, V1 clauses are almost as frequent as V2, constituting 42.7 % of the total, while V3 clauses are the least frequently attested word order pattern, occurring only in 13.7 % of cases. Within V1 main clauses, 35 are preceded by et (17 %), while 12 are absolute verb-initial clauses (6 %).
We now turn to the analysis of the preverbal field and cases of inversions. The nature of the preverbal element of V2 main clauses is shown in Table 3:
The preverbal field of main V2 clauses in the Usatges.
| % | XVS/non pro-drop | ||
|---|---|---|---|
| Subject | 34 | 32 % | – |
| Object | 5 | 5 % | 2/2 |
| Adjunct PPs | 15 | 14 % | 5/6 |
| Adverbs | 12 | 10 % | 2/2 |
| Adverbial CP | 3 | 3 % | – |
| Protases | 39 | 36 % | 3/6 |
| Total | 108 | 100 % | 11/15 |
The most common preverbal element in V2 clauses are protases (36 %). This particularity is connected to the nature of the fragments of the Usatges examined, which present the reader with specific situations and their corresponding legal response in the form of ‘if x, then y’. Protases are followed in frequency by subjects, which constitute 33 % of preverbal constituents. They include noun phrases (28/36), pronouns (4/36), and subject que-clauses (2/36). They are followed in frequency by adjunct PPs, adverbs, and adverbial CPs. Within the total of V2 clauses, 15 present postverbal subjects, and in 11 of these cases, the subject appears adjacent to the verb.
Table 4 shows the different possible combinations of elements in the preverbal field of the Usatges:
The preverbal field of main V3* clauses in the Usatges.
| 1st XP | 2nd XP | 3rd XP | Total | Non pro-drop XVS |
|---|---|---|---|---|
| Adverb | Adjunct PP | – | 5 | – |
| Object | 1 | – | ||
| Subject | – | 2 | – | |
| Adjunct PP | 1 | – | ||
| Adv | Subject | 1 | – | |
| Subject | Adv | Adv | 1 | – |
| – | 3 | – | ||
| Object | – | 1 | – | |
| Adjunct PP | – | 2 | – | |
| Participle | – | 1 | – | |
| Protasis | Adverb | – | 3 | – |
| Protasis | – | 1 | – | |
| Subject | – | 6 | – | |
| Adjunct PP | – | 2 | – | |
| Object | Adverb | 1 | 1 | |
| NP-HT | Object | – | 1 | – |
| NP-ClLD | Protasis | – | 1 | 1 |
| Subject | – | 2 | – | |
| Adjunct PP | Adverb | – | 2 | – |
| Subject | – | 8 | – | |
| Protasis | – | 1 | – | |
| Object | – | 2 | 1 | |
| PP | – | 1 | 1 | |
| Total | 49 | 5/29 |
It can be observed that adjuncts (adverbial or prepositional), scene setters, and other elements associated to the frame field (clitic left dislocated noun phrases, nominal hanging topics, protases, and dislocated subjects) tend to occur in sentence initial position and can follow and precede each other. This is so because multiple scene setters can be merged in the Frame field. Whenever objects appear preverbally, they are verb adjacent (with one exception, where an adverb intervenes between the object and the verb). Only 5/29 cases of XVS have an overt postverbal subject. Of these, four are verb adjacent, and one is not.
For the purposes of determining whether the Usatges display an active low V2 grammar, we focus on cases of XVS with overt subjects. By examining them closely, it can be established that: (i) the verb raises to the low left periphery across subjects in SpecIP and below elements in the upper left periphery; (ii) the preverbal field is informationally unspecified (it can host both rhematic and thematic constituents), and (iii) there are three subject positions available for Old Catalan subjects: the left periphery, SpecIP, and SpecvP.[9]
Cases of XVS in the Usatges show that the preverbal field is informationally unspecified: in (23), the preverbal constituent introduces a new referent that has not been discussed before, the village of Les Roques, while in (24) it contains an aboutness topic using Frascarelli and Hinterhölzl (2007: 1) topicality scale. In both cases, the subject surfaces postverbally. Assuming an underlying V2 analysis, this would show that any constituent, regardless of its informational status, can satisfy the V2 requirement.
| [Les Roques] | agen | los | princeps | en | aytal | senyoria |
| the rocks | have.3pl | the | princes | in | such | lordship |
| ‘May the princes have the Rocks in such lordship’. | ||||||
| Usatges, Fol. 7v | ||||||
| [De | tots | homes | esters, | caualers | de | burgenses | et | de battles |
| of | all | men | lay | knights | from | towns | and | of mayors |
| et | pageses], | stabliren | los | sobredits | princeps | |||
| and | knights | established.3pl | the | aforementioned | princes | |||
| quels | senyors | dels | agen | la | tercera | part | ||
| that;the | lords | of;them | have.3pl.sbjv | the | third | part | ||
| en | la | esmena | dels | ocises, (…) | ||||
| in | the | amending | of;the | killed | ||||
| ‘Regarding all laymen, town knights, and about mayors and peasants, the aforementioned princes established that their lords should have a third of their death compensation, (…)’. | ||||||||
| Usatges, Fol. 10r | ||||||||
Supporting that the verb is located in the left periphery are several cases of Germanic-style inversion, where the subject appears between a modal verb and a lexical verb,[10] as in (25–26):
| [Daltres | bayes | o | mals | fetes | quis | poden | ||
| of;other | offences | or | wrongs | done | which=rfl.3pl | can | ||
| esmenar | ho | redeger] | deu | hom | fer | dret | a | son |
| amend.inf | or | rectify.inf | must.3sg | one | do.inf | right | to | his |
| senyor | axi | com | es | costuma | daquesta | terra | ||
| lord | thus | as | is | custom | of;this | land | ||
| ‘Regarding other offences or wrongdoings that may be amended or rectified, one must legally do so as is customary in this land’. | ||||||||
| Usatges, Fol. 4v | ||||||||
| Si·ls | richs | homes | repten | los | lurs | cavalers, | |
| if;the | rich | men | challenge | the | their | knights | |
| deuen | los | caualers | fer | a | els | axi | con |
| must | the | knights | do.inf | to | they | thus | as |
| farien | si | eren | reptats | del | princep. | ||
| do.3pl.cond | if | were.3PL | challenged | of;the | prince | ||
| ‘If the rich men challenge their knights, the knights must answer to them as they would to the prince’ | |||||||
| Usatges, Fol. 5r | |||||||
In (25), inversion is borne with a fronted direct object that can be contrastively interpreted against the background occurring preverbally. The modal finite verb is followed by the pronominal subject hom ‘one’, which is in turn followed by the non-finite form fer ‘to do’ and the indirect object. In (26), the finite modal is preceded by a protasis and followed by a finite, informationally old subject. Like in (25), the non-finite verb is followed by verbal complements, in this case, an indirect object and an adverbial embedded clause that acts as direct object (ordering between the two probably being determined by the DO’s heaviness).
Contrasting with (25–26), in (27) the presence of a preverbal protasis does not trigger inversion. Instead, the subject appears preverbally, and the object follows the verb, in a typical (S)VO pattern.
| E | [si | alcuna | cosa | fara | mal | a | aqueles] | lo |
| and | if | some | thing | do.3sg.fut | harm | to | those | the |
| princep | fasza | esmena | en | doble, (…). | ||||
| prince | do.3sg.sbjv | amend | in | double | ||||
| ‘And if something damages them, the prince will compensate it with twice the value, (…).’ | ||||||||
| Usatges, Fol. 5r | ||||||||
Finally, (28) contains an example of a postverbal subject occurring after the non-finite verb form, followed by the lexical verb’s complements. Therefore, we can establish that subjects can occur preverbally, as in (27), and postverbally, in two positions: in SpecIP, as illustrated by (25) and (26) and its base-generated position, in SpecvP, as in (28).
| Si | caualers | no | tendran | aquels | feus | e·ls demanen, | ||
| if | knights | not | have.3pl.fut | those | fiefs | and;them=ask.for.3pl | ||
| deuen | prouar | aquels | per | testimoni | ho | per | cartes | |
| must.3pl | prove.inf | those | by | witness | or | by | letters | |
| que | aquels | feus | lus | fossen | donats | per | los | senyors |
| that | those | fieds | to;them= | were.3pl.sbjv | given | by | the | lords |
| ‘Were the knights not to be in possession of those fiefs and be asked for them, they must prove by means of witnesses or letters that those fiefs were bestowed upon them by their lords’. | ||||||||
| Usatges, Fol. 5v | ||||||||
All cases of inversion occur with transitive verbs. This is noteworthy because it shows that inversion in Usatges is not aligned with inversion in Modern pro-drop Romance languages, where unaccusative predicates favour postverbal subjects, and discards the possibility of Romance-style inversion, which is predicate and information structure conditioned, with new subjects tending to occur postverbally (Corr 2016; Hulk and Pollock 2001; Pinto 1997; Sheehan 2010; Tortora 2001).
Elements other than the subject can appear between modal auxiliaries and non-finite verbs. Consider (29) and (30):
| E | per | aquesta | aytal | manera | pot | pare | ho | aui |
| and | for | this | such | way | can.3SG | father | or | grandfather |
| [son fil | ho | sa | fiyla | ho | son | net | ho | sa |
| his son | or | his | daughter | or | his | grandson | or | his |
| neta] | meyorar | |||||||
| granddaughter | improve | |||||||
| ‘And according to this very usage a father or grandfather can improve [the inheritance of] their son or daughter or grandson or granddaughter’. | ||||||||
| Usatges, Fol. 8r | ||||||||
| Los | feus | que | tenen | los | caualers | si·ls | senyors |
| the | fiefs | that | have.3pl | the | knights | if;the | lords |
| ho atorgaran | deuen | los | caualers | a | aquels | auerar. | |
| it=grant.3pl.fut | must.3pl | the | knights | to | those | prove.inf | |
| Regarding the fiefs held by knights, if granted by lords, the knights must prove it [holding the fiefs] to them. | |||||||
| Usatges, Fol. 5v | |||||||
In (29) and (30), the subject is followed by a verbal complement: a direct object in (29) and an indirect object in (30), where the direct object is dropped. In these two cases, the object has been moved to the middle field, the syntactic space between IP and vP, which according to Belletti (2006), and Poletto 2006, could be understood as a low left periphery containing projections correlating with different information categories. Leaving the analysis of Old Catalan middle field for later work, what is worth noting from these two examples is the respective order between the constituents occurring between the auxiliary and the infinitive: the subject always occurs verb-adjacent and is followed by any other element that might appear between the two verbs. This is so because the subject is not located in the middle field or the lower left periphery, but in SpecIP, and the auxiliary has moved across it to the low left periphery. Were the subject to be located in the middle field, we would expect object–subject sequences to be possible, but they are not attested in the Usatges.[11]
As shown in (30), the verb can be preceded by more than one constituent. When that is the case, elements preceding the verb always display the same ordering, which aligns with descriptions of the Medieval Romance left periphery:
| [Force Co [Relwh Co]/{Frame [ScSett][HT] Co} {Topic [LD] [LI] Co} {Focus[I Focus] |
| [II Focus]/[Interrwh] Co} [Fin Co |
| Benincà (2006: 77) |
This is further confirmed by the following V3 cases:
| [Daqui | auant]ScStt | [qualque | cosa | juren]LD | deuen-ho |
| from;here | onwards | whatever | thing | swear.3pl | must.3pl=it |
| escondir | per | batayla | |||
| defend.inf | by | battle | |||
| ‘Henceforth, whatever they swear, they must defend it in combat’. | |||||
| Usatges, Fol. 5v | |||||
| [Si | alcuna | cosa | faran | mal | a | aquels] ScStt | [lo | princep]LD |
| if | some | thing | do.3pl | wrong | to | those | the | prince |
| fasza | esmena | per | miya | plagua | ||||
| do.3sg.sbjv | amend | by | half | wound | ||||
| ‘and if they do something wrong to them, the prince shall amend it with half a wound’ | ||||||||
| Usatges, Fol. 5v | ||||||||
Given that cases of inversion like (25–26) provided unequivocal evidence of V-to-C movement, and that the verb can be preceded by more than one constituent, as in (31–32), it is safe to assume that like Early Old French (until the 13th century, following Wolfe 2020: Ch. 4) and Old Occitan (Donaldson 2016; Wolfe 2018b, 2020) in EOC main clauses, the verb raises to the low left periphery.
Turning now to V1 clauses: they occur in two contexts: (i) at the beginning of a discourse unit, as in (34); (ii) after the coordinating conjunctions e, mas, sinó, o, and ne, all with subject continuity with the preceding clause, as in (35) and (36):
| Manaren | los | damondits | princeps | que | tuit | |
| ordered.3pl | the | aforementioned | princes | that | all | |
| se | garden | de | perjuries | |||
| themselves= | keep.3pl.sbjv | of | perjuries | |||
| ‘The aforementioned princes ordered that all should refrain from committing perjury’. | ||||||
| Usatges, Fol. 8v | ||||||
| Si | negu | forza | puncela, | ho | la | prena | per | muyler, | ||
| if | anyone | forces | maiden | or | her= | take.3sg.sbjv | as | wife | ||
| si | ela | et | sos | pares | ho | uolen | et | li | donen | exouar, |
| if | she | and | her | parents | it= | want | and | to;her= | give.3pl | dowry |
| hó | li | do | marit | de | sa | ualor | ||||
| or | to;her | gives | husband | of | her | value | ||||
| ‘If anyone forced a maiden, either he must take her as his wife, if she and her parents agree and they give her a dowry, or he shall give her a husband of her value’. | ||||||||||
| Usatges, Fol. 10v | ||||||||||
| Sil | senyor | uol | que | lom | seu | li | fassa | maior | |
| if;the | lord | wants | that | the;man | his | to;him | =does.sbjv | greater | |
| seruici | que | no | deu, | lo | senyor | deu | li | crexer | |
| service | than | not | has | the | lord | must.3sg | to;him | increase.inf | |
| lo | benifet, | e | si | no | ho fa | no li | es | obligat | |
| the | benefit | and | if | not | it=does | not to;him | is | obliged |
| sino | del | seruici | que | li | es | tengut | e | seruesca | a son |
| but | of;the | service | that | to;him | is | held | and | serve.3sg.sbjv | to his |
| senyor | axi | com | hom | deu | seruir | senyor | |||
| lord | thus | as | one | must.3sg | serve.inf | lord | |||
| ‘If a lord wants a man of his to make him a greater service than he should, his lord must increase his benefits, and if he does not, he should not but do the service that he committed to and he must serve his lord as one should serve a lord’. | |||||||||
| Usatges, Fol. 4r | |||||||||
V1 clauses fit a low V2 analysis of Old Catalan if we assume, for verb initial clauses at the beginning of a discourse unit, the existence of a null narrative operator (Wolfe 2015a, 2015b: 157), and a Null Topic for V1 preceded by a coordinating conjunction.[12]
As expected for a low V2 Romance language, embedded clauses exhibit a clear preference for preverbal subjects, which precede the verb in 75 % of clauses. Cases of inversion are scarce, and exclusively occur in adverbial clauses (Table 5).
The preverbal field of embedded V2 clauses in the Usatges.
| Frequency | % | XVS/non pro-drop | |
|---|---|---|---|
| Subject | 64 | 75 % | – |
| Object | 8 | 9.5 % | 1/1 |
| Adjunct PPs | 6 | 7.1 % | 1/1 |
| Adverbs | 4 | 4.8 % | 1/1 |
| Subject complement | 3 | 3.6 | 1/1 |
| Total | 85 | 100 % | 4/4 |
This stark asymmetry between main and embedded clauses is often explained by the lack of verb movement to the left periphery in central adverbial clauses, which constitute the bulk of embedded clauses contained in the database (see the protases contained in (35) and (36) above, for instance). Since the verb remains in IP, subjects canonically located in SpecIP precede it.
Cases of inversion, however, suggest that V-to-C is possible, at least, in peripheral adverbial clauses, as in (37):
| cor | [en | les | ligs | godes] | troba | hom | homenatge. |
| since | in | the | laws | Gothic | finds | one | homage |
| ‘since the Gothic laws contain ‘homage’’. | |||||||
| Usatges, Fol. 8v | |||||||
In (37), the adjunt PP en les ligs godes ‘in the Gothic laws’ is fronted to the left periphery, preceding the verb, which in turn, precedes the subject, hom ‘one’, followed by the direct object, homenatge ‘homage’. This adverbial clause contains an illocutionary act that is independent from that of the main clause (the fact that the legal act of homage is listed in Gothic laws), as expected for a peripheral adverbial clause. As discussed above, peripheral adverbial clauses can exhibit main clause phenomena, amongst which, V2.
As shown in Table 2, half of embedded clauses are verb initial. Only 4/84 clauses have overt postverbal subjects. Of the four cases with overt postverbal subjects, three present predicates that are prone to VS orders (subjects in situ with the verb in IP in Modern Romance), such as passive constructions:
| si·l | tenian | assetyat | ses | enamics |
| if;him= | had.3pl | besieged | his | enemies |
| ‘and if his enemies held him under siege, (…)’ | ||||
| Usatges, Fol. 8v | ||||
| e | axi | com | li | es | dat | fer | justícia, | axi;li | es | |
| and | thus | as | to.him= | is | given | do.inf | justice | thus;to.him= | is | |
| dat | acui | se uuyla. | ||||||||
| given | to.whomever | rfl.3sg=want.3sg.sbjv | ||||||||
| ‘and as it is granted to him to carry out justice, so it is granted to him to pardon whomever’. | ||||||||||
| Usatges, Fol. 9v | ||||||||||
In (38), the subject occurs postverbally, as expected for this type of construction. Similarly, in (39), the subject of the verb donar ‘to give, grant’, in the passive form, follows the verb. In this case, the subject is an infinitival clause. Therefore, VS in embedded clauses with V-to-T correlates with the same triggers as it does in Modern Romance, contrasting with VS in main clauses, where it correlates with V2 Germanic-inversion contexts, in which the verb raises to the left periphery.
From our overview of word order in embedded clauses, we can establish that while V-to-T is the dominant word order pattern in embedded clauses, explaining the prevalence of V1 and SVO, V-to-C is also attested (see (38) and (39) above), depending on the central/peripheral status of the clause, that postverbal subjects obey the distributional rules expected for SVO languages.
Before concluding our discussion of the Usatges, we need to consider the distribution of two elements: si and e. Si has already been introduced in Section 3, where we saw it behave like an expletive within a low V2 grammar. We have not found any examples of si being used in the same way in the Usatges. Nevertheless, we have found two sentences where e’s acts as an expletive-like element, as it did in Old Italian (Poletto 2005) and not as a coordinating conjunction. Consider (40) and (41):
| E | [si | puys | altra | uegada | sen | periura] | e | |
| and | if | then | other | time | rfl.3sg;of.it= | perjures | e | |
| deu | perdre | la | ma | o | deu | reembre | ·c· | sols |
| must.3sg | lose | the | hand | or | must.3sg | redeem.inf | 100 | sols |
| o | deu | perdre | la | quarta | part | de | tot | quant |
| or | must.3pl | lose.inf | the | fourth | part | of | all | as much |
| aia, | o | (…) | ||||||
| have.3sg.sbjv | or | |||||||
| ‘And if, after this, he perjures of this again, he must lose his hand or must surrender 100 sols or must lose a fourth of all he might own, or (…)’. | ||||||||
| Usatges, Fol. 6v | ||||||||
| [Cor | aquel | qui | acusa | lo | juhij | de | la | cort | falsa |
| since | that | who | accuses | the | judge | of | the | court | falsifies |
| la | cort | e | dapna | lo | princep] | e | sia | ||
| the | court | and | dammages | the | prince | e | be.3sg.sbjv | ||
| punit | e | sia | dapnat | per tots temps | el | e | |||
| punished | and | be.3sg.sbjv | dammaged | for ever | he | and | |||
| la | sua | liyada. | |||||||
| the | his | lineage | |||||||
| ‘Since he who accuses the judge from the court, desauthorises the court and harms de prince, let him and his lineage be forever punished’. | |||||||||
| Usatges, Fol. 9v | |||||||||
In both (40) and (41), e occurs between a peripheral embedded clause, in (40) a protasis, and in (41) a reason clause, and the verb. Poletto (2005) finds similar cases of e in Old Italian:
| e | quando | avea | forbiti | i | piedi | ed | elli | ||
| and | when | had.3sg | cleaned | the | feet | and | he |
| tornava | fuori | e | rinfangavalisi | vie più | e | tornava | |||
| returned.3sg | outside | and | remudded.3sg | more and more | and | returned.3sg | |||
| a | ricalpitare | il | letto. | E | partisi | e | disse | a | Platone: (…) |
| to | step on.inf | the | bed | and | left.3sg | and | said.3sg | to | Plato |
| ‘and when he had cleaned his feet, went back outside, put mud on them, came back inside and went up onto bed. He left and told Plato: (…) | |||||||||
| FF, 124, apud Poletto (2005: 227, ex. 40) | |||||||||
In her analysis, Poletto proposes that in Old Italian e behaves like a topic continuity expletive occurring between the core of the clause and a scene setter. The fact that e is followed by enclisis further supports this idea. However, the two examples contained in the Usatges database do not contain clitic pronouns, and is, therefore, impossible to establish whether it triggers proclisis, like si, or enclisis (as expected for a topical element). Nevertheless, its occurrence in two pro-drop clauses that pick up on a referent active in the wider discourse and the parallelisms with Old Italian suggest that this might be a possible analysis for a particle that has not been noted in the literature.
Summing up, data from the Usatges displays an asymmetric low V2 grammar with an unspecified preverbal field. The evidence presented covered inversion patterns, clitic placement, verb-initial clauses, and main/embedded clause asymmetries. Germanic-style inversion is found when a constituent other than the subject saturates SpecFinP. Clitic placement does not abide by the Tobler–Mussafia law in the classic sense, and is not only connected to the overt saturation of the preverbal field (enclisis is systematically borne in verb initial clauses), but to the informational value of the preverbal constituent.[13] Verb initial clauses show the distribution pattern expected for low V2, either being discourse initial or showing topic continuity. There is a sharp asymmetry between the word order pattern of main and embedded clauses, the latter only exhibiting V2 when peripheral. Otherwise, they present V-to-T, with a tendency to display preverbal subjects, postverbal subjects only occurring in contexts that favour them in SVO languages. It has further been suggested that the Usatges present a topic continuity marker, e.
7 Early Old Catalan in the early Romance context and beyond
Sections 3–6 have offered an overview of the syntax of EOC from the 10th century to the 11th. Borrell’s diploma, dating of 933, contains features that correspond to an asymmetric low V2 grammar with an unspecified preverbal field. While Bonsom’s texts do not offer the same clear picture, they provide data compatible with this analysis and with analyses provided for contemporary Romance varieties. Finally, the Usatges provide us with conclusive evidence of the passage of Old Catalan through a low V2 phase that seems operative throughout the period under study. But how does V2 emerge in Old Catalan, and how does this help us understand the evolution of the Romance languages syntactically as a continuum?
Ledgeway (2017) analyses the Peregrinatio Egeriae, a 4th century text written in Latin, as containing signs of an incipient V2 grammar: there is systematic V-to-C yielding VSO strings, the first step of V2 as defined by Holmberg (2015: 375). The second step associated to an active V2 grammar, that is, the movement of a constituent to the left of the verb, is, at this stage, optional. This is illustrated in (43):
| a. [CP Ø] episcopus perleget omnem ipsam allocutionem ⇒ | |
| b. [CP [C’
perleget
]] episcopus |
|
| c. [SpecCP
omnem ipsam allocutionem
[C’ perleget]] episcopus |
(original sentence) |
| Pereg. 33.2 | |
| Adapted from Ledgeway (2017: 170, example 4) |
If a V2 grammar had partially developed by the 4th century, the main change from the scenario sketched by Ledgeway to the EOC data presented here is the consolidation of the second step of V2 as a requirement of all unmarked main declarative clauses, which would have occurred during a period in which incipient Early Romance would have still exhibited syntactic continuity. Ledgeway explains the predominance of SVO in embedded clauses by suggesting that SpecIP became a subject position before V-to-C was generalised, thus rendering embedded clauses archaising with respect to main clauses.
Other authors have proposed alternative analyses for Late Latin syntax, sketching alternative hypotheses for the changes attested from Latin SOV to Medieval Romance word order. Klævik-Pettersen (2019) argues that Late Latin VSO strings are instances of V-to-I, with the subject in situ and an emergent SpecIP as a canonical subject position, accounting for SVO oscillations. According to this author, the emergence of V2 in Old French is connected to Germanic influence and syntactic transfer at acquisition level before the end of the Merovingian period. This hypothesis cannot be transferred to the case of Old Catalan, given the stark differences in the sociohistorical context and their impact on the language landscape, including the arrival of Visigoths upon the fall of the Roman Empire, who took control of the administrative government of the area, but represented only up to 10 % of the population (Collins 2004), followed by the arrival of the Arabs in the 8th century. Given that we see low V2 emerge in both Old French and Old Catalan, for the Germanic hypothesis to be applicable to both languages, both Frankish and the language and the Visigoths would have to have had a full active V2 grammar, and the proportion of Germanic population and linguistic pressure to ensure the kind of bilingualism that would grant syntactic transfer would have had to been greater than that of 10 % of the population, broadly corresponding to a ruling class that adopted Latin as their administrative language. Therefore, this hypothesis is discarded for Old Catalan. Here, V2 is assumed to have emerged within a Romance syntactic continuum pre-9th century. Further evidence for this perspective comes from the word order pattern of Old Sardinian, a syntactically archaising language within Medieval Romance. Wolfe (2015b) shows that Old Sardinian presented V-to-C, the first step associated with V2, without an EPP feature associated to the projection targeted by the verb.
By the 11th century, Old Catalan, Old Occitan, Old French, and Old Italo-Romance did not only boast low V2 systems,[14] but they did also have an expletive use of si, as shown in examples (7) and (8). Si is not present in all Romance varieties: it surfaces, west to east, in Catalan, Occitan, French, and Italo-Romance.[15] This suggests that the Medieval Romance languages evolve jointly until the consolidation of the V2 grammar, as it is found in across all of them, but the grammaticalisation of si in these three languages and Italo-Romance suggests a syntactic split between Ibero-Romance and Gallo- and Italo-Romance.[16] The analysis presented here, therefore, presents evidence for the inclusion of Old Catalan in the Gallo-Romance continuum from a syntactic perspective during the period under study, and for viewing Occitan as a central variety in this dialectal linkage.
By the 13th century, Old Catalan’s low V2 system exhibits the first signs of weakening (Pujol i Campeny 2025a, in press), Old French’s low V2 gets reanalysed as a high or strict V2 system (Wolfe 2018a), while Old Occitan retains a low V2 grammar (Vance et al. 2010; Wolfe 2017, 2018a), and Old Italo-Romance varieties follow different, individual paths (Ledgeway 2008; Poletto 2014; Singh 2021; Matticchio 2024). At the same time, Old Spanish, like Old French, developed into a strict or high V2 system by the 14th century (Wolfe 2015a, 2015b: Ch. 6). Therefore, the 13th century appears as a key turning point in the evolution of this syntactic continuum, where each of the three languages takes different paths at sentential word order level. It is possible that from the geographical centre of the Romance linguistic continuum, Catalan acted as an innovative pole, being the first of the recorded Medieval Romance varieties to lose V2 after having presented a low V2 system akin to that of Old Occitan and Old French, including an expletive-like element in the earliest documents, that is, nonetheless lost by the 11th century.
The analysis presented here makes the most out of the little bad data that has reached us for the period under study: by exploring the seemingly poor syntax of legal texts written in Latin, and comparing them amongst themselves, we have been able to positon Catalan within the wider Early Romance picture, opening up new questions regarding the evolution of this linguistic continuum.
8 Conclusions and further research avenues
In this article, we have explored the syntax of four texts from the historical region of Catalonia, dating from the 10th to the 11th century: three in Latin, and one written in Early Old Catalan. It has been shown that the seemingly broken syntax of Borrell’s 10th century diploma hides several telltale signs associated with the syntax of Medieval Romance, and specifically, that of Gallo-Romance: a low V2 grammar with si, an expletive like elements used as last resort strategies in order to satisfy the second requirement of an active V2 grammar, and Romance clitic placement. Bonsom’s 10th and 11th century texts do not present si, but and display a grammar with a strong theme/rheme division between the preverbal and the postverbal space, but they contain cases of inversion that are consistent with a Romance low V2 system. The presence of an active V2 grammar is further confirmed by the Usatges de Barcelona, compiled in the 11th. The Usatges present all the features of a low V2 system, where the verb systematically raises to FinP in unmarked declarative clauses, the preverbal field is unspecified, allowing for the occurrence of informationally new constituents preverbally, with canonical Germanic-style inversion, and expletive use of the topic continuity marker e.
Thus, the evidence presented here clearly demonstrates that up to the 12th century Old Catalan patterned with its neighbouring varieties in exhibiting a low V2 grammar, suggesting that up to this point in time, central Romance varieties, understood as the Gallo-Romance continuum, including Old Catalan and at least some Italo-Romance varieties, formed a syntactic continuum, excluding Old Spanish and Old Portuguese, and potentially Old Romanian (which is not documented for this period).
This highlights the need to study the syntax of Early Romance holistically in order to get better insights on the nature of their syntactic fragmentation. It also shows the importance of adding Old Catalan into comparative Romance studies. Without this often overlooked variety, our understanding of the evolution of this language family remains incomplete.
Funding source: British Academy
Award Identifier / Grant number: PFOS20\200664
Acknowledgments
This work has been supported by the British Academy [grant number: PFOS20\200664].
-
Research ethics: Not applicable.
-
Informed consent: Not applicable.
-
Author contributions: Not applicable.
-
Use of Large Language Models, AI and Machine Learning Tools: Not applicable.
-
Conflict of interest: Not applicable.
-
Research funding: This work has been supported by the British Academy [grant number: PFOS20\200664].
-
Data availability: Publicly available.
References
Alturo i Perucho, Jesús. 2003. Les glosses i el glossari del manuscrit Z.II.2 de El Escorial. In Jesús Alturo i Perucho (ed.), Liber Iudicum Popularis, ordenat pel jutge Bonsom de Barcelona, 256–270. Barcelona: Generalitat de Catalunya. Departament de Justícia i Interior.Search in Google Scholar
Alturo i Perucho, Jesús & Tània Alaix. 2021. El canonge Adanagell de Vic (ca. 860-925), llavor de nova semences (Textos i Estudis de Cultura Catalana). Barcelona: Publicacions de l’Abadia de Montserrat.Search in Google Scholar
Alturo i Perucho, ,Jesús & Tània Alaix. 2022. Un text escrit per Bonsom l’any 998, en nova edició i traduït al català. Caplletra. Revista Internacional de Filologia (72). 15–36.10.7203/caplletra.72.22958Search in Google Scholar
Alturo i Perucho, Jesús & Tània Alaix. 2023. Lletres que parlen: Viatge als orígens del català. Barcelona: La Magrana.Search in Google Scholar
Ayres‐Bennett, Wendy. 2005. A history of the French language through texts. A history of the French language through texts. London: Routledge.10.4324/9780203986738Search in Google Scholar
Badia i Margarit, Antoni M. 1951. Gramática histórica catalana. Gramática histórica catalana. Barcelona: Noguer.Search in Google Scholar
Bastardas, Joan. 1984. Usatges de Barcelona: el codi a mitjan segle XII: establiment del text llatí i edició de la versió catalana del manuscript del segle XIII de l’Arxiu de la Corona d’Aragó de Barcelona. Barcelona: Fundació Noguera.Search in Google Scholar
Batllori, Montserrat. 1993. Participle preposing and other related phenomena in Old Spanish and Old Catalan. Girona: University of Girona.Search in Google Scholar
Batllori, Montserrat & Maria Lluïsa Hernanz. 2009. Foco oracional y polaridad. Entorno a una asimetría entre el español y el catalán. Santiago de Compostela: (Presentation handout presented at the VIII Congreso Internacional de Historia de la Lengua Española, Santiago de Compostela).Search in Google Scholar
Batllori, Montserrat & Maria Lluïsa Hernanz. 2015. Weak focus and polarity: Asymmetries between Catalan and Spanish. In Syntax over time: lexical, morphological and information structural interactions, 280–298. Oxford: Oxford University Press.10.1093/acprof:oso/9780199687923.003.0017Search in Google Scholar
Batllori, Montserrat, Narcís Iglésias & Ana Maria Martins. 2005. Sintaxi dels clítics pronominals en català medieval. Caplletra. Revista Internacional de Filologia (38). 137–177. https://doi.org/10.7203/caplletra.38.4877.Search in Google Scholar
Bellés i Sallent, Joan & Jesús Alturo i Perucho (eds.). 2008. Llibre dels judicis: traducció catalana moderna del “Liber iudiciorum” (Textos jurídics catalans Lleis i costums. 1,2 24). 1st edn. Barcelona: Parlament de Catalunya.Search in Google Scholar
Belletti, Adriana. 2006. Aspects of the low IP area. In Luigi Rizzi (ed.), The Structure of CP and IP: The cartography of syntactic structures, vol. 2, 16–51. New York, Oxford: Oxford University Press.10.1093/oso/9780195159486.003.0002Search in Google Scholar
Benincà, Paola. 2004. The left periphery of Medieval Romance. Studi Linguistici e Filologici Online 2(2). 243–297.Search in Google Scholar
Benincà, Paola. 2006. A detailed map of the left periphery of Medieval Romance. In Crosslinguistic research in Syntax and semantics: Negation, tense and clausal architecture, 53–86. Washington: Georgetown University Press.Search in Google Scholar
Benincà, Paola & Cecilia Poletto. 2004. Topic, focus, and V2. In Luigi Rizzi (ed.), The structure of CP and IP, vol. 1, 52–75. Oxford, New York: Oxford University Press.Search in Google Scholar
Biber, Douglas & Susan Conrad. 2009. Register, genre, and style (Cambridge textbooks in linguistics). Cambridge: Cambridge University Press.10.1017/CBO9780511814358Search in Google Scholar
Boutière, Jean. 1964. Biographies des troubadours : textes provençaux des XIIIe et XIVe siècles (Classiques d’oc ; 1). Biographies des troubadours : textes provençaux des XIIIe et XIVe siècles (Classiques d’oc ; 1). Édition refondue / augmentée d’une traduction française, d’un appendice, d’un lexique, d’un glossaire et d’un index des termes concernant le “trobar,” par Jean Boutière ... avec la collaboration d’I.-M. Cluzel. Paris: Nizet.Search in Google Scholar
Cinque, Guglielmo. 1990. Types of A’ Dependencies. Cambridge, MA: MIT Press.Search in Google Scholar
Cinque, Guglielmo. 1999. Adverbs and functional heads: A cross-linguistic perspective. New York: Oxford University Press.10.1093/oso/9780195115260.001.0001Search in Google Scholar
Collins, Roger. 2004. Visigothic Spain 409-711 (A History of Spain). New York: Wiley.10.1002/9780470754610Search in Google Scholar
Corr, Alice. 2016. Wide-focus subject-verb inversion in Ibero-Romance: A locative account. Glossa: A Journal of general linguistics. Open Library of Humanities 1(1). https://doi.org/10.5334/gjgl.85.Search in Google Scholar
Cruschina, Silvio & Craig Sailor. 2022. What is “residual verb second”? And what does Romance have to do with it? Isogloss. Open Journal of Romance Linguistics 8(3). 1–28. https://doi.org/10.5565/rev/isogloss.235.Search in Google Scholar
Culpeper, Jonathan & Merja Kytö. 2010. Early modern English dialogues: Spoken interaction as writing. Cambridge: Cambridge University Press.Search in Google Scholar
De Cat, Cécile. 2009. French Dislocation: Interpretation, Syntax, Acquisition. New York: Oxford University Press.Search in Google Scholar
Donaldson, Bryan. 2016. Preverbal subjects, information structure, and object clitic position in Old Occitan. Journal of Linguistics 52(1). 37–69. https://doi.org/10.1017/S0022226714000619.Search in Google Scholar
Donaldson, Bryan. 2020. Clitic position in Old Occitan affirmative verb-first declaratives coordinated by e: A variationist analysis. Journal of Historical Linguistics 10(3). 389–426. https://doi.org/10.1075/jhl.19002.don.Search in Google Scholar
Elspaß, Stephan, Nils Langer, Joachim Scharloth & Wim Vandenbussche (eds.). 2007. Germanic language histories “from below” (1700-2000). Berlin, New York: De Gruyter.10.1515/9783110925463Search in Google Scholar
Fischer, Susann & Xavier, Villalba. In press. L’oració simple: propietats bàsiques, ordre i elisió. Gramàtica històrica de la llengua catalana.Search in Google Scholar
François, Alexandre. 2014. Trees, waves and linkages: Models of language diversification. In Claire Bowern & Bethwyn Evans (eds.), The Routledge handbook of historical linguistics, 161–189. London: Routledge.Search in Google Scholar
Frascarelli, Mara & Roland Hinterhölzl. 2007. Types of topics in German and Italian. In Kerstin Schwabe & Susanne Winkler (eds.), On information structure, meaning and form: Generalizations across languages (linguistik aktuell/linguistics today 100), 87–116. Amsterdam: John Benjamins Publishing Company.10.1075/la.100.07fraSearch in Google Scholar
Haegeman, Liliane. 2004. The syntax of adverbial clauses and its consequences for topicalisation. In Martine Coene, Gretel De Cuyper & Yves D’Hulst (eds.), Current studies in comparative romance linguistics (APiL 107), 61–90. Belgium: University of Antwerp.Search in Google Scholar
Haegeman, Liliane. 2012. Adverbial clauses, main clause phenomena, and composition of the left periphery: The Cartography of syntactic structures, volume 8 (Oxford studies in comparative Syntax). Adverbial clauses, main clause phenomena, and composition of the left periphery (Oxford studies in comparative Syntax). United Kingdom: Oxford University Press.10.1093/acprof:oso/9780199858774.001.0001Search in Google Scholar
Holmberg, Anders. 2015. Verb second, in kiss, Tibor & Artemis Alexiadou Syntax: Theory and analysis. An international handbook, 342–383. Berlin: De Gruyter.10.1515/9783110377408.342Search in Google Scholar
Hooper, Paul & Sandra A. Thompson. 1973. On the applicability of root transformations. Linguistic Inquiry 4. 465–497.Search in Google Scholar
Hulk, Aafke & Jean-Yves Pollock. 2001. Subject inversion in Romance and the theory of universal grammar (Oxford studies in comparative Syntax). Oxford, England: Oxford University Press.10.1093/oso/9780195142693.001.0001Search in Google Scholar
Ingham, Richard. 2016. Investigating language change using Anglo-Norman spoken and written register data. Linguistics 54(2). 381–410.10.1515/ling-2016-0004Search in Google Scholar
Klævik-Pettersen, Espen. 2019. Inversion, V-to-C, and verb-second: An investigation into the syntax and word order of old French and late Latin. University of Oslo.Search in Google Scholar
Kroon, Caroline. 1995. Discourse particles in Latin: A study of nam, enim, autem, vero and at (Amsterdam studies in classical philology), vol. 4. Leiden: Brill. https://brill.com/view/title/13504.10.1163/9789004408999Search in Google Scholar
Kytö, Merja. 2016. Register in historical linguistics. Register Studies 1(1). 136–167.10.1075/rs.18011.kytSearch in Google Scholar
Lafont, Robert. 1998. La Chanson de Sainte Foi: texte occitan du XIe siècle. Edité, traduit, présenté et annoté par Robert Lafont (Textes Littéraires Français 490). (Ed. Droz, Genève).Search in Google Scholar
Ledgeway, Adam. 2008. Satisfying V2 in early Romance: Merge vs. Move. Journal of Linguistics 44(2). 437–470. https://doi.org/10.1017/S0022226708005173.Search in Google Scholar
Ledgeway, Adam. 2010. Syntactic and morphosyntactic typology and change. In Martin Maiden, John Charles Smith & Adam Ledgeway (eds.), The Cambridge history of the Romance languages, 1st edn. 382–471. Cambridge: Cambridge University Press.10.1017/CHOL9780521800723.010Search in Google Scholar
Ledgeway, Adam. 2017. Late Latin verb second: The sentential word order of the itinerarium Egeriae. Catalan Journal of Linguistics 16. 163–216. https://doi.org/10.5565/rev/catjl.212.Search in Google Scholar
Ledgeway, Adam. 2020. The north-south divide: Parameters of variation in the clausal domain. L’Italia Dialettale 81. 29–77. https://doi.org/10.17863/CAM.45096.Search in Google Scholar
Leonetti, Manuel & M. Victoria Escandell-Vidal. 2009. La expresión del Verum Focus en español. Español Actual 92. 11–46.Search in Google Scholar
Lewis, Charlton Thomas & Charles Short. 1969. A Latin dictionary: Founded on Andrews’ edition of Freund’s Latin dictionary. A Latin dictionary: Founded on Andrews’ edition of Freund’s Latin dictionary. Oxford: Clarendon Press.Search in Google Scholar
Lightfoot, David. 1999. The Development of language. Acquisition, change, and evolution. Oxford: Blackwell Publishers Ltd.Search in Google Scholar
Martí i Castell, Joan. 2002. Estudi lingüístic dels Usatges de Barcelona. El codi a mitjan segle XII (Textos i Estudis de Cultura Catalana). Barcelona: Curial Edicions Catalanes/Publicacions de l’Abadia de Montserrat.Search in Google Scholar
Matticchio, Andrea. 2024. V1 clauses and EPP in Old Italian. Isogloss. Open Journal of Romance Linguistics 10(3). 1–22. https://doi.org/10.5565/rev/isogloss.341.Search in Google Scholar
Par, Anfós. 1923. Sintaxi catalana segons los escrits en prosa de Bernat Metge (Die Bethefte Zur Zeitachrift Für Roman!Ache Philologie). Halle (Saale): Max Niemeyer Verlag.10.1515/9783112324929Search in Google Scholar
Pinto, Manuela. 1997. Licensing and interpretation of inverted subjects in Italian. (PhD Thesis). University of Utrecht.Search in Google Scholar
Poletto, Cecilia. 2005. Sì and e as Expletives in Old Italian. In Montserrat Batllori, Maria Lluïsa Hernanz, Carme Picallo & Francesc Roca (eds.), Grammaticalization and parametric variation, 206–235. Oxford: Oxford University Press.10.1093/acprof:oso/9780199272129.003.0013Search in Google Scholar
Poletto, Cecilia. 2006. Parallel phases: A study on the high and low left periphery of Old Italian. In Mara Frascarelli (ed.), Phases of interpretation, 261–294. Berlin, New York: De Gruyter Mouton.10.1515/9783110197723.4.261Search in Google Scholar
Poletto, Cecilia. 2014. Word order in old Italian (Oxford studies in diachronic and historical linguistics 7). Oxford, New York: Oxford University Press.Search in Google Scholar
Poletto, Cecilia. 2019. More than one way out: On the factors influencing the loss of V to C movement. Linguistic Variation. 19(1). 47–81. https://doi.org/10.1075/lv.16001.pol.Search in Google Scholar
Posner, Rebecca. 1997. Linguistic change in French. Linguistic change in French. Oxford: Clarendon Press.Search in Google Scholar
Pujol i Campeny, Afra. 2018. Word order in Old Catalan. PhD thesis. University of Cambridge.Search in Google Scholar
Pujol i Campeny, Afra. 2019a. E sí la hoïren tots: sí and emphatic positive polarity in Old Catalan. Isogloss – Open Journal of Romance Linguistics 5. 1–32. https://doi.org/10.5565/rev/isogloss.73.Search in Google Scholar
Pujol i Campeny, Afra. 2019b. V1 clauses in old Catalan. In Anne Breitbarth, Miriam Bouzouita, Melissa Farasyn & Lieven Danckaert (eds.), The Determinants of diachronic stability. Linguistik aktuell/linguistics today, vol. 254, 157–190. Amsterdam: John Benjamins Publishing Company.10.1075/la.254.07pujSearch in Google Scholar
Pujol i Campeny, Afra. 2023. The Emergence and Syntax of “sí que” in Catalan. Journal of Historical Syntax 7(6). 1–30. https://doi.org/10.18148/hs/2023.v7i6-19.153.Search in Google Scholar
Pujol i Campeny, Afra. 2025a. Fronting in Old Catalan: Narrative text and reported speech. Transactions of the Philological Society 123(1). 1–28. https://doi.org/10.1111/1467-968X.12299.Search in Google Scholar
Pujol i Campeny, Afra. 2025b. The syntactic glosses of the Liber Iudicum Popularis: A window into the syntax of early romance? In Giovanibattista Galdi, Simon Aerts & Alessandro Papini (eds.), Varietate delectamur: Multifarious approaches to synchronic and diachronic variation in Latin. Selected papers from the 14th international colloquium on Late and Vulgar Latin (Ghent, 2022), 639–653. Turnhout: Brepols Publishers.10.1484/M.LVLT-EB.5.143331Search in Google Scholar
Pujol i Campeny, Afra. 2025c. Si in Old Catalan and Old Occitan. Beyond verb position and polarity. Revue Romane: Online First. https://doi.org/10.1075/rro.25010.puj.Search in Google Scholar
Pujol i Campeny, Afra. In press a. Syntactic variation, linguistic register and literary genre in Old Catalan. To appear in the peer-reviewed volume by Pierre Larrivée & Francesco, Pinzin (eds.), Syntactic change through text types. Publisher: De Gruyter Mouton.Search in Google Scholar
Rainsford, Thomas. 2024. Proclisis and enclisis in early Gallo-Romance: Evidence from sandhi phenomena. In Janice Carruthers, Mairi McLaughlin & Olivia Walsh (eds.), Historical and sociolinguistic approaches to French. Oxford: Oxford University Press.10.1093/oso/9780192894366.003.0002Search in Google Scholar
Reenen, Pieter van & Lene Schløsler. 2000. The pragmatic functions of the Old French particles ainz, apres, donc, lors, or, puis and si. In Susan C. Herring (ed.), Textual parameters in older languages. Amsterdam: John Benjamins Publishing Company.Search in Google Scholar
Riquer, Martí de. 1964. Història de la literatura catalana. Barcelona: Ariel.Search in Google Scholar
Rizzi, Luigi. 1997. The fine structure of the left periphery. In Adriana Belletti (ed.), Structures and beyond. The cartography of syntactic structures, vol. 3, 223–251. Oxford: Oxford University Press.Search in Google Scholar
Roberts, Ian. 1993. Verbs and diachronic syntax. A comparative History of English and French. Dordrecht: Kluwer Academic Publishers.10.1007/978-94-011-2910-7Search in Google Scholar
Ross, Malcolm D. 1988. Proto oceanic and the Austronesian languages of Western Melanesia. Canberra: Pacific Linguistics.Search in Google Scholar
Russel-Gebbett, Paul Stanley Nigel. 1965. Medieval Catalan texts. Oxford: Dolphin Book Co Ltd.Search in Google Scholar
Sailor, Craig. 2020. Rethinking ‘residual’ verb second. In Rebecca Woods & Sam Wolfe (eds.), Rethinking verb second, 126–149. Oxford: Oxford University Press.10.1093/oso/9780198844303.003.0006Search in Google Scholar
Salvi, Giampaolo. 2002. Il problema di <si> e l’uso riflessivo di essere. Verbum Analecta Neolatina IV(2). 377–398.10.1556/Verb.4.2002.2.10Search in Google Scholar
Sheehan, Michelle. 2010. Free’ inversion in Romance and the null subject parameter. In Theresa Biberauer, Anders Holmberg & Ian Roberts (eds.), Parametric variation: Null subjects in minimalist theory, 231–262. Cambridge: Cambridge University Press.10.1017/CBO9780511770784.007Search in Google Scholar
Singh, Onkar. 2021. Verb second in old Venetian. Isogloss. Open Journal of Romance Linguistics 7. https://doi.org/10.5565/rev/isogloss.110.Search in Google Scholar
Sluckin, Benjamin L. 2025. Revisiting the syntax and development of Kiezdeutsch V3: A new perspective. Journal of Germanic Linguistics. 1–61. https://doi.org/10.1017/S1470542724000114.Search in Google Scholar
Torruella, Joan, Manuel Pérez Saldanya & Josep Martines. 2009. Corpus Informatitzat del Català Antic. Corpus Informatitzat del Català Antic. http://cica.cat/ (accessed 11 October 2020).Search in Google Scholar
Tortora, Cristina. 2001. Evidence for a null locative in Italian. In Guglielmo Cinque & Giampaolo Salvi (eds.), Current studies in Italian Syntax: Essays offered to Lorenzo Renzi, 313–326. Amsterdam: Elsevier.10.1163/9780585473949_018Search in Google Scholar
Troberg, Michelle & John Whitman. 2022. Syntactic glosses and historical syntax. In Laura Grestenberger, Charles Reiss, Hannes A. Fellner & Gabriel Z. Pantillon (eds.), Linguistic studies in honor of Mark R. Hale, 369–394. Weisbaden: Reichert Verlag.Search in Google Scholar
Vance, Barbara, Bryan Donaldson & Britanny Devan Steiner. 2010. V2 loss in Old French and Old Occitan: The role of fronted clauses. In Sonia Colina, Antxon Olarrea & Ana Maria Carvalho (eds.), Romance linguistics 2009: Selected papers from the 39th linguistic symposium on Romance languages (LSRL), Tucson, Arizona, march 2009 (current issues in linguistic theory 315), 301–320. Amsterdam: John Benjamins Publishing Company.10.1075/cilt.315.18vanSearch in Google Scholar
Villalba, Xavier & Laia Mayol. 2013. Right-dislocation in Catalan: Tails, polarity and activation. International Review of Pragmatics 5(1). 87–117. https://doi.org/10.1163/18773109-13050104.Search in Google Scholar
Wanner, Dieter. 1987. From Latin to Old Romance. Berlin, Boston: De Gruyter Mouton.Search in Google Scholar
Weinreich, Uriel, William Labov & Marvin Herzog. 1968. Empirical foundations for a theory of language change. In Winfred Lehmann & Yakov Malkiel (eds.), Directions for historical linguistics, 95–189. Austin: University of Texas Press.Search in Google Scholar
Westergaard, Marit. 2009. Word Order in Old and Middle English: The role of information structure and first language acquisition. Diachronica 26(1). 65–102. https://doi.org/10.1075/dia.26.1.03wes.Search in Google Scholar
Wolfe, Sam. 2015a. Microvariation in medieval Romance syntax: A comparative study. (PhD Thesis). Cambridge: University of Cambridge.Search in Google Scholar
Wolfe, Sam. 2015b. The Old Sardinian Condaghes: A syntactic study. Transactions of the Philological Society 113(2). 177–205. https://doi.org/10.1111/1467-968X.12046.Search in Google Scholar
Wolfe, Sam. 2017. Syntactic variation in two sister languages: A study of word order in old French and old Occitan. In Rodica Zafiu, Gabriela Dindelegan, Adina Dragomirescu & Alexandru Nicolae (eds.), Romance Syntax. Comparative and diachronic perspectives, 53–85. Newcastle: Cambridge Scholars Press.Search in Google Scholar
Wolfe, Sam. 2018a. Verb second in medieval romance. Oxford: Oxford University Press.10.1093/oso/9780198804673.001.0001Search in Google Scholar
Wolfe, Sam. 2018b. Probing the Syntax of a Problematic Particle: Old French ‘si’ Revisited. Transactions of the Philological Society 116(3). 332–362. https://doi.org/10.1111/1467-968X.12123.Search in Google Scholar
Wolfe, Sam. 2020. Reconsidering variation and change in the Medieval French subject system. Glossa: A Journal of General Linguistics 5(1). 1–29. https://doi.org/10.5334/gjgl.942.Search in Google Scholar
Wolfe, Sam. 2021a. Syntactic change in French (Oxford studies in diachronic and historical Linguistics), 1st edn., vol. 47. Oxford: Oxford University Press.Search in Google Scholar
Wolfe, Sam. 2021b. Syntactic change in French. Oxford: Oxford University Press.10.1093/oso/9780198864318.001.0001Search in Google Scholar
Wolfe, S. 2022. Residual verb Second in French and romance. Isogloss. Open Journal of Romance Linguistics 8(3). 1–34. https://doi.org/10.5565/rev/isogloss.208.Search in Google Scholar
Yang, Charles. 2000. Knowledge and learning in natural language. (Doctoral Thesis). Cambridge, MA: Massachusetts Institute of Technology, Department of Linguistics and Philosophy.Search in Google Scholar
© 2025 the author(s), published by De Gruyter, Berlin/Boston
This work is licensed under the Creative Commons Attribution 4.0 International License.