Abstract
Middle voice systems are characterized by the distribution of a middle marker over two macro-classes of verbs: oppositional and non-oppositional verbs. In diachronic studies, it has been proposed that the historical link between the two groups is unidirectional, with middle marking spreading from oppositional to non-oppositional verbs. In this paper, we challenge this assumption and discuss two case studies, one from Anatolian and one from Paman languages that show the opposite development, that is, from non-oppositional to oppositional. In both cases, we argue, constructions that originally had a lexically determined distribution develop grammatical functions connected with valency reduction.
1 Introduction
Middle markers (henceforth MMs) constitute an atypical case of voice construction (in the sense of Zúñiga and Kittilä 2019), because their use is partly lexically and partly grammatically determined. As repeatedly pointed out in the typological literature (see e.g. Inglese 2022; Kemmer 1993; Klaiman 1991; Kulikov 2013), middle voice systems (MVSs) are characterized by a split distribution of MMs over two classes of verbs: those for which middle marking operates as a valency-changing strategy in opposition to a basic bivalent verb, e.g. Italian colpir- si ‘hit oneself’ from colpire ‘hit (something)’, and those that obligatorily carry middle marking, e.g. Italian accorger- si ‘realize’. These two classes go by the name of oppositional versus non-oppositional middles (or alternating vs. non-alternating in Klaiman 1991: 106).
From a historical perspective, the peculiar distribution of MMs has traditionally been explained as the result of a diachronic process whereby originally oppositional markers of valency change (typically reflexives) progressively broaden their functional domain and undergo a process of lexicalization whereby they end up being obligatorily associated with specific (classes of) verbs (see extensive discussion in Kemmer 1993: Chap. 5). An often-cited example is that of the development of the Latin reflexive pronoun se into the MM of Romance languages (as in the Italian example above). The cross-linguistic evidence in support of the oppositional > non-oppositional shift is such that this development is commonly held to be unidirectional (e.g. Haspelmath 2003; Kaufmann 2007).
In this paper, we challenge the unidirectionality of the oppositional > non-oppositional development and present two case studies that offer compelling evidence for the opposite development in the rise of MVSs, that is non-oppositional > oppositional. The first case study is from Hittite (Anatolian, Indo-European), which shows the inflectional middle voice typical of ancient Indo-European languages. While the distribution of the Hittite middle voice at first sight looks compatible with a reflexive (oppositional) origin, a quantitative analysis of middle verbs in a corpus of chronologically ordered Hittite texts shows that in Old Hittite verbal voice was in all likelihood confined to non-oppositional verbs indicating uncontrolled events, and that oppositional functions only developed at a later stage. The second case study focuses on the so-called ‘impersonal’ construction in Umpithamu (Paman, Pama-Nyungan), a constructional type of middle and one of the few instances of middle voice systems found in Pama-Nyungan languages. This structure originates in contact with the neighboring subgroup of Lamalamic languages, where it is restricted to a lexical function, viz. forming idioms for involuntary bodily processes, like ‘have hiccups’ or ‘have a headache’. The structure is transferred to Umpithamu with its basic lexical function, where it also develops a secondary passive-like function, thereby entering the domain of oppositional valency-reducing functions. The case of Umpithamu is further interesting in that the type of construction we discuss is not normally taken into account in analyses of middle marking, which usually focus on verbal morphology rather than construction-level marking. The results of our study contribute to a better understanding of the diachronic typology of MMs, which remains under-studied in the literature (though see Inglese 2023).
The paper is structured as follows. In Section 2, we briefly sketch the mainstream views on the nature and development of MVSs. Section 3 focuses on the history of the middle voice in Hittite. After a brief overview of the synchronic properties of the Hittite middle inflection (Section 3.1), in Section 3.2, we discuss the result of a detailed corpus analysis of the distribution of middle verbs in the history of Hittite. We argue that while non-oppositional verbs constitute the larger group in the earliest stage of documentation, they are progressively outnumbered by oppositional middles over the course of time. We elaborate on these findings in Section 3.3, where we propose our own account of the emergence and development of verbal voice over the documented history of Hittite. Section 4 deals with the Umpithamu data. Section 4.1 discusses the basic properties of impersonal structures and demonstrates their status as a middle voice system, while Section 4.2 analyses the structures’ origins in contact with Lamalamic, and the late development of a passive-like function in Umpithamu. Section 5 recapitulates the findings of this study and puts them in a broader perspective.
2 Middle voice systems in diachrony
From a cross-linguistic perspective, MVSs consist of structures (middle markers or MMs) that are characterized by a split distribution over morphosyntax and the lexicon (see Inglese 2022; Kemmer 1993; Klaiman 1991). On the one hand, MMs provide a morphosyntactic alternation for unmarked (bivalent) verbs, typically with a valency-reducing function, while on the other hand, it is lexically obligatory with some (typically monovalent) verbs that lack an unmarked counterpart. Data from Italian illustrates this distribution quite well. The enclitic pronoun si, typically described as Reflexive, in fact behaves as a MM, in the sense that (mostly transitive) verbs may take on si to express reflexive, reciprocal, anticausative, and passive functions (among other things), as in (1a), whereas other verbs obligatorily occur with si and do not have an unmarked counterpart, as in (1b).[1]
si colpiscono ‘they hit themselves, each other’ < colpire ‘hit’ | [reflexive, reciprocal] |
si scioglie ‘it melts (intr.)’ < sciogliere ‘melt’ | [anticausative] |
si vendono molte case ‘many houses are sold’ < vendere ‘sell’ | [passive] |
arrabbiar si ‘get angry’, accorger si ‘realize’, inginocchiar si ‘kneel down’, fidar si ‘trust’, pentir si ‘repent’ |
As already mentioned, verbs in (1a) and (1b) can be referred to as oppositional and non-oppositional middles, respectively. The former instantiate a transparent process of morphosyntactic alternation for valency change (see Bahrt 2021; Kulikov 2010; Zúñiga and Kittilä 2019), and the meaning of the middle marked verb can be more or less compositionally derived from the meaning of the verb base plus the middle marker. By contrast, non-oppositional verbs are best thought of as lexicalized, in the sense that middle marking must be lexically stored with individual verbs. Indeed, a non-middle counterpart is lacking, and the combination of the verb base plus middle marking is not easily segmentable into distinct meaning components (Haspelmath and Müller-Bardey 2004: 1139).
The split distribution of Italian si illustrated in (1) is the outcome of a well-studied historical process (e.g. Cennamo 1993). Italian si originates in the Latin reflexive pronoun se/sibi. In Latin, se was mostly used in reflexive contexts proper, and to a lesser extent also in anticausative and reciprocal function (see Cennamo et al. 2015; Martínez Rojas et al. 2021). In other words, in origin, se was predominantly used in oppositional contexts. However, in Late Latin, one already finds se being added to intransitive verbs without any discernible semantic contribution, e.g. (se) ambulare ‘walk’ (Cennamo 1999). This situation is continued to varying extents in Romance languages (see Kemmer 1993: Chap. 5 for discussion), where reflexes of Latin se acquire new oppositional functions, such as passive, impersonal, and antipassive (Cennamo 2020; Giacalone Ramat and Sansò 2011; Janic 2016; Wolfsgruber 2017), and also become obligatory with some intransitive verbs (de Benito Moreno 2023; Inglese and Wolfsgruber 2021). Overall, this process can be described as the expansion of an erstwhile oppositional marker to the non-oppositional domain.
The type of development discussed for Latin se finds strong parallels in several languages of the world. In fact, Inglese (2023) found that in a sample of 149 MMs, a reflexive (or other valency changing operation) origin can be reasonably assumed for at least 50 out of 100 MMs for which a historical source can be discerned. Considering the original connection with valency change of the source constructions, all of these must instantiate the oppositional > non-oppositional development.[2] The attention given to the reflexive > mm shift is such that there is a widespread belief that MVSs must indeed originate in oppositional voice markers. The following quote by Kaufmann (2007: 1688) is representative of this line of reasoning:
Historical data provide evidence that the media tantum [i.e. non-oppositional middles] develop after both the direct reflexive and the anticausative reading in languages that encode the middle function by reflexive verbs (…). I therefore, assume that the existence of media tantum in middle marking languages is a consequence of a reinterpretation of the device which derives the differential readings [i.e. oppositional functions] (…).
While the frequency and spread of the oppositional > non-oppositional development can hardly be denied, in the remainder of this paper we discuss two case studies that challenge the purported unidirectionality of this shift in its stronger formulation (as per Kaufmann 2007 above), and show that the opposite development, that is, non-oppositional > oppositional, can also take place.
3 The middle voice in Hittite: a diachronic perspective
Hittite is the most anciently attested language of the Indo-European (henceforth, IE) language family. It belongs to the Anatolian branch of the family (nowadays extinct) and was the main administrative language of the Hittite Kingdom, which ruled over the Anatolian peninsula (and Northern Syria) in the second millennium BCE. The Hittite corpus consists of cuneiform tablets retrieved from the archives of the capital city Hattusa. Based on the existing documentation, scholars commonly distinguish three stages of the language: Old Hittite (OH, ca. 1650–1450 BCE), Middle Hittite (MH, ca. 1450–1350), and New Hittite (NH, ca. 1350–1190) (Hoffner and Melchert 2008: xvii).
As is common to most ancient IE languages, the Hittite verb features a dual system of inflectional voice (see Clackson 2007: Chap. 5). In Hittite, every verb must be marked for voice, which is realized through distinct sets of verbal endings that simultaneously express other grammatical categories, viz. tense, mood, person, and number (Hoffner and Melchert 2008: Chap. 11).[3] The two voices are referred to as Active and Medio-passive or Middle, respectively.[4] The distinction between the Active and the Middle endings is exemplified in Table 1.
The Active and Middle inflections in Hittite.
Present indicative | ||
---|---|---|
Active | Middle | |
walh- ‘hit’ | iya- ‘go’ | |
1sg | walh- mi | iya-hha(ri) |
2sg | walh- si | iya-tta(ri) |
3sg | walh- zi | iya-tta(ri) |
1pl | walh- weni | iya-wasta |
2pl | walh- tani | iya-dduma |
3pl | walh- anzi | iya-nta(ri) |
3.1 The functions of the Hittite middle voice
Studies on the Hittite middle voice usually set up a distinction between two main classes of Middle verbs: media tantum and alternating verbs (Friedrich 1960; Hoffner and Melchert 2008: 303–305; Inglese 2020; Luraghi 2012; Neu 1968a). This means that the Hittite middle voice effectively functions as a MM as defined in Section 2.
The group of media tantum correspond to what we call non-oppositional verbs. These are verbs that only inflect in the middle voice throughout the history of the language. Examples of media tantum are given in (2) (for an exhaustive list see Inglese 2020: 108–114):
ar- ta(ri) ‘stand’, es- a(ri) ‘sit down’, ye/a- tta(ri) ‘go, march’, istu- ā(ri) ‘get out, become known’, ki- tta(ri) ‘lie, be laid’, kis- a(ri) ‘become, happen’, kist- ā(ri) ‘perish’, tarra- tta(ri) ‘be able, can (+infinitve)’, tukk- āri ‘be visible, be important’, ur- āri ‘burn’, wakk- āri ‘be lacking’, wess- tta ‘wear’, zē- a(ri) ‘cook (intr.)’ |
As shown in (2), Hittite media tantum cover situation types that are typical of middle marking (as per Kemmer 1993), including verbs of position and posture, e.g. ar- ta(ri) ‘stand’ and es- a(ri) ‘sit down’, verbs of motion, e.g. ye/a- tta(ri) ‘go’, and spontaneous events, e.g. ur- āri ‘burn’. Generalizing over the semantics of the media tantum, a clear pattern emerges whereby they encode either states, e.g. ki- tta(ri) ‘lie’, or, more frequently, uncontrolled change-of-state events, e.g. kīs- a(ri) ‘become’ (Inglese 2020: 119–122; Luraghi 2012; contra Neu 1968a: 117–119). Syntactically, most media tantum are intransitive verbs of the unaccusative type, defined in Hittite as a specific subset of intransitive verbs that require 3rd person enclitic subject pronouns in the absence of an overt subject (Garrett 1996).[5] In this respect, they differ from both transitive and intransitive unergative verbs, which do not allow enclitic subjects under any circumstance. Consider example (3) in which the Middle verb es- ‘sit down’ is found with the subject clitic =as.
n= as =san | GIŠ hulikanniya | esa | ||
conn=3sg.nom=ptc | carriage.all | sit.prs.mid.3sg | ||
‘And he sits down on the carriage.’ | [KBo 20.18+ rev. 7][6] |
With verbs that inflect both in the Active and in the middle voice, voice alternation encodes a number of valency-reducing operations (Hoffner and Melchert 2008: 303–305; Inglese 2020: 131–151): reflexive, reciprocal, passive, and anticausative (as defined in Zúñiga and Kittilä 2019). In the first place, oppositional Middle verbs may have a reflexive and a reciprocal interpretation. In (4), the Middle form suppiahhut ‘purify oneself’ is the reflexive counterpart of the transitive active verb suppiahh- i ‘purify’, while in (5), the plural form zahhanda is a reciprocal verb based on Active transitive zahh- ‘hit’.
it | suppiahhut |
go.imp.2sg | purify.imp.mid.2sg |
‘Go, purify yourself!’ | [KBo 3.16 iii 8] |
takku | LÚMEŠ | zahhanda | |
if | man(pl) | hit.prs.mid.3pl | |
‘If (two) men strike each other (and one of them dies).’ | [KBo 6.26 ii 16] |
Another, comparatively frequent, function of the middle voice is passive. In the passage in (6), the verb handāittari ‘be determined’ is the passive counterpart of transitive active handai- ‘determine (through oracle)’.
n=at | mahhan | IŠTU | dUTU-ŠI | handāittari |
conn=3sg.nom.n | as | by | his_majesty | determine.prs.mid.3sg |
‘As it is determined by his Majesty, (should I treat it likewise?).’ | ||||
[KBo 16.97 + KBo 40.48 rev. 38–39] |
In (6), a passive interpretation is confirmed by the occurrence of the Agent phrase IŠTU dUTU-ŠI ‘by his Majesty’. Note further that in this example, the verb handāittari triggers the use of the subject enclitic pronoun =at, which demonstrates that oppositional Middle verbs are syntactically intransitive unaccusatives, much in the same way as intransitive media tantum like es- in (3). This proves that the middle voice effectively operates as a valency-reducing device in Hittite.
Finally, as also discussed by Luraghi (2012), voice alternation can also be associated with anticausativization. Consider example (7).
kuitman=ma | gimmanza | nāwi | zinnattat |
until=ptc | winter.nom | not_yet | finish.prs.mid.3sg |
‘And before winter is over.’ | [KBo 2.5 iv 11] |
In (7), the Middle form zinnattat encodes a spontaneous event ‘come to an end’ (this being the only possible interpretation, given the nature of the subject gimmanza ‘winter’) as opposed to the transitive Active counterpart zinni- zi ‘complete, finish (tr.)’, which indicates an externally induced event. Notably, middle forms of several verbs are ambiguous between an anticausative and a passive interpretation, e.g. Active tarupp- zi ‘gather (tr.)’ versus Middle tarupp- ta(ri) ‘gather (intr.), be gathered’.[7]
3.2 The chronological distribution of the Hittite middle
From a panchronic perspective, i.e. if one takes into consideration the entire Hittite corpus, oppositional Middles are more frequent than non-oppositional ones, as reported in Table 2.[8] At first sight, this would seem to suggest that the oppositional component of the system is diachronically primary. Zooming in on oppositional verbs, as the data in Table 3 shows, the passive function is the most widespread, closely followed by the anticausative, while occurrences of reflexives and reciprocals are quite marginal.
The distribution of Middle verbs in the Hittite corpus (data from Inglese 2020).
Non-oppositional | Oppositional | |
---|---|---|
Type frequency | 71 | 94 |
Oppositional functions of Middle verbs (adapted from Inglese 2020: 191).
Anticausative | Passive | Anticausative/passive | Reciprocal | Reflexive | |
---|---|---|---|---|---|
Type frequency | 28 | 39 | 17 | 3 | 7 |
Things become much more interesting once the data is put into a chronological perspective. Thanks to advances in techniques of paleographic text dating, we can now rely on a careful distinction of Hittite texts into three distinct language stages: Old, Middle, and New Hittite (see Inglese 2020: 62–63; van den Hout 2009). This means that, with respect to earlier studies such as Neu (1968a, 1968b), the chronological distribution of Middle verbs can now be explored in detail over the entire history of the language.
Let us first consider the distribution of the two macro-classes of Middle verbs. Overall, as shown in Table 2, non-oppositional verbs are slightly outnumbered by oppositional Middles. The chronological distribution offers a radically different picture, however, as suggested by a comparison between Tables 2 and 4.[9] The most striking result is that the distribution of the two classes is not even over time. In OH, most Middles belong to the non-oppositional class, while oppositional verbs have a rather limited distribution. This imbalance is also reflected in token frequency at this stage: non-oppositional Middles amount to 191 tokens in the OH corpus, versus 24 tokens of oppositional verbs. The OH distribution is radically altered over time, leading to the NH situation, which can be considered the mirror image of OH. At this stage, non-oppositional Middles constitute the minority, while oppositional verbs show signs of a strong expansion.
Chronological distribution of Middle verbs (from Inglese 2020: 220).
Non-oppositional | Oppositional | |
---|---|---|
OH | 28 | 7 |
MH | 16 | 19 |
NH | 13 | 32 |
The internal distribution of oppositional Middles also varies over time. Table 3 suggests that passive is the most frequent function of oppositional Middle verbs in the entire corpus. However, a closer look at the chronology of oppositional Middles, as in Table 5, shows that passives are relatively limited in OH, and become increasingly common over time (anticausatives also increase over time but at a less pronounced rate).[10] Reflexives and reciprocals remain rare at all stages.
Chronological distribution of oppositional Middles (from Inglese 2020: 221).
Anticausative | Passive | Anticausative/passive | Reciprocal | Reflexive | |
---|---|---|---|---|---|
OH | 3 | 2 | 0 | 1 | 1 |
MH | 5 | 7 | 7 | 0 | 1 |
NH | 8 | 13 | 9 | 1 | 0 |
3.3 The origin of the middle voice in Hittite
Taken at face value, the panchronic distribution of the functions of the Hittite middle voice is similar to that of Italian si discussed in (1). In both languages the MM applies to oppositional verbs with passive, anticausative, reflexive, and reciprocal functions as well as to a class of non-oppositional verbs. Based on this similarity, and keeping in mind the general validity of the reflexive > mm shift, one could speculate that the Hittite middle voice originated as a reflexive marker. In fact, a reflexive origin has sometimes been proposed for the Proto-Indo-European Middle inflection as a whole (e.g. Meier-Brügger 2010: 396).
However, the Hittite corpus does not fully support this reconstruction. To begin with, it must be mentioned that even synchronically, voice alternation is only weakly connected with reflexivity in Hittite. Instead, reflexivity is primarily encoded by means of the so-called Reflexive particle =za in combination with Active verbs (see Cotticelli Kurras and Rizza 2013; Hoffner and Melchert 2008: 357–360; Inglese 2020: 81–86). The reflexive usage of =za is exemplified in (8), where the particle combines with the otherwise transitive Active verb walh- ‘hit’ to yield a reflexive reading ‘hit oneself’.
nu= za | 6-ŠU | walhanzi | ||
conn=refl | six_times | hit.prs.3pl | ||
‘(Afterwards the two priests of the God Zilipuri come) and they beat themselves six times.’ | [KUB 1.14 ii 8] |
Moreover, even the few oppositional Middles that license a reflexive reading tend to co-occur with =za in reflexive contexts (see Inglese 2020: 147–148). In fact, the example in (4), with the bare Middle suppihhut in reflexive function, is the exception, as the same verb suppiyahh- ‘purify’ also occurs with the particle when used reflexively, as in (9).
LUGAL-us= za | suppiyahhati | |
king.nom=refl | purify.pst.mid.3sg | |
‘The king has purified himself.’ | [KBo 25.112 ii 14] |
The low frequency of Middle verbs in reflexive function, combined with the existence of a more productive alternative for the encoding of reflexivity, makes it doubtful that reflexivity could have been the original semantic core of the Middle inflection. In addition, as remarked in Section 3.2, in the oldest stage of the language Middle verbs are predominantly non-oppositional, and oppositional ones gain ground only at a later stage. In other words, the Hittite corpus data hardly supports the oppositional > non-oppositional shift.
Based on these observations, the Hittite data can best be explained by assuming a different development, as already proposed by Luraghi (2012) and Cotticelli Kurras and Rizza (2013), among others. The OH data presented in Table 4, in which most middle verbs belong to the media tantum, supports the hypothesis that verbal voice might have been lexically determined in origin, with verbs only inflecting either in the Active or in the middle voice (this idea is not entirely new in IE studies; see e.g. Lazzeroni 2004: 142). In other words, the Middle inflection was originally confined to non-oppositional Middles, thus suggesting that oppositional Middles are an innovation.
As already noted, a remarkable feature of media tantum in OH is that they are mostly connected with spontaneous change-of-state events involving inanimate subjects, e.g. kis- ‘happen’, kist- ‘go out (of fire)’ (see Inglese 2020: 124). Spontaneous Middles of this sort could have easily been perceived as being in opposition to Active verbs indicating the corresponding externally caused event. Indeed, a pattern of voice opposition contrasting Active and Middle verbs already exists between intransitive non-oppositional Middles and their transitive causative counterparts built with derivational suffixes such as -nu- and Active endings, e.g. kist- ā(ri) ‘go out’ versus kistanu- zi ‘put out, extinguish’ (on this pattern see Luraghi 2012). Similarly, Active versus Middle pairs also include suppletive pairs of the type kis- a(ri) ‘happen.mid’ versus iya- zi ‘cause to happen.act’. The existence of this polarization between the Active and the middle voice in the lexicon could have led to the use of the same verbal root in both voices to indicate the anticausative alternation. This might be how voice alternation arose in the first place in the language, making it reasonable to assume that the anticausative function was the first oppositional usage of the middle voice that emerged from the original lexical distribution.
Once established as an anticausativization strategy, voice alternation could have spread to other oppositional functions as well (see Inglese 2020: 228–241 for a full account). The shift from anticausative to passive does not need much comment, as it is typologically rather trivial (see e.g. Bahrt 2021: 206–207; Haspelmath 1990; Inglese 2020: 231–234). Links towards reflexives and reciprocals are less straightforward, and can be explained as follows (see also Bahrt 2021: 203–206).
The shift anticausative > reflexive/reciprocal can be better understood by adopting a finer-grained distinction of anticausatives in terms of decausatives and autocausatives (Creissels 2006: 10; Geniušienė 1987: 86–89, 98–104). Decausative verbs such as ‘melt (intr.)’ typically involve non-controlling Patient subjects, while the subject of autocausatives can be construed as partly exerting control on the onset of the change-of-state event. Autocausatives typically include verbs of self-induced motion, e.g. ‘turn’, which offer a possible bridging context between the anticausative and the reflexive domain.
Verbs of motion arguably offer the key bridging contexts from the anticausative to the reflexive function (a similar point is also made by Holvoet 2020: 118–119). At the pre-Hittite stage, as we have noted, voice alternation was likely restricted to verbs expressing a spontaneous change of state, i.e., decausatives. While verbs denoting a physical change of state (e.g. ‘melt’) constitute the core of the category, decausative verbs also include verbs of motion with inanimate subjects, on the ground that change of location also entails a change of state. Crucially, however, motion events differ from prototypical decausatives (physical change-of-state events), in that their single participant shares properties of prototypical Agents, i.e. agenthood and control, and Patients, i.e. undergoing a change of location (e.g. Luraghi 2010: 140). In other words, unlike events of physical change, motion events can more easily be construed as featuring either a non-controlling (decausative) or controlling (autocausative) subject. It is the autocausative use with motion verbs that represents the first step towards the reflexive use. Consider the behavior of oppositional Middle forms of the Hittite verb nini(n)k- zi ‘raise, set in motion’ in (10).
karittes | nininkanta | |
flood.nom.pl | raise.prs.mid.3pl | |
‘Floods will get moving.’ | [KUB 8.1 iii 21] |
nu | mān | LÚKÚR | kuiski | niniktari | ||||
conn | if | enemy | indf.nom | raise.prs.mid.3sg | ||||
‘If some enemy mobilizes (and goes to attack these borders).’ | [FHL 57+ iii 46–47] |
In (10a), the Middle form ninikanta occurs with the inanimate subject karittes ‘flood’ and the entire situation is depicted as a spontaneous change-of-state event. By contrast, in (10b) the very same verb is used with an animate subject LÚKÚR ‘enemy’, which is construed as agentively bringing about the motion event. Starting from autocausative contexts such as in (10b), voice alternation could have been further extended to reflexive contexts proper, leading to the final stage of the development, in which voice alternation applies to events that cannot possibly be construed as spontaneous and that exclusively involve animate controlling subjects, as in e.g. (4) above. A parallel cline of development can also be assumed for the reciprocal function, with autocausative verbs of the spatial lexical reciprocal type serving as bridging contexts, e.g. tarupp- ‘gather’ and sarra- ‘split, divide’ (see also Inglese 2017). The development of the oppositional functions of the Hittite Middle inflection can be summarized as in Figure 1.

The development of oppositional functions of the Hittite middle voice (adapted from Inglese 2020: 240).
The development proposed here is also wholly compatible with the distribution of Middle verbs in the Hittite corpus, as discussed in Section 3.2. In the first place, assuming a non-oppositional origin of the Middle inflection best explains why non-oppositional Middles are more frequent in OH, and why oppositional ones become more frequent at later stages, as per Table 4. In particular, the fact that non-oppositional Middles seem to decrease over time is also due to the fact that some of these develop a historically new pattern of voice alternation, thereby entering the oppositional class. For example, the verb lazziye/a- tta(ri) ‘recover, become good’ is only non-oppositional in OH, but in NH, one also finds an Active oppositional counterpart (with causative meaning) lazziye/a- zi ‘set straight’.
By contrast, if one were to reconstruct an original reflexive function, one would have to assume that oppositional functions developed into non-oppositional verbs, which then became more prominent, only for oppositional functions to regain ground at a later stage. While this reconstruction is not per se impossible, it is unnecessarily complicated and does not rest on any independent evidence to postulate a reflexive origin of the Middle inflection.[11]
In the second place, the development of oppositional functions sketched in Figure 1 is also compatible with the distribution of oppositional functions in the Hittite corpus as described in Table 5. Indeed, anticausatives are relatively common in OH already and remain so at later stages. The passive, which is the function most closely related to the anticausative, is a secondary development, as it becomes increasingly more frequent especially in NH. The extension to reflexive and reciprocal contexts, which, as discussed, follows a less mainstream path of development, never gained much ground, possibly on account of the existence of dedicated strategies for the encoding of these two functions (Inglese 2020: 156–163).
A typological peculiarity of the Hittite Middle also concerns the timing of the development of the reflexive function. Kemmer (1993: 229) argued that “middle markers from non-reflexive sources will not develop into markers of reflexive semantics”. However, Hittite offers evidence for precisely this type of unexpected development, with a middle marker originating in spontaneous/anticausative events extending to the reflexive domain only secondarily (for further parallels from unrelated languages, see Inglese 2023). It is also noteworthy that lack of directionality in the reflexive > mm development is also compatible with a general lack of directionality in the voice domain that has been advocated in more recent studies. In fact, as pointed out by Bahrt (2021: Chap. 7), it is reductive to claim that reflexives are unidirectionally the preferred source of other voice operations, as patterns of voice syncretism can in fact come about following different paths and taking different voice operations as input. Why it is specifically in the voice domain, and not in others, that we fail to see directionality in diachronic developments, is a question that we leave for future research (see Giomi and Inglese forthcoming for some preliminary remarks).
4 The middle voice in Umpithamu
Umpithamu is a Pama-Nyungan (Paman > Middle Paman) language of north-eastern Australia (Verstraete 2020). It is one of the few Pama-Nyungan languages that could be said to have a middle voice system as defined in this paper (as demonstrated in the sample-based analysis in Inglese 2022). The relevant structure has a non-oppositional function in the lexicon, where it forms idioms designating involuntary physical processes for humans (like hiccups or cramps), and an oppositional function in morphosyntax, where it functions productively as a passive-like structure for human Patients. What is interesting in the context of this study is that we have good information about its diachrony: the structure originates in contact with Lamalamic languages, another Paman subgroup (Pama-Nyungan > Paman > Lamalamic) to the south of Umpithamu, where only the lexical uses are attested. In other words, the morphosyntactic oppositional function of the middle construction in Umpithamu is new and can be demonstrated to originate in its lexical non-oppositional function. Section 4.1 introduces the relevant structures and demonstrates their status as a middle voice system, and Section 4.2 discusses the structure’s historical development, from idioms in the lexicon to a productive passive-like structure.
4.1 Impersonals as a middle voice system
The relevant structures are illustrated in (11) and (12) below, in contrast with the standard transitive structures in the language illustrated in (13) and (14). We will initially refer to them as ‘impersonal’ structures, in line with previous work (Verstraete 2011a), but their middle status will be demonstrated below.
waarruthu | ayngki-n=inguna | ||
bad | throw-pst=3sg.gen | ||
‘He became incapacitated.’ (lit. ‘(it) threw him bad’) | [fieldnotes, text] |
yitha-n=antyangana | Bamaga | |||
leave-pst=1pl.exc.gen | (placename) | |||
‘We were left at Bamaga.’ (lit. ‘(they) left us at Bamaga’) | [fieldnotes, text] |
woypu-mpal | ayngki-ngka=ina | |
ghost-erg | throw-prs=3pl.nom | |
‘It’s the ghosts who throw them.’ | [fieldnotes, text] |
yitha-n=ina-ingku | apiyi |
leave-pst=3pl.nom-3sg.acc | here |
‘They left him here.’ | [fieldnotes, text] |
Standard transitive structures in Umpithamu are characterized by a number of features, two of which are relevant in this context. First, they typically have cross-referencing pronouns, organized on a nominative-accusative basis. The system can be called ‘cross-referencing’ because it is not as systematic as full-fledged agreement (compare Siewierska 2004): Nominative pronouns are present in about 2/3 of verb-headed clauses (as in (13) and (14) above), while accusative pronouns are only present in about 1/3 of clauses (compare (13), without an accusative pronoun, with (14), which has one). Pronouns form complexes that are typically encliticized to the verb, as in (13) and (14), but they can also be located at the start of the clause, a characteristic to which we will return in Section 4.2 below. The second feature of standard transitive clauses concerns nominals, which are organized on an ergative-absolutive basis, as shown in (13) above for the ergative marker. Ergative marking is not obligatory in transitive clauses, but organized along principles of animacy and information structure, as is typical of so-called ‘optional ergative’ systems (Chappell and Verstraete 2019; McGregor 2010; Verstraete 2010). Inanimates take obligatory ergative case, while animates only take ergative case in specific information-structural contexts, like the focus context in (13), where the cause of a disease has just been established as a foreign object in the body, and the ergative-marked participant in (13) identifies the entity responsible for inserting it.
The impersonal structures in (11) and (12) use the same transitive verbs as in (13) and (14), but they differ from standard transitive structures in a few important respects. First, they never take nominative pronouns, as shown in (11) and (12). In other words, there is no trace of subject marking, although in some cases an Agent-type role can be discerned discursively, and more rarely expressed with a (non-cross-referenced) ergative-marked nominal, as in (15) below. Second, they take genitive pronouns instead of accusative ones for the Patient argument, and express these obligatorily rather than optionally. If such arguments are present in lexical form, they take absolutive case, again as illustrated in (15) with apa’ala ‘lower leg’, in an external possession relation with the Patient.[12]
aatyarra-mpal | atha-n=athuna | apa’ala | |||
cramp-erg | bite-pst=1sg.gen | lower.leg | |||
‘I had a cramp in my lower leg.’ (lit. ‘cramp bit me in the lower leg’) | [fieldnotes, elicited] |
Morphosyntactically, the properties of the structures in (11), (12), and (15) can be interpreted as pointing to impersonal status, as argued in Verstraete (2011a), to which we refer for more details, since the argument for impersonal status is less directly relevant to the present discussion. What is more relevant, however, is the semantics of impersonal structures. The use of an oblique (‘genitive’) pronoun instead of the standard accusative one points to the presence of an Experiencer role: what all of these structures have in common is a feature of involuntary experience for a human Patient.[13] This feature manifests itself semantically in two ways. In instances like (11), (15), and (16)–(17) below, impersonal structures use transitive ‘impact’ verbs to build idioms designating the experience of involuntary physical processes in human beings, like cramps (15), hiccups (16), sudden weaknesses (11), or pains (17).
manu | watyu-ngka=athuna | |||
neck | spear-prs=1sg.gen | |||
‘I have hiccups.’ (lit. ‘(it) spears me in the neck’) | [fieldnotes, elicited] |
yaarri | yangka-ngka=athuna | |||
forehead | pull-prs=1sg.gen | |||
‘I have a headache.’ (lit. ‘(it) pulls me on the forehead’) | ||||
[fieldnotes, volunteered] |
In instances like (12) and (18) below, by contrast, impersonal structures provide an alternative construal for any transitive verb, where the basic semantics of the verb is maintained (unlike in the idioms above), but overlaid with a semantic layer that highlights the involuntary experience of the Patient in the process (and at the same time demotes the Agent). Structures like (12) and (18) are either used in contexts where a human being is inherently involuntary in experiencing an activity (as in (18), where an unconscious person is carried away), or in contexts where involuntary involvement is discursively highlighted (as in (12), which describes a government-ordered deportation away from people’s home country).
kali-n=athuna | yinthu | |||
carry-pst=1sg.gen | inside | |||
‘I was carried inside [after fainting].’ (lit. ‘(They) carried me inside’) | ||||
[fieldnotes, text] |
This double function of the structure is precisely what makes it a middle voice system from a typological perspective, in the sense used in this paper. It may not look like the canonical middle, since it does not use a dedicated inflectional marker (as in the case of Hittite in Section 3.1). Instead, we are dealing with a construction that can only be defined in terms of a specific configuration of features, viz. a transitive verb with obligatory absence of nominative cross-reference, and obligatory presence of genitive cross-reference for the Patient. But it is a middle system nonetheless, since it combines a non-oppositional distribution in the lexicon with a voice- or valency-related oppositional function in morphosyntax. The lexical function relates to the building of idioms for the very specific semantic domain of involuntary physical processes in human beings. The verbs involved are not themselves verbs of physical experience, but the constructions as a whole function as idioms for a range of sudden bodily processes over which people have no control. The morphosyntactic function relates to the use as an alternative construal for any transitive structure with a human Patient. Like most languages in the region, Umpithamu does not have any valency-changing morphology, but this structure is the functional equivalent of a more classic valency-reducing process like a passive. For one thing, structures like (12) and (18) demote the Agent: Ergative-marked nominals are rarely expressed, and even if they are, they are never cross-referenced by a nominative pronoun, which is the basic way to establish a subject-predicate relation in the language (compare 19a, which is a noun phrase, with 19b, which is a clause).
mayi | eentinti | |
fruit | small | |
‘a small fruit’ | [fieldnotes, text] |
mayi | eentinti | iluwa | |
fruit | small | 3sg.nom | |
‘The fruit is small.’ | [constructed] |
Secondly, the structure also highlights the Patient: cross-reference is obligatory (unlike with accusative pronouns), and the genitive case overlays it with Experiencer semantics, as being involuntarily involved in the activity. The main difference from a canonical passive is that this argument does not have any morphological subject characteristics, but there are, in fact, typological equivalents of such passives, where the promoted argument behaves like a subject in some ways, even if its morphological coding is not that of a subject (as shown in Malchukov 2008). In any case, the basic point is that this structure has all the properties of a middle voice system as defined in this paper: a restricted set of uses in the lexicon, in a set of idioms for a specific semantic domain, and a productive use in morphosyntax, as a passive-like structure that provides an alternative construal for any transitive structure with a human Patient.
4.2 Diachronic development
From the perspective of this paper, the Umpithamu structure is interesting in that we can reconstruct its historical development: This shows that the oppositional (valency-reducing) function developed from the non-oppositional (lexical) one rather than the other way round. The key in this argument is the system of cross-referencing pronouns that defines the middle voice system in Umpithamu, which can be shown to originate in contact with languages from the neighboring Lamalamic subgroup to the south. This is, in fact, just one instance of profound Lamalamic influence on the morphosyntax of Umpithamu, fueled by long-standing social links and patterns of multilingualism (see Rigsby 1997; Verstraete 2011a, 2012, 2018 for more examples and details of the contact situation).
The first step in the argument about origins concerns the basic morphosyntax of cross-referencing pronouns in Umpithamu. As already mentioned, pronouns invariably form complexes if there is more than one in a clause, and they can be found in two positions: Single pronouns and complexes are most typically encliticized to the verb, as in all of the examples discussed above, but they can also be found in clause-initial position in specific information-structural contexts, as in (20) below. What is peculiar is that the whole complex moves to initial position as soon as one of the participants is in focus or contrastive topic position, as is the case with the focused Agent participant in (20).
ilu-ungku | yawul-ntha-n | |||
3sg.nom-3sg.acc | big-caus-pst | |||
‘He was the one who raised him [the boy].’ | [fieldnotes, text] |
This particular morphosyntactic configuration is specific to Umpithamu within its subgroup. It is not found in any other Middle Paman language, and not in any other Australian language in fact, apart from the Lamalamic languages mentioned below (see Verstraete 2012 for evidence). There is a whole range of first-position phenomena affecting pronouns in Australia (see Mushin and Simpson 2008), but crucially not with the morphosyntactic specifics as found in Umpithamu. The only exception are the Lamalamic languages just south of Umpithamu, all three of which have exactly this configuration: Pronouns form complexes that can either be encliticized to the verb, or move to clause-initial position as soon as one of the participants referenced in the complex is in focus or contrastive topic. This is illustrated in (21) and (22) below for Umbuygamu and Lamalama, the two best-described languages in the subgroup.
Umbuygamu |
nya-n=na-la | ||
hit-pst=1sg.acc-3sg.nom | ||
‘He hit me.’ | [fieldnotes, text] |
na-la | naparra-m | |||
1sg.acc-3sg.nom | sneak-prs | |||
‘He sneaks up on me.’ | [fieldnotes, text] |
Lamalama |
karra-rr=la, | balpa-n=ya-ngunh | |||
go-irr=3sg.nom | stop-pst=1sg.nom-3sg.acc | |||
‘He would have gone, but I stopped him.’ | [Sommer 1999: 44] |
yaw-ngunh | li-y | |||
1sg.nom-3sg.acc | eat-fut | |||
‘I will eat it.’ | [Sommer 1999: 56] |
In other words, this particular configuration appears to be an innovation within Lamalamic. Given how specific the configuration is, the fact that it is also found in Umpithamu just north of Lamalamic obviously suggests a contact scenario. The analysis in Verstraete (2012) provides linguistic and ethnographic evidence to show that the cross-reference pattern in Umpithamu does in fact originate in contact with Lamalamic languages. On the one hand, there are morphosyntactic peculiarities in Umpithamu that point to a relatively recent origin of the cross-referencing system: The most notable feature is the length of the pronoun complexes in enclitic position, which suggests inherited free pronouns that have been slotted into a borrowed pattern relatively recently. And on the other hand, there is good ethnographic evidence for patterns of intermarriage between members of Umpithamu and Lamalamic clans, leading to systematic multilingualism with Umpithamu and at least one Lamalamic language. The fine detail of the argument can be consulted in Verstraete (2012), but the basic point is that the morphosyntactic pattern of pronominal cross-reference in Umpithamu originates in Lamalamic, with the inherited Middle Paman pronoun material slotted into the pattern.
The second step in the argument concerns how these cross-referencing pronouns are used in the language. The impersonal configuration found in Umpithamu – in particular the systematic lack of nominative cross-reference and its associated Experiencer semantics – is not found in any of its Middle Paman relatives. Partially parallel structures known as ‘Experiencer object’ constructions are found further afield in Australia, mainly in some non-Pama-Nyungan families in the continent’s central north (see Evans 2004; Merlan 1985; Walsh 1987 for more details). In the immediate region, however, there is one almost exact parallel, once again in the Lamalamic languages. All three Lamalamic languages have a construction characterized by obligatory absence of nominative cross-reference, and obligatory presence of a non-nominative pronoun for the Patient, either an oblique form, like the dative in Rimanggudinhma in (23), or an accusative, as in the Umbuygamu structure in (24). Verstraete (2011a) provides more details on these structures.
Rimanggudinhma | |||||
noka | rhi-m=thum | ||||
neck | spear-prs-1sg.dat | ||||
‘I have hiccups.’ (lit. ‘(it) spears me in the neck’) | [Godman 1993: 91] |
Umbuygamu | |||
marr-unyenh-u | tha-m=na | ||
nose-sneeze-erg | bite-prs=1sg.acc | ||
‘I am sneezing.’ (lit. ‘sneeze bites me’) | [fieldnotes, elicited] |
In terms of semantics, the structures are an exact match for the non-oppositional (lexical) part of the Umpithamu structure: Again, they use basic transitive verbs in an impersonal structure, to build idioms for the domain of involuntary physical processes. In other words, it looks as if, along with the basic patterns of pronominal morphosyntax, more specific configurations like impersonal constructions were borrowed from Lamalamic into Umpithamu, as a very specific instance of pattern replication in verbal constructions (compare Grossman 2021). In most cases, even the transitive verbs used in the idioms match between the languages, e.g. ‘spear’ for hiccups, ‘pull’ for headaches or ‘bite’ for cramps.
Given this scenario about origins, we can now evaluate the diachronic development of the middle voice system in Umpithamu, which has both restricted non-oppositional uses in the lexicon and a productive oppositional use as a passive-like structure in morphosyntax. The equivalent structures in Lamalamic have the exact same lexical uses in building idioms for involuntary physical processes, but there is no trace of a productive passive-like use as in Umpithamu. This leads to the conclusion that only the lexical uses could have been carried over from Lamalamic to Umpithamu, and that the morphosyntactic function of Umpithamu’s middle voice system is new – or at least newer than its lexical function, and most likely developed out of this function. We cannot determine the trajectory with certainty (for instance, there might have been something else in the language that latched onto the incoming structure), but there is one small indication of how the development could have happened. There is a minor pattern, in Umpithamu and the Lamalamic languages, that resembles the impersonal construction in most ways, and that could have served as a bridge from the lexical function attested in all four languages to the morphosyntactic one found in Umpithamu. As already mentioned in Section 4.1, structures with inanimate Agents are special in that they take obligatory ergative marking for the Agent nominal (rather than optional marking, as for all other types of Agents). If an inanimate Agent affects a human Patient, moreover, there is a second set of features that sets apart such structures: The inanimate Agent is not cross-referenced with a nominative pronoun, and the human Patient is obligatorily cross-referenced, with an accusative or oblique pronoun.[14] Some of these structures are illustrated in (25) and (26) below, for Umpithamu and Umbuygamu.
yuma-mpal | anthi-ku=ingkuna | |||
fire-erg | burn-pot=2sg.gen | |||
‘The fire will burn you.’ | [fieldnotes, elicited] |
Umbuygamu | ||
oharr-aw | θe-m=ungan | |
mud-erg | bog-prs=3sg.obl | |
‘Mud is bogging him.’ | [Sommer 1998: 22; orthography partly adjusted] |
In other words, except for the obligatory use of an ergative-marked nominal, these structures are very similar to the impersonal structures discussed in the rest of this section.[15] They are also interesting semantically, because they can be related to the lexical functions found in all four languages, as well as to the productive passive-like function found in Umpithamu, which would make them a potential bridge between the two. What links these structures to the idioms for involuntary physical processes is the typically involuntary involvement of the human Patient. Inanimates are either natural forces with unpredictable ‘generalized’ agency, or non-motive entities that are not typically monitored for agency (see Fauconnier 2012: 37–38 on some relevant grammatical differences). In either case, the activities they instigate more typically ‘overcome’ people than actions instigated by other humans, which are monitored for agency and are not generalized sources of undirected or unpredictable ‘spontaneous’ agency. In this sense, such activities are not unlike involuntary physical processes, which affect a human being beyond their control. On the other hand, there are also a number of links to voice mechanisms. Most generally, spontaneous events are a common historical source for passive constructions, as shown, for instance, in Haspelmath (1990). More specifically, what links structures like (25) and (26) to voice mechanisms is the fact that they have a very particular configuration of participants, which typically triggers some sort of voice or valency change in languages that have the relevant structures. From a broad typological perspective, inanimate Agents acting on human Patients run counter to the classic participant hierarchies often assumed to govern case marking (Silverstein 1976), and they are a configuration that tends to be associated with special voice options and related phenomena (like hierarchical alignment, compare Gildea and Zuñiga 2016). And in the immediate region, structures with inanimate Agents are often associated with the same valency-reducing morphology that characterizes passives, antipassives and the like, though sometimes with different case frames. This is illustrated in the Guugu Yimidhirr structure in (27) below, where the verb is marked for reduced valency with the reflexive suffix, in combination with standard transitive case frames for the arguments (vs. distinct case frames for canonical reflexives, see further in Verstraete 2011b).
Guugu Yimidhirr | ||||||
nganhi | wagi-idhi | naaybu-unh | ||||
1sg.acc | cut-refl.pst | knife-inst | ||||
‘I got cut on the knife.’ | [Haviland 1979: 123] |
In other words, the very minor category of inanimate Agents acting on human Patients could, by virtue of its formal and semantic links with the other categories, have served as a kernel or a ‘bridge’ towards the passive-like function in Umpithamu, as shown in Figure 2. This scenario remains speculative of course, but even if the origins of the passive-like use lie elsewhere, it is clear from the contact-induced origins of the structure that, unlike the lexical uses, the passive-like use was not borrowed along with the formal structures from Lamalamic, and must have originated later than these lexical uses.

Scenario for development of middle voice system in Umpithamu.
5 Conclusions
This paper has analyzed the development of middle voice systems in Hittite and Umpithamu, in the light of a widespread assumption of unidirectionality in the development of middles. The two languages feature middle voice systems that are synchronically rather different. The Hittite Middle inflection is an example of what is perhaps the best studied type of middle voice, that is, the inflectional Middle of ancient Indo-European languages (Zúñiga and Kittilä 2019: 168–171). The Umpithamu system is structurally quite different, since it does not use a dedicated marker and can only be defined in terms a specific configuration of features, viz. a transitive verb with obligatory absence of nominative cross-reference, and obligatory presence of genitive cross-reference for the Patient. Overall, Hittite and Umpithamu represent two radically different ways of marking voice, which are at two ends of a continuum between more synthetic (Hittite) versus more analytic (Umpithamu) structures. Analytic middles are rare from a typological perspective. A sample of 149 MMs studied in Inglese (2022) only has 8 instances of analytical marking, mostly using pronouns as in Italian (1). In this sense, the description of the Umpithamu construction calls for a refinement of the morphosyntactic typology of middle voice systems, which may reveal more languages that instantiate a similar constructional pattern. In addition to marking, the two languages are also quite distinct in terms of the distribution of functions. While in Hittite the Middle inflection is associated with a rich class of non-oppositional middles and may perform several valency-reducing functions, the functional range of the Umpithamu construction is comparatively narrow, being restricted to idioms for involuntary physical processes on the non-oppositional side and a passive-like function on the oppositional side.
In spite of these differences, both languages are similar in terms of their historical development, as shown in Sections 3 and 4. The construction that ultimately develops into a middle voice system originally has an essentially lexical distribution and develops grammatical functions only at a later stage. In the case of Hittite, as also shown by the distribution of verbs in the Hittite corpus, it is clear that the middle voice is originally confined to non-oppositional middles, while Active/Middle voice alternation in oppositional function gains ground over time. This shift can be understood as an increase in the paradigmaticization of voice, in the sense that at the end of the process virtually every transitive verb can participate in the pattern of voice alternation, thereby making verbal voice a full-fledged inflectional category.[16] As for Umpithamu, the origin of the middle in contact with Lamalamic clearly demonstrates a lexical origin, as Lamalamic languages only use the relevant structure for idioms designating involuntary physical processes and do not show any passive-like use. The passive-like use is only attested in Umpithamu, and must therefore postdate transfer of the lexical pattern from Lamalamic to Umpithamu.
From a more general perspective, the developments we propose for Hittite and Umpithamu enrich our knowledge of the diachronic typology of MVSs and specifically support the existence of an alternative path of development than the one commonly assumed in the literature (see Section 2), viz. a non-oppositional > oppositional shift. Evidence for this type of alternative development is not limited to Hittite and Umpithamu, but is backed up by comparable processes attested in other languages as well (see Inglese 2023). Indeed, while ruled out in accounts that postulate a rigidly unidirectional oppositional > non-oppositional shift (e.g. Kaufmann 2007), which tend to be biased towards research on Indo-European languages, the opposite pattern is perfectly compatible with our understanding of grammaticalization processes, which, broadly speaking, typically involve a shift from a lexical to a grammatical distribution (e.g. Hopper and Traugott 2003).
As discussed at length by Inglese (2023), this richer cross-linguistic empirical evidence casts serious doubts on classic claims about the semantic connection between middles and reflexives. The fact that MMs may emerge out of non-reflexive sources calls into question the purported “closer cognitive connection” (Kemmer 1993: 197) between reflexives and MMs. Adopting a source-oriented perspective (e.g. Cristofaro 2019; Sansò 2018), one could argue that the connection between reflexives and MMs is the result of a number of historical facts, which have little to do with semantic closeness between these categories per se. In particular, the fact that MVSs emerge through distinct processes also suggests that one should be more cautious in explaining MVSs as instantiating a single overarching functional domain across languages, but rather, that each MVS emerges as the by-product of specific historical developments (see also Holvoet 2020: 225; Post and Modi 2022).
In this connection, another striking similarity in the developments described for Hittite and Umpithamu is the role played by the class of spontaneous events as a bridging context between the non-oppositional and the oppositional domain. In Hittite, anticausativization was possibly the first oppositional function to arise out of non-oppositional verbs expressing spontaneous change-of-state events, and similarly, in Umpithamu, the development of the passive function is likely linked to a minor intermediate pattern with inanimate Agents affecting human Patients, i.e. apparently spontaneous events with undirected or unpredictable agency. Again, this behavior is by no means isolated, as spontaneous events and anticausatives have been reported as possible sources of MMs in other languages (see Inglese 2023). Keeping in line with the source-oriented approach, one may argue that is the diachronic versatility of the class of anticausatives/spontaneous events, and not some abstract semantic property of the middle voice domain, that explains why the MVSs of Hittite and Umpithamu show a comparable distribution over oppositional and non-oppositional verbs.
The pivotal role of spontaneous events and anticausatives in the development of middles also raises the question where they stand in regard to the interface between lexicon and grammar. For anticausatives, for instance, one might argue that this is the valency-reducing operation that most strongly affects the lexicon, in the sense that it manipulates the inventory of semantic roles of a given verb by removing the Agent. As a result, verb pairs involved in an anticausative alternation usually depict two different events altogether, to the extent that the event of burning in a fire burns is semantically distinct from the causative event someone lights a fire, which is more complex and entails at least one additional participant (e.g. Tubino Blanco 2020; Zúñiga and Kittilä 2019: 41–43). In this respect, anticausatives differ from passives, as the latter construe the same event from different perspectives (Zúñiga and Kittilä 2019: 83). Indeed, anticausativization has been described in term of derivation (see discussion in Alexiadou et al. 2015: 3–4; Tubino Blanco 2020 with references), and the partly lexical nature of anticausatives is also confirmed by the fact that anticausativization may also be realized via suppletion, e.g. die versus kill (Haspelmath 1993; Nichols et al. 2004). A similar argument could be made for the potential intermediate structure in Umpithamu, which is semantically more specific than the average construction, with animacy values specified for the basic roles of Agent and Patient. It goes beyond the scope of this paper to investigate these intermediate structures in detail, but their pivotal role in the development of middles suggests that they have a peculiar status in grammatical systems, which warrants further work on their synchronic and diachronic typology (see, for instance, Fauconnier 2012; McGregor 1999; Verstraete 2011b).
Acknowledgments
Authorship is shared, resulting from close collaboration between the two authors. For academic purposes, GI is responsible for Sections 2 and 3, whereas JCV is responsible for Section 4. Sections 1 and 5 were written jointly. We thank Ellison Luk, Silvia Luraghi, two anonymous reviewers, and the participants of Description de l’oral et méthodes d’analyse linguistique: Hommage à Claire Blanche-Benveniste (Nancy, October 2021) for comments and questions on earlier versions of (parts of) this paper. We also thank the late Florrie Bassani, Bobby Stewart and Daisy Stewart for teaching JCV about Umpithamu, Umbuygamu and Lamalama, and the younger Lamalama people for continuing support and encouragement. Work on this paper was funded by an FWO postdoctoral fellowship (GI, grant no. 12T5320N) and grant C14/18/034 of the Research Council of the University of Leuven (JCV).
Abbreviations
- 1
-
first person
- 2
-
second person
- 3
-
third person
- acc
-
accusative
- act
-
active
- all
-
allative
- caus
-
causative
- conn
-
connective
- dat
-
dative
- erg
-
ergative
- exc
-
exclusive
- fut
-
future
- gen
-
genitive
- IE
-
Indo-European
- imp
-
imperative
- indf
-
indefinite
- inst
-
instrumental
- intr.
-
intransitive
- irr
-
irrealis
- MH
-
Middle Hittite
- mid
-
middle
- MM
-
middle marker
- MVS
-
middle voice system
- n
-
neuter
- NH
-
New Hittite
- nom
-
nominative
- obl
-
oblique
- OH
-
Old Hittite
- pl
-
plural
- pot
-
potential
- prs
-
present
- pst
-
past
- ptc
-
particle
- refl
-
reflexive
- sg
-
singular
- tr.
-
transitive
References
Alexiadou, Artemis, Elena Anagnostopoulou & Florian Schäfer. 2015. External arguments in transitivity alternations: A layering approach. Oxford: Oxford University Press.10.1093/acprof:oso/9780199571949.001.0001Search in Google Scholar
Bahrt, Nicklas. 2021. Voice syncretism. Berlin: Language Science Press.Search in Google Scholar
de Benito Moreno, Carlota. 2023. From (semi-)oppositional to non-oppositional middles: The case of Spanish reír(se). STUF/Language Typology and Universals 76(2). 121–164.10.1515/stuf-2023-2006Search in Google Scholar
Cennamo, Michela. 1993. The reanalysis of reflexives: A diachronic perspective. Napoli: Liguori.Search in Google Scholar
Cennamo, Michela. 1999. Late Latin pleonastic reflexives and the unaccusative hypothesis. Transactions of the Philological Society 97(1). 103–150. https://doi.org/10.1111/1467-968x.00046.Search in Google Scholar
Cennamo, Michela. 2020. The actualization of new voice patterns in Romance. Persistence in diversity. In Bridget Drinka (ed.), Historical Linguistics 2017, 109–142. Amsterdam & Philadelphia: John Benjamins.10.1075/cilt.350.06cenSearch in Google Scholar
Cennamo, Michela, Thórhallur Eythórsson & Jóhanna Barðdal. 2015. Semantic and (morpho)syntactic constraints on anticausativization: Evidence from Latin and Old Norse-Icelandic. Linguistics 53(4). 677–729. https://doi.org/10.1515/ling-2015-0015.Search in Google Scholar
Chappell, Hillary & Jean-Christophe Verstraete. 2019. Optional and alternating case marking: Typology and diachrony. Language and Linguistics Compass 13. e12311. https://doi.org/10.1111/lnc3.12311.Search in Google Scholar
Clackson, James. 2007. Indo-European linguistics: An introduction. Cambridge: Cambridge University Press.10.1017/CBO9780511808616Search in Google Scholar
Cotticelli Kurras, Paola & Alfredo Rizza. 2013. Reconstructing Proto-Indo-European categories: The reflexive and the middle in Hittite and in the Proto-language. Journal of Historical Linguistics 3(1). 7–27. https://doi.org/10.1075/jhl.3.1.02cot.Search in Google Scholar
Creissels, Denis. 2006. Syntaxe générale: une introduction typologique. Paris: Lavoisier.Search in Google Scholar
Cristofaro, Sonia. 2019. Taking diachronic evidence seriously: Result-oriented versus source-oriented explanations of typological universals. In Karsten Schmidtke-Bode, Natalia Levshina, Susanne Maria Michaelis & Ilja A. Seržant (eds.), Explanation in typology: Diachronic sources, functional motivations and the nature of the evidence, 25–46. Berlin: Language Science Press.Search in Google Scholar
van den Hout, Theo. 2009. A century of Hittite text dating and the origins of the Hittite Cuneiform Script. Incontri Linguistici 32. 11–35.Search in Google Scholar
Evans, Nicholas. 2004. Experiencer objects in Iwaidjan languages (Australia). In Peri Bhaskararao & Karumurri V. Subbarao (eds.), Non-nominative subjects, vol. 1, 169–192. Amsterdam & Philadelphia: John Benjamins.10.1075/tsl.60.10evaSearch in Google Scholar
Fauconnier, Stefanie. 2012. Constructional effects of involuntary and inanimate agents: A cross-linguistic study. Leuven: University of Leuven PhD Dissertation.Search in Google Scholar
Friedrich, Johannes. 1960. Hethitisches Elementarbuch 1. Teil: Kurzgefaßte Grammatik, 2nd edn. Heidelberg: Winter.Search in Google Scholar
Garrett, Andrew J. 1996. Wackernagel’s Law and unaccusativity in Hittite. In Aaron L. Halpern & Arnold M. Zwicky (eds.), Approaching second: Second position critics and related phenomena, 83–133. Stanford: CSLI Publications.Search in Google Scholar
Geniušienė, Emma Š. 1987. The typology of reflexives. Berlin & New York: De Gruyter.10.1515/9783110859119Search in Google Scholar
Giacalone Ramat, Anna & Andrea Sansò. 2011. From passive to impersonal. A case study from Italian and its implications. In Andrej Malchukov & Anna Siewierska (eds.), Impersonal constructions: A cross-linguistic perspective, 189–228. Amsterdam & Philadelphia: John Benjamins.10.1075/slcs.124.07ramSearch in Google Scholar
Gildea, Spike & Fernando Zuñiga. 2016. Referential hierarchies: A new look at some historical and typological patterns. Linguistics 54. 483–529. https://doi.org/10.1515/ling-2016-0007.Search in Google Scholar
Giomi, Riccardo & Guglielmo, Inglese. Forthcoming. Vagueness and ambiguity of voice markers: synchrony and diachrony. In Ilaria, Fiorentini & Chiara, Zanchi (eds.), Vagueness, ambiguity and their relationship. Theory, methods, discourse strategies. Amsterdam & Philadelphia: John Benjamins.Search in Google Scholar
Godman, Irene. 1993. A sketch grammar of Rimanggudinhma. A language of the Princess Charlotte Bay region of Cape York Peninsula. Brisbane: University of Queensland BA Hons thesis.Search in Google Scholar
Grestenberger, Laura. 2016. Reconstructing Proto-Indo-European deponents. Indo-European Linguistics 4(1). 98–149. https://doi.org/10.1163/22125892-00401001.Search in Google Scholar
Grossman, Eitan. 2021. Transitivity, diachrony, and language contact. In Silvia Luraghi & Elisa Roma (eds.), Valency over time. Diachronic perspectives on valency patterns and valency orientation, 13–29. Berlin & Boston: De Gruyter Mouton.10.1515/9783110755657-002Search in Google Scholar
Haspelmath, Martin. 1990. The grammaticization of passive morphology. Studies in Language 14(1). 25–72. https://doi.org/10.1075/sl.14.1.03has.Search in Google Scholar
Haspelmath, Martin. 1993. More on the typology of the inchoative/causative verb alternations. In Bernard Comrie & Maria Polinsky (eds.), Causatives and transitivity, 87–120. Amsterdam & Philadelphia: John Benjamins.10.1075/slcs.23.05hasSearch in Google Scholar
Haspelmath, Martin. 2003. The geometry of grammatical meaning: Semantic maps and crosslinguistic comparison. In Michael Tomasello (ed.), The new psychology of language: Cognitive and functional approaches to language structure, 211–242. Mahwah, NJ: Erlbaum.Search in Google Scholar
Haspelmath, Martin & Thomas Müller-Bardey. 2004. Valency change. In Geert E. Booij, Christian Lehmann & Joachim Mugdan (eds.), Morphology: An international handbook on inflection and word formation, vol. 2, 1130–1145. Berlin & New York: De Gruyter.10.1515/9783110172782.2.14.1130Search in Google Scholar
Haviland, John B. 1979. Guugu Yimidhirr. In Robert M. W. Dixon & Barry J. Blake (eds.), Handbook of Australian languages, vol. 1, 26–180. Oxford: Oxford University Press.10.1075/z.hal1.06havSearch in Google Scholar
Hoffner, Harry A. & Harold Craig Melchert. 2008. A grammar of the Hittite language. Winona Lake (Indiana): Eisenbrauns.10.5325/j.ctv240djsfSearch in Google Scholar
Holvoet, Axel. 2020. The middle voice in Baltic. Amsterdam & Philadelphia: John Benjamins.10.1075/vargreb.5Search in Google Scholar
Hopper, Paul J. & Elizabeth Closs Traugott. 2003. Grammaticalization. Cambridge: Cambridge University Press.10.1017/CBO9781139165525Search in Google Scholar
Inglese, Guglielmo. 2017. A synchronic and diachronic typology of Hittite reciprocal constructions. Studies in Language 41(4). 956–1006. https://doi.org/10.1075/sl.17019.ing.Search in Google Scholar
Inglese, Guglielmo. 2020. The Hittite middle voice: Synchrony, diachrony, typology. Leiden: Brill.10.1163/9789004432307Search in Google Scholar
Inglese, Guglielmo. 2022. Towards a typology of middle voice systems. Linguistic Typology 26(3). 489–531. https://doi.org/10.1515/lingty-2020-0131.Search in Google Scholar
Inglese, Guglielmo. 2023. The rise of middle voice systems: A study in diachronic typology. Diachronica 40(2). 195–237. https://doi.org/10.1075/dia.20058.ing.Search in Google Scholar
Inglese, Guglielmo & Anne Wolfsgruber. 2021. The rise of non-oppositional middle verbs: The case of Old French. Paper presented at the workshop Cognitive mechanisms driving language change, 54th SLE Meeting, digital meeting 30 August–3 September 2021.Search in Google Scholar
Janic, Katarzyna. 2016. L’antipassif dans les langues accusatives. Bruxelles: Peter Lang.10.3726/978-3-0352-6598-9Search in Google Scholar
Jasanoff, Jay H. 2003. Hittite and the Indo-European verb. Oxford: Oxford University Press.10.1093/acprof:oso/9780199249053.001.0001Search in Google Scholar
Kaufmann, Ingrid. 2007. Middle voice. Lingua 117(10). 1677–1714. https://doi.org/10.1016/j.lingua.2006.10.001.Search in Google Scholar
Kemmer, Suzanne. 1993. The middle voice. Amsterdam & Philadelphia: John Benjamins.10.1075/tsl.23Search in Google Scholar
Klaiman, Miriam H. 1991. Grammatical voice. Cambridge: Cambridge University Press.Search in Google Scholar
Kulikov, Leonid. 2010. Voice typology. In Jae Jung Song (ed.), The Oxford handbook of linguistic typology, 368–398. Oxford: Oxford University Press.10.1093/oxfordhb/9780199281251.013.0019Search in Google Scholar
Kulikov, Leonid. 2013. Middle and reflexive. In Silvia Luraghi & Claudia Parodi (eds.), The Bloomsbury Companion to syntax, 261–280. London: Bloomsbury.10.5040/9781472542090.ch-015Search in Google Scholar
Lazzeroni, Romano. 2004. Inaccusatività indoeuropea e alternanza causativa vedica. Archivio Glottologico Italiano 89(2). 139–164.Search in Google Scholar
Luraghi, Silvia. 2010. Transitivity, intransitivity and diathesis in Hittite. Indo-European Linguistics and Classical Philology 16(2). 133–154.Search in Google Scholar
Luraghi, Silvia. 2012. Basic valency orientation and the middle voice in Hittite. Studies in Language 36(1). 1–32. https://doi.org/10.1075/sl.36.1.01lur.Search in Google Scholar
Luraghi, Silvia. 2019. Basic valency orientation, the anticausative alternation, and voice in PIE. In Melanie Malzahn (ed.), Proceedings of the 15. Fachtagung der Indogermanischen Gesellschaft. Wien (Austria), 13–16 September 2016, 259–274. Wiesbaden: Reichert.Search in Google Scholar
Malchukov, Andrej. 2008. Split intransitives, experiencer objects, and ‘transimpersonal’ constructions: (Re-)establishing the connections. In Mark Donohue & Søren Wichmann (eds.), The typology of semantic alignment, 76–100. Oxford: Oxford University Press.10.1093/acprof:oso/9780199238385.003.0003Search in Google Scholar
Martínez Rojas, Lucía, Veróniqua Orqueda, Francisca Toro Varela & Berta González Saavedra. 2021. Desarrollo diacrónico de las funciones de se y sibi del latín arcaico al latín clásico. Boletín de Filología 56(2). 483–517. https://doi.org/10.4067/s0718-93032021000200483.Search in Google Scholar
McGregor, William. 1999. The medio-active construction in Nyulnyulan languages. Studies in Language 23. 531–567. https://doi.org/10.1075/sl.23.3.04mcg.Search in Google Scholar
McGregor, William. 2010. Optional ergative case marking systems in a typological-semiotic perspective. Lingua 120. 1610–1636. https://doi.org/10.1016/j.lingua.2009.05.010.Search in Google Scholar
Meier-Brügger, Michael. 2010. Indogermanische Sprachwissenschaft. Berlin & Boston: De Gruyter Mouton.10.1515/9783110251449Search in Google Scholar
Merlan, Francesca. 1985. Split intransitivity: Functional oppositions in intransitive inflection. In Johanna Nichols & Anthony C. Woodbury (eds.), Grammar inside and outside the clause, 324–362. Cambridge: Cambridge University Press.Search in Google Scholar
Mushin, Ilana & Jane Simpson. 2008. Free to bound to free? Interactions between pragmatics and syntax in the development of Australian pronominal systems. Language 84. 566–596. https://doi.org/10.1353/lan.0.0048.Search in Google Scholar
Neu, Erich. 1968a. Das hethitische Mediopassiv und seine indogermanischen Grundlagen. Wiesbaden: Harrassowitz.Search in Google Scholar
Neu, Erich. 1968b. Interpretation der hethitischen mediopassiven Verbalformen. Wiesbaden: Harrassowitz.Search in Google Scholar
Nichols, Johanna, David A. Peterson & Jonathan Barnes. 2004. Transitivizing and detransitivizing languages. Linguistic Typology 8(2). 149–211. https://doi.org/10.1515/lity.2004.005.Search in Google Scholar
Post, Mark & Yankee Modi. 2022. Subject autonomy marking in Macro-Tani and the typology of middle voice. Linguistics 60(1). 215–238. https://doi.org/10.1515/ling-2020-0218.Search in Google Scholar
Rigsby, Bruce. 1997. Structural parallelism and convergence in the Princess Charlotte Bay languages. In Patrick McConvell & Nicholas Evans (eds.), Archaeology and linguistics. Aboriginal Australia in global perspective, 169–178. Melbourne: Oxford University Press.Search in Google Scholar
Sansò, Andrea. 2018. Explaining the diversity of antipassives: Formal grammar versus (diachronic) typology. Language and Linguistics Compass 12(6). e12277. https://doi.org/10.1111/lnc3.12277.Search in Google Scholar
Siewierska, Anna. 2004. Person. Cambridge: Cambridge University Press.10.1017/CBO9780511812729Search in Google Scholar
Silverstein, Michael. 1976. Hierarchy of features and ergativity. In Robert M. W. Dixon (ed.), Grammatical categories in Australian languages, 112–171. Canberra: AIAS.Search in Google Scholar
Sommer, Bruce. 1998. Umbuygamu. Townsville: Ethnografix Australia.Search in Google Scholar
Sommer, Bruce. 1999. Lamalama. Townsville: Ethnografix Australia.Search in Google Scholar
Tubino Blanco, Mercedes. 2020. Causative-inchoative in morphology. In Oxford research encyclopedia of linguistics. Oxford: Oxford University Press.10.1093/acrefore/9780199384655.013.595Search in Google Scholar
Van Valin, Robert & David Wilkins. 1996. The case for ‘Effector’: Case roles, agents and agency revisited. In Masayoshi Shibatani & Sandra A. Thompson (eds.), Grammatical constructions: Their form and meaning, 289–322. Oxford: Clarendon Press.10.1093/oso/9780198235392.003.0011Search in Google Scholar
Verstraete, Jean-Christophe. 2010. Animacy and information structure in the system of ergative marking in Umpithamu. Lingua 120. 1637–1651. https://doi.org/10.1016/j.lingua.2009.05.011.Search in Google Scholar
Verstraete, Jean-Christophe. 2011a. Impersonal constructions in Umpithamu and the Lamalamic languages. In Andrej Malchukov & Anna Siewierska (eds.), Impersonal constructions: A cross-linguistic perspective, 601–618. Amsterdam & Philadelphia: John Benjamins.10.1075/slcs.124.21verSearch in Google Scholar
Verstraete, Jean-Christophe. 2011b. Experienced action constructions in Umpithamu: Involuntary experience, from bodily processes to externally instigated actions. Cognitive Linguistics 22. 275–302. https://doi.org/10.1515/cogl.2011.011.Search in Google Scholar
Verstraete, Jean-Christophe. 2012. Contact-induced restructuring of pronominal morphosyntax in Umpithamu (Cape York Peninsula, Australia). Diachronica 29. 326–358. https://doi.org/10.1075/dia.29.3.03ver.Search in Google Scholar
Verstraete, Jean-Christophe. 2018. ‘Perhaps’ in Cape York Peninsula. Ignoratives and verbs of visual perception in epistemic marking. In Daniel Van Olmen, Tanja Mortelmans & Frank Brisard (eds.), Aspects of linguistic variation, 247–267. Berlin & Boston: De Gruyter Mouton.10.1515/9783110607963-010Search in Google Scholar
Verstraete, Jean-Christophe. 2020. A dictionary of Umpithamu, with notes on Middle Paman. Canberra: Aboriginal Studies Press.Search in Google Scholar
Walsh, Michael. 1987. The impersonal verb construction. In Ross Steel & Terry Threadgold (eds.), Language topics. Essays in honour of Michael Halliday, 425–438. Amsterdam & Philadelphia: John Benjamins.10.1075/z.lt2.40walSearch in Google Scholar
Wolfsgruber, Anne. 2017. Medieval Romance se. Its grammaticalization and syntactic status. Salzburg: University of Salzburg PhD Dissertation.Search in Google Scholar
Zúñiga, Fernando & Seppo Kittilä. 2019. Grammatical voice. Cambridge: Cambridge University Press.10.1017/9781316671399Search in Google Scholar
© 2023 the author(s), published by De Gruyter, Berlin/Boston
This work is licensed under the Creative Commons Attribution 4.0 International License.
Articles in the same Issue
- Frontmatter
- Introduction: towards a diachronic typology of the middle voice
- From (semi-)oppositional to non-oppositional middles: the case of Spanish reír(se)
- The Turkic middle voice system: deponency and paradigm reorganization
- Middle voice in Bantu: in- and detransitivizing morphology in Kagulu
- From verbalizer to middle marker: the diachrony of middle voice in Malayic
- Evidence against unidirectionality in the emergence of middle voice systems
Articles in the same Issue
- Frontmatter
- Introduction: towards a diachronic typology of the middle voice
- From (semi-)oppositional to non-oppositional middles: the case of Spanish reír(se)
- The Turkic middle voice system: deponency and paradigm reorganization
- Middle voice in Bantu: in- and detransitivizing morphology in Kagulu
- From verbalizer to middle marker: the diachrony of middle voice in Malayic
- Evidence against unidirectionality in the emergence of middle voice systems