On the persistence of SVO: the case of Modern Eastern Armenian

Pollet Samvelian; Pegah Faghiri; Victoria Khurshudyan

doi:10.1515/ling-2021-0013

Article Open Access

On the persistence of SVO: the case of Modern Eastern Armenian

Pollet Samvelian , Pegah Faghiri and Victoria Khurshudyan

Published/Copyright: March 13, 2023

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Linguistics Volume 61 Issue 3

Abstract

This study addresses the issue of unmarked word order in Modern Eastern Armenian (MEA), typologically considered a flexible (S)OV language due to its being strongly left-branching as well as the syntactic properties of its VP (focus, bare objects, low adverbs). However, Armenian grammars generally consider (S)VO to be the canonical order. We have conducted a corpus study and two sentence production experiments to tackle this controversy. These studies show that the placement of direct objects (DOs) is mainly triggered by definiteness. While definite DOs are overwhelmingly postverbal, indefinite DOs display a strong preference for preverbal placement. This implies a “typological discrepancy”: although MEA is a strongly left-branching language, the unmarked placement of definite DOs is postverbal. We account for this “discrepancy” based on areal, historical and cognitive factors. Contact with OV languages has resulted in a consistent shift from right to left-branching in Armenian, whereas word order at the clausal level has resisted the shift because MEA makes optimal use of each order in accordance with their cognitive advantages. The evolution of word order in MEA is an illustration of the universal crosslinguistic bias toward SVO.

Keywords: Armenian; syntax; typology; word-order variation

1 Introduction

The canonical or unmarked word order at the clausal level in Modern Eastern Armenian^[1] (MEA hereafter) is a matter of controversy. While many typological and theoretical syntactic studies group Armenian with (S)OV (flexible) languages, others characterize it as unspecified between (S)OV and (S)VO. More interestingly, Armenian grammars and handbooks generally consider (S)VO to be the canonical word order in MEA. The goal of this paper is twofold: (i) Identifying the parameters that determine the choice between VO and OV using quantitative methods and arguing that each ordering can thus be considered “canonical” depending on the criteria considered; (ii) explaining the current word order preferences based on areal diffusion, historical background and universal cognitive parameters.

In languages with flexible word order, one of the possible orders is generally considered to be “canonical”, “basic” or “unmarked”, while the others are said to be “marked”. When the designation of the basic order in a given language is not a matter of controversy, the criteria for its identification need not be given further thought. The problem arises, however, in languages for which various word orders have been proposed to be basic, which is the case of MEA.

In Section 2, we will present two main sets of arguments that have been evoked in favor of MEA being (S)OV. Typological studies have generally focused on the branching direction of various constituents in MEA, which is a strongly left-branching language, hence OV (Section 2.1). Studies on the syntax of MEA (mainly generativist) have put forward syntactic properties of the VP that are assumed to be characteristic of OV languages, for instance, the position of the inherent focus or bare/indefinite objects (Section 2.2).

On the other hand, as we will see in Section 3, the claim that the unmarked position of the direct object is postverbal in MEA is very common in Armenian grammars and descriptive studies. This is mainly because they take into account another criterion, namely the default position of definite DOs, which they claim to be postverbal. This stance could be justified in light of studies that have argued that the identification of the unmarked word order should take into account the position of canonical or prototypical DOs, that is, definite DOs of event-denoting verbs.

Whatever the definition of the canonical word order is, as we will discuss in detail in Section 3, frequency remains a key concept for its identification. However, up to now the designation of a given order as the unmarked or canonical one in MEA has been either almost exclusively based on the perceived frequency, which could be erroneous, or on some other aspects of Armenian syntax. Nevertheless, the variation between OV and VO is nontrivial in MEA as it can easily be observed in available usage data. For instance, in the Universal Dependencies (UD) treebank of MEA (Yavrumyan et al. 2017),^[2] a sample of 570 sentences, (S)OV and (S)VO occur at a comparable rate: 39 and 46% respectively (see also Faghiri and Samvelian 2020b).^[3]

In Section 4, we will present a quantitative investigation of word order in MEA that we carried out to identify which is the unmarked order, VO or OV, as well as to determine the context(s) that favor each order. This investigation includes a corpus study (Section 4.1), and two sentence production experiments (Section 4.2). The findings of these studies show that: (i) (S)VO order cannot be characterized as a marked option in MEA; (ii) the placement of the DO is not random and is triggered by the (in)definiteness of the DO. In other words, MEA is not unspecified between OV and VO.

These observations imply a “typological discrepancy” in MEA, which is left-branching in almost all its constituents but resorts to (S)VO as its unmarked order for definite DOs at the clausal level. In Section 5, we will account for this discrepancy. To this end, we first consider areal diffusion, in particular contact with Russian, which seems to encourage the VO order in various OV languages of the area (Section 5.1). However, the areal factor cannot be considered the only reason for the high rate of postverbal DOs in MEA. We argue that despite the consistent shift from right-branching to left-branching between Classical Armenian (5th–10th centuries) and Modern Armenian, word order at the clausal level has resisted change and remained VO for definite DOs. Armenian, which is we hypothesize that the same parameters were also at work in the choice of order between OV and VO in Classical Armenian (Section 5.2). We provide a cognitive-based account to explain the persistence of VO (Section 5.3). In this respect, the situation of MEA supports the general directional bias towards VO order advocated in many studies in language typology and/or evolution.

2 Arguments for OV basic order in MEA

MEA can be considered a flexible language with respect to the ordering possibilities of the three major constituents S, O, and, V. This means that all permutations are grammatical, giving rise to six possible orders, SOV, SVO, VSO, VOS, OVS, and OSV. Two of the latter occur with a higher frequency, SOV and SVO. Given that S occurs in the initial position in both cases, the controversy involves the relative order of O and V: Is MEA an OV or VO language?

With very few exceptions (Dryer 2013; Dum-Tragut 2009: 555), typological and descriptive studies have generally grouped Armenian with SOV languages (Der-Houssikian 1978: 227–228; Dryer 1998: 286, 310; Dum-Tragut 2002; Hawkins 1979: 625, 1983: 286; Kozintseva 1995: 8; Minassian 1980: 263, among others).^[4] The same holds for theoretical syntactic studies of Armenian, mainly within the generative framework, which consider MEA a head-final language (Giorgi and Haroutyunian 2016: 190; Hodgson 2013: 6; Kahnemuyipour and Megerdoomian 2011, 2017: 81; Tamrazian 1991: 101, 1994: 7): SOV is thus considered to be the “basic” order, while SVO is “derived” via movement (extraposition) to the postverbal position. In other words, for these studies, MEA (and Armenian in general) belongs to the languages in which word order is influenced by information structure, the postverbal positioning of the constituents corresponding to various information packaging possibilities.

2.1 MEA and word order correlations

With respect to word order correlations (or harmonies) highlighted in the typological literature (Dryer 1992; Greenberg 1966), Armenian (Eastern and Western) is highly consistent in most of the operator/operand features associated with typologically SVO languages: it is quite consistently left-branching (Donabedian 2010) and classified as SOV/Po/GN/AN by Hawkins (1983: 286). Relying on comparable pairs of correlations, Der-Houssikian (1978: 227–228) claims, “The evidence shows that whereas neither the SOV nor the SVO holds consistently, Armenian is nevertheless predominantly an SOV language.”^[5] As illustrated in Table 1, adapted from Dum-Tragut (2002: 180) with some additions, the only correlation pair that does not hold in MEA is (N, Rel), since the relative clause follows the head noun.

Table 1:

Correlation pairs in Classical, Middle and Modern Armenian (adapted from Dum-Tragut [2002: 180]).

Pattern	Classical Armenian (V–XI cen.)	Middle Armenian (XII–XVI cen.)	Modern Armenian (XVII cen.–up to present)
Basic word order	(S)VO, “rather free”	(S)VO, (S)OV	SOV, “non-rigid”
Adpositions	PREP	PREP	POST
Adjective and noun	Preferred ADJ + N	Predominant ADJ + N	Rigid ADJ + N
Genitive and noun	N + GEN	Predominant N + GEN	Rigid GEN + N
Relative clause and noun	N + REL	N + REL	N + REL
Comparison constructions	ADJ + marker + standard	ADJ + marker + standard	Standard + marker + ADJ
Auxiliary and verb	Predominant V + AUX	V + AUX	V + AUX
Predicate and copula	Predominant COP + Pred	(COP + Pred/ Pred + COP)	Predicate + Copula

2.2 The VP level

Apart from branching direction, a bundle of syntactic properties at the VP level has been put forward in studies of the syntax of various languages within the generative framework to distinguish between basic OV and VO orders (see Skopeteas and Fanselow 2010, for instance). Some of these properties have also been evoked in the above-mentioned studies of MEA. A survey of the VP in MEA reveals that it mainly qualifies as an OV language with respect to said properties. The latter will be presented after a brief outline of nominal determination.

Nouns in MEA can be definite, indefinite or non-determined/quantified, i.e., bare. The latter are unspecified for number and have a nonspecific reading, existential (non-specific indefinite), as in (1a), or generic (kind-level), as in (1b).^[6]

(1)

Armen-ə	xnʒor	gn-ec’
Armen-DEF	apple	buy-AOR.3SG
‘Armen bought an apple/apples.’

Armen-ə	xnʒor	sir-um	ē
Armen-DEF	apple	like-IPFV	be.AUX.3SG
‘Armen likes apples.’

Indefiniteness in MEA can also be expressed by the indefinite article mi ‘a(n)’, which is prepositive to the head noun and can be separated from it by adnominal attributes. Definiteness is expressed by a suffixed definite article –ǝ or –n. MEA displays differential object marking: Non-human DOs (indefinite or definite) are always in the nominative case, (2) and (3), which is the default case, i.e., zero marking. Definite human DOs occur in the dative case (4a) and indefinite human DOs occur in the dative case when specific (4b) or in the nominative case otherwise (4c).

(2)

Armen-ə	mi	(has-ac)	xnʒor	gn-ec’
Armen-DEF	a	(ripen-RES)	apple	buy-AOR.3SG
‘Armen bought a (ripe) apple.’

(3)

Armen-ə	tes-av	tun-ə
Armen-DEF	see-AOR.3SG	house-DEF
‘Armen saw the house.’

(4)

Armen-ə	tes-av	aʁǯk-a-n
Armen-DEF	see-AOR.3SG	girl-DAT-DEF
‘Armen saw the girl.’

Armen-ə	mi	aʁǯk-a	tes-av
Armen-DEF	a	girl-DAT	see-AOR.3SG
‘Armen saw a certain girl.’

Armen-ə	(mi)	aʁǯik	tes-av
Armen-DEF	(a)	girl	see-AOR.3SG
‘Armen saw a girl.’

2.2.1 The preverbal position of the focus

The fact that the primary focus in MEA is considered preverbal has served as a crucial piece of evidence in favor of its classification as an (S)OV language. The correlation between the preverbal focus phenomenon and (S)OV order has been well documented in the literature since the early 70s (cf. Dezső 1974; Herring and Paolillo 1995; Kim 1988, for a general discussion, and Arregui-Urbina 2002 for Basque, Erguvanlı 1984 for Turkish, among many others), where it has been claimed that (S)OV languages are more likely to encode focus preverbally than (S)VO languages.

Comrie (1988: 268) claims that “the basic rule in Modern Eastern Armenian is that the focus must immediately precede the finite verb form.”^[7] This claim has been maintained in subsequent works (Dum-Tragut 2009: 630;^[8] Giorgi and Haroutyunian 2016; Kahnemuyipour and Megerdoomian 2011, 2017; Tamrazian 1994).

The main arguments in favor of the preverbal position of the focus are:

The position of (WH-) interrogative pronouns

Interrogative pronouns always occur in the preverbal domain, most commonly immediately before the finite verb, (5), but they can also be separated from the verb by other constituents (6).^[9]

(5)

O?v ^[10]	er	Marina	Zelyonkina-n,	inč’pe?s
Who	be.COP.PST.3SG	Marina	Zelyonkina-DEF,	how
ev	inč’u?	haytn-v-ec’	Hayastan-um
and	why	appear-MED-AOR.3SG	Armenia-LOC
ovk‘-e?r	ev	inč’u?	span-ec’in	nran
who-PL	and	why	kill-AOR.3PL	she.DAT
‘Who was Marina Zelyonkina, how and why she appeared in Armenia, who killed her and how?’

(6)

Nerkayac’-v-ac	t’eknacu-ner-ic’	o?v	p’oʁ-i	karik’
represent-MED-RES	candidate-PL-ABL	who	money-GEN	need
un-i
have-3SG
‘Who needs money among the represented candidates?’

The position of the auxiliary verb

In its affirmative form, the auxiliary verb in MEA is an enclitic that carries tense and agreement features and attaches to various elements.^[11] Tamrazian (1994: 12) provides the following principle for the placement of the auxiliary in MEA, “Being a head final language, the auxiliary verb in this language appears as the rightmost element in declarative sentences,” or, in other words, the auxiliary follows the lexical verb.^[12] Note, however, that, as mentioned by Tamrazian herself, this is not the only possibility for the placement of the auxiliary. The latter is placed after a variety of elements depending on various factors:^[13]

In focus-neutral (or unmarked) affirmative sentences, the auxiliary:
1. appears on the lexical verb, in intransitive constructions, (7).
2. attaches to the preverbal element of the “compound verb” or the “complex predicate”, be it a noun, an adverb or any element forming a “semantic predicate” with the verb, (8).
3. is placed after the direct object, in case the latter is bare or indefinite, (9).
4. attaches to the verb, in transitive constructions with a definite direct object, (10).^[14]

(7)

Pieŕ-ə	k’ayl-um	ēr	hyurasrah-um
Peter-DEF	walk-IPFV	be.AUX.PST.3SG	sitting.room-LOC
‘Peter was walking in the sitting room.’

(8)

Durs	em	gal-is
out	be.AUX.1SG	come-IPFV
‘I am going out.’

(9)

Xot	ē	ut-um	kov-i	pes
grass	be.AUX.3SG	eat-IPFV	cow-GEN	like
‘He eats the grass like a cow.’

(10)

Davit’-ə	čaš-n	ut-um	ē
David-DEF	meal-DEF	eat-IPFV	be.AUX.3SG
yev	voskor-ə	šprt-um	pat-i	koʁm-n…
and	bone-DEF	throw-IPFV	wall-GEN	side-DEF
‘David is eating his meal and throwing the bone towards the wall…’

Crucially, in marked (non-focus-neutral) sentences, the auxiliary attaches to the focused constituent of the sentence (Hodgson 2013: 27; Tamrazian 1994: 14, among many others), as illustrated by the placement of the auxiliary after the subject in (11).

(11)

Na	ē	gal-is
he	be.AUX.3SG	come-IPFV
‘It is him who comes.’

On the basis of similar facts, Comrie (1988) proposes that MEA has a syntactic position for focus in immediate preverbal position. He nevertheless acknowledges that focus can also be postverbal in MEA. Hodgson (2013: 27) argues that preverbal and postverbal focus “involve different syntactic operations with preverbal focus taking place without movement of the focused element, while postverbal focus involves movement to the specifier of a functional projection.” She also notes that the preverbal position is preferred for contrastive focus on the object, while the postverbal position is rather “used for narrow information focus on the object” (2013: 27). We will address the issue of postverbal constituents later.

2.2.2 The position of bare objects

Although all studies admit the possibility for DOs to occur postverbally, bare DOs seem to enjoy less freedom and display a strong preference for the immediately preverbal position (Badikyan 1976: 156–157; Dum-Tragut 2009: 562, among others).

(12)

Apa	ev	paterazmec’-in,	mard	spanec’-in
then	and	wage.war-AOR.3PL	man	kill-AOR.3PL
‘And then [they] waged war and killed humans.’

(13)

Erek	garnanayin	k’aʁc’r	ptuʁ	ker-ank’
yesterday	springtime	sweet	fruit	eat-AOR.1PL
‘Yesterday, we ate springtime sweet fruits.’

However, unlike WH- words, bare objects may also occur in postverbal position, as illustrated by the following example from the EANC:

(14)

Mek’ena-n	et	ber-el-u	npatak-ov	gn-ec’	šaržič’
car-DEF	back	bring-INF-GEN	goal-INST	buy-AOR.3SG	motor
‘For getting back the car [he] bought a motor.’

2.2.3 Word order in VP idioms

The word order in VP idioms can be a cue to the basic word order in flexible languages (Skopeteas and Fanselow 2010: 1378–1379). Since idiomatic sequences are generally semantically opaque (i.e., non-compositional), their components cannot bear specific discourse properties (e.g., topicality, givenness, etc.), which implies that contextually triggered word order variations cannot apply to idiomatic sequences. In other words, word order in such sequences is syntactically determined and corresponds to the basic word order in a given language.

Idiomatic VPs display OV order in MEA, as illustrated in (15a)–(16a). Postposition of the components of the idiom either yields odd, if not ungrammatical sequences (15b), or leads to a compositional (i.e., non-idiomatic) reading, (16b).

(15)

ʒeŕ aŕnel ‘to make fun of’ (lit. ‘hand take’)

Na	mez	ʒeŕ	ē	aŕn-um
(s)he	we.DAT	hand	be.AUX.3SG	take-IPFV
‘(S)he is making fun of us’

* Na	mez	aŕn-um	ē	ʒeŕ
(s)he	us	take-IPFV	be.AUX.3SG	hand
(intended) ‘(S)he is making fun of us.’

(16)

Durs gnal ‘go out’ (lit. ‘outside go’)

Na	durs	gn-ac’
(s)he	outside	go-AOR.3SG
‘(S)he went out.’

Na	gn-ac’	durs
he	go-AOR.3SG	out
‘(S)he went outside.’

2.2.4 The position of low adverbs

Low adverbs are adverbs whose scope is the VP, and therefore are placed ‘low’ in the syntactic structure.^[15] In OV languages, e.g., Georgian (Skopeteas and Fanselow 2010: 1377–1378) or Persian (Kahnemuyipour 2009), they generally occur in the vicinity of the verb, and either immediately precede it or are placed at the left edge of the focus phrase (or the vP). Their appearance in other positions yields a marked interpretation. In MEA, these adverbs have also been assumed to mark the edge of VP (Tamrazian 1994: 172), as illustrated in (17).

(17)

Nstec’-ink’	ŕetine	t’et’ev	navak-ǝ ev arag
sit.down-AOR.1PL	rubber	light	boat-DEF and quickly
heŕac’-ank’	ap’-ic’
leave-AOR.1PL	bank-ABL
‘[We] took the light rubber boat and quickly left the bank.’

To sum up, the branching direction of various constituents designates MEA as a left-branching, hence (S)OV language. Likewise, the set of syntactic properties that has been assumed to be a hallmark of (S)OV languages (i.e., the preverbal position of the focus, bare and indefinite DOs, and low adverbs, as well as the ordering preferences within VP idioms) also hold in MEA. But, as we will see in the next section, the sheer evidence provided by the convergence of these criteria is in sharp contrast to the widespread claim in Armenian grammars that the canonical position for DOs in MEA is postverbal.

3 The VO/OV controversy and methodological issues

In contrast to the studies mentioned in the previous sections, some other studies claim that MEA has no dominant word order and thus is unspecified for OV or VO (see for instance Dryer 2013; Dum-Tragut 2009: 555). More interestingly, Armenian grammars and handbooks consider SVO to be the canonical or “recommended” word order in Armenian (Abrahamyan et al. 1975; Arakelyan 1958; Badikyan 1976; Papoyan and Badikyan 2003), indicating that preverbal placement is usually reserved for either focus marking or indefinite objects.

Although all studies acknowledge the possibility for all NP DOs to occur pre- or postverbally, some studies have claimed that the placement is not random and is triggered by the (in)definiteness of the DO. Badikyan (1976) is one of the first and most detailed studies on the matter and the first attempt to provide corpus-based generalizations. Based on eight selected corpora of various “genres” or types (legal, fiction, scientific…) and of about 185 words each, Badikyan (1976: 154) claims that “in the common order, the direct object is placed after the predicate in all types of texts.” In two cases, however, the preverbal position of the DO is preferred or even obligatory:

The speaker wants to draw the attention of their interlocutor to the direct object and to emphasize it. In this case, the DO bears what Badikyan calls the “logical accent” of the sentence.
The DO is indefinite.

The first case can be considered as a marked order, in which the DO is the identificational focus. Badikyan also mentions that the auxiliary is placed after the DO and thus precedes the lexical verb, which is another cue for considering this order marked. In other words, the preverbal position of the DO in this case is a deliberate choice on behalf of the speaker to focus the DO.

The second case, on the other hand, should not be considered a ‘variation’ or a ‘choice’, as explicitly stated by Badikyan, who claims that the OV order for definite DOs is “unfamiliar” or “strange” in MEA. To sum up, according to Badikyan:

Indefinite DOs must occur preverbally;
All other DOs occur postverbally in the basic or unmarked word order. Preverbal placement implies a focal status for definite DOs.

If these generalizations are shown to be empirically sound, then OV and VO orders do not have the same distribution. The relevance of definiteness to the pre- or postverbal position of the DO has also been noted by Dum-Tragut (2009: 562) “Both in written and spoken MEA, native speakers, with overwhelming frequency, prefer the word order SVO for definite direct objects and SOV for indefinite or non-specific direct objects.” Dum-Tragut also notes that even indefinite DOs tend to occur postverbally provided there are several DOs (i.e., coordination of indefinite DOs or a sequence of multiple indefinite DOs), which could be considered a case of “heavy NP shift” (p. 563).

Faghiri and Samvelian’s (2020b) corpus study using the Eastern Armenian National Corpus (EANC) also confirms Badikyan’s and Dum-Tragut’s claim regarding the preferred post-verbal position of definite DOs. Faghiri and Samvelian investigate a sample of 570 transitive sentences containing the verbs sirel ‘to like’ and spanel ‘to kill’, 338 of which are construed with a periphrastic verbal form and 231 with a simple verb.^[16] They observe that: 1) SVO is by far the most frequent order (78.9%) regardless of the verbal form: 87.5% (vs. 12.5% for SOV) in sentences with simple verbal forms and 75.5% (vs. 24.5% for SOV) in those with the periphrastic form; 2) SVO order maps into all-focus sentences in a fair share of cases; 3) For definite DOs, SVO is clearly the most frequent order (79.1 vs. 20.9% SOV). Based on these results, Faghiri and Samvelian (2020b) conclude that the SVO order is too frequent to be considered a “marked” order and join Badikyan (1976) in claiming that the unmarked word order in sentences with definite DOs is SVO.

To summarize, the claim that the canonical position of the DO is postverbal in MEA is mainly based on the behavior of definite DOs, which is not irrelevant given that it is in line with the definition of the canonical order according to many typological studies, for instance, Siewierska (1988: 8), according to whom the basic order is “[…] the order that occurs in stylistically neutral, independent, indicative clauses with full noun phrase (NP) participants, where the subject is definite, agentive and human, the object is a definite semantic patient, and the verb represents an action, not a state or an event.”

A major challenge for the definition of the canonical order is identifying what constitutes a “stylistically neutral” sentence among semantically equivalent sentence pairs, which is not straightforward. Siewierska (1988) notes that many authors tend to equate the basic order with the dominant order, which can be identified using frequency, a much more accessible criterion. Indeed, as also noted by Dryer (1995), in most cases the order characterized as unmarked is also the one that is perceived as more frequent than the others, so the most effective way for defining the unmarked word order in a given language is to rely on frequency. Note that this view goes back to Greenberg (1966: 67).

However, some studies have claimed that the unmarked word order in a given language may not match the most frequent one. Discussing these cases, Dryer (1995) as well as Siewierska (1988), point out two main problems: (i) in many languages, prototypical transitive sentences (with full NPs) are infrequent and (ii) word order frequencies may vary in the same language from one type of corpus (text or discourse type) to another. Considering that frequency plays no role in the grammar of particular languages, Dryer (1995: 105–106) suggests defining the pragmatically unmarked word order in terms of the ‘elsewhere condition’: a given word order can be considered pragmatically unmarked if “it is the default word order if there are concise ways to characterize the situations in which other word orders are used, with the pragmatically unmarked word order being most easily characterized as the order that is used elsewhere.” According to this view, the notion of canonical word order is (theoretically) independent of relative frequency.

Lambrecht (1996: 17) provides quite a similar definition of pragmatically unmarked word order: “given a pair of allosentences,^[17] one member is pragmatically unmarked if it serves two discourse functions, while the other member is positively specified for some pragmatic feature, the unmarked member is neutral with respect to this feature.” However, contrary to Dryer (1995), Lambrecht claims that a greater overall frequency of occurrence is a corollary of the greater overall frequency of the pragmatically unmarked word order. In other words, the least (pragmatically) marked word order will also be the most frequent one, by virtue of having more distributional freedom.

Following Lambrecht (1996), and in line with almost all other recent studies of word order variation, we adopt a frequency-based approach in this paper. This means that we consider frequency to be a key notion in the identification of the canonical word order. However, our purpose is not to identify the most frequent order (in a given sample of sentences) as the basic word order. Instead, frequencies of competing orders (OV and VO here) are compared in a controlled manner in order to identify the contexts that favor each one.

4 Our empirical investigation

One of the goals of our paper is to provide a reliable picture of word order distribution in MEA, as well as different factors that may be involved in the choice of OV versus VO order, allowing for the identification of the unmarked word order in this language in an empirically well-founded manner. In particular, our aim is to establish the distribution of word order in all-focus transitive sentences for different DO types, namely definite, indefinite and bare DOs. We also aim to determine whether functional factors such as animacy and length, known to trigger ordering preferences crosslinguistically, are also relevant for MEA.

To this end, we first carried out a preliminary corpus-based study in order to get a better picture, reflecting the usage of MEA speakers, of the distribution of word order for different DO types. In corpus data, it is costly to identify all-focus sentences and/or to include the information structure of sentences as a factor in the quantitative study. We, therefore, carried out constrained experimental sentence production studies to collect out-of-the-blue sentences. In addition, the experimental approach allows us to test the hypotheses drawn from the corpus data in a controlled manner. Indeed, corpora do not always provide us with balanced datasets, with enough tokens for each contrast to draw reliable conclusions. Furthermore, attested sentences display idiosyncrasies, not only at the discursive level, that may explain a particular order. For example, in our data, on the one hand, we have sentences where the preverbal (definite) object and the verb form an idiom, on the other hand, we have sentences with long and complex postverbal (non-definite) DOs.

The auxiliary could constitute a challenge for a quantitative study of word order in MEA. In particular, addressing its mobility (see Section 2.2.1) is not straightforward in sentence production experiments. Therefore, it is reasonable to check whether it is necessary for the issue at stake to include the (presence and the mobility of the) auxiliary as a factor in our experiments. Another goal of our corpus study is to determine the extent to which the presence of the auxiliary affects the choice of OV versus VO. If corpus data do not suggest that the distribution of word order is substantially different in sentences with simple versus periphrastic verbal forms, we can safely limit our experimental study to sentences with simple verbs and assume that the results generalize.

4.1 Corpus data: a spoken sample of transitive sentences

As mentioned in the previous section, in their corpus investigation based on a sample of 570 transitive sentences from the EANC, Faghiri and Samvelian (2020b) report that for definite DOs, SVO is by far the most frequent order (79.1 vs. 20.9% SOV). They nevertheless note that their study does not allow for any conclusive generalization on the placement of indefinite and bare DOs, because the latter are quite scarce in their sample (respectively, 17 and 23 vs. 340 definite DOs). This is due to the fact that the periphrastic verbal forms in the sample are limited to cases where the auxiliary immediately follows the lexical verb, i.e., the verb and the auxiliary are adjacent, which has a significant consequence on the composition of the sample by disfavoring bare and indefinite DOs. Recall that in focus-neutral sentences, the auxiliary is placed after indefinite/bare objects, and not after the lexical verb (as it is when the DO is definite). While we can safely assume that for definite DOs, word order variation is not likely to be affected by the presence of the auxiliary, for nondefinite DOs, we cannot rely on Faghiri and Samvelian (2020b).

In order to determine whether the presence of the auxiliary affects the choice of OV versus VO, we studied word order variation in a small-scale spoken corpus, in which we identified and annotated all transitive sentences. We used a task-oriented spoken corpus including recordings of picture-based storytelling from 10 native speakers of MEA (see Khurshudyan 2006). The corpus includes 20 narratives, 2 for each participant, a first version in which the participants are asked to tell a story described by a series of pictures, and a second one, taking place after a lapse of 6–8 hours, in which they are asked to narrate the same story from memory.

We identified 231 finite declarative transitive sentences with SOV or SVO orders that we annotated for the presence and position of the auxiliary and object type (see Table 2).

Table 2:

Word order and verbal form by DO type.

Word order	SOV			SVO			All
DO type	Periphrastic	Simple	All	Periphrastic	Simple	All	All
Bare	14	15	29	2	5	7	36
Indef	7	3	10	11	9	20	30
Def	11	9	20	12	18	30	50
Infinitives	–	–	–	69	39	108	108
Finite clauses	–	–	–	2	5	7	7
Total	32	27	59	96	76	172	231

In Table 3, we focus on sentences that include an auxiliary, where it is placed mainly on the verb (in 68.0% of sentences) or on the object (in 14.1% of sentences). “Other” refers to sentences with a negative auxiliary or sentences in which the auxiliary appears on other constituents, e.g., adverbs.

Table 3:

Word order in periphrastic forms by DO type.

Auxiliary on	Verb			Object			Others			All
Word order	SOV	SVO	All	SOV	SVO	All	SOV	SVO	All	All
Nominal DOs	11	22	33	16	0	16	5	3	8	57
Bare	3	2	5	11	0	11	0	0	0	16
Indef	1	10	11	4	0	4	2	1	3	18
Def	7	10	17	1	0	1	3	2	5	23
Infinitives	0	52	52	0	2	2	0	15	15	69
Finite clauses	0	2	2	0	0	0	0	0	0	2
Total	11	76	87	16	2	18	6	18	23	128

As expected, finite clausal complements (seven cases in our sample) always appear in SVO order, and, in the two sentences including an auxiliary, the latter is placed on the verb.

In the case of definite DOs, in sentences with an auxiliary, the rate of SVO is 52.2%. We looked more closely into SOV sentences and found that in four sentences, there is an idiomatic relation between the verb and the object, which explains the SOV order, and in one sentence, the verb is focused. In sentences with simple verbs, the rate of SVO is 66.7%. Nevertheless, the difference between the distributions of word order in sentences with periphrastic versus simple verb form (11:12 vs. 9:18) is not statistically significant (χ ² (1) = 0.567, p-value = 0.452).

In the case of indefinite objects, in sentences including an auxiliary (placed on the verb in 61.1% of sentences), the rate of SOV is 38.9%. In sentences without an auxiliary, the rate of SOV is 25%. Likewise, the difference between the two distributions (7:11 vs. 3:9) is not statistically significant (χ ² (1) = 0.156, p-value = 0.693/Fisher’s exact test p-value = 0.694). Note that, as we will see below, the surprisingly/unexpectedly low rate of SOV sentences in this sample is plausibly due to an effect of length or heaviness (see Section 5.3 for more details on this effect), that is, caused by the inherent variation in the length of the DOs (roughly, the number of words that constitute the object NP) that appear in those sentences. In the case of bare objects, in sentences with an auxiliary (placed on the object in 64.7% of sentences) the rate of SOV is 87.5%, and in sentences with a simple verb, SOV occurs at the rate of 75.0%. Here again, the difference (14:2 vs. 15:5) is not significant (χ ² (1) = 0.268, p-value = 0.605/Fisher’s exact test p-value = 0.426). Hence, with respect to our working hypothesis, it seems safe to conclude that there is no substantial difference between sentences with simple and periphrastic verbs, given that, if the presence of the auxiliary affected word order for bare and indefinite objects, it would only reinforce the SOV order.

We took a closer look at sentences with bare or indefinite DOs with respect to the length of the DO. We observe that in 7 out of 10 SOV sentences the indefinite DO is simple (i.e., composed of the head noun and the indefinite article only), while in 16 out of 20 SVO sentences the indefinite DO is not simple (i.e., the head noun is modified by an attributive modifier), as in (18) and (19). Accordingly, if we consider sentences with a simple indefinite DO, SOV occurs at the rate of 63.6%.

(18)

(…)	na	tes-unm	e	mi	geʁec’ik	avtomek’ena
	he	see-IPFV	be.AUX.3SG	a	nice	car
‘(…) he sees a nice car.’

(19)

(…)	ev	na	gn-ec’	mi	šat
	and	he	buy-AOR.3SG	a	very
geʁec’ik	suvenir	ir	knoǯ	hamar
nice	souvenir	he.GEN	woman.GEN	for
‘(…) and he bought a very nice souvenir for his wife…’

In the case of bare DOs, the DO is simple (i.e., one word only) in 21 out of 29 SOV sentences, while in 5 out of 7 SVO sentences it is not simple (i.e., the noun is modified by an attributive modifier), as in (20) and (21). Accordingly, if we consider the 23 sentences with a simple bare DO, SOV occurs at the rate of 91.3%.

(20)

(…)	ev	poxziǯman	vorpes	tarberak
	and	compromise.GEN	as	option
aŕn-um	ē	pok’r	xaʁalik’	avto
buy-IPFV	be.AUX.3SG	small	toy	car
‘…and as an option of compromise [he] buys a small toy car…’

(21)

I verǯo	tʁamard-ə	knoǯ	hamar
Finally	man-DEF	woman.GEN	for
gn-ec’	p’ok’rik	xaʁalik’	avtomek’ena…
buy-AOR.3SG	small	toy	car
‘Finally, the man bought a small toy car for the woman…’

The findings of our corpus study present a serious challenge for any assumption of the markedness of the (S)VO order in MEA. Even though there may be a corpus-related bias, the word order distribution in our sample still undermines such an assumption. The rate of SVO order is too high to qualify as a marked option. In particular, the fact that definite DOs are overwhelmingly postverbal is in line with Badikyan’s (1976) and Dum-Tragut’s (2009) claim, as well as with Faghiri and Samvelian’s (2020) observation that the placement of the DO is not random and is triggered by the (in)definiteness of the DO, with indefinite/bare DOs being preferably preverbal, while definite DOs tend to occur post-verbally.

Furthermore, in our corpus data, we observe that long and/or heavy bare and indefinite DOs appear frequently in the SVO order, while simple (that is short or minimal) bare and indefinite DOs are more likely to appear in the SOV order. Nevertheless, our corpus data do not allow us to establish a direct relationship between SVO and length/heaviness of the DO. Hence, in addition to the hypothesis regarding the preferred position of these DOs, we also need to verify the effect of phrasal length on their postposition.

To confirm (or reject) these hypotheses in a controlled manner and to verify that the observed tendencies truly hold in all-focus transitive sentences in MEA, we carried out sentence production experiments. In these experiments, we controlled for the verbal lemma and tense. As for the auxiliary, given that our corpus data do not show any discrepancies that need to be further investigated, our stimuli only include simple verbal forms.

4.2 Experimental data

Our corpus data provided a general idea of the distribution of word order in MEA, where SVO appears at a high rate, especially with definite DOs, while bare and indefinite DOs seem to favor the SOV order when they are not long/heavy. In this section, we tested these hypotheses in two-sentence production experiments. Furthermore, corpus samples naturally contain all types of (declarative) sentences, including all-focus (unmarked) sentences, but also sentences that are pragmatically marked. In our two experiments, we used sentence production protocols designed to collect out-of-the-blue sentences.

Another relevant factor that remains totally uninvestigated in previous studies, as well as in our corpus data, is the effect of animacy. The latter is shown to have an effect on word order preferences (according to the well-known universal ‘animate-before-inanimate’ preference or the ‘animate first’ principle, see e.g., Tomlin 1986). Importantly, studies on word order evolution suggest that SVO is more efficient in cases where both the subject and the object have animate referents and, consequently, are likely to be confused (see Gibson et al. 2013). In other words, animate DOs may favor the SVO order. Given that corpus data is generally not well-balanced with respect to animacy, there might be an undiscovered bias, and it is therefore important to independently check whether there is any interference.

Our first experiment (Experiment 1) uses a cued sentence recall protocol and was carried out in Yerevan. Our second experiment (Experiment 2) uses a constrained sentence production protocol and was carried out via a web-based questionnaire. In Experiment 1 we compare definite and indefinite DOs and in Experiment 2 we compare indefinite and bare DOs.

4.2.1 Experiment 1

The goal of this experiment is to compare definite and indefinite DOs and, at the same time, to study the effect of the animacy of the DO. Animacy is a factor known to have an effect on word order variation crosslinguistically, and importantly, studies on word order change and evolution suggest that SVO is likely to be favored over SOV when the subject and the object both have animate referents (see Gibson et al. 2013). In other words, it is possible that animate DOs favor the SVO order, yet we did not investigate this correlation in our corpus study.

4.2.1.1 Procedure

We used a cued sentence recall protocol, presented as a memory task. Our protocol follows a well-known task used in various experimental studies on word order preferences (e.g., Stallings and MacDonald 2011). Participants were asked to make a sentence with given constituents that appear on a computer screen and to produce it from memory with the help of a cue from the sentence, after a short lapse of time during which, for distraction, they were presented with a basic arithmetic operation. Figure 1 presents the sequencing of different screens as they were presented to participants.

Figure 1:

Sequence of screens as presented to participants in Experiment 1.

Participants were recorded individually. In each session, the experiment was run on one laptop (using the Ibex Farm platform) and recorded via a headset connected to another laptop. The instructions were given verbally and a practice item was presented by the instructor. The participant first filled out a profiling form with information such as their age, language(s), and region. Then they began the trial with a “warm up” session including 3 practice (filler-type) items. Afterward, the participant was asked to start when ready. Each trial included a total of 65 items (20 target items and 45 fillers).

4.2.1.2 Design and material

We manipulated two factors, following a 2 × 2 factorial design: (i) the definiteness of the DO on two levels: indefinite mi astʁ ‘a star’, versus definite astʁ-ə ‘the star’ (see Section 2.1), and (ii) the animacy of the DO on two levels: astʁ ‘star’, versus kin ‘woman’. Table 4 summarizes our four experimental conditions.

Table 4:

Experimental design (Experiment 1).

Condition		Subject	Verb	DO
Cond 1	Indef/Anim	Arpi	saw	a woman
Cond 2	Indef/Inan	Arpi	saw	a star
Cond 3	Def/Anim	Arpi	saw	the woman
Cond 4	Def/Inan	Arpi	saw	the star

We prepared four lists of 20 transitive sentences according to our four experimental conditions, as in (22).

(22)

Arp’i-n	astʁ-ə	tes-av
Arpi-DEF	star-DEF	see-AOR.3SG
‘Arpi saw the star.’

Arp’i-n	mi	astʁ	tes-av
Arpi-DEF	a	star	see-AOR.3SG
‘Arpi saw a star.’

Arp’i-n	knoǯ-ə	tes-av
Arpi-DEF	woman.DAT-DEF	see-AOR.3SG
‘Arpi saw the woman.’

Arp’i-n	mi	kin	tes-av
Arpi-NOM	a	woman	see-AOR.3SG
‘Arpi saw a woman.’

We used the Armenian equivalent of the following verbs (or pair of verbs), each repeated four times: see, find, remember, kill (with animate DOs)/break, beat (with animate DOs)/throw. In 10 items the verb was in the aorist tense (e.g., tesav ‘saw’) and in the other 10 items it was in the conditional/future tense (e.g., ktesni ‘shall see’).

Note that among the three constituents, one is always a proper noun in the nominative case, so the only way to produce a grammatical sentence is to choose the latter as the subject of the sentence. Recall that proper nouns, when used as DOs, must be in the dative case given that MEA displays differential object marking (see Section 2.2).

As illustrated in Figure 2, the subject appeared in the middle of the screen in all the items. The verb appeared at the top (and the DO at the bottom) in 10 items and the reverse in the other 10 items. The cue to produce the sentence was always the subject, hence all the sentences produced by the speakers were expected to begin with the subject. The order of the verb and the DO constituted our dependent variable, with two possible values: (S)VO and (S)OV.

Figure 2:

Screenshot of an item from Experiment 1.

4.2.1.3 Participants

38 native speakers of MEA (living in Armenia) were recorded – including 27 female participants and 11 male participants, with a mean age of 30 years (ranging from 18 to 66 years).^[18] The data were collected during a research stay in Yerevan in October 2019. The recording took place at the Institute of Philology and Ethnography of the Armenian National Academy of Sciences.

4.2.1.4 Results and discussion

Out of a total of 760 (potential) sentences, we gathered a sample of 732 produced sentences – discarding incomplete and/or erroneous sentences, as well as 4 isolated occurrences displaying an unexpected order (OVS, OSV). Table 5 provides the distribution of orders in this sample. We observe that definite DOs strongly favor the SVO order, while indefinite DOs show a clear preference for the SOV order. As for animacy, we only see a small difference between animate and inanimate DOs, with animate DOs slightly favoring SVO. As we can see in Table 6, this difference is larger for definite DOs than for indefinite DOs.

Table 5:

Word order by definiteness and animacy (Experiment 1).

Order	Definite DOs			Indefinite DO			All
Order	Anim	Inan	All	Anim	Inan	All	All
SOV	26	37	63	114	113	227	290
SVO	155	146	301	66	75	141	442
Total	181	183	364	180	188	368	732

Table 6:

%SVO by definiteness × animacy (Experiment 1).

Condition	Definite	Indefinite	All
Animate	85.6%	36.7%	61.2%
Inanimate	79.8%	39.9%	59.6%
All	82.7%	38.3%	60.4%

The statistical analyses of the results, using a logistic mixed model – with sum-coded fixed effects and SVO coded as success, including items and participants as random effects – shows a general preference for SVO (Est. = 0.74, p < 0.01), and a strong main effect of definiteness, definite DOs favoring SVO (Est. = 1.42, p < 0.001).^[19] The results of this experiment hence confirm the findings of our corpus study showing that in all-focus transitive sentences, definite DOs strongly favor the SVO order, while indefinite DOs are more likely to occur in the SOV order.

With respect to animacy, the results of this experiment are also very interesting: while the main effect of animacy is not significant (p = 0.23), there is a marginally significant interaction (Est. = 0.20, p = 0.055) indicating that animacy has an effect on the order with definite DOs: animate definite DOs favor SVO more than inanimate definite DOs do. In other words, there exists a tendency to put animate DOs in the postverbal domain, i.e., the direction predicted by Gibson et al. (2013), only with definite DOs, which are more likely to be confused with the subject.

4.2.2 Experiment 2

In this experiment, we focused on the difference between indefinite and bare DOs, as well as on the effect of length. In corpus data, we observe that long/heavy indefinite and bare DOs appear more frequently in the SVO order. The aim of this experiment is, first, to determine the preferred order for simple bare and simple indefinite DOs, and, second, to verify if longer (nondefinite) DOs favor SVO. To this end, we compared simple DOs with “minimally” long DOs, i.e., DOs modified by a single attributive modifier. In their experimental study on the effect of phrasal length on word order in Persian, where definiteness also plays a role on the linear position of the DO, Faghiri and Samvelian (2020a: 18–22)^[20] showed that when an attributive modifier is added to bare or indefinite DOs they are significantly more likely to behave like definite DOs.^[21] Hence, if the results of our experiment show that minimally longer DOs are more likely to appear in the SVO order, we have enough evidence to confirm that phrasal length favors SVO.

4.2.2.1 Procedure

Experiment 2 was carried out on the Ibex Farm platform (Drummond 2013) as a web-based self-administrated questionnaire.^[22] In this experiment, we used an online constrained sentence production task, following the protocol proposed by Faghiri (2016). This protocol proposes a task that is fairly similar to the cued sentence recall production (see Faghiri and Samvelian 2020a: 17–18).

In this task, participants were presented with a preamble and were asked to complete the latter with phrases that appeared on the screen in order to make a sentence. They simultaneously saw a preamble, an incomplete sentence represented by three blank boxes, and a vertical list of phrases (presented in a counterbalanced order, as was the case in Experiment 1). A screenshot of an item (with English translations) is given in Figure 2. The list of options contained four elements, that is, one element more than the number of blanks, in order to prevent participants from guessing the purpose of the experiment and to push them to concentrate on the content of each sentence, so that they produce reasonably natural sentences.

As in Experiment 1, the questionnaire included a profiling form, as well as some practice items, after which the participants were asked to start when ready. Likewise, each trial included a total of 65 items (20 target items and 45 fillers).

4.2.2.2 Design and material

Experiment 2 also follows a 2 × 2 design, with the two following variables: 1) bareness, contrasting bare, astʁ ‘star’, with indefinite, mi astʁ ‘a star’, DOs, and 2) length, contrasting simple/short (one word) DOs with DOs modified by an attributive modifier, such as, p’aylun astʁ ‘bright star’ as in (23).

(23)

Yerekoyan	Arp’i-n	(mi)	(p’aylun)	astʁ	tes-av
evening	Arpi-DEF	a	bright	star	see-AOR.3SG
‘In the evening, Arpi saw (a) (bright) star.’

As illustrated in Figure 3, in this experiment, an adverbial scene-setting phrase constituted the preamble, and the list of four phrases included the subject, two formally identical choices for the DO (DO1 and DO2), and the verb.

Figure 3:

Screenshot of an item from Experiment 2.

We prepared four lists of 20 transitive sentences according to our four experimental conditions, as in Table 7. The stimuli are entirely based on those used in Experiment 1. Apart from the adverbial preamble, all elements are identical. The two choices of DO match the DOs in animate and inanimate conditions. The same set of verbs was used, but, in items where we had different verbs in animate and inanimate conditions in Experiment 1 (kill/break, beat/throw), we kept the one that required an inanimate DO. The verb appeared at the top of the list and the subject at the bottom in 10 items and the reverse in the other 10 items. The DOs appeared in the middle, with the inanimate choice appearing above the animate choice, as in Figure 3.

Table 7:

Experimental design (Experiment 2).

Condition		Preamble	Subject	Verb	DO1	DO2
Cond 1	Bare/Short	In the evening	Arpi	saw	star	woman
Cond 2	Bare/Long	In the evening	Arpi	saw	bright star	old woman
Cond 3	Indef/Short	In the evening	Arpi	saw	a star	a woman
Cond 4	Indef/Long	In the evening	Arpi	saw	a bright star	an old woman

4.2.2.3 Participants

28 native speakers of MEA from Armenia filled out our online questionnaire. We had 19 female and 9 male participants, from 27 to 70 years old.

4.2.2.4 Results and discussion

Out of a total of 560 (potential) sentences, we collected a sample of 525 produced sentences – discarding erroneous sentences as well as sentences with infrequent orders. Table 8 shows the distribution of word orders in this sample. To analyze the effect of our two experimental factors, we put aside the 32 OVS sentences. It is worth noting that neither DO type shows a bias toward this order in this sample.

Table 8:

Word order by bareness and modification/length (Experiment 2).

Order	Bare DOs			Indefinite DO			Total
Order	Simple	Modified	Total	Simple	Modified	Total	Total
SOV	98	89	187	99	84	183	370
SVO	27	33	60	24	39	63	123
OVS	10	7	17	9	6	15	32
Total	135	129	264	132	129	261	525

We observe a strong general preference for SOV order with a trivial difference between the two types (see Table 9).^[23] There is, however, a more important contrast triggered by modification. As expected, the rate of SOV is lower with modified/longer DOs.

Table 9:

%SOV by bareness × length (Experiment 2).

Condition	Bare	Indefinite	All
Simple	78.4%	80.5%	79.4%
Modified	73%	68.3%	70.6%
All	75.7%	74.4%	75.1%

The statistical analyses of the results, using a logistic mixed model with sum-coded fixed effects and SOV coded as success and including items and participants as random effects,^[24] shows a significant general preference for the SOV order (Est. = 1.56, p < 0.01) and the main effect of length/modification (Est. = −0.40, p < 0.01), but no main effect of the DO type (p = 0.55) nor any interaction between the two factors (p = 0.17).

These results confirm the preference of nondefinite DOs for the SOV order, also observed in our corpus study, indicating furthermore that indefinite and bare DOs are equally likely to favor the SOV order. Interestingly, our results also confirm the observation that non-simple DOs are more likely to appear in the SVO order. In other words, long non-definite DOs tend to behave more like definite DOs.

5 Explaining the persistence of SVO in MEA

Our quantitative investigation shows that:

The rate of SVO order is too high, in both corpus and experimental data, to qualify as a marked option.
Our sentence production experiments confirm that SVO order can be produced out of the blue, corresponding to an all-focus utterance.
The placement of the DO is not random and is triggered by the (in)definiteness of the DO, with indefinite (including bare) DOs being preferably preverbal, while definite DOs are postverbal.
Other factors such as phrasal length and animacy are also relevant. We observe that longer indefinite and bare DOs favor postverbal position. As for animacy, we observe that animate definite DOs appear in SVO order even more frequently than inanimate definite DOs.

Thus, OV is in the minority not only from a strictly distributional and quantitative point of view (especially if clausal and infinitive complements are also taken into account), but also from the point of view of prototypicality, if we consider definite/specific DOs to be prototypical since the default position of definite/specific DOs is postverbal. Consequently, SVO can be considered to be the unmarked word order for definite DOs in MEA. This implies a “typological discrepancy”: while MEA is consistently left branching (or head final) in all its non-clausal constituents, DOs (except for short non-definite ones) tend to be postverbal. The question is why and how this discrepancy occurred and/or survived in MEA.

5.1 The role of areal diffusion

One hypothesis is that the high rate of postverbal (definite) DOs in MEA is an areal diffusion phenomenon, due to contact with Russian, a VO language.^[25] Under this putative view, in the same way that Classical Armenian shifted from (S)VO flexible to (S)OV flexible, probably due to long-standing contact with OV languages, i.e., Iranian languages and Turkish, MEA is now shifting from (S)OV to (S)VO. This assumption is appealing, especially if we consider word order preferences in other OV languages of the area that have been in a similar contact situation with Russian and for which studies on word order variation are available.

The case of Modern Georgian is particularly interesting in this respect. Word order preferences and change have been thoroughly studied in Georgian (see Harris 2000, for an overview). While both OV and VO orders may occur out of the blue, Asatiani and Skopeteas (2012: 128) note that “corpus studies show a clear preference for OV orders in written discourse (Apridonidze 1986: 136–143; Vogt 1971: 222) and same frequency for OV and VO in texts that originate in oral tradition (fairytales).” The situation is thus comparable to MEA, all DOs taken together (i.e., regardless of definiteness): both MEA and Georgian are left-branching languages in most of their constituents but display a very high frequency of VO. The two languages are also comparable in their historical evolution. While Old Georgian (5th–11th centuries) is said to be SVO flexible, Modern Georgian is described as SOV.^[26] Studies on word order variation in Georgian have mainly focused on information structure, especially on the issue of post-verbal focus (Skopeteas and Fanselow 2010), but have not looked into definiteness.^[27] Nonetheless, it is noteworthy that two genetically different languages of the area have followed the same path of evolution.

The case of Azerbaijani, a Turkic language generally considered to be rigid OV, spoken in the Republic of Azerbaijan, in the South Caucasus, but also in greater numbers in Iranian Azerbaijan, is also particularly relevant. Stilo (2014) addresses the occurrence of both pre- and postverbal DOs in the colloquial Azerbaijani of Tabriz (Iran) and Masally (Republic of Azerbaijan) and also in some other non-Turkic languages of the Araxes-Iran Linguistic Area (AILA,^[28] see Stilo 2018), taking into account the factor of definiteness. Stilo’s investigation is based on corpora containing on average ca. 500–600 clauses per (variety of) language. His study reveals a significant difference in the rate of postverbal definite DOs in the two varieties of Azerbaijani. While 100% of indefinite DOs are preverbal in both varieties, 20% of definite DOs occur after the verb (VO) in the variety spoken in Massally, against only 4.6% in the variety spoken in Tabriz.

In order to check whether this difference is coincidental or whether other factors such as areality (or diffusion) are at play, Stilo extends his investigation to non-Turkic languages of the AILA. This investigation shows that, regardless of their genetic affiliation, languages spoken in the north^[29] AILA have an overall significantly higher rate of postverbal DOs than those spoken in the south.^[30] Stilo concludes that there is an isogloss line in the area that separates the AILA languages into a southern zone and a northern zone with regard to postverbal DOs. In other words, the postverbal placement of DOs, and more specifically definite DOs, is an areal phenomenon.

Finally, let us consider Caucasian Urum, an endangered variety of Anatolian Turkish spoken in Georgia. Schröter (2019) examines the effect of focus on word order preferences in this language, in comparison to its substrate language, Turkish, and its contact language, Russian. The results show that both OV and VO can occur in similar discourse contexts in Urum and that focus can occur both pre- and postverbally. This flexibility of Urum is claimed to be a result of its contact with Russian.

To resume, various studies of the languages of the South Caucasus have highlighted the fact that postverbal DOs are a common feature of OV languages of the area. Consequently, it does not seem unreasonable to assume that the high rate of postverbal definite DOs in MEA is also a manifestation of this areal convergence. However, in MEA, contrary to other OV languages of the area, VO is not a marginal option, but the default order for definite DOs. Therefore, areal diffusion, though effective in the reinforcement of the VO order, cannot be the only explanation for its prevalence.

5.2 Word order distribution in Classical Armenian

Our hypothesis is that OV has never been the unmarked order for definite DOs in MEA. While a consistent shift took place from right-branching to left-branching in almost all constituents, historically, word order remained almost/substantially unchanged at the clausal level. Crucially, we will argue that the parameters favoring VO or OV order are analogous in Classical Armenian and MEA.

As previously mentioned, Classical Armenian is generally said to be SVO flexible, or, more precisely, right-branching (cf. Table 1). Note, however, that, like MEA, Classical Armenian displays a high degree of flexibility at the clausal level, so determining which order (VO or OV) was dominant is not a trivial task. Hayrapetyan (1981) investigates this variation in a number of texts from the fifth century, showing that the preference for each ordering may vary considerably depending on 1) whether the text is translated (from the Koine Greek); 2) the period. In 791 sentences from the New Testament (translated from Koine Greek), 619 sentences are in the VO order, and only 172 display the OV order, while in a sample of 732 sentences from texts originally written in Classical Armenian,^[31] OV is slightly prevalent, 390 OV versus 312 VO. Hayrapetyan (1981) reports that the balance changes in favor of VO in the post-Mesropian era (i.e., beginning in the second half of the fifth century):^[32] in a sample of 566 sentences taken from post-Mesropian texts,^[33] VO prevails with 338 occurrences, while 228 sentences display the OV order.

Interestingly, Hayrapetyan (2005) reports comparable rates for OV and VO orders in Classical Armenian and MEA (Table 10). If we abstract from the subject, VO represents 58% and OV 42% in Classical Armenian, which is quite similar to MEA rates, 56% VO and 44% OV.

Table 10:

Comparison of word order variation in Classical Armenian and in MEA (adapted from Hayrapetyan [2005: 32]).

	SVO	SOV	VSO	VOS	OVS	OSV
Early Classical Armenian	35	31	18	5	6	5
Modern (Eastern) Armenian	51	37	2	3	3	4

Hayrapetyan (1981) investigates syntactic, discourse and functional parameters favoring pre- and postverbal orders, such as part of speech (e.g., interrogative pronouns occur preverbally), length (long DOs favor VO order), verb type (e.g., causative and resultative verbs favor VO order), the “construction” of the sentence (e.g., the non-realization or postverbal placement of the subject favors the preverbal placement of the DO), information status (focused DOs are preverbal), etc.

Hayrapetyan (1981) does not mention definiteness per se as a parameter at play in the placement of the DO. In order to check whether definiteness was a relevant parameter in the placement of DOs in Classical Armenian, as it is in MEA, we conducted a small corpus-based investigation of about 50 finite sentences in texts dating from the fifth to seventh centuries.^[34] In this sample, 26% of sentences show OV order, and interestingly, nondefinite DOs appear in the preverbal domain twice as often as definite DOs: out of 29 sentences with a definite DO, only 6 show OV order (21%), while out of 13 sentences with a nondefinite DO, 6 show OV order (46%). The main observation is that non-definite DOs already display a tendency to occur in preverbal position, while definite DOs are placed after the verb.

In sum, despite its high degree of flexibility with respect to the positioning of the DO, Classical Armenian nevertheless displayed a preference for placing definite DOs after the verb. Interestingly, parameters such as length and focus, which Hayrapetyan (1981) mentioned as playing a role in the placement of the DO, are also at work in MEA.

5.3 Cognitive pressures behind the SVO–SOV split in MEA

The persistence of (S)VO from Classical to Modern Eastern Armenian calls for an explanation. We account for this persistence in terms of a general crosslinguistic bias favoring SVO order highlighted in typological and historical studies. Based on an impressive body of works on word order change in various languages,^[35] studies of the dynamics of word order change converge on the view that languages do not change toward all orders equally often. More precisely, while languages drift away from SOV (but not toward it), they tend to drift toward SVO (Li 1977; Givón 1979; Gell-Mann and Ruhlen 2011, among others). This is not to say that the shift from SVO to SOV never happens, but if it does, it is the result of contact (i.e., external pressure),^[36] while change to SVO may occur naturally.

Various accounts have been proposed for this general bias in favor of SVO order. Studies in cognitive science and language processing show that from the point of view of communicative efficiency, that is, with respect to the effort imposed on working memory, SVO provides important advantages that can explain the direction of word order evolution from SOV to SVO (see Gibson et al. 2019: 8–10 for a review).

Our empirical study showed that although the default position of bare and indefinite DOs is preverbal, longer indefinite and bare DOs favor the postverbal position. This is not surprising given general crosslinguistic observations that show that word order variation tends to reduce parsing complexities and cost (or maximize parsing efficiency), by means of minimizing the length between the verb and its dependents, also known as dependency length minimization (e.g., Gibson 1998; Hawkins 1994; Temperley 2008).^[37] Consequently, under this view, generally speaking, a sentence is more efficient in SVO order than in SOV, everything else being equal (cf. Gibson et al. 2019: 8–10 for a review).^[38]

On the other hand, SOV also presents other important cognitive advantages, for instance from the point of view of information theory, by allowing maximization of the predictability of the head verb (Ferrer-i-Cancho 2015). Note that this can explain the initial preference for this basic order. With a simple (short) DO, the pressure for dependency-length minimization is smaller. With long DOs, however, as illustrated by the pair of sentences in Example (24) compared to Example (25), placing a long DO in the postverbal domain results in a less complex sentence, by greatly reducing the distance between the verb and its subject.^[39]

(24)

_s[x x] _o[x x x x x x] v

_s[x x] v _o[ x x x x x x]

(25)

_s[x x] _o[x x] v

_s[x x] v _o[ x x]

Importantly, as argued by Ferrer-i-Cancho (2014, 2015 and Ferrer-i-Cancho and Gomez-Rodrigez (2021), word order preferences in different languages could reflect a tradeoff between two main pressures: 1) Minimizing the distance between the verb and its dependents, in order to reduce the processing cost, favoring SVO and 2) Maximizing the predictability of the verb, favoring SOV. The evolution of word order in MEA can thus be explained by the interaction between these cognitive constraints.

Classical Armenian was already a highly flexible language with respect to word order at the clausal level, taking advantage of both pre- and postverbal placement of DOs for various purposes, depending on a set of syntactic and functional parameters. Longer DOs (DOs containing modifiers, but also clausal and infinitive complements) favored SVO over SOV as regards the arrangement of constituents. The same holds for definite DOs, which displayed a preference for postverbal position. Preverbal position, on the other hand, was dedicated to short nondefinite DOs.

The preverbal position of bare/indefinite DOs in MEA was probably reinforced due to contact with Iranian languages,^[40] which resulted in an abundant use of VP idioms or so-called ‘complex predicates’ in Armenian. Iranian languages generally speaking, and Persian in particular, are well known for abundantly resorting to noun-verb sequences that display the semantic properties of a lexeme while behaving syntactically as two distinct entities, comparable to a verb and its (bare) direct object. In Classical Armenian, ‘phraseological calques’ of these complex predicates, where Iranian material was replaced with Armenian counterparts, were used to coin new lexical units (see Bolognesi 1993: 264–266; Meyer 2017: 27–31; Schmidt 1983: 104, among others). It is reasonable to assume that these constructions reinforced the preverbal placement of bare/indefinite direct objects in Classical Armenian since there is no clear-cut distinction between complex predicates formed by a bare noun and a verb on the one hand and any syntactic sequence formed by a bare object and a verb on the other hand (Samvelian 2012).^[41]

The regularization of preverbal position for bare/indefinite DOs had its own cognitive advantages. The preverbal nominal element of a complex predicate contributes to a great extent to its argument structure, lexical conceptual meaning, and event type, thereby playing a crucial role in building the meaning of the complex predicate.^[42] Placing the noun before that verb not only accelerates the retrieval of the meaning of the predicate but also increases the predictability of the verb (cf. Ferrer-i-Cancho 2015). Given that the number of nouns that are likely to form a complex predicate is much greater than the number of verbs that are likely to appear in such constructions,^[43] having first access to the noun is a more efficient cue for predicting the verb than the reverse. The same reasoning holds mutatis mutandis for bare objects, which have been argued in several studies to be part of the predicate or the event described by the verb rather than being arguments of the latter.^[44] Like the nominal element of complex predicates, the bare object is an important cue for the interpretation of the whole predicate since it narrows or specifies the meaning of the verb, thereby accelerating access to its meaning. Therefore, its placement before the verb maximizes the predictability of the latter (Ferrer-i-Cancho 2015).

To summarize, Classical Armenian, being SVO flexible, could make use of both pre- and postverbal placement of the DO for various discourse purposes, also depending on functional parameters optimizing processing. Contact with Persian reinforced the use of the preverbal position for bare/indefinite DOs but did not result in the abandonment of the postverbal position. It strengthened the division of labor between the two domains, giving rise to what can, loosely speaking, be called a complementary distribution. The fact that both orders, OV and VO, are efficient in themselves and in interaction with each other explains the persistence of their coexistence from Classical to Modern Eastern Armenian.

6 Conclusion

In this study, we carried out a quantitative investigation (corpus-based and experimental) of word order in MEA to address the controversy regarding the unmarked word order in this language. Our results converged on a (loosely speaking) complementary distribution between the two orders, with definite DOs overwhelmingly occurring in the VO order, while indefinite/bare DOs are predominantly placed in the preverbal position. Consequently, if one opts for the definition of the unmarked word order as that containing a “definite semantic patient” (Siewierska 1988: 8), then (S)VO is the unmarked word order in MEA. We also highlighted the effect of length, which favors the postposing of the direct object, regardless of its definiteness.

Furthermore, we attempted to provide an explanation for the “typological discrepancy” of word order in MEA, a language that is consistently head-final (or left-branching) in almost all its constituents, except at the clausal level. We argued that while contact with OV languages has resulted in a consistent shift from right to left branching in Armenian, word order at the clausal level has resisted the shift. The reason for this seemingly unexpected or even paradoxical outcome was, we argued, that Armenian made optimal use of each order with respect to its cognitive advantages.

Our research, which is the first quantitative investigation on the matter, paves the road to new research issues to be addressed regarding word order in MEA.

One of the questions left to be addressed is whether the same distribution is observed in spoken spontaneous speech. Armenian linguists we consulted have the intuition that the (S)OV order is favored in spoken spontaneous speech and colloquial register. If this intuition were to be confirmed by quantitative studies, then it could be the case that the difference is based, among other things, on the way the speaker anchors (or does not anchor) themselves in their speech, in terms of the distinction elaborated by Benveniste (1966) between discours (discourse) and récit (narrative). The first type of speech, where the enunciator (speaker) “appears” in their speech, is characterized as “subjective” by Benveniste. The récit mode, on the other hand, is intended to be objective: the speaker “hides” behind their utterance, which is presented as self-evident and true, without any need to be asserted by a specific speaker. Both our corpus and experimental data belong to the récit type, which could have disfavored the (S)OV order. It would thus be interesting to investigate further the distribution of word order in spontaneous contexts to counterbalance the present study.

Another issue to be addressed is the position of the focus. Although focus is said to be preverbal in MEA, various studies (Comrie 1988; Hodgson 2013) nevertheless mention the existence of a postverbal focus. This is expected given that (S)VO is the unmarked order for definite DOs. Thorough investigation is needed in order to identify the parameters involved in the placement of the focus in MEA.

Supplementary Material

Three files can be found at the following Open Science Framework link: https://osf.io/vg2mq/?view_only=10dc8f10212b4b94ac71384bf9813bee.

Supplementary file 1. Corpus sample.

Supplementary file 2. Experimental stimuli and responses.

Supplementary file 3. Statistical analyses.

Corresponding author: Pollet Samvelian, ILPGA, Université Sorbonne Nouvelle, 8 avenue de Saint-Mandé, 75012 Paris, France, E-mail: pollet.samvelian@sorbonne-nouvelle.fr

Acknowledgments

We would like to express our deep gratitude to Tork Dalalyan and Nvard Vardanyan from the Institute of Archeology and Ethnography of the National Academy of Sciences of Armenia, without whose support and hospitality this research could not have been carried out. They offered us the opportunity to conduct our experiments during a one-week stay in October 2019 at the Academy by providing us with an office and by helping us recruit participants. We also thank all participants, undergraduate and graduate students, researchers and professors, who volunteered to take part in our experiments. Our special thanks to Jorj Abrahamian, whose contribution was invaluable to our online experiment. He helped us post our call for participants on social networks and recruited most of the participants for our second experiment. We are also thankful to two anonymous reviewers whose comments and suggestions greatly improved the paper. This work is partially supported by a public grant overseen by the French National Research Agency (ANR) as part of the program “Investissements d’Avenir” (reference: ANR-10-LABX-0083). It contributes to the IdEx Université de Paris (ANR-18-IDEX-0001). This work was completed while the second author held a postdoctoral position successively at Universität zu Köln and Université Sorbonne Nouvelle.

Author contributions: PS defined the research problem and took the lead in writing the manuscript. PF designed the quantitative studies. VK Provided expertise on Armenian grammar and data. PS and PF wrote the paper. VK provided feedback at all editorial stages. All authors took part in the final revision of the paper.

References

Abrahamyan, Sergey, Varag D. Arakelyan & Vagharshak Kosyan. 1975. Hajoch lezu: šarahjusuthjun [The Armenian Language: Syntax], vol. 2. Yerevan: Haykakan SSR GA Hratarakchutyun.Search in Google Scholar

Apridonidze, Shukia. 1986. Sit’q’vatganlageba axal kartulši [word order in Modern Georgian]. Tbilisi: Mecniereba.Search in Google Scholar

Arakelyan, Varag D. 1958. Hajereni šarahjusuthjun [Armenian Syntax], vol. 1. Yerevan: Haykakan SSR GA Hratarakchutyun.Search in Google Scholar

Arregui-Urbina, Karlos. 2002. Focus on Basque movements. Cambridge, MA: Massachusetts Institute of Technology dissertation.Search in Google Scholar

Asatiani, Rusudan & Stavros Skopeteas. 2012. The information structure of Georgian. In Manfred Krifka & Renate Musan (eds.), The expression of information structure, 127–158. Berlin & New York: De Gruyter Mouton.10.1515/9783110261608.127Search in Google Scholar

Aydemir, Yasemin. 2004. Are Turkish preverbal bare nouns syntactic arguments? Linguistic Inquiry 35(3). 465–474. https://doi.org/10.1162/0024389041402607.Search in Google Scholar

Badikyan, Khachik. 1976. Žamanakakich hajereni parz naχadasuthjan šaradasuthjunə [The word order in the simple sentences in Modern Armenian]. Yerevan: Yerevani hamalsarani hratarakchutyun.Search in Google Scholar

Benveniste, Emile. 1966. Problèmes de linguistique générale, vol. 1. Paris: Gallimard.Search in Google Scholar

Bolognesi, Giancarlo. 1993. Calques iraniens en arménien. In Bela Brogyanyi & Reiner Lipp (eds.), Comparative-historical linguistics: Indo-European and Finno-Ugric, 247–255. Amsterdam & Philadelphia: John Benjamins.10.1075/cilt.97.24bolSearch in Google Scholar

Cinque, Guglielmo. 1999. Adverbs and functional heads: A cross-linguistic perspective. Oxford: Oxford University Press.10.1093/oso/9780195115260.001.0001Search in Google Scholar

Comrie, Bernard. 1988. Topics, grammaticalized topics, and subjects. Berkeley Linguistics Society 14. 265–279. https://doi.org/10.3765/bls.v14i0.1798.Search in Google Scholar

Creissels, Denis. 2006. Syntaxe générale: Une introduction typologique. Paris: Hermès Sciences–Lavoisier.Search in Google Scholar

Dayal, Veneeta. 1999. Bare NP’s, reference to kinds, and incorporation. Proceedings of SALT 9. 35–51. https://doi.org/10.3765/salt.v9i0.2816.Search in Google Scholar

Der-Houssikian, Haig. 1978. The semantic significance of S, O, V sequences in Armenian. In Mohammad A. Jazayery, Edgar C. Polomé & Werner Winter (eds.), Linguistic and literary studies, vol. 2: Descriptive Linguistics, 227–236. Lisse: Peter de Ridder Press.10.1515/9783110800432.227Search in Google Scholar

Dezső, László. 1974. Topics in syntactic typology. In Milan Romportl, Vladimír Skalička, Jaroslav Popela & Bohumil Palek (eds.), Linguistica generalia, vol. 1: Studies in linguistic typology, 191–210. Prague: Charles University.Search in Google Scholar

Donabedian, Anaid. 2010. Nom nu et tropisme typologique : Le cas de l’arménien. In Franck Floricic (ed.), Essais de typologie et de linguistique générale: Mélanges offerts à Denis Creissels, 403–416. Lyon: ENS Editions.Search in Google Scholar

Drummond, Alex. 2013. Ibex farm (free hosting for Internet-based experiments), version 0.3. Available at: http://www.spellout.net/ibexfarm.Search in Google Scholar

Dryer, Matthew S. 1992. The Greenbergian word order correlations. Language 68(1). 81–138. https://doi.org/10.2307/416370.Search in Google Scholar

Dryer, Matthew S. 1995. Frequency and pragmatically unmarked word order. In Pamela Downing & Michael Noonan (eds.), Word order in discourse, 105–135. Amsterdam & Philadelphia: John Benjamins.10.1075/tsl.30.06drySearch in Google Scholar

Dryer, Matthew S. 1998. Aspects of word order in the languages of Europe. In Siewierska Anna (ed.), Constituent order in the languages of Europe, 283–319. Berlin & New York: Mouton de Gruyter.10.1515/9783110812206.283Search in Google Scholar

Dryer, Matthew S. 2013. Order of subject, object and verb. In Matthew S. Dryer & Martin Haspelmath (eds.), The world atlas of language structures online. Leipzig: Max Planck Institute for Evolutionary Anthropology. http://wals.info/chapter/81 (accessed 9 December 2020).Search in Google Scholar

Dum-Tragut, Jasmine. 2009. Armenian: Modern Eastern Armenian. Amsterdam & Philadelphia: John Benjamins.10.1075/loall.14Search in Google Scholar

Dum-Tragut, Jasmine. 2002. Word order correlations and word order change: An “applied-typological” study on literary Armenian variants. Munich: Lincom.Search in Google Scholar

Erguvanlı, Eser E. 1984. The function of word order in Turkish Grammar (University of California Publications in Linguistics 106). Berkeley & Los Angeles: University of California Press.Search in Google Scholar

Faghiri, Pegah. 2016. La variation de l’ordre des constituants dans le domaine préverbal en persan: Approche empirique. Paris: Université Sorbonne Nouvelle dissertation.Search in Google Scholar

Faghiri, Pegah, Pollet Samvelian & Barbara Hemforth. 2018. Is there a canonical order in Persian ditransitive constructions? Corpus-based and experimental studies. In Agnes Korn & Andrej Malchukov (eds.), Ditransitive constructions in a cross-linguistic perspective, 165–185. Wiesbaden: Reichert.Search in Google Scholar

Faghiri, Pegah & Pollet Samvelian. 2020a. Word order preferences and the effect of phrasal length in SOV languages: Evidence from sentence production in Persian. Glossa: A Journal of General Linguistics 5(1). 86. https://doi.org/10.5334/gjgl.1078.Search in Google Scholar

Faghiri, Pegah & Pollet Samvelian. 2020b. Modern Eastern Armenian: OV or VO. In Sho Akamine (ed.), Proceedings of the Thirty first Western Conference on Linguistics (WECOL 2019), 11–18. Fresno: Department of Linguistics California State University.Search in Google Scholar

Ferrer-i-Cancho, Ramon. 2014. Why might SOV be initially preferred and then lost or recovered? A theoretical framework. In Erica A. Cartmill, Sean Roberts, Heidi Lyn & Hannah Cornish (eds.), The evolution of language: Proceedings of the 10th international conference (EVOLANG10), 66–73. Vienna: Wiley.10.1142/9789814603638_0007Search in Google Scholar

Ferrer-i-Cancho, Ramon. 2015. The placement of the head that minimizes online memory: A complex systems approach. Language Dynamics and Change 5(1). 114–137. https://doi.org/10.1163/22105832-00501007.Search in Google Scholar

Ferrer-i-Cancho, Ramon & Carlos Gómez-Rodríguez. 2021. Anti-dependency distance minimization in short sequences. A graph theoretic approach. Journal of Quantitative Linguistics 28(1). 50–76. https://doi.org/10.1080/09296174.2019.1645547.Search in Google Scholar

Gell-Mann, Murray & Merritt Ruhlen. 2011. The origin and evolution of word order. Proceedings of the National Academy of Sciences 18. 17290–17295. https://doi.org/10.1073/pnas.1113716108.Search in Google Scholar

Gibson, Edward. 1998. Linguistic complexity: Locality of syntactic dependencies. Cognition 68(1). 1–76. https://doi.org/10.1016/S0010-0277(98)00034-1.Search in Google Scholar

Gibson, Edward, Steven T. Piantadosi, Kimberly Brink, Leon Bergen, Eunice Lim & Rebecca Saxe. 2013. A noisy-channel account of crosslinguistic word-order variation. Psychological Science 24(7). 1079–1088. https://doi.org/10.1177/0956797612463705.Search in Google Scholar

Gibson, Edward, Richard Futrell, Steven P. Piantadosi, Isabelle Dautriche, Kyle Mahowald, Leon Bergen & Roger Levy. 2019. How efficiency shapes human language. Trends in Cognitive Sciences 23(5). 389–407. https://doi.org/10.1016/j.tics.2019.09.005.Search in Google Scholar

Gildea, Daniel & David Temperley. 2010. Do grammars minimize dependency length? Cognitive Science 34(2). 286–310. https://doi.org/10.1111/j.1551-6709.2009.01073.x.Search in Google Scholar

Giorgi, Alessandra & Sona Haroutyunian. 2016. Word order and information structure in Modern Eastern Armenian. Journal of the Society for Armenian Studies 25. 185–201.Search in Google Scholar

Givón, Talmy. 1979. On understanding grammar. New York: Academic Press.Search in Google Scholar

Givón, Talmy. 1977. The drift from VSO to SVO in Biblical Hebrew: The pragmatics of tense-aspect. In Charles Li (ed.), Mechanisms of syntactic change, 181–254. Austin, TX: University of Texas Press.10.7560/750357-007Search in Google Scholar

Greenberg, Joseph H. 1966. Language universals: With special reference to feature hierarchies. The Hague: Mouton.Search in Google Scholar

Harris, Alice C. 2000. Word order harmonies and word order change in Georgian. In Rosanna Sornicola, Erich Poppe & Ariel Shisha-Halevy (eds.), Stability, variation and change of word-order patterns over time, 133–163. Amsterdam & Philadelphia: John Benjamins.10.1075/cilt.213.13harSearch in Google Scholar

Hawkins, John A. 1979. Implicational universals as predictors of word order change. Language 55(3). 618–648. https://doi.org/10.2307/413320.Search in Google Scholar

Hawkins, John A. 1983. Word order universals. New York: Academic Press.Search in Google Scholar

Hawkins, John A. 1994. A performance theory of order and constituency. New York: Cambridge University Press.10.1017/CBO9780511554285Search in Google Scholar

Hayrapetyan, Sergo A. 1981. Ugig χndri šaradasuthjunə dasakan grabarum [The order of the direct object in classical Armenian]. Herald of Social Sciences 9. 71–84.Search in Google Scholar

Hayrapetyan, Sergo A. 2005. Grabari šarahyusut’yun [Syntax of Grabar. V century]. Yerevan: HH GAA “Gitutyun” Hratarakchutyun.Search in Google Scholar

Herring, Susan C. & John C. Paolillo. 1995. Focus position in SOV languages. In Pamela Downing & Michael Noonan (eds.), Word order in discourse, 163–198. Amsterdam & Philadelphia: John Benjamins.10.1075/tsl.30.08herSearch in Google Scholar

Hodgson, Katherine. 2013. Discourse configurationality in Eastern Armenian: The nominal projection. Cambridge: University of Cambridge MPhil thesis.Search in Google Scholar

Kahnemuyipour, Arsalan. 2009. The syntax of sentential stress. Oxford: Oxford University Press.10.1093/acprof:oso/9780199219230.001.0001Search in Google Scholar

Kahnemuyipour, Arsalan & Karine Megerdoomian. 2011. Second position clitics and the vP phase: The case of the Armenian auxiliary. Linguistic Inquiry 42(1). 152–162. https://doi.org/10.1162/ling_a_00033.Search in Google Scholar

Kahnemuyipour, Arsalan & Karine Megerdoomian. 2017. On the positional distribution of an Armenian auxiliary: Second-position clisis, focus, and phases. Syntax 20(1). 77–97. https://doi.org/10.1111/synt.12129.Search in Google Scholar

Khurshudyan, Vicktoria. 2006. Средства выражения хезитации в устном армянском дискурсе в типологической перспективе [Hesitation means in oral Armenian discourse in typological perspective]. Moscow: Russian State University for the Humanities dissertation.Search in Google Scholar

Kim, Alan Hyun-Oak. 1988. Preverbal focusing and type XXIII languages. In Michael Hammond, Edith Moravcsik & Jessica Wirth (eds.), Studies in syntactic typology, 147–169. Amsterdam & Philadelphia: John Benjamins.10.1075/tsl.17.12kimSearch in Google Scholar

Kozintseva, Natalia. 1995. Modern Eastern Armenian. Munich: Lincom Europa.Search in Google Scholar

Lambrecht, Knud. 1996. Information structure and sentence form: Topic, focus, and the mental representations of discourse referents. Cambridge: Cambridge University Press.Search in Google Scholar

Li, Charles. 1977. Mechanisms of syntactic change. Austin, TX: University of Texas Press.10.7560/750357Search in Google Scholar

Meyer, Robin. 2017. Iranian-Armenian language contact in and before the 5th century CE. Oxford: University of Oxford dissertation.Search in Google Scholar

Minassian, Martiros. 1980. Grammaire d’arménien oriental. New York: Delmar.Search in Google Scholar

Papoyan, Artashes & Khachik Badikyan. 2003. Žamanakakich hajoch lezvi šarahjusuthjun [Modern Armenian Syntax]. Yerevan: Yerevani hamalsarani hratarakchutyun.Search in Google Scholar

Samvelian, Pollet. 2012. Grammaire des prédicats complexes: Les constructions nom-verbe. Paris: Editions Hermès-Lavoisier.Search in Google Scholar

Schmidt, Karl Horst. 1983. Iranisches Lehngut im Armenischen. Revue des Études Arméniennes 17. 73–112.Search in Google Scholar

Schröter, Stefanie. 2019. The syntax of focus in Caucasian Urum. Lingua 229. https://doi.org/10.1016/j.lingua.2019.06.012.Search in Google Scholar

Semenova, Xenia. 2014. Fokusnoye markirovaniye v sovremennom vostochnom armyanskom yazyke «na dagestanskom fone» (Focus marking in Modern Eastern Armenian “on Dagestanian background”). In Yazyk. Konstanty. Peremennyye. Pamyati Aleksandra Yevgenʹyevicha Kibrika [Language. Constants. Variables. In memory of Alexander Evgenievich Kibrik], 407–429. Saint-Petersburg: Aleteiya.Search in Google Scholar

Siewierska, Anna. 1988. Word order rules. London: Croom Helm.Search in Google Scholar

Skopeteas, Stavros & Gisbert Fanselow. 2010. Focus in Georgian and the expression of contrast. Lingua 120. 1370–1391. https://doi.org/10.1016/j.lingua.2008.10.012.Search in Google Scholar

Stallings, Lynne M. & Maryellen C. MacDonald. 2011. It’s not just the “heavy NP”: Relative phrase length modulates the production of heavy-NP shift. Journal of Psycholinguistic Research 40(3). 177–187. https://doi.org/10.1006/jmla.1998.2586.Search in Google Scholar

Stilo, Donald. 2014. Areal factors and postverbal patients in spoken Azerbaijani: A corpus-based study. In Nurettin Demir, Birsel Karakoç & Astrid Menz (eds.), Turcology and linguistics: Eva Agnes Csato Festschrift, 417–430. Ankara: Hacettepe Üniversitesi Yayınları.Search in Google Scholar

Stilo, Donald. 2018. Investigating shared features in the Araxes-Iran linguistic area and its subareas. In Christiane Bulut (ed.), Linguistic minorities in Turkey and Turkic-speaking minorities of the periphery, 427–452. Wiesbaden: Harrassowitz.10.2307/j.ctvckq4v1.23Search in Google Scholar

Tamrazian, Armine. 1991. Focus and WH-movement in Armenian. UCL Working Papers in Linguistics 3. 102–121.Search in Google Scholar

Tamrazian, Armine. 1994. The syntax of Armenian: Chains and the auxiliary. London: University College London dissertation.Search in Google Scholar

Temperley, David. 2008. Dependency-length minimization in natural and artificial languages. Journal of Quantitative Linguistics 15(3). 256–282. https://doi.org/10.1080/09296170802159512.Search in Google Scholar

Tomlin, Russell S. 1986. Basic word order: Functional principles. London: Croom Helm.Search in Google Scholar

Tuite, Kevin. 1998. Kartvelian morphosyntax: Number agreement and morphosyntactic orientation in the South Caucasian languages. Munich: Lincom Europa.Search in Google Scholar

Vogt, Hans. 1971. Grammaire de la langue géorgienne. Oslo: Universitetsforlaget.Search in Google Scholar

Yavrumyan, Marat, Hrant Khachatrian, Anna Danielyan & Gor Arakelyan. 2017. ArmTDP: Eastern Armenian treebank and dependency parser. In XI International Conference on Armenian Linguistics, Abstracts. Yerevan.Search in Google Scholar

Received: 2021-01-19

Accepted: 2022-02-19

Published Online: 2023-03-13

Published in Print: 2023-05-25

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

https://doi.org/10.1515/ling-2021-0013

Keywords for this article

Armenian; syntax; typology; word-order variation

Creative Commons

BY 4.0