An improved test of the constant rate hypothesis: late Modern American English possessive have

Richard Zimmermann

doi:10.1515/cllt-2021-0038

Article Open Access

An improved test of the constant rate hypothesis: late Modern American English possessive have

Richard Zimmermann

Published/Copyright: June 20, 2022

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Corpus Linguistics and Linguistic Theory Volume 19 Issue 3

Abstract

The Constant Rate Hypothesis (CRH) predicts that a linguistic innovation should spread at identical rates of change in all grammatical contexts in which it is used (Kroch 1989. Reflexes of grammar in patterns of language change. Language Variation and Change 1(3). 199–244). Weaknesses in previous tests of the CRH are identified. A new study is conducted that improves upon them. It utilizes a syntactic change in late Modern American English possessive have, which altered its realization in the grammar-theoretically related contexts negation, inversion, VP-adjunction and VP-ellipsis. Data sets are collected from the Corpus of Historical American English (Davies 2010. The corpus of historical American English: 400 million words, 1810-2009. http://corpus.byu.edu/coha/ (accessed 10 September 2016)) and analyzed with mixed-effects logistic regression models. The statistical analysis reveals that it is indeed plausible to assume that each of the contexts innovates the use of possessive have at the same speed. Implications of the findings for the CRH are discussed.

Keywords: big data; constant rate hypothesis; corpus linguistics; diachronic syntax; late Modern English; mixed-effects logistic regression; possessive have; syntactic change

In memory of Tony Kroch

1 Introduction

Competition is a frequent form of language change that proceeds as an opposition between variants of a linguistic variable (e.g., Kroch 1994;^[1] Zehentner 2019: Ch. 6.3). Most research has focused on the dynamics of two-variant systems. Competition normally arises from a small modification of a linguistic pattern, which thereby creates a new virtually synonymous alternative that is incompatible with the old form and hence vies with it for the same opportunities to be uttered (e.g., Mufwene 2009: 20, similarly Ritt 1995: 54). The innovative variant begins to co-exist side by side with the conservative one in the mind of a speaker leading to variation within an individual (e.g., Roeper 1999). It also spreads from one speaker to another so that the competing grammatical options propagate through the population resulting in variation within the speech community (e.g., Labov 2001). While several outcomes may develop from this state of affairs, such as a failed change, stable differentiation or free variation (e.g., De Smet et al. 2018; Postma 2017; Roberts 2007: Ch. 4), the most salient consequence and focus of this paper is the complete replacement of the old with the new form across several generations during a transitional period, a time when both variants have substantial currency in a population.

The Constant Rate Hypothesis (CRH) is the prediction that “when one grammatical option replaces another with which it is in competition across a set of linguistic contexts, the rate of replacement, properly measured, is the same in all of them” (Kroch 1989: 200). More technically speaking, the CRH hypothesizes that a regression of the probability of a new form as a function of time and grammatical context will return the time parameter as independent of the grammatical context parameter(s) even though contexts may be significantly different from each other. My own formulation of the CRH takes the form of the following argument:

Premise (i) – Distinct grammatical structures represented in the mind can access the innovative/conservative variant in competition^[2] thereby creating different grammatical contexts for the change.
Premise (ii) – The frequency of use of the innovative/conservative variant in competition (in any context) depends solely on its entrenchment.^[3] In the case of a replacement change, entrenchment, shared between the two variants, gradually strengthens/weakens^[4] for the new/old form to the point of its complete dominance/non-availability over successive lifespans and generations of speakers.
Premise (iii) – There are diachronically stable factors that influence whether a particular context would occur with the innovative/conservative variant in the first place.^[5]
Conclusion – The rising/falling diachronic pattern of the frequency of use of the innovative/conservative variant is independent of grammatical contexts.^[6]

The following points concerning the CRH are particularly noteworthy. First, the CRH stands as an alternative to the claim that rates of change tend to be inherently variable across contexts, or are perhaps predictable from some feature of contexts. For example, Bailey uses a concept of “weighting”, defined by markedness, rule ordering or frequency, to propose that “environments are […] faster and slower according to their relative heaviness” and that changes “operate ‘faster’ in heavier environments than in lighter ones” (1973: 44–5, 82, 110–3, 143). Second, the CRH’s implication – if a single innovation is expressed in distinct grammatical contexts, it changes at the same rate in all of them – is sometimes turned around so that identity in the rates of change is taken as evidence for the relatedness of contexts. This logical inversion is not legitimate (e.g., Kauhanen and Walkden 2018: 508) and reasoning along such lines should be avoided, or at least be signposted explicitly as an extension of the CRH, not the CRH itself. Third, the CRH is frequently (mis)interpreted to claim that a change is actuated earlier in some contexts than in others (e.g., “changes may occur in a particular syntactic context before spreading to another”, Kastovsky 2012: 158). Rather, it asserts that once actuated, a change has full potential to be used in all contexts and independent factors explain why corpora record a new form more frequently in one over some other context. Fourth, the CRH has been criticized as merely relating “to an emergent property of language at the level of aggregate corpus data” (anonymous, p.c.). This is unlikely to be correct because the theoretical underpinnings of the CRH (as explicated in premises (i) to (iii) above) are rooted in individual minds. If constant rate effects are not caused by cognitive mechanisms in individuals, one would leave their emergence unexplained. Finally, the CRH is quite special in that it posits the truth of a null hypothesis (Paolillo 2011: 264). Null hypothesis significance testing has the potential to provide evidence for significantly different rates of change across contexts but cannot demonstrate that the rate of change is not different across contexts. Analytical advancements to address this issue in the context of logistic regression have been discussed in the literature (e.g., Ecay 2015: 36–40, 63–66; Kauhanen and Walkden 2019 for remarks on power analysis), but have not yet become standard practice. Hence, while this paper is careful to use state-of-the-art techniques in well controlled conditions to detect significant contextual divergences in the rate of change, it cannot resolve the fundamental problem that its methodology makes it impossible to prove conclusively the absence of such an effect.

Taken at face value, the CRH is a straightforward, empirically testable hypothesis. It can be refuted directly by observing that the rate of change depends on the grammatical contexts. In practice, the CRH comes with a number of conceptual and methodological challenges so difficult that the field has seen great confusion about the CRH’s interpretation and controversy about its applicability. Some researchers endorse the CRH, writing that “the notion of a constant rate has accumulated enough support over the last three decades for this to be referred to as the Constant Rate Effect” (Kauhanen and Walkden 2018: 484, referencing Pintzuk 2003). Others express great skepticism towards the CRH, stating that “[c]hanges may occur at different times and at different rates (pace Kroch 1989)” (Traugott and Trousdale 2010: 26) and that “the Constant Rate Hypothesis should not be considered valid unless tested on more reliable data” (Feltgen 2017:133). The purpose of this paper is therefore, firstly, to list common misconceptions surrounding the treatment of the CRH in previous case studies and, secondly, to present a new, improved enquiry that is free of the problems thus identified.

Section 2 will present a summary of the most important shortcomings of previous studies and derive a list of desiderata that an investigation of the CRH should observe. Section 3 reports an improved test of the CRH that meets those desiderata. Its empirical basis concerns possessive have in late Modern to Present-Day American English (c. 1800–today). I will outline the relevant linguistic developments as well as an elementary syntactic model, describe the database used for the collection of the material, the Corpus of Historical American English (Davies 2010), and evaluate the CRH using mixed-effect logistic regression models. Section 4 discusses the findings. Section 5 concludes.

2 Problems of previous studies and desiderata for a new study

This section discusses important requirements for an empirical test of the CRH.

2.1 Discussion of previous studies

It is my contention that all previous examinations of the CRH have serious shortcomings. I will list and exemplify the most important ones in the following paragraphs.

2.1.1 Grammatical contexts

Tests of the CRH must derive contexts from an explicit grammatical theory: “the set of contexts that change together [must be] defined […] by a shared syntactic structure, whose existence can only be the product of an abstract grammatical analysis” (Kroch 1989: 201). Appropriate syntactic distinctions can largely be captured by their form, frequently modelled as transformations, phrase structure rules, constructions, parameters, attribute-value matrices, word order patterns, and suchlike.^[7] This is so for the following reasons:

The CRH has been postulated for different structures but is silent on other distinctions. In order to test the CRH, one must stay within its domain.
Using grammatical contexts explains independence between contexts and the competing variants. Grammatical contexts are created as a combination of syntactic representations, like passive bystanders, with the changing forms in competition. Since the embedding system itself, as a reasonable idealization, tends to be stable, the changing variants will permeate all patterns in which they can be used equally rapidly.
Defining contexts by form is necessary for methodological reasons. Paolillo believes that “the analyst has methodological freedom to operationalize the object of study in ways that partition consideration of related changes into different historical processes. […] The availability of such analytical options means that the analyst can always posit an analysis consistent with the CR[H]” (2011: 266). If the contexts are soundly deduced from a formal theory, their operationalization and resulting partition of the data is highly restricted and will not actually be at risk of the analyst’s cherry-picking.

Despite the requirement to test the CRH with contexts that are grammatically motivated, researchers have often used contexts in a wide sense, as any independent variable that a linguist hypothesizes has a statistically significant effect on the progression of a competitive change. For example, Corley (2014) reports constant rates of change in the development of Early Modern English negation across extra-grammatical categories, such as gender and social class, using data from Nevalainen and Raumolin-Brunberg (2003). Nothing prevents a change from occasionally developing at the same rate in different dialects or social groups, perhaps on account of stable levels of interaction between them (“pseudo constant rate effects,” Kauhanen and Walkden 2018). However, there is no theoretical expectation for constant rates to emerge across sociolinguistic contexts by default, and in fact they frequently do not.

Functional divisions along categories such as information structure, weight or processing constraints are likewise not directly in the purview of the CRH. For instance, Kauhanen and Walkden (2018: 511, referring to work by John Hawkins) discuss a hypothetical change from OV to VO influenced by object weight so that a light object would have a bias to be produced in an OV configuration, a heavy object in a VO configuration (actual study, Taylor and Pintzuk 2012). For this functional bias to translate into varying context intercepts, the measured contexts would themselves have to be defined according to this bias, here as a division between heavy versus light objects. However, the grammatical analysis for the variation simply consists of some account of the order of object and verb, i.e., a single grammatical context. Thus, the distinction is in a sense trivial: Since there is only one context, it is not surprising that the change would be expressed at the same speed for heavy and light objects. An important advantage of functional contexts is that their influence can be expected to be constant throughout a transitional period since they “are shared by all speakers across all languages and are not subject to change” (Kauhanen and Walkden 2018: 512). This is the reason why functional effects have often been linked to the CRH. Nevertheless, these contexts are not ideal. The challenge lies in building a theory that explains how functional factors can influence grammatical contexts rather than to define contexts as functional categories. Functional factors also lead to methodological difficulties: What counts as heavy versus light? Should the predictor not be regarded as continuous? Etc.

Finally, random variables have been used as contexts with respect to the CRH but are likewise not pertinent for this purpose. The most important case are individual lexical items. For instance, Feltgen (2017: 272–279) defines contexts as nominal complements after French en plein ‘in plain’, or dans ‘in.’ Ogura rejects the CRH on the basis of lexical diffusion concluding that “Kroch’s claim for constancy in the rate of change, therefore, does not stand up upon closer scrutiny” (Ogura 1993: 51; Ogura and Wang 1996). In reality, it is to be expected that distinct lexical units interact with surrounding words in different ways, show effects of frequency and semantic similarity, appear in distinct structures altogether, vary in the degree to which they form compositional versus idiomatic expressions, have a different probability to fossilize as archaisms, and hence may display significantly different rates of change. Where individual lexical items are encountered in grammatical contexts, they should be included as random effects in the statistical models. It would be even better if the CRH was tested with a change that involves a single lexeme since this eliminates the need for diffusion control entirely.

2.1.2 Sample size

A major point of criticism that leads Paolillo to deny the CRH the status of an empirical prediction is that “[t]he principal cause of the failure to find nonindependence in the data is the variance of the different series, whose wandering easily overwhelms any estimate of difference between their slopes” or that “claiming independence invites type 2 error of unknown probability” (2011: 265–266). In other words, failure to reject the CRH may be due simply to low power, but absence of evidence cannot count as evidence of absence (Paolillo 2011: 264, 272). The only way to inoculate against this risk is to rely on very large data sets that result in high certainty in the estimates of the true rates of change despite the noisy “wandering” of the observed data.

Although large data sets are needed to test the CRH, many investigations have relied on rather small amounts of data. For example, Fruehwald et al. (2013) claim to demonstrate a constant rate effect in a medieval German sound change based on only 500 examples spread over three phonological contexts and four time points. This is an example of a fascinating extension of the CRH whose conclusions nevertheless remain doubtful because of small sample size.

2.1.3 Unambiguous forms

The old and new forms should be identifiable unambiguously by their form. Santorini (1993) tests the CRH with a change in Yiddish verb positioning, conservative verb-final SXV versus innovative verb-medial SVX. However, the SVX variant is ambiguous between real verb-medial order and some form of constituent extraposition of X. The number of SVX tokens must thus be corrected on the basis of the frequency of other extraposition phenomena. This methodology, while perfectly plausible, introduces an additional complicating step into the testing procedure, may open the door for post-hoc theorizing, and could also introduce additional noise to the measurements.

2.1.4 Replacement

The linguistic development used to test the CRH should result in replacement. The reason is that the CRH may not hold for cases of stable variation with semantic differentiation whereas replacement is a strong indicator for largely synonymous variants.^[8] As far as I can see, there is currently no solid understanding that would allow researchers to predict a priori that competition will lead to replacement. Therefore, the empirical change must already have occurred (for discussion of relevant issues in the context of be-like quotatives, see Gardner et al. 2020).

An example of a study that does not meet this requirement is Tagliamonte (2013). She conducts an apparent-time study on the rise of the going to future versus other future forms such as will in contemporary York English across contexts such as sentence type (conforming to the CRH) or grammatical person (contradicting the CRH). Since her data shows an increase in the use of going to from 17% in the oldest cohort to about 35% in the youngest cohort, she speculates that going to is becoming more frequent. However, the observed age effect could be caused by factors other than ensuing replacement, such as lack of control for individual lexical items or style. The two forms have co-existed for centuries and there is in fact evidence of semantic differentiation between them (e.g., Gries and Stefanowitsch 2004; Haegeman 1989).

2.1.5 Monotonic spread

The increase in the new variant must proceed steadily without oscillations in the form of an S-shape curve. In other words, the change should display a monotonic spread. The reason for this requirement is methodological. The standard evaluation of the CRH uses a logistic regression model that includes a single time parameter. It models the relation between the competing forms and time as an S-shaped curve, or more precisely as linear on a logit scale. If the rise of the new form does not follow an S-shaped curve, such a model will not fit well, which would necessitate more complex models with their associated complications. Other statistical models that would not require monotonicity in the increase of the new form are an active area of research but are not currently widely used. S-shaped curves are in fact the single most common type of trajectory in competitive changes (e.g., Blythe and Croft 2012, but see skepticism towards sigmoidal curves in diffusion e.g., Denison 2003), so that logistic regression should be a suitable technique. Note however that non-monotonic changes (e.g., inverse U-shaped failing changes) do not in principle fall outside of the domain of the CRH. It would simply predict that non-monotonic developments should be mirrored to the same degree in all grammatical contexts.

I can see two reasons for recording fluctuations in the rate of change. On the one hand, they may result from accidental contamination of the data. This is likely, in particular, in studies based on corpora whose texts allow poor control over factors such as genre, dialect, and other social dimensions. Indeed, scholars have warned against the “religio-juridico-belletristico-commercial nature […] [of] high-style written language” (Joseph and Janda 2003: 141) with respect to the CRH. If any one of these factors is not held constant during the transitional period, a monotonic change may be recorded as non-monotonic. For example, if the corpus over-represents time 1 with informal texts closely reflecting spoken language, such as letters or dialogues, but time 2 with formal texts involving prescriptive pressures or formulaic language, such as prayers or legal documents, then a change might seemingly slow down or even revert. Similarly, imprecise manuscript dating, discrepancies between original and manuscript dates, scribal intrusion and updating, or translation effects may introduce oscillations in the recorded frequency of the new form if these distortions occur more often at some time points than at others.

For example, Pintzuk (1996) examines the transition of Old English finite verbs from final to medial position with clause type (main vs. subordinate) as the contextual variable. She provides evidence in favor of the CRH since both clause types move in parallel on the whole. However, the development first shows an increase in early Old English, then a dip in late Old English and then a rise again towards Middle English. It is plausible that this non-monotonic trajectory is caused by data contaminations typical of medieval documents, such as fluctuating text types or dialects, imprecise dating, translation effects, etc.

On the other hand, non-monotonic developments in the data are not necessarily accidental. They can also reflect real changes within the language system, in the evaluation of a changing form, or in how texts are transmitted (cf. Amato et al. 2018).

A prime example is Ellegård’s (1953) study on the regulation of do-support in the history of English across several clause type contexts, which also forms the basis for the primary case study in Kroch (1989). His data set shows a curious anomaly in c. 1575, when a number of contexts appear to shift their rates of change markedly. Several scholars have modeled, replicated and acknowledged the reality of this inflection point (e.g., Ecay 2015; Vulanović and Baayen 2006; Warner 2005). The rise of English do-support is thus likely influenced by countervening language-internal or cultural shifts during the transitional period, which complicates the determination of one global rate of change parameter.

2.1.6 Complete transition

The CRH can be tested most rigorously with a syntactic change that is attested from beginning to end. Since any stage of a grammatical development can be influenced by unforeseen factors, omitting some intervals of the transitional period may result in incorrect conclusions about the potential constancy of a contextual effect.

Simonenko et al. (2019), for instance, investigate the rate of change in the grammar-theoretically related processes of the rise of expletives, the emergence of new verbal endings and the loss of pro-drop in Medieval French. They reject the CRH because the rise of expletives appears to proceed more slowly than the other two developments. However, their data set does not encompass the termination of the change in the expletive context. If additional data was collected from the time the change goes to completion (perhaps c. 1700), it would be quite conceivable that the rates of change converge in all contexts after all.

2.1.7 Adequate statistics

Finally, the CRH should be evaluated statistically by comparing models with and without the time-context interaction term. Random variables should be controlled for. This is the current state-of-the-art procedure.

A problematic study in this respect is Wallenberg (2016), who studies a decline in relative clause extraposition in different languages as the context category. He provides neither summaries of models with all relevant variables considered (grammatical function and weight), nor standard errors for the coefficient estimates, nor an actual test of the CRH as “models fit across all of the language data sets will not converge” so that his proposal of “a crosslinguistic ‘constant rate effect’” (Wallenberg 2016: e243, e238) must be regarded as unproven.

2.2 Desiderata for a new study

A test of the CRH should meet the following requirements: (i) The contexts must be grammatical since the CRH does not make claims about the diffusion of a syntactic pattern in non-grammatical contexts, like socio-linguistic categories, or with respect to random effects, like individual lexical items. Grammatical contexts should be deduced from a syntactic theory. (ii) The data set should be large in order to increase the power to detect even small divergences in the rate of change parameter. (iii) The competing linguistic forms must be largely identifiable by their form. Competition between the innovative and conservative forms should (iv) result in a complete replacement change, (v) proceed, on the whole, on a monotonic trajectory, and (vi) be traced with relevant examples essentially throughout the entire transitional period. Finally, (vii) the CRH should be evaluated with proper statistical techniques.

I am not aware of any studies that meet all of these criteria. The next section aims to close this gap by providing an improved investigation of the CRH.

3 A new test of the CRH

I will now present a study testing the CRH that meets the desiderata outlined in the last section.

3.1 Material: change in late modern English possessive have

The empirical material for this study concerns finite possessive have in late Modern American English (c. 1800–today). The term ‘possessive have’ is here defined by complementation pattern, as any instance of a finite form of the lexeme have co-occurring with a nominal object, have + DP. As such, the structure contrasts with other complementation patterns of have, perfect auxiliary, have + past participle (have done something), causative have, have + non-finite clause with an overt subject (have someone do something), its passive variant (have something done), modal have, have + to-infinitival clause (have to do something), etc. Semantic considerations regarding the denotation of the DP do not play a role for my definition.^[9]

The verb has changed its realization in several syntactic contexts. The diachronic variation is illustrated below. Example (1) exemplifies the conservative, example (2) the innovative, use of possessive have.

(1)

conservative variant:

“have you not still some water left in the sprinkling can!” “To be sure I have,” Uncle Wiggily whispered back.

(from: Star Tribune Journal, Minneapolis, July 11, 1916, p. 18)

(2)

innovative variant:

“Perhaps she is lonely. Do you not still have your sewin’ circle?” “We do, but Abigail says Loretta will not come”

(from: Marti Talbott’s Marblestone Mansion, Book 6, published 2013, p. 70)

In (1), not negates the proposition directly (have not), while in (2) negation requires do-support (do not have). Questions are formed by inverting have and the subject in (1) (have you), but by inserting the dummy auxiliary do in (2) (do you have). Example (1) places possessive have in front of a VP-adverb (have still); example (2) places it after such an item (still have). Finally, (1) forms an elliptical answer with have (I have), whereas (2), once again, uses the word do (We do).^[10]

The change in possessive have can be analyzed as follows. Modern English makes a fairly sharp distinction between two kinds of verbs, commonly called auxiliaries versus lexical main verbs. I will model the two types as two different word class categories and call the former I (“inflection”) and the latter V (“verb”) (following Chomsky 1986). There is an ordering restriction such that I occurs earlier in the clause than V. I formalize this in terms of a hierarchical constituent structure, where I dominates V, so that I is “higher” and V is “lower”.

These assumptions account for differential behavior of auxiliaries and lexical main verbs with respect to negation, inversion, VP-ellipsis (the first three of the so-called NICE properties, Huddleston 1976) as well as the placement of VP-adjuncts (e.g., Pollock 1989). The negator not is sandwiched between higher I and lower V. It follows that auxiliaries must be placed before not (3a) whereas lexical verbs are ungrammatical in this position (3b). Instead, they require do-supported negation, (3c), which, conversely, is not a grammatical option for auxiliaries, (3d).

(3)

a. Mary [_I may] not [_V study] for her exam.

b. *Mary studies not for her exam. (V before not)

c. Mary does not study for her exam.

d. *Mary (does) not may study for her exam. (not before I)

Inversion is a word order pattern that results from targeting material exclusively found in higher I and placing it in front of the subject. Therefore, auxiliaries can invert with the subject, (4a), whereas full lexical verbs cannot do so, (4b). Rather, do-support is needed in the absence of another auxiliary, (4c), but cannot be employed if another auxiliary is present, (4d).

(4)

a. [_I Was] his theory [_V accepted]?

b. * Became his theory accepted? (inversion of V)

c. Did his theory become accepted?

d. * (Did) his theory was accepted? (non-inversion of I)

Adverbs are typically adjoined to VP. Such VP-adjuncts are thus placed in front of finite transitive verbs, (5a), and cannot occur post-verbally before a verbal complement, (5b). However, this diagnostic is not as stringent as the negation or inversion tests for the distinction between auxiliaries and lexical verbs. While VP-adjuncts typically occur after auxiliaries, they are also acceptable in front of them, (5c). The former option, however, is considerably more common.

(5)

a. They [_I have] probably [_V visited] Spain.

b. * They have visited probably Spain. (VP-adjunct after V)

c. ^ok They probably have visited Spain. (VP-adjunct before I, rare)

VP-ellipsis, constructions involving an incomplete target sentence without a verb phrase reconstructed on a preceding, complete antecedent or source clause, requires the presence of higher I whereas lower V must be a part of the omitted material. As a consequence, VP-ellipsis structures can be licensed by an auxiliary, (6a), but not by a lexical main verb, (6b). If a source clause does not include an auxiliary, VP-ellipsis requires the dummy auxiliary do, (6c). In contrast, if a source clause involves an auxiliary, the target clause is degraded if it uses do-support instead of the contextually salient antecedent auxiliary (6d).

(6)

You [_I were] [_V reading] a book.

And they were _, too./Or were you _?/But I was not _.

You read a book.

* And they read _, too./* Or did you read _?/* But I did not read _. (V survives ellipsis)

You read a book.

And they did _, too./Or did you _?/But I did not _.

You were reading a book.

^?? And they did _, too./^?? Or did you _?/^?? But I did not _.

(I does not survive ellipsis)

The four patterns differentiating lexical verbs and auxiliaries are exactly the four contexts in which late Modern American English possessive have has changed its realization. This is unlikely to be an accident. Hence, one should characterize the difference between lexical verbs versus auxiliaries and the change in possessive have in closely-related terms. Specifically, conservative possessive have should be grouped together with Modern English auxiliaries as an element of category I. Innovative possessive have should be grouped together with Modern English lexical main verbs as an item of category V. The change can then be understood as dual representation of possessive have as both I and V with replacement of the former with the latter during the transitional period. This is sketched in (7).

(7)

The present case study thus meets the first desideratum listed in the previous section: A grammar-theoretical model identifies a single linguistic change, the reassignment of the category of possessive have from I to V, with realization in four distinct environments, negation, inversion, VP-adjunction and VP-ellipsis. The CRH thus predicts that the innovation in possessive have will spread at identical rates of change in all four contexts.

Note that the relatedness of the contexts depends on the correctness of the aspect of the grammatical theory from which they are deduced. While it is impossible to prove that any particular formalism articulating a theory must be correct, I would like to point out that the analysis presented here is extremely consensual as far as such matters go in linguistics. This lends further credence to the claim that the change in possessive have falls under the domain of the CRH.

3.2 Corpus basis and data collection

All data for this study was taken from the Corpus of Historical American English (COHA) (Davies 2010). It consists of c. 385 million words in c. 115,000 electronic files containing largely standard American English written texts. The corpus architecture allowed coding every token automatically for its source text, approximate year of composition and one of four generic genre categories (for more information on the corpus structure, see Davies 2012). Henceforth, I will cite all examples from COHA by file name and year.

I collected instances of possessive have from COHA in the four contexts, i.e. negation, inversion, VP-adjunct and VP-ellipsis. The negation data was assembled with queries consisting of a form of have followed by negation for the conservative variant, or do, negation and have for the innovative variant. Both kinds of queries were followed by elements targeting an object, such as the, for instance, had not the versus did not have the. Subsequently, I manually deleted precision errors. The final negation data consists of 33,074 tokens. The near minimal pair in (8) exemplifies the kind of sentences included in the data set.

(8)

conservative variant:

he had not the power to express an emotion

(BrewstersMillions 1902)

innovative variant:

the board did not have the power to modify a decree

(Denver 1995)

The inversion data was obtained for two separate environments, direct questions and negative inversion. Direct questions were retrieved with queries that placed have in front of an element targeting a subject for the conservative variant and do in front of a targeted subject followed by have for the innovative variant. Simplified examples of such queries are: wh-adverb have pronoun the versus wh-adverb do pronoun have the. As before, false hits were removed by hand. This yielded 9,319 tokens of direct questions, such as (9).

(9)

conservative variant:

Then why have you the money?

(ConstantImage 1960)

innovative variant:

Why do they have the best jazz in the world?

(Harpers 1965)

Instances of negative inversion were found with searches for several negative constituents immediately followed by have for the conservative variant or do and the word have up to 9 words to its right^[11] for the innovative variant. The following queries illustrate: neither have versus neither do (+ have after 0–9 words). I removed incorrect examples. This procedure added 1,040 negative inversion clauses, illustrated in (10). The combined data set for inversion thus includes 10,359 tokens.

(10)

conservative variant:

Never has a lover more grace than when he deplores the pangs he suffers

(LastDuelInSpain 1832)

innovative variant:

Never did a man have a more devoted adherent in his wooing than did I

(HeartHappyHollow 1904)

The VP-adjunct data set was likewise collected for two subsets, VP-adjoined adverbs and subject-dependent floating quantifiers. I investigated the adverb phrases still, already and no longer. These items showed a low probability of pre-auxiliary placement (5.4%, 5.2% and 2.3% of all occurrences, respectively) and may thus reliably diagnose the VP-boundary. Search queries placed these adjuncts after the verb for the conservative variant, e.g., have already the, and before the verb for the innovative variant, e.g., already have the. False hits were removed manually, which left 7,720 tokens, (11).

(11)

conservative variant:

You have still two children

(FalseShame 1815)

innovative variant:

They still had three children

(LizzyGlenn 1859)

The floating quantifier data set was compiled by placing the items all, both and each after have in queries for the conservative variant and before have in queries for the innovative variant, e.g., have each the versus each have the. I deleted all cases in which these words did not function as floating quantifiers with possessive have. An additional 594 instances were found in this way, (12). The combined VP-adjunct data set has a size of 8,314 tokens.

(12)

conservative variant:

the red limit and the violet limit have both the same luminous intensity

(TreatiseOnForces 1845)

innovative variant:

Arcturus and Capella both have the same magnitude

(ExploringDistant 1956)

Finally, verb phrase ellipsis structures were retrieved by searching for have or do in conjunction with tags targeting subjects and punctuation signs while simultaneously forcing a form of have to collocate up to 9 words to the left. The former string functioned as a proxy for a verb phrase ellipsis structure whereas the collocation was meant to target finite possessive have in a preceding source clause (e.g., My parents have a lot of friends. At least I think they have / do ). A simplified illustration of such a search query is have pronoun ,(+ have preceding 0–9 words) versus do pronoun ,(+ have preceding 0–9 words). Irrelevant hits were removed. The verb phrase ellipsis data contains 2,332 observations, (13).

(13)

conservative variant:

I have no military reputation, for if I had _, it would doubtless be forever ruined

(YaleRev 1913)

innovative variant:

I wish I had a skateboard. If I did _, I’d join them.

(Esquire 1999)

The final data set is quite large with a size of 54,079 tokens, meeting the second requirement for a test of the CRH. Furthermore, the innovative and conservative forms were identified by their form, as illustrated by distinct search queries for each. Adverb placement is a slight exception since an estimated 2.3–5.4% of the retrieved hits may instantiate a parse that does not diagnose the targeted variant. On the whole, however, the third requirement for the old and new forms to be unambiguously identifiable by form hold in this case study as well.

3.3 The data set

I will mention some noteworthy aspects regarding the final data set. Figure 1 presents a graphical representation of the data distribution for the negation context. The graph plots the proportion of innovative do-supported negation on the Y-axis against time on the X-axis. There is enough material for 198 data points, every year from 1810 to 2009 except for 1812 and 1813. The size of a data point is proportional to the overall number of tokens for that year. The graph includes a fitted line from a simple logistic model regressing the innovative form against time only as a guide for the eye illustrating the degree to which the data points conform to or diverge from a monotonic progression of the change.

Figure 1:

Distribution of possessive have data (negation context).

Innovative do not have completely replaces conservative have not and such substitution is likewise apparent in the other contexts. This satisfies the fourth desideratum, i.e., a replacement change.

The next desideratum, monotonicity, is slightly more complicated. There is an obvious S-shaped curve for the spread of the innovative variant of possessive have. Graphs for the other contexts reveal similar sigmoidal trajectories.

However, there are some outlier years between the late 1840s and c. 1890 with a greater-than-expected number of innovative forms also visible in the inversion and ellipsis contexts. These divergences can be explained as a result of register fluctuations in the corpus. COHA predominantly includes texts of standard American English, which is typically used by the upper, educated social classes. However, linguistic forms from other varieties intrude into this register. Fictional texts may quote or parody discourses that are not actually representative of the grammar of the authors, but of one of their literary characters’ manner of speaking.

A) The innovative variant of possessive have is often recorded in 19th century fictional, direct speeches of lower-class characters (Warner 1995: 547). This intrusion of innovative possessive have from low-prestige varieties into high-register writings substantially elevates the overall frequency of the innovative variant in the fiction genre during the early stages of the change. The data distribution is compatible with a change ‘from below’ (e.g., Labov 2007: 346).

To illustrate, the innovative negative possessive have construction in (14a), uttered by a slave hunter in Uncle Tom’s Cabin, also shows stigmatized negative concord (don’t never have no). Example (14b), uttered by a slave named ‘Cato,’ not only includes an innovative possessive have question but also distinct features of African-American Vernacular English, such as fortition of inter-dental fricatives to alveolar stops (e.g., this, the, thanks as dis, de, tanks), the reduction of final consonant clusters (e.g., and, last, expect as an’, las’, spec), and the realization of standard intervocalic v as b (heaven as hebben) (AAVE phonological features number 5, 1 and 12 in Rickford 1999: 4–5).

(14)

They don’t never have no doubt o’ your meaning, Tom.

(UncleTomsCabin 1852)

I is loss fum de cumpny, but dis is de ferry, and I spec dey’ll soon come. But didn’t we have a good time las’ night in Buffalo? Dem dar Buffalo gals make my heart flutter, dat dey did. But, tanks be to de Lord, I is got religion. I got it las’ night in de meetin.’ Before I got religion, I was a great sinner; […]. But now I is a conwerted man; I is bound for hebben;

(Escape 1861)

B) The conservative variant of possessive have is often found in direct speeches of late 20th century or 21st century historical fiction, where it likely serves as a depiction of a literary character’s upper socio-economic class or of a text’s ancient setting. These intrusions lower the overall frequency of the innovative variant in the fiction genre during the late stages of the change.

The following 21st century examples illustrate this effect. The conservative negative possessive have structure in (15a) is produced by a fictional upper-class mademoiselle set in 1840s New Orleans. The conservative direct possessive have question in (15b) portrays old-fashioned parlance imagined as typical of the early 19th century regency period in England.

(15)

My mother has not the will – but you know her trials. I must wed without delay

(RoguesSalute 2007)

“I can not feel a heartbeat.” Phaedra’s voice was surprisingly calm given the terror rising in her throat. “Have you a doctor close at hand, Mr. Kemble?” “Yes, yes, just round the corner.”

(TemptedAllNight 2009)

The influence of register fluctuations on the measurement of the change in the realization of possessive have can be related to the two reasons for deviations from monotonicity discussed in Section 2.1.5. Accidental data contamination due to inconstant inclusion of different text types might be instantiated here as an over-representation of fictional texts including direct speech in some years. Indeed, COHA’s proportion of fictional texts drops from c. 55% in 1850 to c. 40% by 1950. Real developments in the language system or culture that are adequately reflected in the textual record might manifest as shifts in the production of fiction, such as a decline in slave narratives after the abolition of slavery, or the emergence of the modern period drama.

Non-fiction texts document the change in a “purer” way in the sense that they consistently record Standard American English whereas fictional texts allow a glimpse into the wider use of the competing forms in more variable or marked styles. The imitations of, first, vernacular and, later, archaic uses of possessive have in fiction should raise and lower the base probability with which the innovative variant is encountered in fiction during the change’s early and late phase, respectively. This should result in a difference in the rate of change such that the change seemingly progresses more slowly in fiction, and more rapidly in non-fiction.

Importantly, the innovative form gains ground steadily independently of genre. Despite the register fluctuations, I therefore conclude that the change meets the fifth requirement of a largely monotonic increase as well.^[12]

Lastly, the change is attested by and large in its entirety, innovative possessive have being virtually non-existent before the 1820s and having reached a probability of ∼95% in the 2000s. The absolute end point of the change is not recorded in the data. This is true for all contexts except for VP-adjunction, where the earliest stages of the change are not recorded instead and the change has been at ceiling since the 1990s. The inception of the change may not be identified accurately since the corpus contains fewer documents for the early 19th century than later periods making it harder to pick up early instances of the innovative variant. Nevertheless, the data meets the sixth condition, complete attestation, as well since the corpus samples texts from at least the largest part of the transitional period.

4 Results

The diachronic variation in possessive have will be analyzed with mixed-effects logistic regression models (for accessible introductions, see e.g., Gries 2015; Johnson 2009). The model I propose includes the following variables:

The ‘Year’ predictor estimates the effect of time. It is a continuous variable based on the text dates provided by COHA. The variable has been standardized to z-scores.^[13]
The ‘Genre’ variable lets the intercept of the regression line vary between fiction and non-fiction (=COHA categories news, magazine, non-fiction).
The ‘Year:Genre’ interaction term allows for divergence in the slopes of fictional and non-fictional texts.
The ‘Context’ factor quantifies the difference between the grammatical contexts.
The ‘Year:Context’ interaction allows each context to change at different rates.
A random ‘Text’ variable controls for differences between individual texts.

The resulting model is presented in Table 1.

Table 1:

Mixed-effects models for the change in possessive have.

	Estimate^c	Std. error	z-value	p
Fixed effects^a,b
Intercept	0.18778	0.03684	5.098	<0.001^***
Year (non-fic)	2.84853	0.04929	57.790	<0.001^***
Genre (non-fic → fic)	−0.21400	0.04307	−4.969	<0.001^***
Year:Genre (non-fic → fic)	−0.55738	0.05350	−10.419	<0.001^***
Context(neg → inv)	−0.14799	0.03646	−4.059	<0.001^***
Context(neg → vpa)	2.66721	0.06281	42.466	<0.001^***
Context(neg → vpe)	−0.43928	0.06765	−6.493	<0.001^***
Year:Context(neg → inv)	0.01200	0.04782	0.251	0.802
Year:Context(neg → vpa)	−0.09863	0.06601	−1.494	0.135
Year:Context(neg → vpe)	0.05975	0.09484	0.630	0.529

Random effect

Text, N = 9,706
Variance of random intercepts: 1.181 (95%-confidence interval [1.068–1.294])

^aEffect on use of innovative variant. ^bfic – fiction, non-fic – non-fiction, neg – negation, inv – inversion, vpa – VP-adjuncts, vpe – VP-ellipsis. ^cAll estimates based on z-scores of the ‘Year’ variable, ***highly significant.

When the ‘Year’ predictor is transformed back on its original scale, the rate of change of 0.0552 log-odds per year (95%-confidence interval [0.0533–0.0571]) in the default genre, non-fictional texts, would imply that the innovative variant progresses from 1 to 99% of use in 167 years. The ‘Genre’ coefficient suggests that fictional texts are slightly more conservative than non-fictional texts. The rate of change for fictional texts, back-transformed on the ‘Year:Genre’ interaction, is only 0.0552–0.0108 = 0.0444 (95%-confidence interval [0.0423–0.0464]), predicting a time span of 207 years for a spread from 1 to 99% of use. This finding is compatible with the effect of register fluctuations proposed in Section 3.3. The ‘Context’ variable lists the differences between negation and the other contexts. VP-adjuncts have a substantially higher base probability of occurring with the innovative variant than the others. All of these variables are significant predictors for the realization of possessive have in this model. The results remain unchanged if another context is chosen as the reference category. The random ‘Text’ variable reports 9,706 individual texts aggregating the data, and a variance of the varying intercepts of 1.181. This suggests moderate clustering within texts (ICC = 26.4%). The model performs well, displaying a good model fit (Pseudo-R²_marginal = 0.635, Pseudo-R²_conditional = 0.731) and high classification precision (C-index = 0.941, classification accuracy = 86.6% vs. baseline: 55.2%). Therefore, this study also meets the seventh desideratum for a test of the CRH, an adequate statistical model.

I can at last check for identity in the rates of change in the four contexts. The ‘Year:Context’ interaction is non-significant for any pair of negation and a second context. This is true if any other context is made the default. The data is thus consistent with a scenario in which the true difference in the rate of change between the grammatical contexts is zero.

The state-of-the-art evaluation of the CRH uses a likelihood ratio test (LRT) comparing a full model with a term for the interaction between the ‘Year’ and ‘Context’ variables (as above) to a reduced model without this interaction term. Table 2 presents the result.

Table 2:

Analysis of deviance table and LRT for reduced and full possessive have models.

Model	Df	Deviance	Difference Df	Difference deviance	p
1	8	40,021.21
2	11	40,018.26	3	2.95	0.3985

Model 1: (reduced model)

Have ∼ Year + Genre + Year:Genre + Context + (1 | Text)

Model 2: (full model)

Have ∼ Year + Genre + Year:Genre + Context + Year:Context + (1 | Text)

The addition of the ‘Year:Context’ interaction term does not significantly improve the model fit (assuming a chi-square distribution of the LRT statistic). The LRT thus provides no justification for its inclusion in the model.

Figure 2 plots the rise of innovative possessive have for each of the four linguistic contexts based on independent mixed-effects models, thus further highlighting the great similarity in the rates of change between them. The range between the highest and the lowest point estimates of the ‘Year’ coefficients returned by these models indicates that the time needed for the change to proceed from 1 to 99% should vary by 20 years in non-fiction and 18 years in fiction. To repeat, this effect size was not returned as significant with the sample size of this study and it bears repeating that the paradigm of null hypothesis significance testing cannot establish that an effect does not exist. However, assuming that this magnitude is in fact realistic, how could one assess whether it would be meaningful? An interpretation of this effect size cannot be fully objective but must rely on some reference frame within the research context. My view is that divergences of 20 or 18 years should be regarded as small because there are many complexities in measuring linguistic changes that could plausibly create noise of this magnitude, such as uneven distribution of tokens across contexts by different authors, especially where those are paired with uncontrolled social variables or sub-genres, editorial or prescriptive changes of texts, or dependencies of grammatical contexts on other factors.

Figure 2:

Data points, point estimates and 95% confidence intervals for the rise in innovative possessive have from independent mixed-effects regressions for each context.

In summary, the evidence adduced from the change in the realization of possessive have is fully compatible with the CRH.

5 Discussion

The findings of this paper have implications for several research questions in historical syntax. The most important consideration is this: Previous affirmations of the CRH could quite plausibly be rejected on the basis of small or low-quality data, inexplicit analyses or other complications (see 2.1.1–2.1.7). The present study avoids such problems by identifying and abiding by seven essential methodological desiderata for a test of the CRH. Even though the method of null hypothesis significance testing cannot determine how likely the data are to emerge if there is an effect of time–context interaction, the compatibility of the data with the absence of such an effect should lead to a strengthening of one’s confidence in the correctness of the CRH. In fact, it may be time to abandon the special status ascribed to the CRH as prima facie less plausible than its alternative and instead to treat it as a fallback position in the absence of evidence like any other null hypothesis. Once the CRH is viewed more widely as an ordinary null hypothesis, linguists should provide explanations when grammatical contexts do not develop at identical rates (language–internal interactions, cultural shifts etc.) rather than presuppose that rates of change may and will vary by default.

Another contribution of this study comes from the demonstration of a visibly S-shaped propagation of a linguistic innovation (see e.g., Figure 1). Such S-shaped patterns support models of diffusion that incorporate some kind of advantage of an innovative form such as certain cognitive constraints (selection) (e.g., Newberry et al. 2017) over random or purely interactional processes such as social diffusion (drift) (e.g., Blythe et al. 2009). This leads to the question of what such an advantage or “fitness” of a form may be. Perhaps the general problem can be approached from specific cases like the one discussed here. My own idea is that the advantage of an innovative string like don’t have the may lie in reduced syntactic ambiguity over a competitor string such as haven’t the. Other causes for the change have been proposed. Warner (1995) links the innovation to a re-set parameter after the loss of thou and the rise of do-support, which also resulted in other, roughly contemporaneous changes in the auxiliary system, such as the emergence of is being or the disappearance of elided verb phrases headed by be. Rohrbacher (1999: 196) believes that the change was motivated by the fact that the conservative position of possessive have does not usually assign theta-roles. Therefore, possessive have may have begun to cluster better with verbs like own or want than be or will. One could also point to traditional ideas of regularization or analogical pressure from semantically similar verbs (I own a house is to I don’t own a house as I have a house is to X). Moreover, the timing of the VP-adjunct context, changing about half a century earlier than the other environments (see Figure 2), mirrors very closely Early Modern English verb placement facts, where the loss of post-verbal VP-adjuncts likewise precedes the rise of do-support (e.g., Haeberli and Ihsane 2016). This cyclic behavior is curious and may be suggestive of a causal influence of the former on the latter development (some related discussion in Roberts 1985). Trudgill et al. (2002) argue that the adoption of do-support with have in American English is linked to a relatively low number of dynamic uses of have (e.g., preference for take a look over have a look in that variety). One causal explanation that is not easily compatible with the findings of this paper is the ‘Rich Agreement Hypothesis’ (e.g., Koeneman and Zeijlstra 2014 and references therein). It states that English verbs are placed structurally low as a consequence of the loss of verbal inflections. Possessive have did not undergo any notable changes in its inflections close to or during the time it came to appear structurally low. More work is needed to establish the specific advantage of innovative possessive have and the general nature of selection pressures giving rise to S-shaped diffusions in linguistic changes.

There are numerous further lines of inquiry that might be stimulated by the results of this paper. Some grammatical frameworks may be better equipped to deal with the specifics of the change in possessive have than others. The VP-adjunct context in particular might be predicted to be a context that should change in tandem with the others by some models of English clause structure but not by others. For instance, Culicover (2008) might find it difficult to accommodate the VP-adjunct facts. One could control for additional determinants that may influence the realization of possessive have and check if they affect the outcome for the CRH. Relevant additional factors include, for instance, semantic distinctions between concrete and abstract possession or between dynamic and stative situations (e.g., Lorenz 2016; Tagliamonte 2003). The discussion of the genre fluctuations (see Section 3.3) could serve as an impetus to investigate the life cycle of competitors in greater detail. It might be quite common for a conservative form to acquire formal, archaic or otherwise marked connotations towards the end of a change. It might be worthwhile to attempt a fresh reflection on the relation between lexical diffusion and the CRH. Since most syntactic changes concern abstract patterns instantiated by concrete lexical items, they may involve some degree of lexical diffusion. It seems important for those cases to come with explicit assumptions and analysis techniques. All of these questions seem to me worthy of pursuit in the future.

6 Conclusion

This study subjected the CRH to a new falsification attempt using an innovation in American English possessive have. It should, in my estimation, count as the most demanding one ever to have done so. The result showed that the data is compatible with the CRH. It would therefore be rational to update one’s prior belief about the CRH and move towards accepting it as an ordinary null hypothesis.

Corresponding author: Richard Zimmermann, The University of Manchester, Manchester, UK, E-mail: richard.zimmermann@manchester.ac.uk

Acknowledgements

This work has now been evolving for ten years. Originally starting out as a side-project, it became more important when I spent some time at UPenn studying under Tony Kroch. Alas, Tony has not lived to see this paper finalized. If he was still around, he would undoubtedly have something witty to say about it. He is missed. Since this study would have been entirely impossible without his academic foundations, he first and foremost deserves acknowledgment. Thank you! For expressions of my gratitude to linguists who have influenced my thinking up to 2017, see the related chapter in my PhD dissertation. Thanks are due to three reviewers for a previously rejected version of this paper. Among those, Aaron Ecay deserves special mention. I would also like to say thanks to Beatrice Santorini, David Willis and Eric Haeberli for their input at that time. I am grateful to three reviewers of CLLT for their useful feedback, especially to Henri Kauhanen. Finally, I would like to thank Stefanie Wulff, among others, for her patience.

References

Amato, Roberta, Lucas Lacasa, Albert Díaz-Guilera & Andrea Baronchellid. 2018. The dynamics of norm change in the cultural evolution of language. Proceedings of the National Academy of Sciences 115(33). 8260–8265.10.1073/pnas.1721059115Search in Google Scholar

Bailey, Charles-James. 1973. Variation and linguistic theory. Washington, D.C.: Center for Applied Linguistics.Search in Google Scholar

Blythe, Richard, Gareth Baxter, William Croft & Alan McKane. 2009. Modeling language change: An evaluation of Trudgill’s theory of the emergence of New Zealand English. Language Variation and Change 21. 257–296.10.1017/S095439450999010XSearch in Google Scholar

Blythe, Richard A. & William Croft. 2012. S-curves and the mechanism of propagation in language change. Language 88(2). 269–304.10.1353/lan.2012.0027Search in Google Scholar

Chomsky, Noam. 1986. Barriers. Cambridge, Massachusetts: MIT Press.Search in Google Scholar

Corley, Kerry. 2014. The constant rate hypothesis in syntactic change: Empirical fact or lies, damned lies and statistics? Cambridge: University of Cambridge BA thesis.Search in Google Scholar

Culicover, Peter. 2008. The rise and fall of constructions and the history of English do-support. Journal of Germanic Linguistics 20(1). 1–52.10.1017/S1470542708000019Search in Google Scholar

Davies, Mark. 2010. The corpus of historical American English: 400 million words, 1810-2009. http://corpus.byu.edu/coha/ (accessed 10 September 2016).Search in Google Scholar

Davies, Mark. 2012. Expanding horizons in historical linguistics with the 400 million word corpus of historical American English. Corpora 7. 121–157.10.3366/cor.2012.0024Search in Google Scholar

De Smet, Hendrik, Frauke D’hoedt, Lauren Fonteyn & Kristel Van Goethem. 2018. The changing functions of competing forms: Attraction and differentiation. Cognitive Linguistics 29(2). 197–234.10.1515/cog-2016-0025Search in Google Scholar

Denison, David. 2003. Log(Ist)Ic and simplistic S-curves. In Hickey Raymond (ed.), Motives for language change, 54–70. Cambridge: Cambridge University Press.10.1017/CBO9780511486937.005Search in Google Scholar

Ecay, Aaron. 2015. A multi-step analysis of the evolution of English do-support. Philadelphia: University of Pennsylvania dissertation.Search in Google Scholar

Ellegård, Alvar. 1953. The auxiliary do: The establishment and regulation of its use in English. Gothenburg studies in English. Stockholm: Almqvist & Wiksell.Search in Google Scholar

Feltgen, Quentin. 2017. Statistical physics of language evolution: The grammaticalization phenomenon. Paris: L’Université de Recherche Paris Sciences et Lettres dissertation.Search in Google Scholar

Fruehwald, Joseph, Jonathan Gress-Wright & Joel Wallenberg. 2013. Phonological rule change: The constant rate effect. In Seda Kan, Claire Moore-Cantwell & Robert Staubs (eds.), NELS 40: Proceedings of the 40th Annual Meeting of the North East Linguistic Society, 219–230. Cambridge, Massachusetts: GLSA Publications.Search in Google Scholar

Gardner, Matt H., Derek Denis, Marisa Brook & Sali A. Tagliamonte. 2020. Be like and the constant rate effect: From the bottom to the top of the S-curve. English Language and Linguistics (First View). 1–44.10.1017/S1360674320000076Search in Google Scholar

Gries, Stefan Th. & Anatol Stefanowitsch. 2004. Extending collostructional analysis: A corpus-based perspective on alternations. International Journal of Corpus Linguistics 9(1). 97–129.10.1075/ijcl.9.1.06griSearch in Google Scholar

Gries, Stefan Th. 2015. The most under-used statistical method in corpus linguistics: Multi-level (and mixed-effects) models. Corpora 10(1). 95–125.10.3366/cor.2015.0068Search in Google Scholar

Haeberli, Eric & Tabea Ihsane. 2016. Revisiting the loss of verb movement in the history of English. Natural Language & Linguistic Theory 34(2). 497–542.10.1007/s11049-015-9312-xSearch in Google Scholar

Haegeman, Liliane. 1989. Be going to and will: A pragmatic account. Journal of Linguistics 25. 291–317.10.1017/S0022226700014110Search in Google Scholar

Huddleston, Rodney. 1976. Some theoretical issues in the description of the English verb. Lingua 40(4). 331–383.10.1016/0024-3841(76)90084-XSearch in Google Scholar

Johnson, Daniel E. 2009. Getting off the goldvarb standard: Introducing rbrul for mixed-effects variable rule analysis. Language and Linguistics Compass 3(1). 359–383.10.1111/j.1749-818X.2008.00108.xSearch in Google Scholar

Joseph, Brian D. & Richard D. Janda. 2003. On language, change, and language change – Or, of history, linguistics, and historical linguistics. In Joseph, Brian D. & Richard D. Janda (eds.), The handbook of historical linguistics, 3–180. Oxford: Blackwell.10.1111/b.9781405127479.2004.00002.xSearch in Google Scholar

Kastovsky, Dieter. 2012. Linguistic levels: Syntax. In Bergs Alexander & Laurel J. Brinton (eds.), English historical linguistics: An international handbook, vol. 1, 148–164. Berlin: De Gruyter.Search in Google Scholar

Kauhanen, Henri & George Walkden. 2018. Deriving the constant rate effect. Natural Language & Linguistic Theory 36(2). 483–521.10.1007/s11049-017-9380-1Search in Google Scholar

Kauhanen, Henri & George Walkden. 2019. When is a constant rate truly constant? A Monte Carlo power analysis of the logistic operationalization of constant rate effects. Paper presented at the Symposium on Representations, Usage and Social Embedding in Language Change (RUSE). University of Manchester UK, 20-21 August.Search in Google Scholar

Koeneman, Olaf & Hedde Zeijlstra. 2014. The rich agreement hypothesis rehabilitated. Linguistic Inquiry 45(4). 571–615.10.1162/LING_a_00167Search in Google Scholar

Kroch, Anthony. 1989. Reflexes of grammar in patterns of language change. Language Variation and Change 1(3). 199–244.10.1017/S0954394500000168Search in Google Scholar

Kroch, Anthony. 1994. Morphosyntactic variation. In Beals Katie (ed.), Proceedings of the Thirtieth Annual Meeting of the Chicago Linguistics Society, 180–201. Chicago: Chicago Linguistics Society.Search in Google Scholar

Labov, William. 2001. Principles of language change volume 2: Social factors. Oxford: Blackwell.Search in Google Scholar

Labov, William. 2007. Transmission and diffusion. Language 83(2). 344–387.10.1353/lan.2007.0082Search in Google Scholar

Langacker, Ronald. 1987. Foundations of cognitive grammar. Volume 1: Theoretical prerequisites. Stanford, CA: Stanford University Press.Search in Google Scholar

Lorenz, David. 2016. Form does not follow function, but variation does: The origin and early usage of possessive have got in English. English Language and Linguistics 20(3). 487–510.10.1017/S1360674316000332Search in Google Scholar

Mufwene, Salikoko. 2009. The evolution of language: Hints from creoles and pidgins. In Minett James & William S-Y Wang (eds.), Language evolution and the brain, 1–33. Hong Kong: City University of Hong Kong Press.Search in Google Scholar

Myler, Neil. 2016. Building and interpreting possession sentences. Cambridge, Massachusetts: MIT Press.10.7551/mitpress/9780262034913.001.0001Search in Google Scholar

Nevalainen, Terttu & Helena Raumolin-Brunberg. 2003. Historical sociolinguistics: Language change in Tudor & Stuart England. London: Pearson Educational.Search in Google Scholar

Newberry, Mitchell G., Christopher A. Ahern, Robin Clark & Joshua B. Plotkin. 2017. Detecting evolutionary forces in language change. Nature 551. 223–226.10.1038/nature24455Search in Google Scholar

Ogura, Mieko. 1993. The development of periphrastic do in English: A case of lexical diffusion in syntax. Diachronica 10(1). 51–85.10.1075/dia.10.1.04oguSearch in Google Scholar

Ogura, Mieko & William S.-Y. Wang. 1996. Snowball effect in lexical diffusion: The development of –S in the third person singular present indicative in English. In Britton Derek (ed.), English historical linguistics 1994: Papers from the 8th International Conference on English Historical Linguistics, 119–141. Philadelphia: John Benjamins.10.1075/cilt.135.10oguSearch in Google Scholar

Paolillo, John C. 2011. Independence claims in linguistics. Language Variation and Change 23(2). 257–274.10.1017/S0954394511000081Search in Google Scholar

Pintzuk, Susan. 1996. Variation and change in Old English clause structure. Language Variation and Change 7(2). 229–260.10.1017/S0954394500001009Search in Google Scholar

Pintzuk, Susan. 2003. Variationist approaches to syntactic change. In Joseph Brian D & Richard D. Janda (eds.), The handbook of historical linguistics, 509–528. Oxford: Blackwell.10.1002/9780470756393.ch15Search in Google Scholar

Pollock, Jean-Yves. 1989. Verb movement, universal grammar, and the structure of IP. Linguistic Inquiry 20(3). 365–424.Search in Google Scholar

Postma, Gertjan. 2017. Modelling transient states in language change. In Mathieu Eric & Robert Truswell (eds.), Micro-change and macro-change in diachronic syntax, 75–93. Oxford: Oxford University Press.10.1093/oso/9780198747840.003.0006Search in Google Scholar

Rickford, John. 1999. African American vernacular English: Features and use, evolution, and educational implications. Oxford: Blackwell.Search in Google Scholar

Ritt, Nikolaus. 1995. Language change as evolution: Looking for linguistic genes. Vienna English Working Papers 4(1). 43–56.Search in Google Scholar

Roberts, Ian G. 1985. Agreement parameters and the development of English modal auxiliaries. Natural Language & Linguistic Theory 3. 21–58.10.1007/BF00205413Search in Google Scholar

Roberts, Ian G. 2007. Diachronic syntax. Oxford: Oxford University Press.Search in Google Scholar

Roeper, Thomas. 1999. Universal bilingualism. Bilingualism: Language and Cognition 2(3). 169–186.10.1017/S1366728999000310Search in Google Scholar

Rohrbacher, Bernhard. 1999. Morphology-driven syntax: A theory of V-to-I raising and pro-drop. Los Angeles: Sage.10.1075/la.15Search in Google Scholar

Santorini, Beatrice. 1993. The rate of phrase structure change in the history of Yiddish. Language Variation and Change 5. 257–283.10.1017/S0954394500001502Search in Google Scholar

Simonenko, Alexandra, Crabbé Benoit & Sophie Prévost. 2019. Agreement syncretization and the loss of null subjects: Quantificational models for medieval French. Language Variation and Change 31. 275–301.10.1017/S0954394519000188Search in Google Scholar

Tagliamone, Sali. 2003. Every place has a different toll: Determinants of grammatical variation in cross-variety perspective. In Rohdenburg Günter & Britta Mondorf (eds.), Determinants of grammatical variation in English, 531–554. Berlin & New York: Mouton de Gruyter.10.1515/9783110900019.531Search in Google Scholar

Tagliamone, Sali. 2013. Roots of English: Exploring the history of dialects. Cambridge: Cambridge University Press.Search in Google Scholar

Taylor, Ann & Susan Pintzuk. 2012. Verb order, object position and information status in Old English. York Papers in Linguistics 2. 29–52.Search in Google Scholar

Traugott, Elizabeth C. & Graeme Trousdale. 2010. Gradience, gadualness and grammaticalization: How do they intersect? In Traugott, Elizabeth C. & Graeme Trousdale (eds.), Gradience, gradualness and grammaticalization, 19–44. Philadelphia: John Benjamins.10.1075/tsl.90.04traSearch in Google Scholar

Trudgill, Peter, Terttu Nevalainen & Ilse Wischer. 2002. Dynamic have in north American and British Isles English. English Language and Linguistics 6(1). 1–15.10.1017/S1360674302001016Search in Google Scholar

Vulanović, Relja & Harald R. Baayen. 2006. Fitting the development of periphrastic do in all sentence types. In Grzybek Peter & Reinhard Köhler (eds.), Exact methods in the study of language and text: Dedicated to Gabriel Altmann on the Occasion of his 75th Birthday, 679–688. Berlin: Walter De Gruyter.10.1515/9783110894219.679Search in Google Scholar

Wallenberg, Joel. 2016. Extraposition is disappearing. Language 92(4). e237–e256.10.1353/lan.2016.0079Search in Google Scholar

Warner, Anthony. 2005. Why do dove: Evidence for register variation in early modern English negatives. Language Variation and Change 17(3). 257–280.10.1017/S0954394505050106Search in Google Scholar

Zehentner, Eva. 2019. Competition in language change: The rise of the English dative alternation. Berlin: De Gruyter.10.1515/9783110633856Search in Google Scholar

Received: 2021-04-28

Accepted: 2022-05-24

Published Online: 2022-06-20

Published in Print: 2023-10-26

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

https://doi.org/10.1515/cllt-2021-0038

Keywords for this article

big data; constant rate hypothesis; corpus linguistics; diachronic syntax; late Modern English; mixed-effects logistic regression; possessive have; syntactic change

Creative Commons

BY 4.0