Good enough for Galton, and much more: commentary on “Replication and methodological robustness in quantitative typology” by Becker and Guzmán Naranjo

Chundra A. Cathcart

doi:10.1515/lingty-2025-0030

Artikel Open Access

Good enough for Galton, and much more: commentary on “Replication and methodological robustness in quantitative typology” by Becker and Guzmán Naranjo

Chundra A. Cathcart

Veröffentlicht/Copyright: 25. Juli 2025

Veröffentlicht von

Veröffentlichen auch Sie bei De Gruyter Brill

Manuskript einreichen Informationen für Autor*innen Erkunden Sie dieses Fachgebiet

Aus der Zeitschrift Linguistic Typology Band 29 Heft 3

In recent years, the language sciences have adopted and refined philosophies and tools for data analysis employed in related fields like evolutionary anthropology (McElreath 2016) and disciplines like biology where analogous phenomena (e.g., trait evolution; Garamszegi 2014) are studied. These methodological developments have not been without controversy. In particular, the perceived encroachment upon linguistic topics by tools associated with other disciplines has led to a degree of tension within the language sciences. The conflict is often framed as one between traditional methods (e.g., stratified sampling) and phylogenetic comparative methods (PCMs; for a recent revival of this debate see e.g. Haspelmath 2020), the latter of which can be challenging and computationally intensive to implement, not to mention black box-like.

However, if computational difficulty is the main drawback of PCMs, then the key issues of this debate may be misconstrued. PCMs as we use them today are usually implemented in a Bayesian framework, which estimates posterior distributions of parameters (i.e., estimands that we do not observe and wish to infer) via iterative sampling-based procedures like Markov chain Monte Carlo (MCMC), when no analytic (i.e., paper and pencil) solution is available (for an alternative framework, see Blei et al. 2017). Frequentist statistical methods (e.g., chi-squared tests, classical regression) generally operate within a Maximum Likelihood (ML) framework and are faster to implement, but many of these models are restrictive in terms of what they allow to be analyzed and make arcane assumptions that are poorly understood. Despite their computationally intensive nature, Bayesian methods are more flexible, giving more freedom to practitioners in terms of model building. Yet the “costly/complex” (Bayesian) versus “cheap/simple” (ML) dichotomy cuts across less and more “traditional” families of models. One can find fast, ML implementations of PCMs (Harmon 2019), just as one finds Bayesian t-tests and Bayesian regression (Kruschke 2014).

The target article’s (Becker and Guzmán Naranjo 2025) authors use a version of Bayesian regression that accounts for phylogenetic and spatial non-independence in a highly flexible manner. As such, this methodology is a continuation of a more familiar one (regression), but it incorporates properties of PCMs and involves a more complex parameterization and model-fitting procedure than standard regression models (even more so than mixed-effects models implemented in R packages like lme4, Bates et al. 2015). In general, I believe that the target article makes very clear the benefits of this approach. Certainly, this type of modeling strategy should be part of the methodological arsenal of anyone wishing to establish associations between linguistic variables while accounting for phylogenetic and spatial non-independence. I wish, however, to highlight how the freedom provided by this framework forces users to make a number of non-trivial decisions, and to emphasize that broadly speaking, PCMs allow practitioners to treat phylogeny not simply as a nuisance factor which must be controlled for, but as a means of shedding light on the complex scenarios underlying the associations we detect synchronically.

The flexibility of Bayesian methods is one of their major draws, and can be both beneficial and challenging. While for many frequentist methods, key assumptions are built into the model, Bayesian practitioners must make many assumptions themselves over the course of model building. Even the most standard-seeming assumptions may not be trivial. When we construct a simple model with a small number of parameters, what we may see as a fairly neutral decision can also be construed as an active choice between restrictiveness and relaxation (e.g., Lemey et al. 2010). Other choices in model building may embody highly divergent understandings of the processes giving rise to the data. One ostensibly trivial decision I wish to highlight is the authors’ use of phylogenetic regression to control for phylogenetic effects. Given the scope of the paper, the authors understandably avoid technical details, but more specifically, what they assume is that linguistic features evolve according to Brownian motion (which like the spatial model they employ is a type of Gaussian process, albeit with a different kernel, i.e., the function determining the covariance between languages). Brownian motion is just one of a large suite of models of phylogenetic trait evolution (Blomberg et al. 2020; Gill et al. 2017; Landis et al. 2013), not all of which are Gaussian processes, and not all of which can be combined with a regression model in a straightforward manner. The assumptions of Brownian motion are fairly simple – the covariance between two taxa in a phylogeny with respect to some trait value is proportional to their shared history (i.e., the displacement in time between the root of the tree and the most recent common ancestor of the two taxa). Under Brownian motion, the variance in a value’s displacement is proportional to the time elapsed during displacement, but the expected displacement is zero – Brownian motion does not accommodate long-term preferences or biases in change, unlike the mean-reverting Ornstein-Uhlenbeck process, a type of Gaussian process that can be included in a regression model using a kernel function similar to what the authors use to model spatial effects (Butler and King 2004; Ringen et al. 2021). As a mean-reverting process, the Ornstein-Uhlenbeck model assumes that while trait values can exhibit random walk-like behavior (as in Brownian motion), they are drawn back to some optimal value according to a parameter representing strength of selection. While it may be challenging to find exact linguistic analogs for all parameters in this process, it is conceptually not unreasonable to posit that some linguistic features may exhibit more mean-reverting behavior than others. Is it necessary to investigate this possibility when controlling for statistical biases? Perhaps not, although it is impossible to be certain that the Brownian assumption is entirely unproblematic, at least without more detailed investigation, including simulation studies. My point here is not that researchers should test all possible evolutionary scenarios – this would be unfeasible – but that even fairly trivial-seeming choices may not be so, and this issue underscores the fact that multiple modeling approaches are needed to understand the conditions under which certain associations between features obtain.

With diachronic change being as complex as it is, it is worth emphasizing that PCMs, of which phylogenetic regression is just one example, offer us more than just a means of accounting for Galton’s problem, that is, the issue of phylogenetic non-independence (Narroll 1961). Indeed, many authors using more complex phylogenetic models are content to see their results as demonstrating a relationship between multiple variables while accounting for phylogenetic history (Jäger and Wahle 2021) and to leave matters there. But there is more to be explored beyond simply detecting whether there is evidence for an association – we can ask questions regarding the dynamics that led to said association (Cathcart et al. 2020; Craevschi et al. 2025; Sheehan et al. 2023). For instance, if we detect an inverse relationship between complexity in different domains of morphosyntax using phylogenetic methods, we can investigate whether one domain simplifies before the other, or whether simplification is simultaneous in both domains. Similar models probe whether linguistic migration precedes putatively environmentally driven linguistic changes (Hartmann et al. 2024). Phylogenetic models that are more explicit with respect to their assumptions about the nature of change might additionally be able to shed light on evolutionary scenarios between which regression-based approaches are unable to distinguish. And while most of the phylogenetic approaches in the literature do not explicitly model contact, they may indirectly absorb information regarding areality in an interpretable fashion (e.g., if rates of change on a branch are higher than expected). At the end of the day, it is likely impossible to incorporate all relevant dimensions of linguistic change into a single model; hence, a multi-pronged approach is needed to fully understand the processes that have given rise to what we observe synchronically. The approach outlined in the target article is an important puzzle piece, but one of many.

Corresponding author: Chundra A. Cathcart [ˌʧ͡ʌndɹə ˈkæθkɑɹt], Institute for the Interdisciplinary Study of Language Evolution, University of Zurich, Zürich, Switzerland, E-mail: chundra.cathcart@uzh.ch

References

Bates, D., M. Mächler, B. Bolker & S. Walker. 2015. Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1). 1–48. https://doi.org/10.18637/jss.v067.i01.Suche in Google Scholar

Becker, Laura & M. Guzmán Naranjo. 2025. Replication and methodological robustness in quantitative typology. Linguistic Typology 29(3). 463–505. https://doi.org/10.1515/lingty-2023-0076.Suche in Google Scholar

Blei, D. M., A. Kucukelbir & J. D. McAuliffe. 2017. Variational inference: A review for statisticians. Journal of the American Statistical Association 112(518). 859–877. https://doi.org/10.1080/01621459.2017.1285773.Suche in Google Scholar

Blomberg, S. P., S. I. Rathnayake & C. M. Moreau. 2020. Beyond Brownian motion and the Ornstein-Uhlenbeck process: Stochastic diffusion models for the evolution of quantitative characters. The American Naturalist 195(2). 145–165. https://doi.org/10.1086/706339.Suche in Google Scholar

Butler, M. A. & A. A. King. 2004. Phylogenetic comparative analysis: A modeling approach for adaptive evolution. The American Naturalist 164(6). 683–695. https://doi.org/10.2307/3473229.Suche in Google Scholar

Cathcart, C. A., A. Hölzl, G. Jäger, P. Widmer & B. Bickel. 2020. Numeral classifiers and number marking in Indo-Iranian: A phylogenetic approach. Language Dynamics and Change 1(aop). 1–53.10.1163/22105832-bja10013Suche in Google Scholar

Craevschi, A., S. Babinski & C. Cathcart. 2025. Semantics drives analogical change in Germanic strong verb paradigms: A phylogenetic study. https://arxiv.org/html/2502.17670v1 (accessed 24 February 2025).Suche in Google Scholar

Garamszegi, L. Z. (ed.). 2014. Modern phylogenetic comparative methods and their application in evolutionary biology: Concepts and practice. Heidelberg, New York, Dordrecht, London: Springer.10.1007/978-3-662-43550-2Suche in Google Scholar

Gill, M. S., L. S. Tung Ho, G. Baele, P. Lemey & M. A. Suchard. 2017. A relaxed directional random walk model for phylogenetic trait evolution. Systematic Biology 66(3). 299–319. https://doi.org/10.1093/sysbio/syw093.Suche in Google Scholar

Harmon, L. 2019. Phylogenetic comparative methods. https://lukejharmon.github.io/pcm/ (accessed 14 October 2022).10.32942/OSF.IO/E3XNRSuche in Google Scholar

Hartmann, F., S. G. Roberts, P. Valdes & R. Grollemund. 2024. Investigating environmental effects on phonology using diachronic models. Evolutionary Human Sciences 6. e8. https://doi.org/10.1017/ehs.2023.33.Suche in Google Scholar

Haspelmath, M. 2020. Some issues with the correlated-evolution method for testing causal hypotheses in comparative linguistics. Diversity Linguistics Comment. https://doi.org/10.58079/nsvs (accessed 3 February 2025).Suche in Google Scholar

Jäger, G. & J. Wahle. 2021. Phylogenetic typology. Frontiers in Psychology 12. 682132. https://doi.org/10.3389/fpsyg.2021.682132.Suche in Google Scholar

Kruschke, J. 2014. Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan. Amsterdam: Academic Press.10.1016/B978-0-12-405888-0.00008-8Suche in Google Scholar

Landis, M. J., J. G. Schraiber & M. Liang. 2013. Phylogenetic analysis using Lévy processes: Finding jumps in the evolution of continuous traits. Systematic Biology 62(2). 193–204. https://doi.org/10.1093/sysbio/sys086.Suche in Google Scholar

Lemey, P., A. Rambaut, J. J. Welch & M. A. Suchard. 2010. Phylogeography takes a relaxed random walk in continuous space and time. Molecular Biology and Evolution 27(8). 1877–1885. https://doi.org/10.1093/molbev/msq067.Suche in Google Scholar

McElreath, R. 2016. Statistical rethinking: A Bayesian course with examples in R and Stan. Boca Raton: CRC Press.Suche in Google Scholar

Narroll, R. 1961. Two solutions to Galton’s problem. Philosophy of Science 28. 15–29.10.1086/287778Suche in Google Scholar

Ringen, E., J. S. Martin & A. Jaeggi. 2021. Novel phylogenetic methods reveal that resource-use intensification drives the evolution of “complex” societies. https://ecoevorxiv.org/repository/object/4119/download/8181/ (accessed 18 May 2021).10.32942/OSF.IO/WFP95Suche in Google Scholar

Sheehan, O., J. Watts, R. D. Gray, J. Bulbulia, S. Claessens, E. J. Ringen & Q. D. Atkinson. 2023. Coevolution of religious and political authority in Austronesian societies. Nature Human Behaviour 7(1). 38–45. https://doi.org/10.1038/s41562-022-01471-y.Suche in Google Scholar

Published Online: 2025-07-25

Published in Print: 2025-10-27

This work is licensed under the Creative Commons Attribution 4.0 International License.

Artikel in diesem Heft

https://doi.org/10.1515/lingty-2025-0030

Creative Commons

BY 4.0