In recent years, the language sciences have adopted and refined philosophies and tools for data analysis employed in related fields like evolutionary anthropology (McElreath 2016) and disciplines like biology where analogous phenomena (e.g., trait evolution; Garamszegi 2014) are studied. These methodological developments have not been without controversy. In particular, the perceived encroachment upon linguistic topics by tools associated with other disciplines has led to a degree of tension within the language sciences. The conflict is often framed as one between traditional methods (e.g., stratified sampling) and phylogenetic comparative methods (PCMs; for a recent revival of this debate see e.g. Haspelmath 2020), the latter of which can be challenging and computationally intensive to implement, not to mention black box-like.
However, if computational difficulty is the main drawback of PCMs, then the key issues of this debate may be misconstrued. PCMs as we use them today are usually implemented in a Bayesian framework, which estimates posterior distributions of parameters (i.e., estimands that we do not observe and wish to infer) via iterative sampling-based procedures like Markov chain Monte Carlo (MCMC), when no analytic (i.e., paper and pencil) solution is available (for an alternative framework, see Blei et al. 2017). Frequentist statistical methods (e.g., chi-squared tests, classical regression) generally operate within a Maximum Likelihood (ML) framework and are faster to implement, but many of these models are restrictive in terms of what they allow to be analyzed and make arcane assumptions that are poorly understood. Despite their computationally intensive nature, Bayesian methods are more flexible, giving more freedom to practitioners in terms of model building. Yet the “costly/complex” (Bayesian) versus “cheap/simple” (ML) dichotomy cuts across less and more “traditional” families of models. One can find fast, ML implementations of PCMs (Harmon 2019), just as one finds Bayesian t-tests and Bayesian regression (Kruschke 2014).
The target article’s (Becker and Guzmán Naranjo 2025) authors use a version of Bayesian regression that accounts for phylogenetic and spatial non-independence in a highly flexible manner. As such, this methodology is a continuation of a more familiar one (regression), but it incorporates properties of PCMs and involves a more complex parameterization and model-fitting procedure than standard regression models (even more so than mixed-effects models implemented in R packages like lme4, Bates et al. 2015). In general, I believe that the target article makes very clear the benefits of this approach. Certainly, this type of modeling strategy should be part of the methodological arsenal of anyone wishing to establish associations between linguistic variables while accounting for phylogenetic and spatial non-independence. I wish, however, to highlight how the freedom provided by this framework forces users to make a number of non-trivial decisions, and to emphasize that broadly speaking, PCMs allow practitioners to treat phylogeny not simply as a nuisance factor which must be controlled for, but as a means of shedding light on the complex scenarios underlying the associations we detect synchronically.
The flexibility of Bayesian methods is one of their major draws, and can be both beneficial and challenging. While for many frequentist methods, key assumptions are built into the model, Bayesian practitioners must make many assumptions themselves over the course of model building. Even the most standard-seeming assumptions may not be trivial. When we construct a simple model with a small number of parameters, what we may see as a fairly neutral decision can also be construed as an active choice between restrictiveness and relaxation (e.g., Lemey et al. 2010). Other choices in model building may embody highly divergent understandings of the processes giving rise to the data. One ostensibly trivial decision I wish to highlight is the authors’ use of phylogenetic regression to control for phylogenetic effects. Given the scope of the paper, the authors understandably avoid technical details, but more specifically, what they assume is that linguistic features evolve according to Brownian motion (which like the spatial model they employ is a type of Gaussian process, albeit with a different kernel, i.e., the function determining the covariance between languages). Brownian motion is just one of a large suite of models of phylogenetic trait evolution (Blomberg et al. 2020; Gill et al. 2017; Landis et al. 2013), not all of which are Gaussian processes, and not all of which can be combined with a regression model in a straightforward manner. The assumptions of Brownian motion are fairly simple – the covariance between two taxa in a phylogeny with respect to some trait value is proportional to their shared history (i.e., the displacement in time between the root of the tree and the most recent common ancestor of the two taxa). Under Brownian motion, the variance in a value’s displacement is proportional to the time elapsed during displacement, but the expected displacement is zero – Brownian motion does not accommodate long-term preferences or biases in change, unlike the mean-reverting Ornstein-Uhlenbeck process, a type of Gaussian process that can be included in a regression model using a kernel function similar to what the authors use to model spatial effects (Butler and King 2004; Ringen et al. 2021). As a mean-reverting process, the Ornstein-Uhlenbeck model assumes that while trait values can exhibit random walk-like behavior (as in Brownian motion), they are drawn back to some optimal value according to a parameter representing strength of selection. While it may be challenging to find exact linguistic analogs for all parameters in this process, it is conceptually not unreasonable to posit that some linguistic features may exhibit more mean-reverting behavior than others. Is it necessary to investigate this possibility when controlling for statistical biases? Perhaps not, although it is impossible to be certain that the Brownian assumption is entirely unproblematic, at least without more detailed investigation, including simulation studies. My point here is not that researchers should test all possible evolutionary scenarios – this would be unfeasible – but that even fairly trivial-seeming choices may not be so, and this issue underscores the fact that multiple modeling approaches are needed to understand the conditions under which certain associations between features obtain.
With diachronic change being as complex as it is, it is worth emphasizing that PCMs, of which phylogenetic regression is just one example, offer us more than just a means of accounting for Galton’s problem, that is, the issue of phylogenetic non-independence (Narroll 1961). Indeed, many authors using more complex phylogenetic models are content to see their results as demonstrating a relationship between multiple variables while accounting for phylogenetic history (Jäger and Wahle 2021) and to leave matters there. But there is more to be explored beyond simply detecting whether there is evidence for an association – we can ask questions regarding the dynamics that led to said association (Cathcart et al. 2020; Craevschi et al. 2025; Sheehan et al. 2023). For instance, if we detect an inverse relationship between complexity in different domains of morphosyntax using phylogenetic methods, we can investigate whether one domain simplifies before the other, or whether simplification is simultaneous in both domains. Similar models probe whether linguistic migration precedes putatively environmentally driven linguistic changes (Hartmann et al. 2024). Phylogenetic models that are more explicit with respect to their assumptions about the nature of change might additionally be able to shed light on evolutionary scenarios between which regression-based approaches are unable to distinguish. And while most of the phylogenetic approaches in the literature do not explicitly model contact, they may indirectly absorb information regarding areality in an interpretable fashion (e.g., if rates of change on a branch are higher than expected). At the end of the day, it is likely impossible to incorporate all relevant dimensions of linguistic change into a single model; hence, a multi-pronged approach is needed to fully understand the processes that have given rise to what we observe synchronically. The approach outlined in the target article is an important puzzle piece, but one of many.
References
Bates, D., M. Mächler, B. Bolker & S. Walker. 2015. Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1). 1–48. https://doi.org/10.18637/jss.v067.i01.Search in Google Scholar
Becker, Laura & M. Guzmán Naranjo. 2025. Replication and methodological robustness in quantitative typology. Linguistic Typology. https://doi.org/10.1515/lingty-2023-0076.Search in Google Scholar
Blei, D. M., A. Kucukelbir & J. D. McAuliffe. 2017. Variational inference: A review for statisticians. Journal of the American Statistical Association 112(518). 859–877. https://doi.org/10.1080/01621459.2017.1285773.Search in Google Scholar
Blomberg, S. P., S. I. Rathnayake & C. M. Moreau. 2020. Beyond Brownian motion and the Ornstein-Uhlenbeck process: Stochastic diffusion models for the evolution of quantitative characters. The American Naturalist 195(2). 145–165. https://doi.org/10.1086/706339.Search in Google Scholar
Butler, M. A. & A. A. King. 2004. Phylogenetic comparative analysis: A modeling approach for adaptive evolution. The American Naturalist 164(6). 683–695. https://doi.org/10.2307/3473229.Search in Google Scholar
Cathcart, C. A., A. Hölzl, G. Jäger, P. Widmer & B. Bickel. 2020. Numeral classifiers and number marking in Indo-Iranian: A phylogenetic approach. Language Dynamics and Change 1(aop). 1–53.10.1163/22105832-bja10013Search in Google Scholar
Craevschi, A., S. Babinski & C. Cathcart. 2025. Semantics drives analogical change in Germanic strong verb paradigms: A phylogenetic study. https://arxiv.org/html/2502.17670v1 (accessed 24 February 2025).Search in Google Scholar
Garamszegi, L. Z. (ed.). 2014. Modern phylogenetic comparative methods and their application in evolutionary biology: Concepts and practice. Heidelberg, New York, Dordrecht, London: Springer.10.1007/978-3-662-43550-2Search in Google Scholar
Gill, M. S., L. S. Tung Ho, G. Baele, P. Lemey & M. A. Suchard. 2017. A relaxed directional random walk model for phylogenetic trait evolution. Systematic Biology 66(3). 299–319. https://doi.org/10.1093/sysbio/syw093.Search in Google Scholar
Harmon, L. 2019. Phylogenetic comparative methods. https://lukejharmon.github.io/pcm/ (accessed 14 October 2022).10.32942/OSF.IO/E3XNRSearch in Google Scholar
Hartmann, F., S. G. Roberts, P. Valdes & R. Grollemund. 2024. Investigating environmental effects on phonology using diachronic models. Evolutionary Human Sciences 6. e8. https://doi.org/10.1017/ehs.2023.33.Search in Google Scholar
Haspelmath, M. 2020. Some issues with the correlated-evolution method for testing causal hypotheses in comparative linguistics. Diversity Linguistics Comment. https://doi.org/10.58079/nsvs (accessed 3 February 2025).Search in Google Scholar
Jäger, G. & J. Wahle. 2021. Phylogenetic typology. Frontiers in Psychology 12. 682132. https://doi.org/10.3389/fpsyg.2021.682132.Search in Google Scholar
Kruschke, J. 2014. Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan. Amsterdam: Academic Press.10.1016/B978-0-12-405888-0.00008-8Search in Google Scholar
Landis, M. J., J. G. Schraiber & M. Liang. 2013. Phylogenetic analysis using Lévy processes: Finding jumps in the evolution of continuous traits. Systematic Biology 62(2). 193–204. https://doi.org/10.1093/sysbio/sys086.Search in Google Scholar
Lemey, P., A. Rambaut, J. J. Welch & M. A. Suchard. 2010. Phylogeography takes a relaxed random walk in continuous space and time. Molecular Biology and Evolution 27(8). 1877–1885. https://doi.org/10.1093/molbev/msq067.Search in Google Scholar
McElreath, R. 2016. Statistical rethinking: A Bayesian course with examples in R and Stan. Boca Raton: CRC Press.Search in Google Scholar
Narroll, R. 1961. Two solutions to Galton’s problem. Philosophy of Science 28. 15–29.10.1086/287778Search in Google Scholar
Ringen, E., J. S. Martin & A. Jaeggi. 2021. Novel phylogenetic methods reveal that resource-use intensification drives the evolution of “complex” societies. https://ecoevorxiv.org/repository/object/4119/download/8181/ (accessed 18 May 2021).10.32942/OSF.IO/WFP95Search in Google Scholar
Sheehan, O., J. Watts, R. D. Gray, J. Bulbulia, S. Claessens, E. J. Ringen & Q. D. Atkinson. 2023. Coevolution of religious and political authority in Austronesian societies. Nature Human Behaviour 7(1). 38–45. https://doi.org/10.1038/s41562-022-01471-y.Search in Google Scholar
© 2025 the author(s), published by De Gruyter, Berlin/Boston
This work is licensed under the Creative Commons Attribution 4.0 International License.