Abstract
In 2019, Eliud Kipchoge ran a sub-two hour marathon wearing Nike’s Alphafly shoes. Despite being the fastest marathon time ever recorded, it wasn’t officially recognized as race conditions were tightly controlled to maximize his success. Besides, Kipchoge’s use of Alphafly shoes was controversial, with some experts claiming that they might have provided an unfair competitive advantage. In this work, we assess the potential influence of advanced footwear technology and the likelihood of a sub-two hour marathon in official races, by studying the evolution of running top performances from 2001 to 2019 for long distances ranging from 10 km to marathon. The analysis is performed using extreme value theory, a field of statistics dealing with analysis of rare events. We find a significant evidence of performance-enhancement effect with a 10% increase of the probability that a new world record for marathon-men discipline is set in 2021. However, results suggest that achieving a sub-two hour marathon in an official race in 2021 is still very unlikely, and exceeds 10% probability only by 2025.
Acknowledgement
We acknowledge Harry Spearing for sharing code that provided good inspiration to our work.
-
Author contribution: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
-
Research funding: The authors received no specific funding for this work.
-
Conflict of interest statement: The authors declare no conflicts of interest regarding this article.
Appendix A: Theory and model
A.1 Extremes for identically distributed variables
Extreme value theory (EVT) is a branch of statistics which studies the tails of probability distributions. It was first developed for block maxima (Gumbel 1958) analysis, but the Peaks Over Threshold (POT) method (Davison and Smith 1990) is often preferred, as it uses all the most extreme data, rather than just the maxima, typically leading to more efficient inference. Let X be a random variable with distribution function F, if there exist random sequences a n , b n > 0 such that
as n → ∞ is a non-degenerate limiting distribution, then for a large enough threshold u we can use the approximation
where σ u = σ + ξ(u − μ) > 0, a+ = max(a, 0). If ξ < 0 then x must lie in the interval [0, x H ], where x H = u − σ u /ξ is the upper limit of the distribution, whereas if ξ ≥ 0, x can take any positive value. The limit distribution H u , called Generalized Pareto distribution (GPD) motivates an approximation for large u, giving a model for the distribution of the exceedances above such threshold, regardless of the distribution F.
Given a large enough sample of n independent identically distributed (IID) observations, in the POT approach a threshold u is carefully chosen, and exceedances can be used to estimate the parameters of the GPD. Threshold choice can be rather subjective and case-dependent, and is subject to a bias-variance trade-off. In this paper we base our choice on graphical diagnostics; however, other alternative methods might also be suitable; see Scarrott and MacDonald (2012) for a detailed review of these techniques.
It is remarkable that the rate of the frequency of exceedances above the threshold u can be derived in a fashion that gives way to a more complete perspective of exceedances modelling, using point process models. Let X i be IID random variables with distribution function F, we define
where
where
Therefore we can construct a model for extreme tails with two components: a model for the number of exceedances, given by (4), which is Poisson distributed with mean
Consider the sequence of point processes on
where the scaling 1/(n + 1) in the first coordinate ensures that the time axis is continuous on (0, 1), and the sequences a
n
, b
n
are defined in (1). More precisely, on regions of the form [0, 1] × (u, ∞), where u is large enough such that (2) approximately holds, we have that P
n
→ P as n → ∞, where P is a non-homogeneous Poisson Process. Consequently, the integrated measure Λ of P on
and its intensity function is
with x > u and 0 < t ≤ 1. For statistical inference we assume that for large enough n, P
n
∼ P is a good approximation. The scaling coefficients a
n
, b
n
, can be absorbed into the intensity function, so we work directly with the series
A.2 Extremes of non-stationary sequences
The extreme value models derived so far are built on the assumption of IID variables. However, in our work, non-stationarity data arise due to the improvement of racing conditions over time, and the potential Vaporfly shoes. Therefore, we relax the identically distributed assumption by introducing a time-dependent structure, while keeping independence assumption. Indeed, the time variation for parameters θ(t) = {μ(t), σ(t), ξ(t)} will translate into a time-dependent rate of exceedances, and distribution of such exceedances. Under this covariate structure, the intensity of the non-homogeneous Poisson process P will be
Now, in the general case where we have n points
and the full likelihood is
The parameters
where σ u (t) = σ(t) + ξ(t){u − μ(t)}.
A.3 Model
For most disciplines (and specially for marathon-men) a linear dependence on time for the scale parameter of the GP distribution of the exceedances was best suited in AIC terms. The following parametrisation was used to incorporate such structural time dependence.
where d ∈ D is the superscript denoting discipline d, y(t) is the year corresponding to time t,
A.4 Expected running times of next new world record
As derived in Spearing et al. (2021), the expected new world record time for discipline d at year y will be
where
A.5 Probability of breaking a world record in a given year
Let
Therefore, let
where
A.6 Time until next world record is set
Let T(d) be the random variable describing the waiting time until a new world record is set for an discipline e, if we define t
y
= y − 2020, the probability
We can further estimate the expected waiting time until the world record is broken for any discipline e, which has the following expression
where Pr(R y ) is the probability that the world record is broken at year y, as described in (19).
A.7 Adjusting for AFT effect
Let x > u be a running-time recorded during year y > 2018, when Vaporfly and other shoes with technological advances are widely used in official races. We denote by x c the corrected or equivalent time of x if such shoes were not used. Its expression can be derived as in Spearing et al. (2021), obtaining
where
A.8 Breaking the 2 h marathon
Let Pr(B2 = y) be the probability the 2-h marathon being broken in a given year y, it follows from (19) that
where 2h ≔ −7200 and marM refers to the marathon-men discipline. Additionally, we could compute the cumulative probability of achieving a sub 2-h marathon before year y, which follows from (20)
Appendix B: Model estimates
Parameter estimates (with 95% confidence intervals) for the model.
Discipline |
|
|
β (d) |
---|---|---|---|
Marathon-men | 30.63 (30.40,33.95) | −7591 (−7594,−7584) | 12.51 (12.43,12.61) |
Marathon-women | 101.95 (101.81,105.62) | −8418 (−8420,−8403) | 7.83 (7.73,8.00) |
Half marathon-men | 16.15 (16.01,16.57) | −3577 (−3578,−3574) | 3.64 (3.61,3.68) |
Half marathon-women | 39.17 (39.98,40.92) | −4112 (−4115,−4105) | 11.66(11.62,11.74) |
10 km-men | 10.64 (10.49,11.00) | −1647 (−1647,−1645) | 1.02 (0.99,1.06) |
10 km-women | 17.74 (17.58,18.23) | −1863 (−1865,−1860) | 2.12 (2.09,2.18) |
δ ( d ) | γ ( d ) | ξ | |
3.77 (3.73,3.84) | 12.60 (9.54,19.46) | −0.251 (−0.248,-0.255) | |
0.36 (0.33,0.50) | 51.42 (50.08,60.12) | ||
0.85 (0.83,0.88) | 0.70 (−2.39,2.86) | ||
2.59 (2.57,2.63) | 14.64(13.09,18.34) | ||
0.29 (0.28,0.32) | 6.44 (5.79,8.29) | ||
0.63 (0.61,0.67) | 13.74 (11.83,15.54) |
World records as in 2019 and AFT-corrected times for records recorded before 2018.
Discipline | World record 2019 | AFT-corrected world record 2019 |
---|---|---|
Marathon-men | 02:01:39 | 02:01:48 (−3 s,+8 s) |
Marathon-women | 02:14:04 | 02:14:17 (−2 s,+12 s) |
Half marathon-men | 00:58:01 | – |
Half marathon-women | 01:04:51 | 01:05:08 (−3 s,+5 s) |
10 km-men | 00:26:38 | – |
10 km-women | 00:29:43 | – |
Appendix C: Model checking

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Diagnostic QQ plot for the model. The plot displays the log of the quantiles of the transformed observations for all disciplines, against the quantiles of a unit exponential distribution, with 95% confidence intervals.

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u d with 95% confidence intervals (black dashes).

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u d with 95% confidence intervals (black dashes).

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u d with 95% confidence intervals (black dashes).
References
Angus, S. D. 2019. “A Statistical Timetable for the Sub–2-hour Marathon.” Medicine & Science in Sports & Exercise 51 (7): 1460–6. https://doi.org/10.1249/mss.0000000000001928.Suche in Google Scholar
Barnes, K. R., and A. E. Kilding. 2019. “A Randomized Crossover Study Investigating the Running Economy of Highly-Trained Male and Female Distance Runners in Marathon Racing Shoes versus Track Spikes.” Sports Medicine 49 (2): 331–42. https://doi.org/10.1007/s40279-018-1012-3.Suche in Google Scholar
Bermon, S., F. Garrandes, A. Szabo, I. Berkovics, and P. E. Adami. 2021. “Effect of Advanced Shoe Technology on the Evolution of Road Race Times in Male and Female Elite Runners.” Frontiers in Sports and Active Living 3: 46. https://doi.org/10.3389/fspor.2021.653173.Suche in Google Scholar
Blest, D. C. 1996. “Lower Bounds for Athletic Performance.” Journal of the Royal Statistical Society. Series D (The Statistician) 45 (2): 243–53. https://doi.org/10.2307/2988413.Suche in Google Scholar
Coles, S. 2001. An Introduction to Statistical Modeling of Extreme Values. London: Springer.10.1007/978-1-4471-3675-0Suche in Google Scholar
Davison, A. C., and R. L. Smith. 1990. “Models for Exceedances over High Thresholds.” Journal of the Royal Statistical Society: Series B 52 (3): 393–425. https://doi.org/10.1111/j.2517-6161.1990.tb01796.x.Suche in Google Scholar
Einmahl, J. J., J. H. J. Einmahl, and L. d. Haan. 2019. “Limits to Human Life Span through Extreme Value Theory.” Journal of the American Statistical Association 114 (527): 1075–80. https://doi.org/10.1080/01621459.2018.1537912.Suche in Google Scholar
Einmahl, J. H. J., and J. R. Magnus. 2008. “Records in Athletics through Extreme-Value Theory.” Journal of the American Statistical Association 103 (484): 1382–91. https://doi.org/10.1198/016214508000000698.Suche in Google Scholar
Guinness, J., D. Bhattacharya, J. Chen, M. Chen, and A. Loh. 2020. “An Observational Study of the Effect of Nike Vaporfly Shoes on Marathon Performance.” arXiv:2002.06105 [stat].Suche in Google Scholar
Gumbel, E. J. 1958. Statistics of Extremes. New York: Columbia University Press.10.7312/gumb92958Suche in Google Scholar
Holmes, T., R. Huggett, and A. Westerling. 2008. “Statistical Analysis of Large Wildfires.” In The Economics of Forest Disturbances: Wildfires, Storms, and Invasive Species, 59–77. Dordrecht: Springer.10.1007/978-1-4020-4370-3_4Suche in Google Scholar
Hoogkamer, W., S. Kipp, J. H. Frank, E. M. Farina, G. Luo, and R. Kram. 2018. “A Comparison of the Energetic Cost of Running in Marathon Racing Shoes.” Sports Medicine 48 (4): 1009–19. https://doi.org/10.1007/s40279-017-0811-2.Suche in Google Scholar
Hunter, S. K., M. J. Joyner, and A. M. Jones. 2015. “The Two-Hour Marathon: What’s the Equivalent for Women?” Journal of Applied Physiology 118 (10): 1321–3. https://doi.org/10.1152/japplphysiol.00852.2014.Suche in Google Scholar
IAAF. 2011. Iaaf to Continue to Recognise Existing Women’s Road-Running Records. Also available at https://www.worldathletics.org/news/undefined/iaaf-to-continue-to-recognise-existing-womens.Suche in Google Scholar
Joyner, M. J., J. R. Ruiz, and A. Lucia. 2011. “The Two-Hour Marathon: Who and When?” Journal of Applied Physiology 110 (1): 275–7. https://doi.org/10.1152/japplphysiol.00563.2010.Suche in Google Scholar
Katz, R. W., M. B. Parlange, and P. Naveau. 2002. “Statistics of Extremes in Hydrology.” Advances in Water Resources 25 (8): 1287–304. https://doi.org/10.1016/s0309-1708(02)00056-8.Suche in Google Scholar
Kipp, S., R. Kram, and W. Hoogkamer. 2019. “Extrapolating Metabolic Savings in Running: Implications for Performance Predictions.” Frontiers in Physiology 10: 79. https://doi.org/10.3389/fphys.2019.00079.Suche in Google Scholar
NASA. 2020. Goddard Institute for Space Studies. Also available at https://climate.nasa.gov/vital-signs/global-temperature/.Suche in Google Scholar
Quealy, K., and J. Katz. 2019. Nike’s Fastest Shoes May Give Runners an Even Bigger Advantage than We Thought. Also available at https://www.nytimes.com/interactive/2019/12/13/upshot/nike-vaporfly-nextpercent-shoe-estimates.html.Suche in Google Scholar
Robinson, M. E., and J. A. Tawn. 1995. “Statistics for Exceptional Athletics Records.” Journal of the Royal Statistical Society. Series C (Applied Statistics) 44 (4): 499–511. https://doi.org/10.2307/2986141.Suche in Google Scholar
Rodrigues, L., M. Gomes, and D. Pestana. 2011. “Statistics of Extremes in Athletics.” Revstat Statistical Journal 9 (2): 127–53.Suche in Google Scholar
Scarrott, C., and A. MacDonald. 2012. “A Review of Extreme Value Threshold Estimation and Uncertainty Quantification.” Revstat Statistical Journal 10 (1): 33–60.Suche in Google Scholar
Senefeld, J. W., M. H. Haischer, A. M. Jones, C. C. Wiggins, R. Beilfuss, M. J. Joyner, and S. K. Hunter. 2021. “Technological Advances in Elite Marathon Performance.” Journal of Applied Physiology 130 (6): 2002–8. https://doi.org/10.1152/japplphysiol.00002.2021.Suche in Google Scholar
Spearing, H., J. Tawn, D. Irons, T. Paulden, and G. Bennett. 2021. “Ranking, and Other Properties, of Elite Swimmers Using Extreme Value Theory.” Journal of the Royal Statistical Society: Series A 184 (1): 368–95. https://doi.org/10.1111/rssa.12628.Suche in Google Scholar
Stephenson, A., and J. Tawn. 2013. “Determining the Best Track Performances of All Time Using a Conceptual Population Model for Athletics Records.” Journal of Quantitative Analysis in Sports 9 (1): 67–76. https://doi.org/10.1515/jqas-2012-0047.Suche in Google Scholar
Strand, M., and D. Boes. 1998. “Modeling Road Racing Times of Competitive Recreational Runners Using Extreme Value Theory.” The American Statistician 52 (3): 205–10. https://doi.org/10.1080/00031305.1998.10480564.Suche in Google Scholar
Thomson, N. 2017. Do Nike’s Zoom Vaporfly 4% Marathon Shoes Actually Make You Run Faster? | WIRED. Also available at https://www.wired.com/story/do-nike-zoom-vaporfly-make-you-run-faster/.Suche in Google Scholar
Tucker, R., and J. Santos-Concejero. 2017. “The Unlikeliness of an Imminent Sub-2-hour Marathon: Historical Trends of the Gender Gap in Running Events.” International Journal of Sports Physiology and Performance 12 (8): 1017–22. https://doi.org/10.1123/ijspp.2016-0634.Suche in Google Scholar
World Athletics 2020. Technical Rules. Also available at https://hmg-prod.s3.amazonaws.com/files/c2-1-technical-rules-amended-on-31-january-2020-1580483189.pdf.Suche in Google Scholar
© 2022 Walter de Gruyter GmbH, Berlin/Boston
Artikel in diesem Heft
- Frontmatter
- Research Articles
- G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory
- Evaluating the performance of elite level volleyball players
- Review
- Optical tracking in team sports
- Research Article
- MSE-optimal K-factor of the Elo rating system for round-robin tournament
- Influence of advanced footwear technology on sub-2 hour marathon and other top running performances
Artikel in diesem Heft
- Frontmatter
- Research Articles
- G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory
- Evaluating the performance of elite level volleyball players
- Review
- Optical tracking in team sports
- Research Article
- MSE-optimal K-factor of the Elo rating system for round-robin tournament
- Influence of advanced footwear technology on sub-2 hour marathon and other top running performances