Influence of advanced footwear technology on sub-2 hour marathon and other top running performances

Andreu Arderiu; Raphaël de Fondeville

doi:10.1515/jqas-2021-0043

Article

Influence of advanced footwear technology on sub-2 hour marathon and other top running performances

Andreu Arderiu and Raphaël de Fondeville

Published/Copyright: March 14, 2022

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal Journal of Quantitative Analysis in Sports Volume 18 Issue 1

Abstract

In 2019, Eliud Kipchoge ran a sub-two hour marathon wearing Nike’s Alphafly shoes. Despite being the fastest marathon time ever recorded, it wasn’t officially recognized as race conditions were tightly controlled to maximize his success. Besides, Kipchoge’s use of Alphafly shoes was controversial, with some experts claiming that they might have provided an unfair competitive advantage. In this work, we assess the potential influence of advanced footwear technology and the likelihood of a sub-two hour marathon in official races, by studying the evolution of running top performances from 2001 to 2019 for long distances ranging from 10 km to marathon. The analysis is performed using extreme value theory, a field of statistics dealing with analysis of rare events. We find a significant evidence of performance-enhancement effect with a 10% increase of the probability that a new world record for marathon-men discipline is set in 2021. However, results suggest that achieving a sub-two hour marathon in an official race in 2021 is still very unlikely, and exceeds 10% probability only by 2025.

Keywords: athletics performance; footwear technology; running time records; statistical analysis; vaporfly

Corresponding author: Andreu Arderiu, Department of Mathematics, EPFL, Lausanne, Switzerland, E-mail: andreu.arderiu@alumni.epfl.ch

Acknowledgement

We acknowledge Harry Spearing for sharing code that provided good inspiration to our work.

Author contribution: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
Research funding: The authors received no specific funding for this work.
Conflict of interest statement: The authors declare no conflicts of interest regarding this article.

Appendix A: Theory and model

A.1 Extremes for identically distributed variables

Extreme value theory (EVT) is a branch of statistics which studies the tails of probability distributions. It was first developed for block maxima (Gumbel 1958) analysis, but the Peaks Over Threshold (POT) method (Davison and Smith 1990) is often preferred, as it uses all the most extreme data, rather than just the maxima, typically leading to more efficient inference. Let X be a random variable with distribution function F, if there exist random sequences a_n, b_n > 0 such that

(1) n 1 − F ( a n x + b n ) → − log G ( x )

as n → ∞ is a non-degenerate limiting distribution, then for a large enough threshold u we can use the approximation

(2) Pr ( X > x | X > u ) ≈ H u ( x ) = 1 − 1 + ξ { ( x − u ) / σ u } − 1 / ξ ξ ≠ 0 , 1 − exp { − ( x − μ ) / σ u } , ξ = 0 , x ∈ R ,

where σ_u = σ + ξ(u − μ) > 0, a₊ = max(a, 0). If ξ < 0 then x must lie in the interval [0, x_H], where x_H = u − σ_u/ξ is the upper limit of the distribution, whereas if ξ ≥ 0, x can take any positive value. The limit distribution H_u, called Generalized Pareto distribution (GPD) motivates an approximation for large u, giving a model for the distribution of the exceedances above such threshold, regardless of the distribution F.

Given a large enough sample of n independent identically distributed (IID) observations, in the POT approach a threshold u is carefully chosen, and exceedances can be used to estimate the parameters of the GPD. Threshold choice can be rather subjective and case-dependent, and is subject to a bias-variance trade-off. In this paper we base our choice on graphical diagnostics; however, other alternative methods might also be suitable; see Scarrott and MacDonald (2012) for a detailed review of these techniques.

It is remarkable that the rate of the frequency of exceedances above the threshold u can be derived in a fashion that gives way to a more complete perspective of exceedances modelling, using point process models. Let X_i be IID random variables with distribution function F, we define

(3) N n ( x ) = ∑ i = 1 n 1 ( X i > a n x + b n ) ,

where 1 ( A ) is an indicator whether the event A occurs. It follows that N_n(x) ∼ Binomial(n, 1 − F(a_nx + b_n)) with mean n 1 − F ( a n x + b n ) , and using the classical Poisson limit of the binomial distribution,

(4) N n ( x ) → N ( x ) ∼ Poisson ( λ ) ,

where λ = { 1 + ξ ( x − μ ) / σ } + − 1 / ξ .

Therefore we can construct a model for extreme tails with two components: a model for the number of exceedances, given by (4), which is Poisson distributed with mean λ = { 1 + ξ ( x − μ ) / σ } + − 1 / ξ , and a model for the distribution of the exceedances, which is GPD distributed, following H_u(x).

Consider the sequence of point processes on R 2 (Coles 2001)

(5) P n = i n + 1 , X i − b n a n : i = 1 , … , n ,

where the scaling 1/(n + 1) in the first coordinate ensures that the time axis is continuous on (0, 1), and the sequences a_n, b_n are defined in (1). More precisely, on regions of the form [0, 1] × (u, ∞), where u is large enough such that (2) approximately holds, we have that P_n → P as n → ∞, where P is a non-homogeneous Poisson Process. Consequently, the integrated measure Λ of P on A 1 , u = [ 0,1 ] × ( u , ∞ ) is given by

(6) Λ ( A 1 , u ) = 1 + ξ u − μ σ + − 1 / ξ ,

and its intensity function is

(7) λ ( t , x ) = 1 σ 1 + ξ x − μ σ + − 1 / ξ − 1 = λ ( x ) ,

with x > u and 0 < t ≤ 1. For statistical inference we assume that for large enough n, P_n ∼ P is a good approximation. The scaling coefficients a_n, b_n, can be absorbed into the intensity function, so we work directly with the series i n + 1 , X i : i = 1 , … , n . Therefore, for a region of the form A 1 , u = [ 0,1 ] × ( u , ∞ ) , containing n points x = ( t 1 , x 1 ) , … , ( t n , x n ) , the likelihood for the parameters θ = (μ, σ, ξ) is

(8) L ( θ ; x ) = exp − Λ ( A 1 , u ) ∏ i = 1 n λ ( x i ) .

A.2 Extremes of non-stationary sequences

The extreme value models derived so far are built on the assumption of IID variables. However, in our work, non-stationarity data arise due to the improvement of racing conditions over time, and the potential Vaporfly shoes. Therefore, we relax the identically distributed assumption by introducing a time-dependent structure, while keeping independence assumption. Indeed, the time variation for parameters θ(t) = {μ(t), σ(t), ξ(t)} will translate into a time-dependent rate of exceedances, and distribution of such exceedances. Under this covariate structure, the intensity of the non-homogeneous Poisson process P will be

(9) λ ( t , x ) = 1 σ ( t ) 1 + ξ ( t ) x − μ ( t ) σ ( t ) + − 1 / ξ ( t ) − 1 .

Now, in the general case where we have n points x = ( t 1 , x 1 ) , … , ( t n , x n ) in the region A T , u = [ 0 , T ] × ( u , ∞ ) , the integrated intensity becomes

(10) Λ ( A T , u ) = ∫ 0 T 1 + ξ ( t ) x − μ ( t ) σ ( t ) + − 1 / ξ ( t ) d t ,

and the full likelihood is

(11) L { θ ( t ) ; x } = exp Λ ( A T , u ) ∏ i = 1 n λ ( t i , x i ) .

The parameters θ ( t ) = ( μ ( t ) , σ ( t ) , ξ ( t ) ) are estimated by maximizing (11), and with such estimates, for a given time t, predictions about the number of exceedances can be made by integrating (9). The excess distribution at time t will be given by

(12) Pr X t > x | X t > u = 1 − H u ( x , t ) = 1 + ξ ( t ) x − u σ u ( t ) + − 1 ξ ( t ) ,

where σ_u(t) = σ(t) + ξ(t){u − μ(t)}.

A.3 Model

For most disciplines (and specially for marathon-men) a linear dependence on time for the scale parameter of the GP distribution of the exceedances was best suited in AIC terms. The following parametrisation was used to incorporate such structural time dependence.

(13) ξ ( d ) ( t ) = ξ

(14) μ ( d ) ( t ) = μ 0 ( d ) + β ( d ) y ( t ) + γ ( d ) 1 { y ( t ) ≥ 2018 }

σ ( d ) ( t ) = σ 0 ( d ) + ξ ( d ) β ( d ) y ( t )

(15) + ξ ( d ) γ ( d ) 1 { y ( t ) ≥ 2018 } + δ y ( t )

where d ∈ D is the superscript denoting discipline d, y(t) is the year corresponding to time t, ξ ( d ) , μ 0 ( d ) ∈ R , σ 0 ( d ) ∈ R + are the shape, location, and scale parameter of the Poisson process, β ∈ R controls the linear trend in σ^(d)(t) and μ^(d)(t), γ ( d ) ∈ R represents Vaporfly shoes effect, 1 is the indicator function, and 2018 is the year when the shoes started to be widely used in official races. Note that this parametrisation enforces the GPD scale parameter for exceedances above u_d to change linearly with time.

(16) σ u ( d ) ( t ) = σ ( d ) ( t ) + ξ ( d ) u d − μ ( d ) ( t ) = σ 0 ( d ) + ξ ( d ) ( u d − μ 0 ( d ) ) + δ y ( t ) ≔ σ u ( d ) + δ y ( t )

A.4 Expected running times of next new world record

As derived in Spearing et al. (2021), the expected new world record time for discipline d at year y will be

(17) E X y * ( d ) = ∫ r d x H , e x d H r d ( d ) ( x , y ) d x d x = r d + σ r d ( d ) ( y ) 1 − ξ , if ξ < 1 ,

where σ r d ( d ) ( y ) = σ 0 ( d ) + ξ r d − μ 0 ( d ) + δ y ( t ) , X y * ( d ) is the random variable denoting the running-time of a new world record for discipline e, set in year y, and r_d is the current (2019) world record of discipline d, so that r_d = max(X^(d)), with X^(d) the set of all observations for discipline d.

A.5 Probability of breaking a world record in a given year

Let N y ( d ) be the number of exceedances of the threshold u_d for discipline d during year y, it is Poisson distributed with mean

(18) Λ ( d ) ( A y , u ) = 1 + ξ u d − μ ( d ) ( y ) σ ( d ) ( y ) + − 1 / ξ .

Therefore, let X 1 : N y ( d ) ( d ) = X i ( d ) , i = 1 , … , N y ( d ) , where X i ( d ) ∼ i i d H u ( d ) ( y ) , if we denote by Pr ( R y ( d ) ) the probability that a world record for discipline d is set in year y,

(19) Pr ( R y ( d ) ) = 1 − exp − Λ ( d ) ( A y , u ) H ̄ u ( d ) ( r d , y ) ,

where H ̄ u ( d ) ( r d , y ) ≔ 1 − H u ( d ) ( r d , y ) .

A.6 Time until next world record is set

Let T^(d) be the random variable describing the waiting time until a new world record is set for an discipline e, if we define t_y = y − 2020, the probability F T ( d ) ( t y ) = Pr ( T ( d ) < t y ) that a world record for discipline e is set before some year y is

(20) F T ( d ) ( t y ) = 1 − exp − ∑ k = 2020 y − 1 Λ ( d ) ( A k , u ) H ̄ u ( d ) ( r d , k ) .

We can further estimate the expected waiting time until the world record is broken for any discipline e, which has the following expression

(21) E T ( d ) = Pr ( R 2020 ) + ∑ t = 2 ∞ Pr ( R 2019 + t ) ∑ k = 1 t − 1 1 − Pr ( R 2019 + k ) ,

where Pr(R_y) is the probability that the world record is broken at year y, as described in (19).

A.7 Adjusting for AFT effect

Let x > u be a running-time recorded during year y > 2018, when Vaporfly and other shoes with technological advances are widely used in official races. We denote by x_c the corrected or equivalent time of x if such shoes were not used. Its expression can be derived as in Spearing et al. (2021), obtaining

(22) x c = u d + σ C , u ( d ) ( y ) ξ Λ ( d ) ( A y , u ) H ̄ u ( d ) ( x , y ) Λ C ( d ) ( A y , u ) − 1 ,

where Λ C ( d ) ( A y , u ) has the form of Λ ( d ) ( A y , u ) but with the corrected parameters

(23) μ C ( d ) ( y ) = μ 0 ( d ) + β y ,

(24) σ C ( d ) ( y ) = σ 0 ( d ) + ξ β y + δ y ,

(25) σ C , u ( d ) ( y ) = σ C ( d ) ( y ) + ξ u d − μ C ( d ) ( y ) .

A.8 Breaking the 2 h marathon

Let Pr(B2 = y) be the probability the 2-h marathon being broken in a given year y, it follows from (19) that

(26) Pr ( B 2 = y ) = 1 − exp − Λ ( mar M ) ( A y , u ) H ̄ u ( mar M ) ( 2 h , y ) ,

where 2h ≔ −7200 and marM refers to the marathon-men discipline. Additionally, we could compute the cumulative probability of achieving a sub 2-h marathon before year y, which follows from (20)

(27) Pr ( B 2 < y ) = 1 − exp − ∑ k = 2020 y − 1 Λ ( mar M ) ( A k , u ) H ̄ u ( mar M ) ( 2 h , k ) .

Appendix B: Model estimates

Table 4 and 5.

Table 4:

Parameter estimates (with 95% confidence intervals) for the model.

Discipline	σ 0 ( d )	μ 0 ( d )	β ^(d)
Marathon-men	30.63 (30.40,33.95)	−7591 (−7594,−7584)	12.51 (12.43,12.61)
Marathon-women	101.95 (101.81,105.62)	−8418 (−8420,−8403)	7.83 (7.73,8.00)
Half marathon-men	16.15 (16.01,16.57)	−3577 (−3578,−3574)	3.64 (3.61,3.68)
Half marathon-women	39.17 (39.98,40.92)	−4112 (−4115,−4105)	11.66(11.62,11.74)
10 km-men	10.64 (10.49,11.00)	−1647 (−1647,−1645)	1.02 (0.99,1.06)
10 km-women	17.74 (17.58,18.23)	−1863 (−1865,−1860)	2.12 (2.09,2.18)
	δ ^( d )	γ ^( d )	ξ
	3.77 (3.73,3.84)	12.60 (9.54,19.46)	−0.251 (−0.248,-0.255)
	0.36 (0.33,0.50)	51.42 (50.08,60.12)
	0.85 (0.83,0.88)	0.70 (−2.39,2.86)
	2.59 (2.57,2.63)	14.64(13.09,18.34)
	0.29 (0.28,0.32)	6.44 (5.79,8.29)
	0.63 (0.61,0.67)	13.74 (11.83,15.54)

Table 5:

World records as in 2019 and AFT-corrected times for records recorded before 2018.

Discipline	World record 2019	AFT-corrected world record 2019
Marathon-men	02:01:39	02:01:48 (−3 s,+8 s)
Marathon-women	02:14:04	02:14:17 (−2 s,+12 s)
Half marathon-men	00:58:01	–
Half marathon-women	01:04:51	01:05:08 (−3 s,+5 s)
10 km-men	00:26:38	–
10 km-women	00:29:43	–

Appendix C: Model checking

Figures 5 –11.

Figure 5:

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Figure 6:

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Figure 7:

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Figure 8:

Diagnostic QQ plot for the model. The plot displays the log of the quantiles of the transformed observations for all disciplines, against the quantiles of a unit exponential distribution, with 95% confidence intervals.

Figure 9:

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u_d with 95% confidence intervals (black dashes).

Figure 10:

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u_d with 95% confidence intervals (black dashes).

Figure 11:

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u_d with 95% confidence intervals (black dashes).

References

Angus, S. D. 2019. “A Statistical Timetable for the Sub–2-hour Marathon.” Medicine & Science in Sports & Exercise 51 (7): 1460–6. https://doi.org/10.1249/mss.0000000000001928.Search in Google Scholar

Barnes, K. R., and A. E. Kilding. 2019. “A Randomized Crossover Study Investigating the Running Economy of Highly-Trained Male and Female Distance Runners in Marathon Racing Shoes versus Track Spikes.” Sports Medicine 49 (2): 331–42. https://doi.org/10.1007/s40279-018-1012-3.Search in Google Scholar

Bermon, S., F. Garrandes, A. Szabo, I. Berkovics, and P. E. Adami. 2021. “Effect of Advanced Shoe Technology on the Evolution of Road Race Times in Male and Female Elite Runners.” Frontiers in Sports and Active Living 3: 46. https://doi.org/10.3389/fspor.2021.653173.Search in Google Scholar

Blest, D. C. 1996. “Lower Bounds for Athletic Performance.” Journal of the Royal Statistical Society. Series D (The Statistician) 45 (2): 243–53. https://doi.org/10.2307/2988413.Search in Google Scholar

Coles, S. 2001. An Introduction to Statistical Modeling of Extreme Values. London: Springer.10.1007/978-1-4471-3675-0Search in Google Scholar

Davison, A. C., and R. L. Smith. 1990. “Models for Exceedances over High Thresholds.” Journal of the Royal Statistical Society: Series B 52 (3): 393–425. https://doi.org/10.1111/j.2517-6161.1990.tb01796.x.Search in Google Scholar

Einmahl, J. J., J. H. J. Einmahl, and L. d. Haan. 2019. “Limits to Human Life Span through Extreme Value Theory.” Journal of the American Statistical Association 114 (527): 1075–80. https://doi.org/10.1080/01621459.2018.1537912.Search in Google Scholar

Einmahl, J. H. J., and J. R. Magnus. 2008. “Records in Athletics through Extreme-Value Theory.” Journal of the American Statistical Association 103 (484): 1382–91. https://doi.org/10.1198/016214508000000698.Search in Google Scholar

Guinness, J., D. Bhattacharya, J. Chen, M. Chen, and A. Loh. 2020. “An Observational Study of the Effect of Nike Vaporfly Shoes on Marathon Performance.” arXiv:2002.06105 [stat].Search in Google Scholar

Gumbel, E. J. 1958. Statistics of Extremes. New York: Columbia University Press.10.7312/gumb92958Search in Google Scholar

Holmes, T., R. Huggett, and A. Westerling. 2008. “Statistical Analysis of Large Wildfires.” In The Economics of Forest Disturbances: Wildfires, Storms, and Invasive Species, 59–77. Dordrecht: Springer.10.1007/978-1-4020-4370-3_4Search in Google Scholar

Hoogkamer, W., S. Kipp, J. H. Frank, E. M. Farina, G. Luo, and R. Kram. 2018. “A Comparison of the Energetic Cost of Running in Marathon Racing Shoes.” Sports Medicine 48 (4): 1009–19. https://doi.org/10.1007/s40279-017-0811-2.Search in Google Scholar

Hunter, S. K., M. J. Joyner, and A. M. Jones. 2015. “The Two-Hour Marathon: What’s the Equivalent for Women?” Journal of Applied Physiology 118 (10): 1321–3. https://doi.org/10.1152/japplphysiol.00852.2014.Search in Google Scholar

IAAF. 2011. Iaaf to Continue to Recognise Existing Women’s Road-Running Records. Also available at https://www.worldathletics.org/news/undefined/iaaf-to-continue-to-recognise-existing-womens.Search in Google Scholar

Joyner, M. J., J. R. Ruiz, and A. Lucia. 2011. “The Two-Hour Marathon: Who and When?” Journal of Applied Physiology 110 (1): 275–7. https://doi.org/10.1152/japplphysiol.00563.2010.Search in Google Scholar

Katz, R. W., M. B. Parlange, and P. Naveau. 2002. “Statistics of Extremes in Hydrology.” Advances in Water Resources 25 (8): 1287–304. https://doi.org/10.1016/s0309-1708(02)00056-8.Search in Google Scholar

Kipp, S., R. Kram, and W. Hoogkamer. 2019. “Extrapolating Metabolic Savings in Running: Implications for Performance Predictions.” Frontiers in Physiology 10: 79. https://doi.org/10.3389/fphys.2019.00079.Search in Google Scholar

NASA. 2020. Goddard Institute for Space Studies. Also available at https://climate.nasa.gov/vital-signs/global-temperature/.Search in Google Scholar

Quealy, K., and J. Katz. 2019. Nike’s Fastest Shoes May Give Runners an Even Bigger Advantage than We Thought. Also available at https://www.nytimes.com/interactive/2019/12/13/upshot/nike-vaporfly-nextpercent-shoe-estimates.html.Search in Google Scholar

Robinson, M. E., and J. A. Tawn. 1995. “Statistics for Exceptional Athletics Records.” Journal of the Royal Statistical Society. Series C (Applied Statistics) 44 (4): 499–511. https://doi.org/10.2307/2986141.Search in Google Scholar

Rodrigues, L., M. Gomes, and D. Pestana. 2011. “Statistics of Extremes in Athletics.” Revstat Statistical Journal 9 (2): 127–53.Search in Google Scholar

Scarrott, C., and A. MacDonald. 2012. “A Review of Extreme Value Threshold Estimation and Uncertainty Quantification.” Revstat Statistical Journal 10 (1): 33–60.Search in Google Scholar

Senefeld, J. W., M. H. Haischer, A. M. Jones, C. C. Wiggins, R. Beilfuss, M. J. Joyner, and S. K. Hunter. 2021. “Technological Advances in Elite Marathon Performance.” Journal of Applied Physiology 130 (6): 2002–8. https://doi.org/10.1152/japplphysiol.00002.2021.Search in Google Scholar

Spearing, H., J. Tawn, D. Irons, T. Paulden, and G. Bennett. 2021. “Ranking, and Other Properties, of Elite Swimmers Using Extreme Value Theory.” Journal of the Royal Statistical Society: Series A 184 (1): 368–95. https://doi.org/10.1111/rssa.12628.Search in Google Scholar

Stephenson, A., and J. Tawn. 2013. “Determining the Best Track Performances of All Time Using a Conceptual Population Model for Athletics Records.” Journal of Quantitative Analysis in Sports 9 (1): 67–76. https://doi.org/10.1515/jqas-2012-0047.Search in Google Scholar

Strand, M., and D. Boes. 1998. “Modeling Road Racing Times of Competitive Recreational Runners Using Extreme Value Theory.” The American Statistician 52 (3): 205–10. https://doi.org/10.1080/00031305.1998.10480564.Search in Google Scholar

Thomson, N. 2017. Do Nike’s Zoom Vaporfly 4% Marathon Shoes Actually Make You Run Faster? | WIRED. Also available at https://www.wired.com/story/do-nike-zoom-vaporfly-make-you-run-faster/.Search in Google Scholar

Tucker, R., and J. Santos-Concejero. 2017. “The Unlikeliness of an Imminent Sub-2-hour Marathon: Historical Trends of the Gender Gap in Running Events.” International Journal of Sports Physiology and Performance 12 (8): 1017–22. https://doi.org/10.1123/ijspp.2016-0634.Search in Google Scholar

World Athletics 2020. Technical Rules. Also available at https://hmg-prod.s3.amazonaws.com/files/c2-1-technical-rules-amended-on-31-january-2020-1580483189.pdf.Search in Google Scholar

Received: 2021-05-17

Revised: 2022-02-11

Accepted: 2022-02-11

Published Online: 2022-03-14

Published in Print: 2022-03-26

You are currently not able to access this content.

Articles in the same Issue

https://doi.org/10.1515/jqas-2021-0043

Keywords for this article

athletics performance; footwear technology; running time records; statistical analysis; vaporfly