Startseite Influence of advanced footwear technology on sub-2 hour marathon and other top running performances
Artikel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

Influence of advanced footwear technology on sub-2 hour marathon and other top running performances

  • Andreu Arderiu EMAIL logo und Raphaël de Fondeville
Veröffentlicht/Copyright: 14. März 2022
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

In 2019, Eliud Kipchoge ran a sub-two hour marathon wearing Nike’s Alphafly shoes. Despite being the fastest marathon time ever recorded, it wasn’t officially recognized as race conditions were tightly controlled to maximize his success. Besides, Kipchoge’s use of Alphafly shoes was controversial, with some experts claiming that they might have provided an unfair competitive advantage. In this work, we assess the potential influence of advanced footwear technology and the likelihood of a sub-two hour marathon in official races, by studying the evolution of running top performances from 2001 to 2019 for long distances ranging from 10 km to marathon. The analysis is performed using extreme value theory, a field of statistics dealing with analysis of rare events. We find a significant evidence of performance-enhancement effect with a 10% increase of the probability that a new world record for marathon-men discipline is set in 2021. However, results suggest that achieving a sub-two hour marathon in an official race in 2021 is still very unlikely, and exceeds 10% probability only by 2025.


Corresponding author: Andreu Arderiu, Department of Mathematics, EPFL, Lausanne, Switzerland, E-mail:

Acknowledgement

We acknowledge Harry Spearing for sharing code that provided good inspiration to our work.

  1. Author contribution: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.

  2. Research funding: The authors received no specific funding for this work.

  3. Conflict of interest statement: The authors declare no conflicts of interest regarding this article.

Appendix A: Theory and model

A.1 Extremes for identically distributed variables

Extreme value theory (EVT) is a branch of statistics which studies the tails of probability distributions. It was first developed for block maxima (Gumbel 1958) analysis, but the Peaks Over Threshold (POT) method (Davison and Smith 1990) is often preferred, as it uses all the most extreme data, rather than just the maxima, typically leading to more efficient inference. Let X be a random variable with distribution function F, if there exist random sequences a n , b n > 0 such that

(1) n 1 F ( a n x + b n ) log G ( x )

as n → ∞ is a non-degenerate limiting distribution, then for a large enough threshold u we can use the approximation

(2) Pr ( X > x | X > u ) H u ( x ) = 1 1 + ξ { ( x u ) / σ u } 1 / ξ ξ 0 , 1 exp { ( x μ ) / σ u } , ξ = 0 , x R ,

where σ u = σ + ξ(uμ) > 0, a+ = max(a, 0). If ξ < 0 then x must lie in the interval [0, x H ], where x H = uσ u /ξ is the upper limit of the distribution, whereas if ξ ≥ 0, x can take any positive value. The limit distribution H u , called Generalized Pareto distribution (GPD) motivates an approximation for large u, giving a model for the distribution of the exceedances above such threshold, regardless of the distribution F.

Given a large enough sample of n independent identically distributed (IID) observations, in the POT approach a threshold u is carefully chosen, and exceedances can be used to estimate the parameters of the GPD. Threshold choice can be rather subjective and case-dependent, and is subject to a bias-variance trade-off. In this paper we base our choice on graphical diagnostics; however, other alternative methods might also be suitable; see Scarrott and MacDonald (2012) for a detailed review of these techniques.

It is remarkable that the rate of the frequency of exceedances above the threshold u can be derived in a fashion that gives way to a more complete perspective of exceedances modelling, using point process models. Let X i be IID random variables with distribution function F, we define

(3) N n ( x ) = i = 1 n 1 ( X i > a n x + b n ) ,

where 1 ( A ) is an indicator whether the event A occurs. It follows that N n (x) ∼ Binomial(n, 1 − F(a n x + b n )) with mean n 1 F ( a n x + b n ) , and using the classical Poisson limit of the binomial distribution,

(4) N n ( x ) N ( x ) Poisson ( λ ) ,

where λ = { 1 + ξ ( x μ ) / σ } + 1 / ξ .

Therefore we can construct a model for extreme tails with two components: a model for the number of exceedances, given by (4), which is Poisson distributed with mean λ = { 1 + ξ ( x μ ) / σ } + 1 / ξ , and a model for the distribution of the exceedances, which is GPD distributed, following H u (x).

Consider the sequence of point processes on R 2 (Coles 2001)

(5) P n = i n + 1 , X i b n a n : i = 1 , , n ,

where the scaling 1/(n + 1) in the first coordinate ensures that the time axis is continuous on (0, 1), and the sequences a n , b n are defined in (1). More precisely, on regions of the form [0, 1] × (u, ∞), where u is large enough such that (2) approximately holds, we have that P n P as n → ∞, where P is a non-homogeneous Poisson Process. Consequently, the integrated measure Λ of P on A 1 , u = [ 0,1 ] × ( u , ) is given by

(6) Λ ( A 1 , u ) = 1 + ξ u μ σ + 1 / ξ ,

and its intensity function is

(7) λ ( t , x ) = 1 σ 1 + ξ x μ σ + 1 / ξ 1 = λ ( x ) ,

with x > u and 0 < t ≤ 1. For statistical inference we assume that for large enough n, P n P is a good approximation. The scaling coefficients a n , b n , can be absorbed into the intensity function, so we work directly with the series i n + 1 , X i : i = 1 , , n . Therefore, for a region of the form A 1 , u = [ 0,1 ] × ( u , ) , containing n points x = ( t 1 , x 1 ) , , ( t n , x n ) , the likelihood for the parameters θ = (μ, σ, ξ) is

(8) L ( θ ; x ) = exp Λ ( A 1 , u ) i = 1 n λ ( x i ) .

A.2 Extremes of non-stationary sequences

The extreme value models derived so far are built on the assumption of IID variables. However, in our work, non-stationarity data arise due to the improvement of racing conditions over time, and the potential Vaporfly shoes. Therefore, we relax the identically distributed assumption by introducing a time-dependent structure, while keeping independence assumption. Indeed, the time variation for parameters θ(t) = {μ(t), σ(t), ξ(t)} will translate into a time-dependent rate of exceedances, and distribution of such exceedances. Under this covariate structure, the intensity of the non-homogeneous Poisson process P will be

(9) λ ( t , x ) = 1 σ ( t ) 1 + ξ ( t ) x μ ( t ) σ ( t ) + 1 / ξ ( t ) 1 .

Now, in the general case where we have n points x = ( t 1 , x 1 ) , , ( t n , x n ) in the region A T , u = [ 0 , T ] × ( u , ) , the integrated intensity becomes

(10) Λ ( A T , u ) = 0 T 1 + ξ ( t ) x μ ( t ) σ ( t ) + 1 / ξ ( t ) d t ,

and the full likelihood is

(11) L { θ ( t ) ; x } = exp Λ ( A T , u ) i = 1 n λ ( t i , x i ) .

The parameters θ ( t ) = ( μ ( t ) , σ ( t ) , ξ ( t ) ) are estimated by maximizing (11), and with such estimates, for a given time t, predictions about the number of exceedances can be made by integrating (9). The excess distribution at time t will be given by

(12) Pr X t > x | X t > u = 1 H u ( x , t ) = 1 + ξ ( t ) x u σ u ( t ) + 1 ξ ( t ) ,

where σ u (t) = σ(t) + ξ(t){uμ(t)}.

A.3 Model

For most disciplines (and specially for marathon-men) a linear dependence on time for the scale parameter of the GP distribution of the exceedances was best suited in AIC terms. The following parametrisation was used to incorporate such structural time dependence.

(13) ξ ( d ) ( t ) = ξ
(14) μ ( d ) ( t ) = μ 0 ( d ) + β ( d ) y ( t ) + γ ( d ) 1 { y ( t ) 2018 }
σ ( d ) ( t ) = σ 0 ( d ) + ξ ( d ) β ( d ) y ( t )
(15) + ξ ( d ) γ ( d ) 1 { y ( t ) 2018 } + δ y ( t )

where d ∈ D is the superscript denoting discipline d, y(t) is the year corresponding to time t, ξ ( d ) , μ 0 ( d ) R , σ 0 ( d ) R + are the shape, location, and scale parameter of the Poisson process, β R controls the linear trend in σ(d)(t) and μ(d)(t), γ ( d ) R represents Vaporfly shoes effect, 1 is the indicator function, and 2018 is the year when the shoes started to be widely used in official races. Note that this parametrisation enforces the GPD scale parameter for exceedances above u d to change linearly with time.

(16) σ u ( d ) ( t ) = σ ( d ) ( t ) + ξ ( d ) u d μ ( d ) ( t ) = σ 0 ( d ) + ξ ( d ) ( u d μ 0 ( d ) ) + δ y ( t ) σ u ( d ) + δ y ( t )

A.4 Expected running times of next new world record

As derived in Spearing et al. (2021), the expected new world record time for discipline d at year y will be

(17) E X y * ( d ) = r d x H , e x d H r d ( d ) ( x , y ) d x d x = r d + σ r d ( d ) ( y ) 1 ξ , if ξ < 1 ,

where σ r d ( d ) ( y ) = σ 0 ( d ) + ξ r d μ 0 ( d ) + δ y ( t ) , X y * ( d ) is the random variable denoting the running-time of a new world record for discipline e, set in year y, and r d is the current (2019) world record of discipline d, so that r d = max(X (d) ), with X (d) the set of all observations for discipline d.

A.5 Probability of breaking a world record in a given year

Let N y ( d ) be the number of exceedances of the threshold u d for discipline d during year y, it is Poisson distributed with mean

(18) Λ ( d ) ( A y , u ) = 1 + ξ u d μ ( d ) ( y ) σ ( d ) ( y ) + 1 / ξ .

Therefore, let X 1 : N y ( d ) ( d ) = X i ( d ) , i = 1 , , N y ( d ) , where X i ( d ) i i d H u ( d ) ( y ) , if we denote by Pr ( R y ( d ) ) the probability that a world record for discipline d is set in year y,

(19) Pr ( R y ( d ) ) = 1 exp Λ ( d ) ( A y , u ) H ̄ u ( d ) ( r d , y ) ,

where H ̄ u ( d ) ( r d , y ) 1 H u ( d ) ( r d , y ) .

A.6 Time until next world record is set

Let T(d) be the random variable describing the waiting time until a new world record is set for an discipline e, if we define t y = y − 2020, the probability F T ( d ) ( t y ) = Pr ( T ( d ) < t y ) that a world record for discipline e is set before some year y is

(20) F T ( d ) ( t y ) = 1 exp k = 2020 y 1 Λ ( d ) ( A k , u ) H ̄ u ( d ) ( r d , k ) .

We can further estimate the expected waiting time until the world record is broken for any discipline e, which has the following expression

(21) E T ( d ) = Pr ( R 2020 ) + t = 2 Pr ( R 2019 + t ) k = 1 t 1 1 Pr ( R 2019 + k ) ,

where Pr(R y ) is the probability that the world record is broken at year y, as described in (19).

A.7 Adjusting for AFT effect

Let x > u be a running-time recorded during year y > 2018, when Vaporfly and other shoes with technological advances are widely used in official races. We denote by x c the corrected or equivalent time of x if such shoes were not used. Its expression can be derived as in Spearing et al. (2021), obtaining

(22) x c = u d + σ C , u ( d ) ( y ) ξ Λ ( d ) ( A y , u ) H ̄ u ( d ) ( x , y ) Λ C ( d ) ( A y , u ) 1 ,

where Λ C ( d ) ( A y , u ) has the form of Λ ( d ) ( A y , u ) but with the corrected parameters

(23) μ C ( d ) ( y ) = μ 0 ( d ) + β y ,
(24) σ C ( d ) ( y ) = σ 0 ( d ) + ξ β y + δ y ,
(25) σ C , u ( d ) ( y ) = σ C ( d ) ( y ) + ξ u d μ C ( d ) ( y ) .

A.8 Breaking the 2 h marathon

Let Pr(B2 = y) be the probability the 2-h marathon being broken in a given year y, it follows from (19) that

(26) Pr ( B 2 = y ) = 1 exp Λ ( mar M ) ( A y , u ) H ̄ u ( mar M ) ( 2 h , y ) ,

where 2h ≔ −7200 and marM refers to the marathon-men discipline. Additionally, we could compute the cumulative probability of achieving a sub 2-h marathon before year y, which follows from (20)

(27) Pr ( B 2 < y ) = 1 exp k = 2020 y 1 Λ ( mar M ) ( A k , u ) H ̄ u ( mar M ) ( 2 h , k ) .

Appendix B: Model estimates

Table 4 and 5.

Table 4:

Parameter estimates (with 95% confidence intervals) for the model.

Discipline σ 0 ( d ) μ 0 ( d ) β (d)
Marathon-men 30.63 (30.40,33.95) −7591 (−7594,−7584) 12.51 (12.43,12.61)
Marathon-women 101.95 (101.81,105.62) −8418 (−8420,−8403) 7.83 (7.73,8.00)
Half marathon-men 16.15 (16.01,16.57) −3577 (−3578,−3574) 3.64 (3.61,3.68)
Half marathon-women 39.17 (39.98,40.92) −4112 (−4115,−4105) 11.66(11.62,11.74)
10 km-men 10.64 (10.49,11.00) −1647 (−1647,−1645) 1.02 (0.99,1.06)
10 km-women 17.74 (17.58,18.23) −1863 (−1865,−1860) 2.12 (2.09,2.18)
δ ( d ) γ ( d ) ξ
3.77 (3.73,3.84) 12.60 (9.54,19.46) −0.251 (−0.248,-0.255)
0.36 (0.33,0.50) 51.42 (50.08,60.12)
0.85 (0.83,0.88) 0.70 (−2.39,2.86)
2.59 (2.57,2.63) 14.64(13.09,18.34)
0.29 (0.28,0.32) 6.44 (5.79,8.29)
0.63 (0.61,0.67) 13.74 (11.83,15.54)
Table 5:

World records as in 2019 and AFT-corrected times for records recorded before 2018.

Discipline World record 2019 AFT-corrected world record 2019
Marathon-men 02:01:39 02:01:48 (−3 s,+8 s)
Marathon-women 02:14:04 02:14:17 (−2 s,+12 s)
Half marathon-men 00:58:01
Half marathon-women 01:04:51 01:05:08 (−3 s,+5 s)
10 km-men 00:26:38
10 km-women 00:29:43

Appendix C: Model checking

Figures 511.

Figure 5: 
Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.
Figure 5:

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Figure 6: 
Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.
Figure 6:

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Figure 7: 
Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.
Figure 7:

Mean residual life plots, with 95% confidence intervals. The red dashed line indicates the threshold used in our analysis.

Figure 8: 
Diagnostic QQ plot for the model. The plot displays the log of the quantiles of the transformed observations for all disciplines, against the quantiles of a unit exponential distribution, with 95% confidence intervals.
Figure 8:

Diagnostic QQ plot for the model. The plot displays the log of the quantiles of the transformed observations for all disciplines, against the quantiles of a unit exponential distribution, with 95% confidence intervals.

Figure 9: 
Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u
d
 with 95% confidence intervals (black dashes).
Figure 9:

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u d with 95% confidence intervals (black dashes).

Figure 10: 
Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u
d
 with 95% confidence intervals (black dashes).
Figure 10:

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u d with 95% confidence intervals (black dashes).

Figure 11: 
Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u
d
 with 95% confidence intervals (black dashes).
Figure 11:

Estimated expected (black circles) and observed (red crosses) exceedances above the threshold u d with 95% confidence intervals (black dashes).

References

Angus, S. D. 2019. “A Statistical Timetable for the Sub–2-hour Marathon.” Medicine & Science in Sports & Exercise 51 (7): 1460–6. https://doi.org/10.1249/mss.0000000000001928.Suche in Google Scholar

Barnes, K. R., and A. E. Kilding. 2019. “A Randomized Crossover Study Investigating the Running Economy of Highly-Trained Male and Female Distance Runners in Marathon Racing Shoes versus Track Spikes.” Sports Medicine 49 (2): 331–42. https://doi.org/10.1007/s40279-018-1012-3.Suche in Google Scholar

Bermon, S., F. Garrandes, A. Szabo, I. Berkovics, and P. E. Adami. 2021. “Effect of Advanced Shoe Technology on the Evolution of Road Race Times in Male and Female Elite Runners.” Frontiers in Sports and Active Living 3: 46. https://doi.org/10.3389/fspor.2021.653173.Suche in Google Scholar

Blest, D. C. 1996. “Lower Bounds for Athletic Performance.” Journal of the Royal Statistical Society. Series D (The Statistician) 45 (2): 243–53. https://doi.org/10.2307/2988413.Suche in Google Scholar

Coles, S. 2001. An Introduction to Statistical Modeling of Extreme Values. London: Springer.10.1007/978-1-4471-3675-0Suche in Google Scholar

Davison, A. C., and R. L. Smith. 1990. “Models for Exceedances over High Thresholds.” Journal of the Royal Statistical Society: Series B 52 (3): 393–425. https://doi.org/10.1111/j.2517-6161.1990.tb01796.x.Suche in Google Scholar

Einmahl, J. J., J. H. J. Einmahl, and L. d. Haan. 2019. “Limits to Human Life Span through Extreme Value Theory.” Journal of the American Statistical Association 114 (527): 1075–80. https://doi.org/10.1080/01621459.2018.1537912.Suche in Google Scholar

Einmahl, J. H. J., and J. R. Magnus. 2008. “Records in Athletics through Extreme-Value Theory.” Journal of the American Statistical Association 103 (484): 1382–91. https://doi.org/10.1198/016214508000000698.Suche in Google Scholar

Guinness, J., D. Bhattacharya, J. Chen, M. Chen, and A. Loh. 2020. “An Observational Study of the Effect of Nike Vaporfly Shoes on Marathon Performance.” arXiv:2002.06105 [stat].Suche in Google Scholar

Gumbel, E. J. 1958. Statistics of Extremes. New York: Columbia University Press.10.7312/gumb92958Suche in Google Scholar

Holmes, T., R. Huggett, and A. Westerling. 2008. “Statistical Analysis of Large Wildfires.” In The Economics of Forest Disturbances: Wildfires, Storms, and Invasive Species, 59–77. Dordrecht: Springer.10.1007/978-1-4020-4370-3_4Suche in Google Scholar

Hoogkamer, W., S. Kipp, J. H. Frank, E. M. Farina, G. Luo, and R. Kram. 2018. “A Comparison of the Energetic Cost of Running in Marathon Racing Shoes.” Sports Medicine 48 (4): 1009–19. https://doi.org/10.1007/s40279-017-0811-2.Suche in Google Scholar

Hunter, S. K., M. J. Joyner, and A. M. Jones. 2015. “The Two-Hour Marathon: What’s the Equivalent for Women?” Journal of Applied Physiology 118 (10): 1321–3. https://doi.org/10.1152/japplphysiol.00852.2014.Suche in Google Scholar

IAAF. 2011. Iaaf to Continue to Recognise Existing Women’s Road-Running Records. Also available at https://www.worldathletics.org/news/undefined/iaaf-to-continue-to-recognise-existing-womens.Suche in Google Scholar

Joyner, M. J., J. R. Ruiz, and A. Lucia. 2011. “The Two-Hour Marathon: Who and When?” Journal of Applied Physiology 110 (1): 275–7. https://doi.org/10.1152/japplphysiol.00563.2010.Suche in Google Scholar

Katz, R. W., M. B. Parlange, and P. Naveau. 2002. “Statistics of Extremes in Hydrology.” Advances in Water Resources 25 (8): 1287–304. https://doi.org/10.1016/s0309-1708(02)00056-8.Suche in Google Scholar

Kipp, S., R. Kram, and W. Hoogkamer. 2019. “Extrapolating Metabolic Savings in Running: Implications for Performance Predictions.” Frontiers in Physiology 10: 79. https://doi.org/10.3389/fphys.2019.00079.Suche in Google Scholar

NASA. 2020. Goddard Institute for Space Studies. Also available at https://climate.nasa.gov/vital-signs/global-temperature/.Suche in Google Scholar

Quealy, K., and J. Katz. 2019. Nike’s Fastest Shoes May Give Runners an Even Bigger Advantage than We Thought. Also available at https://www.nytimes.com/interactive/2019/12/13/upshot/nike-vaporfly-nextpercent-shoe-estimates.html.Suche in Google Scholar

Robinson, M. E., and J. A. Tawn. 1995. “Statistics for Exceptional Athletics Records.” Journal of the Royal Statistical Society. Series C (Applied Statistics) 44 (4): 499–511. https://doi.org/10.2307/2986141.Suche in Google Scholar

Rodrigues, L., M. Gomes, and D. Pestana. 2011. “Statistics of Extremes in Athletics.” Revstat Statistical Journal 9 (2): 127–53.Suche in Google Scholar

Scarrott, C., and A. MacDonald. 2012. “A Review of Extreme Value Threshold Estimation and Uncertainty Quantification.” Revstat Statistical Journal 10 (1): 33–60.Suche in Google Scholar

Senefeld, J. W., M. H. Haischer, A. M. Jones, C. C. Wiggins, R. Beilfuss, M. J. Joyner, and S. K. Hunter. 2021. “Technological Advances in Elite Marathon Performance.” Journal of Applied Physiology 130 (6): 2002–8. https://doi.org/10.1152/japplphysiol.00002.2021.Suche in Google Scholar

Spearing, H., J. Tawn, D. Irons, T. Paulden, and G. Bennett. 2021. “Ranking, and Other Properties, of Elite Swimmers Using Extreme Value Theory.” Journal of the Royal Statistical Society: Series A 184 (1): 368–95. https://doi.org/10.1111/rssa.12628.Suche in Google Scholar

Stephenson, A., and J. Tawn. 2013. “Determining the Best Track Performances of All Time Using a Conceptual Population Model for Athletics Records.” Journal of Quantitative Analysis in Sports 9 (1): 67–76. https://doi.org/10.1515/jqas-2012-0047.Suche in Google Scholar

Strand, M., and D. Boes. 1998. “Modeling Road Racing Times of Competitive Recreational Runners Using Extreme Value Theory.” The American Statistician 52 (3): 205–10. https://doi.org/10.1080/00031305.1998.10480564.Suche in Google Scholar

Thomson, N. 2017. Do Nike’s Zoom Vaporfly 4% Marathon Shoes Actually Make You Run Faster? | WIRED. Also available at https://www.wired.com/story/do-nike-zoom-vaporfly-make-you-run-faster/.Suche in Google Scholar

Tucker, R., and J. Santos-Concejero. 2017. “The Unlikeliness of an Imminent Sub-2-hour Marathon: Historical Trends of the Gender Gap in Running Events.” International Journal of Sports Physiology and Performance 12 (8): 1017–22. https://doi.org/10.1123/ijspp.2016-0634.Suche in Google Scholar

World Athletics 2020. Technical Rules. Also available at https://hmg-prod.s3.amazonaws.com/files/c2-1-technical-rules-amended-on-31-january-2020-1580483189.pdf.Suche in Google Scholar

Received: 2021-05-17
Revised: 2022-02-11
Accepted: 2022-02-11
Published Online: 2022-03-14
Published in Print: 2022-03-26

© 2022 Walter de Gruyter GmbH, Berlin/Boston

Heruntergeladen am 10.9.2025 von https://www.degruyterbrill.com/document/doi/10.1515/jqas-2021-0043/html
Button zum nach oben scrollen