Startseite Improving strength of schedule metrics in sports scheduling
Artikel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

Improving strength of schedule metrics in sports scheduling

  • Adam DeHollander ORCID logo EMAIL logo und Mark Karwan ORCID logo
Veröffentlicht/Copyright: 8. Juli 2025
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

This paper studies sports Strength of Schedule (SoS) metrics and their utility for decision-makers in creating season schedules. We identify limitations of the widely used Bowl Championship Series (BCS) SoS metric, including infrequent rank changes and reduced variance, which limit its effectiveness in assessing true schedule difficulty. In response, we introduce two novel metrics: Probabilistic Random Walk Length (PRWL) SoS and Customized-Distributional (CD) SoS. The PRWL SoS leverages random walks on a graph to provide a comprehensive measure incorporating the difficulty of a team’s opponents schedules. The CD SoS offers a customizable approach, incorporating team-specific goals and preferences, for a more tailored assessment of schedule strength. Additionally, we propose a SoS variability metric to evaluate the variability of SoS among teams, promoting schedule balance. Through a detailed case study involving a large sports conference, we demonstrate the practical application and impact of our proposed metrics. The case study illustrates the superiority of the proposed scheduling format over the existing one, prompting the conference to adopt a new schedule format and eliminate their previous divisional structure. Our contributions provide more accurate tools for evaluating schedule difficulty, enhancing schedule balance, and optimizing sports schedules, ultimately promoting fairness and competitiveness in sports leagues.


Corresponding author: Adam DeHollander, Department of Industrial and Systems Engineering, University at Buffalo, Buffalo, NY, 14260, USA, E-mail: 

Acknowledgments

The authors would like to thank the sports conference for their financial support and for providing the data necessary for this study.

  1. Research ethics: This study did not involve human participants or animal subjects and solely utilized simulation methods. Therefore, ethical approval was not required.

  2. Informed consent: As no human participants were involved in this simulation study, informed consent was not applicable.

  3. Author contributions: Adam DeHollander: Conceptualization, Methodology, Data Analysis, Writing – Original Draft. Mark Karwan: Supervision, Project Administration, Funding Acquisition, Conceptualization, Methodology, Writing – Review & Editing.

  4. Use of Large Language Models, AI and Machine Learning Tools: A large language model (LLM) was used solely to assist with wording and language refinement in portions of the manuscript. The model was not used for generating original content, conducting analyses, or contributing to the conceptual or methodological aspects of the work.

  5. Conflict of interest: The authors declare no conflicts of interest related to this study.

  6. Research funding: This research was supported by funding from a sports conference. The development of the Strength of Schedule (SoS) metrics was conducted independently from this funding; the funding facilitated the application of these metrics to the conference’s data.

  7. Data availability: The data generated and analyzed during this study are available from the corresponding author upon reasonable request.

References

Angelini, G., Candila, V., and De Angelis, L. (2022). Weighted Elo rating for tennis match predictions. Eur. J. Oper. Res. 297: 120–132, https://doi.org/10.1016/j.ejor.2021.04.011.Suche in Google Scholar

Bouzarth, E.L., Cromer, A.W., Fravel, W.J., Grannan, B.C., and Hutson, K.R. (2020). Dynamically scheduling NFL games to reduce strength of schedule variability. J. Sports Anal. 6: 281–293, https://doi.org/10.3233/jsa-200428.Suche in Google Scholar

Bouzarth, E.L., Grannan, B.C., Harris, J.M., and Hutson, K.R. (2022). Scheduling the valley baseball league. INFORMS J. Appl. Anal. 52: 189–197, https://doi.org/10.1287/inte.2021.1076.Suche in Google Scholar

Bowman, R.A., Harmon, O., and Ashman, T. (2023). Schedule inequity in the national basketball association. J. Sports Anal. 9: 61–76, https://doi.org/10.3233/jsa-220629.Suche in Google Scholar

Burioni, R. and Cassi, D. (2005). Random walks on graphs: ideas, techniques and results. J. Phys. A: Math. Gen. 38: R45, https://doi.org/10.1088/0305-4470/38/8/r01.Suche in Google Scholar

Chebotarev, P.Y. (1994). Aggregation of preferences by the generalized row sum method. Math. Soc. Sci. 27: 293–320, https://doi.org/10.1016/0165-4896(93)00740-l.Suche in Google Scholar

Csató, L. (2013). Ranking by pairwise comparisons for Swiss-system tournaments. Cent. Eur. J. Oper. Res. 21: 783–803, https://doi.org/10.1007/s10100-012-0261-8.Suche in Google Scholar

Csató, L. (2017). On the ranking of a Swiss system chess team tournament. Ann. Oper. Res. 254: 17–36, https://doi.org/10.1007/s10479-017-2440-4.Suche in Google Scholar

Csató, L., Devriesere, K., Goossens, D., Gyimesi, A., Lambers, R., and Spieksma, F. (2025). Ranking matters: does the new format select the best teams for the knockout phase in the UEFA champions league? arXiv preprint arXiv:2503.13569, https://doi.org/10.48550/arXiv.2503.13569 Suche in Google Scholar

Devlin, S. and Treloar, T. (2018). A network diffusion ranking family that includes the methods of Markov, Massey, and Colley. J. Quant. Anal. Sports 14: 91–101, https://doi.org/10.1515/jqas-2017-0098.Suche in Google Scholar

Devriesere, K., Csató, L., and Goossens, D. (2025). Tournament design: a review from an operational research perspective. Eur. J. Oper. Res. 324: 1–21, https://doi.org/10.1016/j.ejor.2024.10.044.Suche in Google Scholar

Dolf, M. and Teehan, P. (2015). Reducing the carbon footprint of spectator and team travel at the University of British Columbia’s varsity sports events. Sport Manag. Rev. 18: 244–255, https://doi.org/10.1016/j.smr.2014.06.003.Suche in Google Scholar

Elo, A.E. (1978). The rating of chessplayers: past and present. Arco Publishing, New York.Suche in Google Scholar

Fearnhead, P. and Taylor, B.M. (2010). Calculating strength of schedule, and choosing teams for March Madness. Am. Statistician 64: 108–115, https://doi.org/10.1198/tast.2010.09161.Suche in Google Scholar

Ferrand, Y.B., Magazine, M.J., Rao, U.S., and Glass, T.F. (2018). Managing responsiveness in the emergency department: comparing dynamic priority queue with fast track. J. Oper. Manag. 58–59: 15–26, https://doi.org/10.1016/j.jom.2018.03.001.Suche in Google Scholar

FIFA (2018). Revision of the FIFA/coca-cola world ranking, Available at: https://digitalhub.fifa.com/m/f99da4f73212220/original/edbm045h0udbwkqew35a-pdf.pdf (Accessed 21 March 2025).Suche in Google Scholar

Gomes de Pinho Zanco, D., Szczecinski, L., Kuhn, E.V., and Seara, R. (2024). Stochastic analysis of the Elo rating algorithm in round-robin tournaments. Digit. Signal Process. 145: 104313, https://doi.org/10.1016/j.dsp.2023.104313.Suche in Google Scholar

González-Díaz, J., Hendrickx, R., and Lohmann, E. (2014). Paired comparisons analysis: an axiomatic approach to ranking methods. Soc. Choice Welfare 42: 139–169, https://doi.org/10.1007/s00355-013-0726-2.Suche in Google Scholar

Goossens, D., Yi, X., and Van Bulck, D. (2020). Fairness trade-offs in sports timetabling. In: Ley, C. and Dominicy, Y. (Eds.). Science meets sports: when statistics are more than numbers. Cambridge Scholars, Newcastle upon Tyne, UK, pp. 213–244.Suche in Google Scholar

Hvattum, L.M. and Arntzen, H. (2010). Using ELO ratings for match result prediction in association football. Int. J. Forecast. 26: 460–470, https://doi.org/10.1016/j.ijforecast.2009.10.002.Suche in Google Scholar

Karwan, M., Kurt, M., Pandey, N.K., and Cunningham, K. (2015). Alleviating competitive imbalance in NFL schedules: an integer-programming approach. In: 9th annual MIT sloan sports analytics conference. Boston, MA.Suche in Google Scholar

Keener, J.P. (1993). The Perron–Frobenius theorem and the ranking of football teams. SIAM Rev. 35: 80–93, https://doi.org/10.1137/1035004.Suche in Google Scholar

Kim, J. and Kim, S. (2024). Evaluating aerial duel ability of football players using height-adjusted Elo rating model. Int. J. Perform. Anal. Sport: 1–14, https://doi.org/10.1080/24748668.2024.2420458.Suche in Google Scholar

Lee, Y.H. and Fort, R. (2023). Division play and outcome uncertainty in sports leagues. J. Sports Econ. 24: 639–663, https://doi.org/10.1177/15270025221148995.Suche in Google Scholar

Lee, Y.H., Kim, Y., and Kim, S. (2019). Competitive balance with unbalanced schedules. J. Quant. Anal. Sports 15: 239–260, https://doi.org/10.1515/jqas-2017-0100.Suche in Google Scholar

Leiva Bertrán, F. (2025). Ranking in incomplete tournaments: the generalized win percentage method, efficiency, and NCAA football. J. Sports Econ. 26: 3–34, https://doi.org/10.1177/15270025241268738.Suche in Google Scholar

Lenten, L.J. (2015) Measurement of competitive balance in conference and divisional tournament design, J. Sports Econ. 16: 3–25, https://doi.org/10.1177/1527002512471538.Suche in Google Scholar

Uyar, B. and Surdam, D. (2013). Searching for on-field parity: evidence from national football league scheduling during 1991–2006. J. Sports Econ. 14: 479–497, https://doi.org/10.1177/1527002512438901.Suche in Google Scholar

Vaziri, B., Dabadghao, S., Yih, Y., and Morin, T.L. (2018). Properties of sports ranking methods. J. Oper. Res. Soc. 69: 776–787, https://doi.org/10.1057/s41274-017-0266-8.Suche in Google Scholar

Xia, F., Sun, K., Yu, S., Aziz, A., Wan, L., Pan, S., and Liu, H. (2021). Graph learning: a survey. IEEE Trans. Artif. Intell. 2: 109–127, https://doi.org/10.1109/tai.2021.3076021.Suche in Google Scholar

Received: 2024-12-09
Accepted: 2025-06-17
Published Online: 2025-07-08

© 2025 Walter de Gruyter GmbH, Berlin/Boston

Heruntergeladen am 17.10.2025 von https://www.degruyterbrill.com/document/doi/10.1515/jqas-2024-0171/html?lang=de
Button zum nach oben scrollen