Abstract
In this work we develop a new algorithm for rating of teams (or players) in one-on-one games by exploiting the observed difference of the game-points (such as goals), also known as a margin of victory (MOV). Our objective is to obtain the Elo-style algorithm whose operation is simple to implement and to understand intuitively. This is done in three steps: first, we define the probabilistic model between the teams’ skills and the discretized MOV variable: this generalizes the model underpinning the Elo algorithm, where the MOV variable is discretized into three categories (win/loss/draw). Second, with the formal probabilistic model at hand, the optimization required by the maximum likelihood rule is implemented via stochastic gradient; this yields simple online equations for the rating updates which are identical in their general form to those characteristic of the Elo algorithm: the main difference lies in the way the scores and the expected scores are defined. Third, we propose a simple method to estimate the coefficients of the model, and thus define the operation of the algorithm; it is done in a closed form using the historical data so the algorithm is tailored to the sport of interest and the coefficients defining its operation are determined in entirely transparent manner. The alternative, optimization-based strategy to find the coefficients is also presented. We show numerical examples based on the results of the association football of the English Premier League and the American football of the National Football League.
-
Author contribution: The author has accepted responsibility for the entire content of this submitted manuscript and approved submission.
-
Research funding: None declared.
-
Conflict of interest statement: The author declares no conflicts of interest regarding this article.
References
Agresti, A. 1992. “Analysis of Ordinal Paired Comparison Data.” Journal of the Royal Statistical Society: Series C (Applied Statistics) 41: 287–97. https://doi.org/10.2307/2347562.Search in Google Scholar
Agresti, A. 2013. Categortical Data Analysis. Hoboken, New Jersey: John Wiley & Sons.Search in Google Scholar
Arntzen, H., and L. M. Hvattum. 2021. “Predicting Match Outcomes in Association Football Using Team Ratings and Player Ratings.” Statistical Modelling 21: 449–70. https://doi.org/10.1177/1471082x20929881.Search in Google Scholar
Boshnakov, G., T. Kharrat, and I. G. McHale. 2017. “A Bivariate Weibull Count Model for Forecasting Association Football Scores.” International Journal of Forecasting 33: 458–66. https://doi.org/10.1016/j.ijforecast.2016.11.006.Search in Google Scholar
Caron, F., and A. Doucet. 2012. “Efficient Bayesian Inference for Generalized Bradley–Terry Models.” Journal of Computational & Graphical Statistics 21: 174–96. https://doi.org/10.1080/10618600.2012.638220.Search in Google Scholar
Constantinou, A. C., and N. E. Fenton. 2012. “Solving the Problem of Inadequate Scoring Rules for Assessing Probabilistic Football Forecast Models.” Journal of Quantitative Analysis in Sports 8. https://doi.org/10.1515/1559-0410.1418.Search in Google Scholar
Davidson, R. R. 1970. “On Extending the Bradley-Terry Model to Accommodate Ties in Paired Comparison Experiments.” Journal of the American Statistical Association 65: 317–28. https://doi.org/10.1080/01621459.1970.10481082.Search in Google Scholar
Dittrich, R., B. Francis, R. Hatzinger, and W. Katzenbeisser. 2007. “A Paired Comparison Approach for the Analysis of Sets of Likert-Scale Responses.” Statistical Modelling 7: 3–28. https://doi.org/10.1177/1471082x0600700102.Search in Google Scholar
Dixon, M. J., and S. G. Coles. 1997. “Modelling Association Football Scores and Inefficiencies in the Football Betting Market.” Journal of the Royal Statistical Society: Series C (Applied Statistics) 46: 265–80. https://doi.org/10.1111/1467-9876.00065.Search in Google Scholar
Elo, A. E. 1978. The Rating of Chessplayers, Past and Present. New York: Arco Publishing Inc.Search in Google Scholar
Fahrmeir, L., and G. Tutz. 1994. “Dynamic Stochastic Models for Time-dependent Ordered Paired Comparison Systems.” Journal of the American Statistical Association 89: 1438–49. https://doi.org/10.1080/01621459.1994.10476882.Search in Google Scholar
FIFA. 2018. Revision of the FIFA/Coca-Cola World Ranking. Also available at https://digitalhub.fifa.com/m/f99da4f73212220/original/edbm045h0udbwkqew35a-pdf.pdf.Search in Google Scholar
FiveThirtyEight. 2020. How our NFL Predictions Work. Also available at https://fivethirtyeight.com/methodology/how-our-nfl-predictions-work/.Search in Google Scholar
Football-data.co.uk. 2019. Historical Football Results and Betting Odds Data. Also available at https://www.football-data.co.uk/data.php.Search in Google Scholar
Gelman, A., J. Hwang, and A. Vehtari. 2014. “Understanding predictive information criteria for Bayesian models.” Statistics and Computing 24: 997–1016. https://doi.org/10.1007/s11222-013-9416-2.Search in Google Scholar
Glickman, M. E. 1999. “Parameter Estimation in Large Dynamic Paired Comparison Experiments.” Journal of the Royal Statistical Society: Series C (Applied Statistics) 48: 377–94. https://doi.org/10.1111/1467-9876.00159.Search in Google Scholar
Goddard, J. 2005. “Regression Models for Forecasting Goals and Match Results in Association Football.” International Journal of Forecasting 21: 331–40. https://doi.org/10.1016/j.ijforecast.2004.08.002.Search in Google Scholar
Gramacy, R., S. Jensen, and M. Taddy. 2013. “Estimating Player Contribution in Hockey with Regularized Logistic Regression.” Journal of Quantitative Analysis in Sports 9: 97–111. https://doi.org/10.1515/jqas-2012-0001.Search in Google Scholar
Held, L., and R. Vollnhals. 2005. “Dynamic Rating of European Football Teams.” IMA Journal of Management Mathematics 16: 121–30. https://doi.org/10.1093/imaman/dpi004.Search in Google Scholar
Herbrich, R., and T. Graepel. 2006. “TrueSkill(TM): A Bayesian Skill Rating System.” In Technical report. Also available at https://www.microsoft.com/en-us/research/publication/trueskilltm-a-bayesian-skill-rating-system-2/.10.7551/mitpress/7503.003.0076Search in Google Scholar
Hvattum, L. M., and H. Arntzen. 2010. “Using Elo Ratings for Match Result Prediction in Association Football.” International Journal of Forecasting 26: 460–70. https://doi.org/10.1016/j.ijforecast.2009.10.002.Search in Google Scholar
Hvattum, L. M., and G. Gelade. 2021. “Comparing Bottom-Up and Top-Down Ratings for Individual Soccer Players.” International Journal of Computer Science in Sport 20: 23–42. https://doi.org/10.2478/ijcss-2021-0002.Search in Google Scholar
Ingram, M. 2021. “How to Extend Elo: a Bayesian Perspective.” Journal of Quantitative Analysis in Sports 17: 203–19. https://doi.org/10.1515/jqas-2020-0066.Search in Google Scholar
Karlis, D., and I. Ntzoufras. 2008. “Bayesian Modelling of Football Outcomes: Using the Skellam’s Distribution for the Goal Difference.” IMA Journal of Management Mathematics 20: 133–45. https://doi.org/10.1093/imaman/dpn026.Search in Google Scholar
Király, F. J., and Z. Qian. 2017. “Modelling Competitive Sports: Bradley-Terry-Elo Models for Supervised and On-Line Learning of Paired Competition Outcomes.” In arXiv E-Prints. arXiv:1701.08055.Search in Google Scholar
Knorr-Held, L. 2000. “Dynamic Rating of Sports Teams.” Journal of the Royal Statistical Society. Series D (The Statistician) 49: 261–76. https://doi.org/10.1111/1467-9884.00236.Search in Google Scholar
Kovalchik, S. 2020. “Extension of the Elo Rating System to Margin of Victory.” International Journal of Forecasting 36: 1329–41. https://doi.org/10.1016/j.ijforecast.2020.01.006.Search in Google Scholar
Kovalchik, S. A. 2016. “Searching for the GOAT of Tennis Win Prediction.” Journal of Quantitative Analysis in Sports 12: 127–38. https://doi.org/10.1515/jqas-2015-0059.Search in Google Scholar
Lasek, J., and M. Gagolewski. 2020. “Interpretable Sports Team Rating Models Based on the Gradient Descent Algorithm.” International Journal of Forecasting 37: 1061–71.10.1016/j.ijforecast.2020.11.008Search in Google Scholar
Maher, M. J. 1982. “Modelling Association Football Scores.” Statistica Neerlandica 36: 109–18. https://doi.org/10.1111/j.1467-9574.1982.tb00782.x.Search in Google Scholar
Manderson, A. A., K. Murray, and B. A. Turlach. 2018. “Dynamic Bayesian Forecasting of AFL Match Results Using the Skellam Distribution.” Australian & New Zealand Journal of Statistics 60: 174–87. https://doi.org/10.1111/anzs.12225.Search in Google Scholar
Massey, K. 1997. “Statistical Models Applied to the Rating of Sports Teams,” Technical report.Search in Google Scholar
Pro Football reference. 2021. Pro Football & NFL History. Also available at https://www.pro-football-reference.com/years/.Search in Google Scholar
Rao, P. V., and L. L. Kupper. 1967. “Ties in Paired-Comparison Experiments: A Generalization of the Bradley-Terry Model.” Journal of the American Statistical Association 62: 194–204. https://doi.org/10.1080/01621459.1967.10482901.Search in Google Scholar
Sinclair, C. D. 1982. “Glim for Preference.” In GLIM 82: Proceedings of the International Conference on Generalised Linear Models, edited by Gilchrist, R., pp. 164–78. New York: Springer New York.10.1007/978-1-4612-5771-4_16Search in Google Scholar
Szczecinski, L., and A. Djebbi. 2020. “Understanding Draws in Elo Rating Algorithm.” Journal of Quantitative Analysis in Sports 16: 211–20. https://doi.org/10.1515/jqas-2019-0102.Search in Google Scholar
Wolf, S., M. Schmitt, and B. Schuller. 2020. “A Football Player Rating System.” Journal of Sports Analytics 6: 243–57.10.3233/JSA-200411Search in Google Scholar
© 2022 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Frontmatter
- Research Articles
- G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory
- Evaluating the performance of elite level volleyball players
- Review
- Optical tracking in team sports
- Research Article
- MSE-optimal K-factor of the Elo rating system for round-robin tournament
- Influence of advanced footwear technology on sub-2 hour marathon and other top running performances
Articles in the same Issue
- Frontmatter
- Research Articles
- G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory
- Evaluating the performance of elite level volleyball players
- Review
- Optical tracking in team sports
- Research Article
- MSE-optimal K-factor of the Elo rating system for round-robin tournament
- Influence of advanced footwear technology on sub-2 hour marathon and other top running performances