Startseite Expected hypothetical completion probability
Artikel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

Expected hypothetical completion probability

  • Sameer K. Deshpande und Katherine Evans EMAIL logo
Veröffentlicht/Copyright: 16. November 2019
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

Using high-resolution player tracking data made available by the National Football League (NFL) for their 2019 Big Data Bowl competition, we introduce the Expected Hypothetical Completion Probability (EHCP), a objective framework for evaluating plays. At the heart of EHCP is the question “on a given passing play, did the quarterback throw the pass to the receiver who was most likely to catch it?” To answer this question, we first built a Bayesian non-parametric catch probability model that automatically accounts for complex interactions between inputs like the receiver’s speed and distances to the ball and nearest defender. While building such a model is, in principle, straightforward, using it to reason about a hypothetical pass is challenging because many of the model inputs corresponding to a hypothetical are necessarily unobserved. To wit, it is impossible to observe how close an un-targeted receiver would be to his nearest defender had the pass been thrown to him instead of the receiver who was actually targeted. To overcome this fundamental difficulty, we propose imputing the unobservable inputs and averaging our model predictions across these imputations to derive EHCP. In this way, EHCP can track how the completion probability evolves for each receiver over the course of a play in a way that accounts for the uncertainty about missing inputs.

References

Burke, B. 2019. “Deepqb: deep learning with player tracking to quantify quarterback decision making and performance.” In Proceedings of the 2019 MIT Sloan Sports Analytics Conference. http://www.sloansportsconference.com/wp-content/uploads/2019/02/DeepQB.pdf.Suche in Google Scholar

Carpenter, B., A. Gelman, M. D. Hoffman, D. Lee, B. Goodrich, M. Betancourt, M. Brubaker, J. Guo, P. Li, and A. Riddell. 2017. “Stan: a probabilistic programing language.” Journal of Statistical Software 76(1):1–32.10.18637/jss.v076.i01Suche in Google Scholar

Cervone, D., A. D’Amour, L. Bornn, and K. Goldsberry. 2014. “Pointwise: predicting points and valuing decisions in real time with NBA optical tracking data.” In Proceedings of the 2014 MIT Sloan Sports Analytics Conference. http://www.sloansportsconference.com/wp-content/uploads/2018/09/cervone_ssac_2014.pdf.Suche in Google Scholar

Cervone, D., A. D’Amour, L. Bornn, and K. Goldsberry. 2016. “A multiresolution stochastic process model for predicting basketball possession outcomes.” Journal of the American Statistical Association 111(514):585–599.10.1080/01621459.2016.1141685Suche in Google Scholar

Chipman, H. A., E. I. George, and R. E. McCulloch. 2010. “BART: Bayesian additive regression trees.” The Annals of Applied Statistics 4(1):266–298.10.1214/09-AOAS285Suche in Google Scholar

Franks, A., A. Miller, L. Bornn, and K. Goldsberry. 2015. “Counterpoints: advanced defensive metrics for NBA basketball.” In Proceedings of the 2015 MIT Sloan Sports Analytics Conference. http://www.sloansportsconference.com/wp-content/uploads/2015/02/SSAC15-RP-Finalist-Counterpoints2.pdf.Suche in Google Scholar

Gelman, A., A. Jakulin, M. G. Pittau, and Y.-S. Su. 2008. “A weakly informative default prior distribution for logistic regression.” Annals of Applied Statistics 2(4):1360–1383.10.1214/08-AOAS191Suche in Google Scholar

Horowitz, M., R. Yurko, and S. Ventura. 2019. nflscrapR: compiling the NFL play-by-play API for easy use in R. R package version 1.8.1.Suche in Google Scholar

Linero, A. R. 2017. “A review of tree-based Bayesian methods.” Communications for Statistical Applications and Methods 24(6):543–559.10.29220/CSAM.2017.24.6.543Suche in Google Scholar

Linero, A. R. 2018. “Bayesian regression trees for high-dimensional prediction and variable selection.” Journal of the American Statistical Association 113(522):626–636.10.1080/01621459.2016.1264957Suche in Google Scholar

McCulloch, R., R. Sparapani, R. Gramacy, C. Spanbauer, and M. Pratola. 2018. BART: Bayesian Additive Regression Trees. R package version 2.1.10.1002/9781118445112.stat08251Suche in Google Scholar

Miller, A. and L. Bornn. 2017. “Possession sketches: mapping NBA strategies.” In Proceedings of the 2017 MIT Sloan Sports Analytics Conference. http://www.sloansportsconference.com/wp-content/uploads/2017/02/1624.pdf.Suche in Google Scholar

NFL Next Gen Stats Team. 2018. “Next gen stats introduction to completion probability.” http://www.nfl.com/news/story/0ap3000000964655/article/next-gen-stats-introduction-to-completion-probability.Suche in Google Scholar

Stan Development Team. 2018. RStan: the R interface to Stan. R package version 2.17.3.10.2478/msd-2018-0003Suche in Google Scholar

Published Online: 2019-11-16
Published in Print: 2020-06-25

©2019 Walter de Gruyter GmbH, Berlin/Boston

Heruntergeladen am 1.10.2025 von https://www.degruyterbrill.com/document/doi/10.1515/jqas-2019-0050/html
Button zum nach oben scrollen