Home Multi-agent statistically discriminative sub-trajectory mining and an application to NBA basketball
Article
Licensed
Unlicensed Requires Authentication

Multi-agent statistically discriminative sub-trajectory mining and an application to NBA basketball

  • Rory Paul Bunker ORCID logo EMAIL logo , Vo Nguyen Le Duy , Yasuo Tabei , Ichiro Takeuchi and Keisuke Fujii ORCID logo
Published/Copyright: September 23, 2024
Become an author with De Gruyter Brill

Abstract

Improvements in tracking technology through optical and computer vision systems have enabled a greater understanding of the movement-based behaviour of multiple agents, including in team sports. In this study, a multi-agent statistically discriminative sub-trajectory mining (MA-Stat-DSM) method is proposed that takes a set of binary-labelled agent trajectory matrices as input and incorporates Hausdorff distance to identify sub-matrices that statistically significantly discriminate between the two groups of labelled trajectory matrices. Utilizing 2015/16 SportVU NBA tracking data, agent trajectory matrices representing attacks consisting of the trajectories of five agents (the ball, shooter, last passer, shooter defender, and last passer defender), were truncated to correspond to the time interval following the receipt of the ball by the last passer, and labelled as effective or ineffective based on a definition of attack effectiveness that we devise in the current study. After identifying appropriate parameters for MA-Stat-DSM by iteratively applying it to all matches involving the two top- and two bottom-placed teams from the 2015/16 NBA season, the method was then applied to selected matches and could identify and visualize the portions of plays, e.g., involving passing, on-, and/or off-the-ball movements, which were most relevant in rendering attacks effective or ineffective.


Corresponding author: Rory Paul Bunker, Nagoya University, Nagoya, Japan, E-mail:

Award Identifier / Grant number: 19H04941

Award Identifier / Grant number: 20H04075

Award Identifier / Grant number: JPMJPR20CA

  1. Research ethics: Not applicable.

  2. Author contributions: The authors have accepted responsibility for the entire content of this manuscript and approved its submission.

  3. Competing interests: No competing interests to declare.

  4. Research funding: This research was partly funded by JSPS (Grants 19H04941, 20H04075) and JST PRESTO (Grant JPMJPR20CA).

  5. Data availability: The raw data was sourced from https://github.com/neilmj/BasketballData/tree/master/2016.NBA.Raw.SportVU.Game.Logs. The code is available from https://github.com/rorybunker/ma-stat-dsm.

Code

The MA-Stat-DSM code is available on GitHub: https://github.com/rorybunker/ma-stat-dsm

Statistical testing & pruning properties of (MA)-Stat-DSM

The statistical testing and pruning properties of MA-Stat-DSM are essentially analogous to the original Stat-DSM (Le Vo et al. 2020). Therefore, in this subsection (and in Figure 2 in the main text), we provide a brief explanation of the statistical testing and pruning properties of (MA-)Stat-DSM and refer the reader to (Le Vo et al. 2020) for full details.

Stat-DSM (MA-Stat-DSM) represents sub-trajectories (sub-matrices) in the form of a tree, which is pruned to remove sub-trajectories (sub-matrices) that are guaranteed to not be discriminative (this pruning criterion is shown in line 18 of the MA-Stat-DSM algorithm pseudo-code in Algorithm 1). Stat-DSM (MA-Stat-DSM) uses Fisher’s Exact Test (FET) (Agresti 1992; Fisher 1922) to determine the statistical significance of a sub-trajectory (sub-matrix) using a contingency table with the number of trajectories (matrices) that, respectively, contain and do not contain sub-trajectories (sub-matrices) within a distance of ɛ. A correction for multiple-testing bias is also incorporated, which is necessary due to the calculation of p-values for a large number of trajectories (trajectory matrices), and is conducted using the Westfall-Young (WY) method (Terada et al. 2013; Westfall and Young 1993). Sub-trajectories (sub-matrices) are only identified as SSD if their p-value is less than their adjusted significance level, δ, which is, in turn, less than α = 0.05. The dataset labels are permuted B = 1,000 times as part of this procedure. The pruning and WY methods are applied simultaneously to reduce complexity (Step 1, Figure 2).

References

Agresti, A. (1992). A survey of exact inference for contingency tables. Stat. Sci. 7: 131–153, https://doi.org/10.1214/ss/1177011454.Search in Google Scholar

Ai, S., Na, J., De Silva, V., and Caine, M. (2021). A novel methodology for automating spatio-temporal data classification in basketball using active learning. In: 2021 IEEE 2nd international conference on pattern recognition and machine learning (PRML), IEEE, pp. 39–45.10.1109/PRML52754.2021.9520715Search in Google Scholar

Alcorn, M.A. and Nguyen, A. (2021). baller2vec: a multi-entity transformer for multi-agent spatiotemporal modeling. arXiv preprint arXiv:2102.03291.Search in Google Scholar

Bunker, R., Fujii, K., Hanada, H., and Takeuchi, I. (2021). Supervised sequential pattern mining of event sequences in sport to identify important patterns of play: an application to rugby union. PLOS One 16: e0256329, https://doi.org/10.1371/journal.pone.0256329.Search in Google Scholar PubMed PubMed Central

Cao, Y., Zhu, J., and Gao, F. (2016). An algorithm for mining moving flock patterns from pedestrian trajectories. In: Web technologies and applications: APWeb 2016 workshops, WDMA, GAP, and SDMA, Suzhou, China, september 23-25, 2016, proceedings. Springer, pp. 310–321.10.1007/978-3-319-45835-9_27Search in Google Scholar

Carling, C., Bloomfield, J., Nelsen, L., and Reilly, T. (2008). The role of motion analysis in elite soccer: contemporary performance measurement techniques and work rate data. Sports Med. 38: 839–862, https://doi.org/10.2165/00007256-200838100-00004.Search in Google Scholar PubMed

Chen, C.-H., Liu, T.-L., Wang, Y.-S., Chu, H.-K., Tang, N.C., and Liao, H.-Y.M. (2015). Spatio-temporal learning of basketball offensive strategies. In: Proceedings of the 23rd ACM international conference on multimedia, pp. 1123–1126.10.1145/2733373.2806297Search in Google Scholar

Facchinetti, T., Metulini, R., and Zuccolotto, P. (2023). Filtering active moments in basketball games using data from players tracking systems. Ann. Oper. Res. 325: 521–538, https://doi.org/10.1007/s10479-021-04391-8.Search in Google Scholar

Fisher, R.A. (1922). On the interpretation of χ 2 from contingency tables, and the calculation of p. J. R. Stat. Soc. 85: 87–94, https://doi.org/10.2307/2340521.Search in Google Scholar

Fujii, K., Yokoyama, K., Koyama, T., Rikukawa, A., Yamada, H., and Yamamoto, Y. (2016). Resilient help to switch and overlap hierarchical subsystems in a small human group. Sci. Rep. 6: 23911, https://doi.org/10.1038/srep23911.Search in Google Scholar PubMed PubMed Central

Fujii, K., Inaba, Y., and Kawahara, Y. (2017). Koopman spectral kernels for comparing complex dynamics: application to multiagent sport plays. In: European conference on machine learning and knowledge discovery in databases (ECML-PKDD’17). Springer, pp. 127–139.10.1007/978-3-319-71273-4_11Search in Google Scholar

Fujii, K., Kawasaki, T., Inaba, Y., and Kawahara, Y. (2018). Prediction and classification in equation-free collective motion dynamics. PLoS Comput. Biol. 14: e1006545, https://doi.org/10.1371/journal.pcbi.1006545.Search in Google Scholar PubMed PubMed Central

Fujii, K., Takeishi, N., Kawahara, Y., and Takeda, K. (2020). Policy learning with partial observation and mechanical constraints for multi-person modeling. arXiv preprint arXiv:2007.03155.Search in Google Scholar

Goes, F., Meerhoff, L., Bueno, M., Rodrigues, D., Moura, F., Brink, M., Elferink-Gemser, M., Knobbe, A., Cunha, S., Torres, R., et al.. (2021). Unlocking the potential of big data to support tactical performance analysis in professional soccer: a systematic review. Eur. J. Sport Sci. 21: 481–496, https://doi.org/10.1080/17461391.2020.1747552.Search in Google Scholar PubMed

Gudmundsson, J. and Horton, M. (2017). Spatio-temporal analysis of team sports. ACM Comput. Surv. 50: 1–34, https://doi.org/10.1145/3054132.Search in Google Scholar

Herrera, F., Ventura, S., Bello, R., Cornelis, C., Zafra, A., Sánchez-Tarragó, D., and Vluymans, S. (2016). Multiple instance learning: foundations and algorithms. Springer, Cham.10.1007/978-3-319-47759-6Search in Google Scholar

Hochreiter, S. and Schmidhuber, J. (1997). Long short-term memory. Neural Comput. 9: 1735–1780, https://doi.org/10.1162/neco.1997.9.8.1735.Search in Google Scholar PubMed

Hrovat, G., Fister, I.Jr, Yermak, K., Stiglic, G., and Fister, I. (2015). Interestingness measure for mining sequential patterns in sports. J. Intell. Fuzzy Syst. 29: 1981–1994, https://doi.org/10.3233/ifs-151676.Search in Google Scholar

Hughes, M.D. and Bartlett, R.M. (2002). The use of performance indicators in performance analysis. J. Sports Sci. 20: 739–754, https://doi.org/10.1080/026404102320675602.Search in Google Scholar PubMed

Huttenlocher, D.P., Klanderman, G.A., and Rucklidge, W.J. (1993). Comparing images using the hausdorff distance. IEEE Trans. Pattern Anal. Mach. Intell. 15: 850–863, https://doi.org/10.1109/34.232073.Search in Google Scholar

Karcher, C. and Buchheit, M. (2014). On-court demands of elite handball, with special reference to playing positions. Sports Med. 44: 797–814, https://doi.org/10.1007/s40279-014-0164-z.Search in Google Scholar PubMed

Le, H.M., Yue, Y., Carr, P., and Lucey, P. (2017). Coordinated multi-agent imitation learning. In: International conference on machine learning. PMLR, pp. 1995–2003.Search in Google Scholar

Le Vo, D.N., Sakuma, T., Ishiyama, T., Toda, H., Arai, K., Karasuyama, M., Okubo, Y., Sunaga, M., Hanada, H., Tabei, Y., et al.. (2020). Stat-DSM: statistically discriminative sub-trajectory mining with multiple testing correction. IEEE Trans. Knowl. Data Eng. 34: 1477–1488.10.1109/TKDE.2020.2994344Search in Google Scholar

Li, Z., Han, J., Ji, M., Tang, L.-A., Yu, Y., Ding, B., Lee, J.-G., and Kays, R. (2011). Movemine: mining moving object data for discovery of animal movement patterns. ACM Trans. Intell. Syst. Technol. 2: 1–32, https://doi.org/10.1145/1989734.1989741.Search in Google Scholar

Lucey, P., Bialkowski, A., Carr, P., Yue, Y., and Matthews, I. (2014). How to get an open shot: analyzing team movement in basketball using tracking data. In: Proceedings of the 8th annual MIT SLOAN sports analytics conference.Search in Google Scholar

Mazimpaka, J.D. and Timpf, S. (2016). Trajectory data mining: a review of methods and applications. J. Spat. Inf. Sci. 2016: 61–99, https://doi.org/10.5311/josis.2016.13.263.Search in Google Scholar

McIntyre, A., Brooks, J., Guttag, J., and Wiens, J. (2016). Recognizing and analyzing ball screen defense in the NBA. In: Proceedings of the MIT sloan sports analytics conference, Boston, MA, USA, pp. 11–12.Search in Google Scholar

McQueen, A., Wiens, J., and Guttag, J. (2014). Automatically recognizing on-ball screens. In: 2014 MIT sloan sports analytics conference.Search in Google Scholar

Metulini, R., Manisera, M., and Zuccolotto, P. (2018). Modelling the dynamic pattern of surface area in basketball and its effects on team performance. J. Quant. Anal. Sports 14: 117–130, https://doi.org/10.1515/jqas-2018-0041.Search in Google Scholar

Papadimitriou, C.H. and Steiglitz, K. (1982). Combinatorial optimization, 24. Prentice Hall, Englewood Cliffs, NJ.Search in Google Scholar

Papalexakis, E. and Pelechrinis, K. (2018). thoops: a multi-aspect analytical framework for spatio-temporal basketball data. In: Proceedings of the 27th ACM international conference on information and knowledge management, pp. 2223–2232.10.1145/3269206.3272002Search in Google Scholar

Raabe, D., Nabben, R., and Memmert, D. (2023). Graph representations for the analysis of multi-agent spatiotemporal sports data. Appl. Intell. 53: 3783–3803, https://doi.org/10.1007/s10489-022-03631-z.Search in Google Scholar

Rein, R. and Memmert, D. (2016). Big data and tactical analysis in elite soccer: future challenges and opportunities for sports science. SpringerPlus 5: 1–13, https://doi.org/10.1186/s40064-016-3108-2.Search in Google Scholar PubMed PubMed Central

Sampaio, J., McGarry, T., Calleja-González, J., Jiménez Sáiz, S., Schelling i del Alcázar, X., and Balciunas, M. (2015). Exploring game performance in the National Basketball Association using player tracking data. PLOS One 10: e0132894, https://doi.org/10.1371/journal.pone.0132894.Search in Google Scholar PubMed PubMed Central

Shah, R. and Romijnders, R. (2016). Applying deep learning to basketball trajectories. arXiv preprint arXiv:1608.03793.Search in Google Scholar

Sicilia, A., Pelechrinis, K., and Goldsberry, K. (2019). Deephoops: evaluating micro-actions in basketball using deep feature representations of spatio-temporal data. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 2096–2104.10.1145/3292500.3330719Search in Google Scholar

Skinner, B. and Guy, S.J. (2015). A method for using player tracking data in basketball to learn player skills and predict team performance. PLOS One 10: e0136393, https://doi.org/10.1371/journal.pone.0136393.Search in Google Scholar PubMed PubMed Central

Su, H., Liu, S., Zheng, B., Zhou, X., and Zheng, K. (2020). A survey of trajectory distance measures and performance evaluation. VLDB J. 29: 3–32, https://doi.org/10.1007/s00778-019-00574-9.Search in Google Scholar

Taha, A.A. and Hanbury, A. (2015). An efficient algorithm for calculating the exact Hausdorff distance. IEEE Trans. Pattern Anal. Mach. Intell. 37: 2153–2163, https://doi.org/10.1109/tpami.2015.2408351.Search in Google Scholar PubMed

Terada, A., Tsuda, K., and Sese, J. (2013). Fast Westfall-Young permutation procedure for combinatorial regulation discovery. In: 2013 IEEE international conference on bioinformatics and biomedicine. IEEE, pp. 153–158.10.1109/BIBM.2013.6732479Search in Google Scholar

Terner, Z. and Franks, A. (2021). Modeling player and team performance in basketball. Annu. Rev. Stat. Appl. 8: 1–23, https://doi.org/10.1146/annurev-statistics-040720-015536.Search in Google Scholar

Tian, C., De Silva, V., Caine, M., and Swanson, S. (2019). Use of machine learning to automate the identification of basketball strategies using whole team player tracking data. Appl. Sci. 10: 24, https://doi.org/10.3390/app10010024.Search in Google Scholar

Wang, K.-C. and Zemel, R. (2016). Classifying NBA offensive plays using neural networks. In: Proceedings of MIT sloan sports analytics conference, Vol. 4.Search in Google Scholar

Westfall, P.H. and Young, S.S. (1993). Resampling-based multiple testing: examples and methods for p-value adjustment, 279. John Wiley & Sons, New York.Search in Google Scholar

Yoon, Y., Hwang, H., Choi, Y., Joo, M., Oh, H., Park, I., Lee, K.-H., and Hwang, J.-H. (2019). Analyzing basketball movements and pass relationships using realtime object tracking techniques based on deep learning. IEEE Access 7: 56564–56576, https://doi.org/10.1109/access.2019.2913953.Search in Google Scholar

Zhao, Y., Shang, S., Wang, Y., Zheng, B., Nguyen, Q.V.H., and Zheng, K. (2018a). Rest: a reference-based framework for spatio-temporal trajectory compression. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 2797–2806.10.1145/3219819.3220030Search in Google Scholar

Zhao, Y., Yang, R., Chevalier, G., Shah, R.C., and Romijnders, R. (2018b). Applying deep bidirectional lstm and mixture density network for basketball trajectory prediction. Optik 158: 266–272, https://doi.org/10.1016/j.ijleo.2017.12.038.Search in Google Scholar

Zheng, Y. (2015). Trajectory data mining: an overview. ACM Trans. Intell. Syst. Technol. 6: 1–41, https://doi.org/10.1145/2743025.Search in Google Scholar

Ziyi, Z., Takeda, K., and Fujii, K. (2022). Cooperative play classification in team sports via semi-supervised learning. Int. J. Comput. Sci. Sport 21: 111–121, https://doi.org/10.2478/ijcss-2022-0006.Search in Google Scholar

Ziyi, Z., Bunker, R., Takeda, K., and Fujii, K. (2023). Multi-agent deep-learning based comparative analysis of team sport trajectories. IEEE Access 11: 43305–43315, https://doi.org/10.1109/access.2023.3269287.Search in Google Scholar

Zuccolotto, P., Sandri, M., and Manisera, M. (2021). Spatial performance indicators and graphs in basketball. Soc. Indic. Res. 156: 725–738, https://doi.org/10.1007/s11205-019-02237-2.Search in Google Scholar

Received: 2023-04-20
Accepted: 2024-07-29
Published Online: 2024-09-23

© 2024 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 7.10.2025 from https://www.degruyterbrill.com/document/doi/10.1515/jqas-2023-0039/html?recommended=sidebar
Scroll to top button