Abstract
Prior clustering approaches of soccer players have employed a variety of methods based on various data categories, but none of them have focused on clustering by career paths characterized through a time series analysis of yearly performance quality. Therefore, this study aims to propose a methodology how a career path can be represented as a time series of a player’s seasonal qualities and then be clustered with players that have a similar career path. The underlying data focuses on soccer players from the five largest European soccer nations (Big-5). This allows for the identification of different types of career paths of players and the investigation of significant disparities between career paths among the Big-5 nations. In line with our proposed methodological approach, we identified and interpreted 13 different clusters of player career paths. These range from the cluster with the highest player quality scores to the pattern comprising players with the weakest scores. Further, the detected clusters show significant differences regarding variables of soccer players’ early career phase in adolescence (e.g., age of debut in professional soccer, years spent in a youth academy). The presented approach might represent a first step for stakeholders in soccer to get an objective insight in players’ career by utilizing mainly freely available data sources.
-
Research ethics: The local Institutional Review Board deemed the study exempt from review.
-
Author contributions: The authors have accepted responsibility for the entire content of this manuscript and approved its submission.
-
Competing interests: The authors state no conflict of interest.
-
Research funding: None declared.
-
Data availability: The raw data can be obtained on request from the corresponding author.
References
Akhanli, S.E. and Hennig, C. (2017). Some issues in distance construction for football players performance data. Ann. Data Sci. Ser. A 2: 29–45.Search in Google Scholar
Akhanli, S.E. and Hennig, C. (2023). Clustering of football players based on performance data and aggregated clustering validity indexes. J. Quant. Anal. Sports 19: 103–123, https://doi.org/10.1515/jqas-2022-0037.Search in Google Scholar
Amaratunga, D. and Cabrera, J. (2001). Analysis of data from viral dna microchips. J. Am. Stat. Assoc. 96: 1161–1170, https://doi.org/10.1198/016214501753381814.Search in Google Scholar
Baker, J., Cobley, S., and Fraser‐Thomas, J. (2009). What do we know about early sport specialization? Not much!. High Abil. Stud. 20: 77–89, https://doi.org/10.1080/13598130902860507.Search in Google Scholar
Bellman, R. and Kalaba, R. (1959). On adaptive control processes. IRE Trans. Autom. Control 4: 1–9, https://doi.org/10.1109/tac.1959.1104847.Search in Google Scholar
Bianco, T., Malo, S., and Orlick, T. (1999). Sport injury and illness: elite skiers describe their experiences. Res. Q. Exerc. Sport 70: 157–169, https://doi.org/10.1080/02701367.1999.10608033.Search in Google Scholar PubMed
Brewer, B.W., Van Raalte, J.L., and Linder, D.E. (1993). Athletic identity: hercules’ muscles or achilles heel? Int. J. Sport Psychol. 24: 237–254.10.1037/t15488-000Search in Google Scholar
Brown, M.K. and Rabiner, L. (1982) Dynamic time warping for isolated word recognition based on ordered graph searching techniques. In: ICASSP ’82. IEEE international conference on acoustics, speech, and signal processing, Vol. 7, pp. 1255–1258.10.1109/ICASSP.1982.1171695Search in Google Scholar
Caliński, T. (2014). Dendrogram. John Wiley & Sons, Ltd, Hoboken, New Jersey, US.10.1002/9781118445112.stat05624Search in Google Scholar
Carpels, T., Scobie, N., Macfarlane, N.G., and Kemi, O.J. (2021). Youth-to-senior transition in elite european club soccer. Int. J. Exerc. Sci 14: 1192–1203.Search in Google Scholar
Collins, D., MacNamara, Á., and McCarthy, N. (2016). Super champions, champions, and almosts: important differences and commonalities on the rocky road. Front. Psychol. 6: 1–11, https://doi.org/10.3389/fpsyg.2015.02009.Search in Google Scholar PubMed PubMed Central
Curran, G.M., Landes, S.J., McBain, S.A., Pyne, J.M., Smith, J.D., Fernandez, M.E., Chambers, D.A., and Mittman, B.S. (2022). Reflections on 10 years of effectiveness-implementation hybrid studies. Front. Health Serv. 2: 1–10, https://doi.org/10.3389/frhs.2022.1053496.Search in Google Scholar PubMed PubMed Central
Côté, J., Salmela, J.H., Baria, A., and Russell, S.J. (1993). Organizing and interpreting unstructured qualitative data. Sport Psychol. 7: 127–137, https://doi.org/10.1123/tsp.7.2.127.Search in Google Scholar
D’Urso, P., Giovanni, L.D., and Vitale, V. (2023). A robust method for clustering football players with mixed attributes. Ann. Oper. Res. 325: 9–36, https://doi.org/10.1007/s10479-022-04558-x.Search in Google Scholar
Güllich, A. and Emrich, E. (2014). Considering long-term sustainability in the development of world class success. Eur. J. Sport Sci. 14: S383–S397, https://doi.org/10.1080/17461391.2012.706320.Search in Google Scholar PubMed
Kaufman, L. and Rousseeuw, P. (1990). Finding Groups in data: an introduction to cluster analysis. John Wiley & Sons, Ltd, Hoboken, New Jersey, US.10.1002/9780470316801Search in Google Scholar
Lanwehr, R., Honsel, M., and Wilms, R. (2021). The evaluation of quality – a comparison of methods based on youth academies in German professional football. GIO. J. Appl. Organ. Psychol. 52: 25–35, https://doi.org/10.1007/s11612-021-00556-y.Search in Google Scholar
MacNamara, Á., Button, A., and Collins, D. (2010). The role of psychological characteristics in facilitating the pathway to elite performance part 1: identifying mental skills and behaviors. Sport Psychol. 24: 52–73, https://doi.org/10.1123/tsp.24.1.52.Search in Google Scholar
Madhulatha, T.S. (2012). An overview on clustering methods. IOSR J. Eng. 2: 719–725, https://doi.org/10.9790/3021-0204719725.Search in Google Scholar
Monteiro, R., Monteiro, D., Nunes, C., Torregrossa, M., and Travassos, B. (2020). Identification of key career indicators in Portuguese football players. Int. J. Sports Sci. Coach. 15: 533–541, https://doi.org/10.1177/1747954120923198.Search in Google Scholar
Nasco, S.A. and Webb, W.M. (2006). Toward an expanded measure of athletic identity: the inclusion of public and private dimensions. J. Sport Exerc. Psychol. 28: 434–453, https://doi.org/10.1123/jsep.28.4.434.Search in Google Scholar
Paule-Koba, A.L. (2019). Identifying athlete’s majors and career aspirations: the next step in clustering research. J. Athl. Dev. Exp. 1: 8–15, https://doi.org/10.25035/jade.01.01.02.Search in Google Scholar
Roca, A. and Ford, P.R. (2020). Decision-making practice during coaching sessions in elite youth football across european countries. Sci. Med. Footb. 4: 263–268, https://doi.org/10.1080/24733938.2020.1755051.Search in Google Scholar
Scikit-Learn (2022). Quantile transform, Available at: https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.quantile_transform.html (Accessed 27 Nov 2022).Search in Google Scholar
Shelly, Z., Burch, R., Tian, W., Strawderman, L., Piroli, A., and Bichey, C. (2020). Using k-means clustering to create training groups for elite american football student-athletes based on game demands. Int. J. Kinesiol. Sports Sci. 8: 47–63, https://doi.org/10.7575//aiac.ijkss.v.8n.2p.47.Search in Google Scholar
UEFA (2022). Country coefficients, Available at: https://uefa.com/nationalassociations/uefarankings/country (Accessed 27 Nov 2022).Search in Google Scholar
Williams, G.G. and MacNamara, Á. (2022). Challenge is in the eye of the beholder: exploring young athlete’s experience of challenges on the talent pathway. Int. J. Sports Sci. 40: 1078–1087, https://doi.org/10.1080/02640414.2022.2047503.Search in Google Scholar PubMed
© 2024 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Frontmatter
- Research Articles
- Estimating positional plus-minus in the NBA
- No cheering in the background? Individual performance in professional darts during COVID-19
- Spatial roles in hockey special teams
- Career path clustering of elite soccer players among European Big-5 nations utilizing Dynamic Time Warping
- The strategic jump-the order effect on winning “The Final Three” in long jump competitions
Articles in the same Issue
- Frontmatter
- Research Articles
- Estimating positional plus-minus in the NBA
- No cheering in the background? Individual performance in professional darts during COVID-19
- Spatial roles in hockey special teams
- Career path clustering of elite soccer players among European Big-5 nations utilizing Dynamic Time Warping
- The strategic jump-the order effect on winning “The Final Three” in long jump competitions