Generalised additive models for location, scale, and shape in sports science: a systematic review

Main Article Content

Lucas F. Rosa
https://orcid.org/0000-0002-3263-4089
Andréa C. Konrath
https://orcid.org/0000-0002-3742-5032
Luiz R. Nakamura
https://orcid.org/0000-0002-7312-2717

Abstract

Sport has increasingly evolved into an interdisciplinary research field, where statistical modelling plays a central role in performance analysis, injury prediction, tactical optimisation, and audience behaviour. Although traditional methods like linear regression and generalised linear models are frequently employed, their inability to handle complex data structures has led to the expanding usage of more flexible approaches. In this context, generalised additive models for location, scale and shape (GAMLSS) are one of the most flexible statistical frameworks currently available, enabling the simultaneous modelling of multiple distributional parameters. Hence, in this paper, we perform a detailed systematic review exploring the application of GAMLSS in sports science, drawing on peer-reviewed articles. The results show a variety of applications, such as the development of reference growth curves, athlete performance modelling, match-fixing detection, and forecasting of sport-related consumer behaviour. GAMLSS have proven especially useful in contexts where traditional models are inadequate, offering enhanced flexibility in capturing distributional nuances. Nonetheless, opportunities remain to integrate GAMLSS with machine learning techniques and to extend their use across underexplored domains in sport. This review contributes to the field by outlining current trends, highlighting methodological strengths, and identifying promising directions for future research.

Article Details

How to Cite
Rosa, L. F., Konrath, A. C., & Nakamura, L. R. (2026). Generalised additive models for location, scale, and shape in sports science: a systematic review. Brazilian Journal of Biometrics, 44(2), e-44946. https://doi.org/10.28951/bjb.v44i2.946
Section
Articles

References

1. Abalo-Núñez, R, Gutiérrez-Sánchez, A, Pérez, M. C. I. & Vernetta-Santana, M. Injury prediction in aerobic gymnastics based on anthropometric variables. Science & Sports 33, 228–236 (2018).

2. Adam, T, Mayr, A & Kneib, T. Gradient boosting in Markov-switching generalized additive models for location, scale and shape. Econometrics and Statistics 22, 3–16 (2022).

3. Almarashi, A. M., Daniyal, M & Jamal, F. A novel comparative study of NNAR approach with linear stochastic time series models in predicting tennis player’s performance. BMC Sports Science, Medicine and Rehabilitation 16, 28 (2024).

4. Aria, M & Cuccurullo, C. bibliometrix: an R-tool for comprehensive science mapping analysis. Journal of Informetrics 11, 959–975 (2017).

5. Ayodeji, I & Abiodun, O. The effect of rivalry on sport performance: a case study of Cristiano Ronaldo and Lionel Messi. Sports Innovation Journal 5, 61–77 (2024).

6. Cerqueira, P. H. R., Nakamura, L. R., Pescim, R. R. & Leandro, R. A. Investigating the underlying causal network on European football teams. Journal of Data Science 15, 293–312 (2017).

7. Datson, N, Weston, M, Drust, B, Atkinson, G, Lolli, L & Gregson, W. Reference values for performance test outcomes relevant to English female soccer players. Science and Medicine in Football 6, 589–596 (2022).

8. De Araújo, F. J. M., Guerra, R. R. & Peña-Ramírez, F. A. The Burr XII quantile regression for salary-performance models with applications in the sports economy. Computational and Applied Mathematics 41, 282 (2022).

9. Exel, J & Dabnichki, P. Precision sports science: what is next for data analytics for athlete performance and well-being optimization? Applied Sciences 14, 3361 (2024).

10. Ferenhof, H. A. & Fernandes, R. F. Demystifying literature review in the AI era: updating the SSF method with generative artificial intelligence support. Biblios 88, e003 (2025).

11. Ferenhof, H. A. & Fernandes, R. F. Demystifying the literature review as basis for scientificwriting: SSF method. Revista ACB 21, 550–563 (2016).

12. Ferrari, S & Cribari-Neto, F. Beta regression for modelling rates and proportions. Journal of Applied Statistics 31, 799–815 (2004).

13. Gifford, M & Bayrak, T. A predictive analytics model for forecasting outcomes in the National Football League games using decision tree and logistic regression. Data Analytics Journal 8, 100296 (2023).

14. Hastie, T & Tibshirani, R. Generalized additive models. Statistical Science 1, 297–310 (1986).

15. Hester, D, Hunter, G, Shuleva, K & Kekes-Sabo, T. Bridging the gap – practical application: review and evaluation of relative strength handicapping models. National Strength and Conditioning Association Journal 12, 54–57 (1990).

16. Hofner, B, Mayr, A & Schmid, M. gamboostLSS: an R package for model building and variable selection in the GAMLSS Framework. Journal of Statistical Software 74, 1–31 (2016).

17. Huebner, M & Cole, T. J. Ranking performances of Olympic-style weightlifters adjusted for body mass on the same scale for both sexes: a novel approach. Journal of Sports Sciences 42, 2124–2130 (2024).

18. Jain, P. K., Quamer, W & Pamula, R. Sports result prediction using data mining techniques in comparison with base line model. Opsearch 58, 54–70 (2021).

19. Jia, H, Wan, B, Bu, T, Luo, Y, Ma, W, Huang, S, Gang, L, Deng, W & Liu, Z. Chinese physical fitness standard for campus football players: a pilot study of 765 children aged 9 to 11. Frontiers in Physiology 13, 1023910 (2022).

20. Lolli, L, Gregson, W, Bonanno, D, Kuitunen, S & Di Salvo, V. Age-related reference intervals for physical performance test outcomes relevant to male youth Middle Eastern football players. International Journal of Sports Physiology and Performance 18, 1283–1295 (2023).

21. Makaruk, H, Starzak, M, Tarkowski, P, Sadowski, J & Winchester, J. The effects of resistance training on sport-specific performance of elite athletes: a systematic review with meta-analysis. Journal of Human Kinetics 15, 135–155 (2024).

22. Markopoulou, C, Papageorgiou, G & Tjortjis, C. Diverse machine learning for forecasting goal-scoring likelihood in elite football leagues. Machine Learning & Knowledge Extraction 6, 1762–1781 (2024).

23. Mello, J. B., Nobre, G. C., Gaya, A. R., Duncan, M, Nevill, A & Gaya, A. Growth charts of Brazilian youngs: 20-years data of 95,000 children and adolescents from “Projeto Esporte Brasil”. Ciência & Saúde Coletiva 29, e06412023 (2024).

24. Nakamura, L. R., Cerqueira, P. H. R., Ramires, T. G., Pescim, R. R., Rigby, R. A. & Stasinopoulos, D. M. A new continuous distribution on the unit interval applied to modelling the points ratio of football teams. Journal of Applied Statistics 46, 416–431 (2019).

25. Nelder, J. A. & Wedderburn, R. W. M. Generalized linear models. Journal of the Royal Statistical Society Series A: General 135, 370–384 (1972).

26. Ötting, M, Langrock, R & Deutscher, C. Integrating multiple data sources in match-fixing warning systems. Statistical Modelling 18, 483–504 (2018).

27. Page, M. J. et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372, 71 (2021).

28. Pala, L. O. O., Nakamura, L. R., Sabe, E. M. & Ramires, T. G. Agricultural insurance and climate variability: analyzing soybean yield risks in Brazil using distributional regression models. The Geneva Papers on Risk and Insurance – Issues and Practice, Forthcoming (2026).

29. Pantzalis, V. C. & Tjortjis, C. Sports analytics for football league table and player performance prediction in Proceedings of the 11th International Conference on Information, Intelligence, Systems and Applications (IISA) 1-8 (2020).

30. Patel, P. A., Gopali, R, Reddy, A & Patel, K. K. Baseball-related ocular injuries in the United States: a 10-year analysis. Expert Review of Ophthalmology 17, 145–151 (2022).

31. Podrigalo, L. V., Volodchenko, A. A., Rovnaya, O. A., Podavalenko, O. V. & Grynova, T. I. The prediction of success in kickboxing based on the analysis of morphofunctional, physiological, biomechanical and psychophysiological indicators. Physical Education of Students 22, 51–56 (2018).

32. R Core Team. R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing (Vienna, Austria, 2025). https://www.R-project.org/.

33. Ramires, T. G., Nakamura, L. R., Righetto, A. J., Carvalho, R. J., Vieira, L. A. & Pereira, C. A. B. Comparison between highly complex location models and GAMLSS. Entropy 23, 469 (2021).

34. Rigby, R. A. & Stasinopoulos, D. M. Generalized additive models for location, scale and shape. Journal of the Royal Statistical Society Series C: Applied Statistics 54, 507–554 (2005).

35. Rigby, R. A., Stasinopoulos, M. D., Heller, G. Z. & De Bastiani, F. Distributions for Modeling Location, Scale, and Shape (CRC Press, Boca Raton, FL, 2019).

36. Roczniok, R, Stanula, A, Maszczyk, A, Mostowik, A, Kowalczyk, M, Fidos-Czuba, O & Zaja¸c, A. Physiological, physical and on-ice performance criteria for selection of elite ice hockey teams. Biology of Sport 33, 43–48 (2016).

37. Roquim, F. V., Lima, R. R., Nakamura, L. R., Ramires, T. G. & Blanco, Y. A. C. Dealing with highly kurtotic count data with excess zeroes: comparing different treatments in the control of dairy cattle gastrointestinal parasites. Brazilian Journal of Biometrics 41, 412–423 (2023).

38. Ruf, L, Altmann, S, Kloss, C & Härtel, S. Normative reference centiles for sprint performance in high-level youth soccer players: the need to consider biological maturity. Pediatric Exercise Science 36, 192–200 (2024).

39. Sarlis, V, Chatziilias, V, Tjortjis, C & Mandalidis, D. A data science approach analysing the impact of injuries on basketball player and team performance. Information Systems 99, 101750 (2021).

40. Silva-Alonso, T, Iglesias-Pérez, M. C. & García-Soidán, J. L. Percentile curves and reference values for 2000-m rowing ergometer performance time in international rowers aged 14-70 years. Journal of Human Sport & Exercise 13, 731–742 (2018).

41. Stasinopoulos, M. D., Rigby, R. A. & De Bastiani, F. GAMLSS: a distributional regression approach. Statistical Modelling 18, 248–273 (2018).

42. Staunton, C. A., Romann, M, Björklund, G & Born, D. Diving into a pool of data: using principal component analysis to optimize performance prediction in women’s short-course swimming. Journal of Sports Science 42, 519–526 (2024).

43. Swindell, H. W., Marcille, M. L., Trofa, D. P., Paulino, F. E., Desai, N. N., Lynch, T. S., Ahmad, C. S. & Popkin, C. A. An analysis of sports specialization in NCAA Division I collegiate athletics. Orthopaedic Journal of Sports Medicine 7, 2325967118821179 (2019).

44. Vanhelst, J, Le Cunuder, A, Léger, L, Duclos, M, Mercier, D & Carré, F. Sport participation, weight status, and physical fitness in French adolescents. European Journal of Pediatrics 183, 5213–5221 (2024).

45. Wang, Z, Su, L., Agudamu, Gong, T, Bu, T & Zhang, Y. Factors driving FIFA world cup 2022 viewership ratings in mainland China: marketing outlooks for FIFA world cup 2026. Frontiers in Sports and Active Living 5, 1282898 (2024).

46. Wei, S. Deep learning algorithms for sports data analysis and prediction in sports competitions. Applied Mathematics and Nonlinear Sciences 9, 1–14 (2024).

47. Wheatcroft, E. Forecasting football matches by predicting match statistics. Journal of Sports Analytics 7, 77–97 (2021).

48. Yao, Y, Wan, B, Long, B, Bu, T & Zhang, Y. In quest of China sports lottery development path to common prosperity in 2035. Plos One 19, e0297629 (2024).

49. Yue, J. C., Chou, E. P., Hsieh, M. H. & Hsiao, L. C. A study of forecasting tennis matches via the Glicko model. Plos One 17, e0266838 (2022).

50. Zotero. Zotero (Version 6.0) Corporation for Digital Scholarship. Available at: https://www.zotero.org/.

Accessed: 22 Jul 2025. 2024.

Similar Articles

<< < 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 > >> 

You may also start an advanced similarity search for this article.