Research on Olympic Performance Prediction System Based on Multi-source Data Fusion
DOI:
https://doi.org/10.54097/ypv7e705Keywords:
random forest, advantage score, Olympic medal distribution, great coach effect.Abstract
This paper proposes an Olympic performance prediction system based on multi-source data fusion. First, through data cleaning and feature engineering, we constructed a dataset containing features such as athlete scale, number of events, and host country identification, and introduced an advantage score to quantify national competitiveness in different events. Random forest algorithm was used to predict the medal distribution of the 2028 Olympics, and a linear regression model was employed to quantitatively analyze the impact of elite coaches on medal acquisition. Experimental results show that the prediction system achieved an R² value of 0.716 on the test set, outperforming other machine learning models. Meanwhile, the study found that the coaching factor has a significant contribution to improving competitive performance, providing important reference for countries to optimize their coaching configuration.
Downloads
References
[1] SAJADI S M, BAGHAIE S, REZAEI R. Optimizing sports development: Identifying and prioritizing key indicators for professional and competitive sports [J]. World Development, 2024, 180: 106651.
[2] MCCULLOUGH B P, ORR M, KELLISON T. Sport ecology: Conceptualizing an emerging subdiscipline within sport management [J]. Journal of Sport Management, 2020, 34 (6): 509 - 520.
[3] GREENWELL T C, DANZEY-BUSSELL L A, SHONK D J. Managing sport events [M]. Human Kinetics, 2024.
[4] EL-MAGHRABI Y, SHARIF M. Game Changers or Game Predictors? Big Data Analytics in Sports for Performance Enhancement and Fan Engagement [J]. Journal of Contemporary Healthcare Analytics, 2022, 6 (6): 19 - 39.
[5] ROSSI A. Predictive models in sport science: multi-dimensional analysis of football training and injury prediction [J]. 2017.
[6] HODSON T O. Root mean square error (RMSE) or mean absolute error (MAE): When to use them or not [J]. Geoscientific Model Development Discussions, 2022, 2022: 1 - 10.
[7] CHICCO D, WARRENS M J, JURMAN G. The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation [J]. Peerj computer science, 2021,7: e623.
[8] ROBESON S M, WILLMOTT C J. Decomposition of the mean absolute error (MAE) into systematic and unsystematic components [J]. PloS one, 2023, 18 (2): e279774.
[9] NASON A E. The Contribution of Domestic and International Conflict in Renaissance Italy to the Sport of Fencing [J]. 2023.
[10] MARQUES OLIVEIRA F, COSTA TELLES S, CARLOS NERY L, et al. Beach Volleyball Management in Brazil [J]. 2023.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







