Olympic Medal Count Prediction Model Based on Xgboost
DOI:
https://doi.org/10.54097/m30t9y42Keywords:
Olympic Games, Olympic Medal Prediction, XGBoost, RMSE.Abstract
Accurately predicting Olympic medal counts for each country is of significant value for shaping sports policy, optimizing resource allocation, and advancing the sports industry. This study presents a predictive model based on the Extreme Gradient Boosting (XGBoost) algorithm to estimate the number of gold, silver, and bronze medals awarded to each country in the Olympics. The validity of the model is demonstrated by predicting the medal counts for the 2024 Olympics. Relevant data from 1896 to 2024 were collected and preprocessed. Five key features were extracted for each Olympic edition: the host country, the number of gold, silver, and bronze medals, the total medal count, the number of athletes per country, the gender distribution of athletes, and the sports events in which each country participated. Data from 1896 to 2012 were used for training, while data from the 2016 and 2020 Olympics were used as the test set. The model was trained using the XGBoost algorithm, and optimization was performed by minimizing the root mean square error (RMSE). Four features for the 2024 Olympics—host country, number of participating athletes, gender distribution of athletes, and the sports events contested—were used to predict the medal counts for each country. By comparing the predicted results with the actual data, the RMSE values for gold, silver, bronze, and total medals were calculated to be 0.6462, 0.5547, 0.2965, and 0.3922, respectively. These results validate the exceptional performance of the XGBoost model in predicting Olympic medal counts, providing effective and forward-looking strategic insights for the optimization of sports resource allocation and the setting of competitive goals across countries.
Downloads
References
[1] Dai L, Zhang S, Shi X Z. Design of a prediction model based on ARIMA-Random Forest algorithm[C]// International Conference on Digital Analysis and Processing, Intelligent Computation (DAPIC). Incheon, Korea, Republic of: IEEE, 2025: 820–824.
[2] Ran Z X, Ou H H, Qin X D, et al. Research on Olympic medal prediction based on the FCCL-Transformer model[C]// International Conference on Digital Analysis and Processing, Intelligent Computation (DAPIC). Incheon, Korea, Republic of: IEEE, 2025: 915–922.
[3] Yang Y K, Feng W. Research on Olympic medal prediction based on the LSTM-GWO-DeepForest model[C]// International Conference on Digital Analysis and Processing, Intelligent Computation (DAPIC). Incheon, Korea, Republic of: IEEE, 2025: 844–849.
[4] Yang H, Li X Y, Li S X. Prediction and analysis of Olympic medals based on Probit regression and Tobit model[C]// International Conference on Digital Analysis and Processing, Intelligent Computation (DAPIC). Incheon, Korea, Republic of: IEEE, 2025: 839–843.
[5] Asha V, Vasumathi M T, Chowdhury D, et al. Comparative Evaluation of Random Forest, XGBoost, and SVC Models for Fire Detection[C]. Proceedings of the Second International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS-2024). Erode, India: IEEE, 2024: 470-475.
[6] He A Q, Xiong W, Bai Y L. XGBoost-based Voltage Stability Margin Prediction for Power Systems[C]. IEEE International Conference on Sensors, Electronics and Computer Engineering (ICSECE). Jinzhou, China: IEEE, 2023: 1225-1228.
[7] Bi Z B, Fu C X, Zhu J Y, et al. Control Ore Processing Quality Based on Xgboost Machine Learning Algorithm[C]. Asirope Conference on Electronics, Data Processing and Informatics (ACEDPI). Prague, Czech Republic: IEEE, 2023: 177-180.
[8] Wang Z A, Wang D Z, Li Y M. Intelligent Prediction of Short-Term Load of Distribution Network Based on XGBoost Algorithm[C]. Chinese Control and Decision Conference (CCDC). Xi'an, China: IEEE, 2024: 4617-4621.
[9] Dharani Nivash A, Ammal Dhanalakshmi M. Identification of Cyber Bullying Using XGBoost Compared to Random Forest Classifier to Improve Accuracy[C]. Chinese Control and Decision Conference (CCDC). Nitte, India: IEEE, 2025: 856-859.
[10] Sheng C, Yu H Z. An Optimized Prediction Algorithm Based on XGBoost[C]. International Conference on Networking and Network Applications (NaNA). Urumqi, China: IEEE, 2022: 442-447.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.