Research on gold medal prediction model based on multi-algorithm fusion

Authors

  • Zhiyuan Xiao
  • Ziyang Ma
  • Junjie Wang
  • Wei Liu

DOI:

https://doi.org/10.54097/qcs3ft45

Keywords:

SHAP method, Ridge regression model, home field advantage effect, Logistic regression.

Abstract

In this study, a prediction framework based on multi-source historical data is proposed, and feature engineering and regularization techniques are used to construct interpretable machine learning models. After screening key features through Pearson correlation analysis, ridge regression and SHAP prediction models are built respectively, and the main effect correction parameter is innovatively introduced to enhance the model generalization ability. In the process of feature space construction, dynamic weight indicators are generated based on time-series data, and high-dimensional feature dimensionality reduction is accomplished through logistic regression, ultimately forming a predictor system containing four core dimensions. At the algorithmic level, this study compares the performance of L2 regularized ridge regression and game theory-based SHAP prediction model in prediction. And the experiments are parameter tuned, and the results show that the regularized model is effective in suppressing overfitting, and its mean square error (0.281) is reduced by 71.4% compared with the SHAP benchmark model. By constructing a dual assessment system, both the prediction accuracy (R²>0.95) and the quantitative resolution of feature contribution are ensured. In terms of feature importance analysis, the SHAP value calculation reveals the nonlinear relationship of each dimension, and the probability prediction module realizes the interval estimation of the probability of event occurrence through logistic regression integration. The framework exhibits strong robustness on the validation set. The hybrid modeling approach proposed in this study provides a new technical path for the time-series prediction problem, and its modular design can be extended to other prediction scenarios that require a balance between accuracy and interpretability.

Downloads

Download data is not yet available.

References

[1] Yuan, J. A preliminary study of Olympic gold medal prediction model in the era of big data: Taking the results of the Athletics World Championships as an example [J]. Sports Science and Technology Literature Bulletin, 2021, 29 (06): 132 - 134.

[2] Schlembach, C., Schmidt, S. L., & Schreyer, D. et al. Forecasting the Olympic medal distribution: A socioeconomic machine learning model [J]. Technological Forecasting and Social Change, 2022, 175: 121314.

[3] Liao, L., Zhao, Z., Li, Z., et al. Logistic regression and SHAP analysis for modeling and validation of femoral head necrosis after internal fixation of femoral neck fracture [J/OL]. Chinese Tissue Engineering Research, 2025 - 03 - 03.

[4] Xue, Y., & Yang, W. Influencing factors and suggestions for home field advantage in the Winter Olympics [J]. Chinese Sports Coach, 2022, 30 (01): 28 - 30+63. DOI: 10.16784/j.cnki.csc.2022.01.004.

[5] Yang, Y., & Zhu, F. Research on logistics demand forecasting in Guangxi based on ridge regression model [J]. Logistics Technology, 1 – 9.

[6] Tian, H., He, Y., Wang, M., et al. Medal prediction and participation strategy of Chinese athletes in the 2022 Beijing Winter Olympics: Analysis based on the effect of home field advantage in the Olympic Games [J]. Sports Science, 2021, 41 (02): 313 + 22.

[7] Feng, D. An analytical study of home field advantage in the 2002-2018 Winter Olympics [C] // Chinese Society of Sports Science. Compilation of Abstracts of Papers from the 11th National Sports Science Conference. Beijing Sport University; 2019: 3.

[8] Zhang, W.-F. Statistical characterization of Spearman's parsimonious correlation coefficient and Gini gamma correlation coefficient [D]. Guangdong University of Technology, 2020.

[9] Wu, J., Jing, B., Jing, J., et al. Study on MRI denoising based on nonlocal mean and linear least mean square error estimation [J]. China Medical Devices, 2025, 40 (02): 35 - 39+66.

[10] Yan, S., & Zhao, Y. Intercontinental distribution and trend prediction of track and field medals in the 27th to 31st Olympic Games [J]. Journal of Xichang College (Natural Science Edition), 2019, 33 (03): 68 - 74.

Downloads

Published

26-08-2025

How to Cite

Xiao, Z., Ma, Z., Wang, J., & Liu, W. (2025). Research on gold medal prediction model based on multi-algorithm fusion. Highlights in Science, Engineering and Technology, 152, 133-145. https://doi.org/10.54097/qcs3ft45