A Medal Prediction Model Based on XGBoost-Logistic Regression and Quantification of the “Great Coach” Effect

Authors

  • Yisheng Gao
  • Yixiao Mou
  • Shengdi Xu

DOI:

https://doi.org/10.54097/t2cfqx21

Keywords:

XGBoost, Medal Prediction, Ridge Regression, CUSUM, “great coach”.

Abstract

The paper selected sports that have consistently appeared in the Olympic Games since 2000, created a feature system, considering factors such as historical data, the elite athletes, gender ratios, and hosting country in the games. By comparing the performance of different models in this input feature the XGBoost performed exceptionally well in forecasting with its highest R² value which indicates that it is good at explaining the fluctuations in the data and the lowest MSE value which reflect that it has less variance in its predicted values. Furthermore, paper aimed to quantify the contribution of "great coach" on the number of medals won. Using the coaching information, the paper reconstructe the feature system and applied Ridge Regression to assess the impact of coaches. The effect of Coach Lang Ping on the Chinese and American women’s volleyball teams was quantified as 6.432 and 6.913, respectively. Similarly, Coach Béla Károlyi contribution to Romanian and American gymnastics medals was 4.6123 and 4.3333, respectively. Then employed the CUSUM method to identify key breakpoints in the medal sequence and identified 3 countries (Italy and United States) that could benefit from employing “great coaches.” The results highlighted the host country and elite athletes as particularly influential factors, offering valuable insights for national Olympic committees in their decision-making processes.

Downloads

Download data is not yet available.

References

[1] Scelles N, Andreff W, Bonnal L, Andreff M, & Favard Forecasting national medal totals at the Summer Olympic Games reconsidered [J]. Social Science Quarterly, (2020). 101 (2). 697 – 711.

[2] SHI Huimin, ZHANG Dongying, & ZHANG Yonghui. Can Olympic Medals Be Predicted?Based on the Interpretable Machine Learning Perspective [J]. Journal of Shanghai University of Sport, (2024). (04) 26 - 36.

[3] KUANG Hong-da, XU Li-ping, & Ll Lin-ying. China Sport Science and Technology [J]. China Sport Science and Technology, (2018). (01) 57 - 63+128.

[4] SONG Zhigang, JAO Fangqian & WU Xiangwei. Study on the Characteristics of China's Awards in the Tokyo Olympics andthe lmplications for the Preparation of the New Olympic Cycle[J]. Journal of Beijing Sport University (2024). (07), 146 - 156.

[5] TIAN Hui, HE Yiman, WANG Min, Ll Juan, YU Peiyang, Ql Shunhong, & TlAN Ye. Medal Forecasts and Competition Strategies for Chinese Athletes in theBeijing 2022 Winter Olympic Games-Based on Olympic Home Advantage [J]. China Sport Science. (2021). (02). 3 - 13+22.

[6] SHUI Yingyi, ZHANG Qi, LI Gen, ZHANG Shihao & Wu Shang. A Review of Research on Social Network Influence Prediction Based on Multi-Class Features [J]. Frontiers of Data & Computing (2025) (01), 2 - 18.

[7] Zhang Y, & Chen L.A Study on Forecasting the Default Risk of Bond Based on XGboost Algorithm and Over-Sampling Method [J]. Theoretical Economics Letters, (2021).11, 258 - 267.

[8] Waberi A.D, Mwangi R.W. and Rimiru R.M. Advancing Type II Diabetes Predictions with a Hybrid LSTM-XGBoost Approach [J]. Journal of Data Analysis and Information Processing, (2024)12, 163 - 188.

[9] Khalaf, G Improving the Ordinary Least Squares Estimator by Ridge Regression [J]. Open Access Library Journal, (2022).9: e8738.

[10] Degang Li. Baidu Index Change Point Analysis Based on CUSUM Method [J]. Statistics and Applications, 2024, 13 (2), 521 - 527.

Downloads

Published

26-08-2025

How to Cite

Gao, Y., Mou, Y., & Xu, S. (2025). A Medal Prediction Model Based on XGBoost-Logistic Regression and Quantification of the “Great Coach” Effect. Highlights in Science, Engineering and Technology, 152, 111-121. https://doi.org/10.54097/t2cfqx21