A Study on the Prediction of Olympic Medal Distribution Based on Regression Analysis and ARIMA Models

Authors

  • Yanhao Zhang
  • Tianyu Zhang
  • Daorui Wan

DOI:

https://doi.org/10.54097/1a9aax71

Keywords:

Olympic medal predictions, regression analysis, ARIMA model, random forest, confidence interval.

Abstract

This study aims to construct a predictive model to forecast the distribution of medals at the 2028 Summer Olympics in Los Angeles. Using regression analysis and ARIMA models, we addressed the issues of predicting the number of medals and analyzing the trends in the progress or regression of medals won by each country. First, we employed a linear regression model, incorporating historical medal data and the number of athletes, to predict the number of gold medals and total medals for each country, and calculated the 95% confidence interval. Second, we utilized ARIMA models to analyze the historical trends in medal counts for each country. By integrating feature fusion and machine learning classifiers, we predicted which countries might experience improvements or declines in future Olympics. Additionally, we employed a random forest model to predict the probability of countries that have never won medals securing their first medals. The research findings provide strategic support for national Olympic committees and offer new methodological references for Olympic medal predictions.

Downloads

Download data is not yet available.

References

[1] Zhang Chupei. Digital Intelligence Empowering the Olympics: Practical Experience and Future Prospects of the 2024 Paris Olympics [C]//Shaanxi Sports Science Society, Shaanxi Student Sports Association. Proceedings of the 4th Shaanxi Sports Science Conference (Abstracts) - Comprehensive Sports (Poster Session). Harbin Sport University, 2025: 17. DOI: 10.26914/c.cnkihy.2025.016666.

[2] Tang Haifeng, Zhao Lunan, Yi Chao, et al. Research on the Trend of China's Medal Performance and Event Distribution Characteristics in Past Summer Olympics [J]. Journal of Anhui Sports Science, 2018, 39 (03): 35 - 39.

[3] Wan Jinjing. A Dynamic Uncertainty Quantification Fusion Framework Based on Quantile Regression and Conformal Prediction [J]. Statistics and Management, 2025, 40 (06): 23 - 36. DOI: 10.16722/j.issn.1674 - 537x.2025.06.011.

[4] Yang Qinwei. Multiple Linear Regression Model for Predicting Performance in the 2020 Olympics [J]. Practical Electronics, 2018, (Z2): 121 - 123. DOI: 10.16589/j.cnki.cn11-3571/tn.2018.z2.058.

[5] Luo Junrong. Application of ARIMA Model in GDP Forecasting for Lanzhou City [J]. China Electronic Commerce, 2025, 31 (11): 7 - 9.

[6] Zhen Yan, Kang Jintao, Zhao Xiaoming, et al. Logging Interpretation Method for Diagenetic Facies of Tight Sandstone Based on Improved SMOTE and Random Forest Algorithm [J/OL]. Journal of Southwest Petroleum University (Natural Science Edition), 1 – 13 [2025 - 07 - 04]. http: // kns. cnki. net/ kcms/ detail/ 51. 1718.TE.20250701.1607.002.html.

[7] Zhang Yuhua. Prediction of China's Medal Count in the 31st Olympics Based on Dynamic Linear Regression Model [J]. Journal of Henan Normal University (Natural Science Edition), 2013,41 (02): 24 - 26+60. DOI: 10.16366/j.cnki.1000 - 2367.2013.02.003.

Downloads

Published

26-08-2025

How to Cite

Zhang, Y., Zhang, T., & Wan, D. (2025). A Study on the Prediction of Olympic Medal Distribution Based on Regression Analysis and ARIMA Models. Highlights in Science, Engineering and Technology, 152, 171-178. https://doi.org/10.54097/1a9aax71