Research on Tennis Match Outcome Prediction Based on Multi-Algorithm Integration and Bayesian Analysis

Authors

  • Kangqi Yu
  • Jingjing Liu
  • Xiaole Meng

DOI:

https://doi.org/10.54097/08qdyh39

Keywords:

GA-XGBoost, LightGBM, Bayesian Change Point Analysis, Machine Learning, Tennis Momentum Prediction.

Abstract

The intense competition in the men's singles final of the 2023 Wimbledon Championships highlighted the dynamic and unpredictable nature of tennis matches. Inspired by this observation, this study aims to quantify and analyze momentum shifts in tennis matches and explore their impact on match outcomes. We compare multiple machines learning algorithms, including MLP, Logistic Regression, XGBoost, and Naive Bayes, ultimately selecting the XGBoost model for its superior ability to handle nonlinear relationships and high-dimensional data. Furthermore, we optimize the XGBoost model with a Genetic Algorithm (GA) to better capture match dynamics and employ LightGBM to predict fluctuations during matches. Additionally, Bayesian Change Point Analysis is used to detect key fluctuation points. Through data preprocessing, feature extraction (covering technical, tactical, psychological, and physical factors), and model training and validation, our findings reveal that momentum plays a significant role in determining match outcomes. This study presents a novel analytical framework, offering new perspectives and methods for understanding and predicting tennis match results.

Downloads

Download data is not yet available.

References

[1] Gilovich T, Vallone R, Tversky A. The hot hand in basketball: On the misperception of random sequences [J]. Cognitive psychology, 1985, 17 (3): 295 - 314.

[2] Taylor J, Demick A. A multidimensional model of momentum in sports [J]. Journal of Applied Sport Psychology, 1994, 6 (1): 51 - 70.

[3] Del Corral J, Prieto-Rodríguez J. Are differences in ranks good predictors for Grand Slam tennis matches? [J]. International Journal of Forecasting, 2010, 26 (3): 551 - 563.

[4] Rosker J, Majcen Rosker Z. Skill level in tennis serve return is related to adaptability in visual search behavior [J]. Frontiers in Psychology, 2021, 12: 689378.

[5] Zhai W, Wang Y. Lasso Ridge based XGBoost and Deep_LSTM Help Tennis Players Perform Better [J]. arXiv preprint arXiv:2405.07030, 2024.

[6] Almarashi A M, Daniyal M, Jamal F. A novel comparative study of NNAR approach with linear stochastic time series models in predicting tennis player's performance [J]. BMC Sports Science, Medicine and Rehabilitation, 2024, 16 (1): 28.

[7] Rastogi D, Johri P, Tiwari V, et al. multi-class classification of brain tumour magnetic resonance images using multi-branch network with inception block and five-fold cross validation deep learning framework [J]. Biomedical Signal Processing and Control, 2024, 88: 105602.

[8] Joe H, Kim H G. Multi-label classification with XGBoost for metabolic pathway prediction [J]. BMC bioinformatics, 2024, 25 (1): 52.

[9] Linganathan S, Singamsetty P. Genetic algorithm to the bi-objective multiple travelling salesman problem [J]. Alexandria Engineering Journal, 2024, 90: 98 - 111.

[10] Dai H, Zhou Y, Liu H, et al. XGBoost-based prediction of on-site acceleration response spectra with multi-feature inputs from P-wave arrivals [J]. Soil Dynamics and Earthquake Engineering, 2024, 178: 108503.

[11] Yang D, Liu Z, Wang Y, et al. Adaptive multi-channel Bayesian Graph Neural Network [J]. Neurocomputing, 2024, 575: 127260.

[12] Ribeiro Jr E, Neave N, Marsili B K, et al. Prenatal androgenization (2D: 4D) predictions of tennis match‐play success in junior players: A search for physiological explanations [J]. American Journal of Human Biology, 2024, 36 (1): e23979.

[13] Borderias M, Crespo M, Martínez-Gallego R, et al. Comparison of the game structure and point ending during Grand Slam women's doubles tennis [J]. International Journal of Sports Science & Coaching, 2023, 18 (4): 1248 - 1255.

Downloads

Published

05-07-2025

How to Cite

Yu, K., Liu, J., & Meng, X. (2025). Research on Tennis Match Outcome Prediction Based on Multi-Algorithm Integration and Bayesian Analysis. Highlights in Science, Engineering and Technology, 145, 255-267. https://doi.org/10.54097/08qdyh39