Research of Influence Factors that Possibly Lead to Cardiovascular Disease using Machine Learning

Authors

  • Huailang Peng

DOI:

https://doi.org/10.54097/q5245c60

Keywords:

Cardiovascular disease; logistic regression; naive Bayes; random forest; influencing factors.

Abstract

While previous studies have studied and demonstrated that the incidence of cardiovascular disease (CVD) is related to various factors such as hypertension, high cholesterol, smoking, diabetes, obesity, lifestyle, pregnancy and so on, there still exist many unidentified factors that are valuable to be researched on. This research tries to apply three classical machine learning algorithms to deal with the data from the Kaggle website. The dataset was compiled by Alphiree from the online Cardiovascular Diseases Risk Prediction Dataset. This dataset cited data from 2021 Behavioral Risk Factor Surveillance System (BRFSS). This study uses and processes the 5,523 records collected as data from the BRFSS in 2021 from World Health Organization (WHO). It is concluded that the BMI, Weight, Age Category, Height, Green Vegetables Consumption, Fruit Consumption and FriedPotato Consumption have relatively strong relationships with the development of CVD, while General Health, Checkup, Exercise, Skin Cancer, Other Cancer, Depression, Diabetes, Arthritis, Sex, Smoking History and Alcohol Consumption have relatively weak relationships with having a CVD. This result provides some new perspectives to study the pathogenesis and treatment of CVD and point to the way for further research afterward.

Downloads

Download data is not yet available.

References

[1] Hu Liangyuan, Bian Liu, Yan Li. Ranking sociodemographic, health behavior, prevention, and environmental factors in predicting neighborhood cardiovascular health: a Bayesian machine learning approach. Preventive medicine, 2020, 141: 106240.

[2] Fuster-Parra Pilar, et al. Bayesian network modeling: A case study of an epidemiologic system analysis of cardiovascular risk. Computer methods and programs in biomedicine, 2016, 126: 128-142.

[3] Dunn, Andrea L, et al. Reduction in cardiovascular disease risk factors: 6-month results from ProjectActive. Preventive medicine, 1997, 26: 883-892.

[4] Benschop Laura, Johannes J. Duvekot, et al. Future risk of cardiovascular disease risk factors and events in women after a hypertensive disorder of pregnancy. Heart, 2019, 105: 1273-1278.

[5] Peltola Tomi, et al. Hierarchical Bayesian Survival Analysis and Projective Covariate Selection in Cardiovascular Event Risk Prediction. BMA, 2014, 79-88.

[6] Chekouo Thierry, Sandra E. Safo. Bayesian integrative analysis and prediction with application to atherosclerosis cardiovascular disease. Biostatistics, 2023, 24: 124-139.

[7] Elsayad Alaa M, Mahmoud Fakhr. Diagnosis of cardiovascular diseases with bayesian classifiers. J. Comput. Sci., 2015, 11: 274-282.

[8] Miranda Eka, et al. Detection of cardiovascular disease risk's level for adults using naive Bayes classifier. Healthcare informatics research, 2016, 22: 196-205.

[9] Ambrish G, et al. Logistic regression technique for prediction of cardiovascular disease. Global Transitions Proceedings, 2022, 3: 127-130.

[10] Yang Li, et al. Study of cardiovascular disease prediction model based on random forest in eastern China. Scientific reports, 2020, 10: 5245.

Downloads

Published

25-02-2025

How to Cite

Peng, H. (2025). Research of Influence Factors that Possibly Lead to Cardiovascular Disease using Machine Learning. Highlights in Science, Engineering and Technology, 128, 198-203. https://doi.org/10.54097/q5245c60