Research of Prediction Diabetes Risk Using Logistic Regression Models

Authors

  • Gujie Li

DOI:

https://doi.org/10.54097/3bqp5w39

Keywords:

Diagnosing diabetes, HbA1c, BMI, Diabetes prediction, Logistic Regression.

Abstract

Diabetes has become a major global health concern, with its prevalence steadily rising. Early prediction and prevention are crucial to reducing the disease burden. This study explored the relationship between HbA1c level, age, smoking history, BMI, and diabetes risk by analyzing electronic health records (EHR) of more than 10,000 individuals and logistic regression models. By establishing a prediction model, the impact of these variables on the likelihood of an individual developing diabetes was evaluated. The results showed that the overall accuracy of the logistic regression model reached 89%, and the ROC-AUC score was as high as 0.9624, showing excellent discrimination between diabetic and non-diabetic cases. Among them, HbA1c level (coefficient 2.49), blood glucose concentration (coefficient 1.3), and age (coefficient 1.16) were confirmed to be key predictors for diabetes diagnosis, especially HbA1c level (coefficient 2.49) was the most influential factor. The study also discussed the potential limitations of the model performance and future improvement directions.

Downloads

Download data is not yet available.

References

[1] Davidson M. B., Schriger D. L. Effect of age and race/ethnicity on HbA1c levels in people without known diabetes mellitus: Implications for the diagnosis of diabetes. Diabetes Research and Clinical Practice, 2010, 87(3): 415-421.

[2] Eliasson B. Cigarette smoking and diabetes. Progress in Cardiovascular Diseases, 2003, 45(5): 405-413.

[3] Narayan K. V., Boyle J. P., Thompson T. J., Gregg E. W., Williamson D. F. Effect of BMI on lifetime risk for diabetes in the US. Diabetes Care, 2007, 30(6): 1562-1566.

[4] Gregg E. W., Cadwell B. L., Cheng Y. J., Cowie C. C., Williams D. E., Geiss L., et al. Trends in the prevalence and ratio of diagnosed to undiagnosed diabetes according to obesity levels in the U.S. Diabetes Care, 2004, 27: 2806-2812.

Downloads

Published

23-05-2025

How to Cite

Li, G. (2025). Research of Prediction Diabetes Risk Using Logistic Regression Models. Highlights in Science, Engineering and Technology, 140, 38-41. https://doi.org/10.54097/3bqp5w39