High-Risk Population Identification Based on Machine Learning
DOI:
https://doi.org/10.54097/1c76hk30Keywords:
Digital Health, High-Risk Populations Identification, SVM, Random Forest, XGBoost, SMOTE.Abstract
This research centers on identifying high-risk populations with poor digital health capabilities among older adults in Beijing in the post-pandemic context. The goal is to identify high-risk populations precisely by comparing various machine learning algorithms, offering support for public health policymaking.The study adopts the social ecosystem theory to develop an indicator framework, collects data through a mix of online and offline approaches, and performs empirical analysis using SVM, Random Forest, and XGBOOST classification algorithms with L-SMOTE interpolation resampling to compare their effectiveness in identifying high-risk populations. Results indicate that the SMOTE-resampled SVM algorithm outperforms in accuracy, recall, and F1 score, while the L-SMOTE-resampled XGBOOST algorithm also demonstrates solid performance. Thus, when considering recall, precision, and F1 score comprehensively, the SMOTE-resampled SVM algorithm provides the best prediction results for high-risk populations.
Downloads
References
[1] Qin Chaohui, Huang Wenhao. "Digital Health Technologies in Age-Friendly Services: Current Supply and Development Strategies" [J]. Soft Science of Health, 2023, 37(10): 22-27.
[2] Peng Yanxia, Gao Yunfei, Yong Jingjing, et al. "An Analysis of Digital Health Technology Anxiety and Nursing Strategies for Community Elderly" [J]. Chinese Journal of Nursing, 2023, 58(11): 1345-1351.
[3] Huang Xinhui. "The Use of Electronic Health Records to Bridge the Digital Divide in Elderly Healthcare" [J]. Information Systems Engineering, 2023, (02): 10-13.
[4] Si Ge. "Internet Use and Its Impact on the Lives of Older Adults: Coping Strategies" [J]. Gerontology Research, 2021, 9(09): 69-78.
[5] Sun Qihu, Guan Yutong. "Research on the Digital Divide in Older Adults During the COVID-19 Pandemic: A Case Study of 'Health Code' Usage Challenges" [J]. Journal of Anhui University of Science and Technology (Social Science Edition), 2022, 24(02): 81-86.
[6] Wang Xueqing, Liu Shuang, Li Qiuyan, et al. "Classification of Tunnel Rock Mass with SVM Based on K-Fold Cross-Validation" [J]. Mining and Metallurgical Engineering, 2021, 41(06): 126-128+133.
[7] Yang Heya, Xing Wenshuo, Chen Cong, et al. "Open-Circuit Fault Detection in Modular Multilevel Converters Using a Random Forest Binary Classifier" [J]. Proceedings of the CSEE, 2023, 43(10): 3916-3928.
[8] Wang Qi, Xiong Shalina, Zhan Rou, et al. "XGBoost-Based Financial Fraud Detection for Imbalanced Datasets" [J]. Computer Era, 2023, (12): 59-63.
[9] Song Wanpeng, Yang Ming, Jin Yuanyuan. "Analysis of Prediction Models Using SMOTE Sampling and KNN Algorithm" [J]. Application of Integrated Circuits, 2023, 40(04): 76-78.
[10] Chen Hailong, Yang Chang, Du Mei, et al. "Credit Risk Prediction Using LightGBM Enhanced with Boundary-Adaptive SMOTE and FocalLoss Function" [J]. Computer Applications, 2022, 42(07): 2256-2264.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







