Research On Regression of High-Dimensional Redundant Data Based on Elastic Net and Gaussian Process
DOI:
https://doi.org/10.54097/0gg29650Keywords:
High-dimensional data, Elastic net, Gaussian process, Regression model, Feature selection.Abstract
In modern scientific research and engineering practice, managing high-dimensional and redundant datasets has become a prevalent challenge. Such datasets often contain a substantial amount of noise and irrelevant information, which significantly complicates traditional regression modeling approaches. Consequently, the development of a regression model that can effectively tackle high-dimensional redundant data, thereby extracting valuable information from complex datasets and making precise predictions, is of paramount importance. To tackle this challenge, this paper introduces an innovative regression model. The method integrates L1 and L2 regularization techniques to diminish redundant information within high-dimensional data, thereby enhancing the model's generalization capabilities. Simultaneously, the elastic net approach balances the model's complexity with noise sensitivity when handling high-dimensional data, preventing overfitting and underfitting, and thus improving prediction accuracy. The model parameters are determined using the maximum likelihood estimation algorithm. Simulation experiments and actual data analysis demonstrate that the proposed method surpasses benchmark methods in performance, showcasing its robust competitiveness.
Downloads
References
[1] Robert T .Regression Shrinkage and Selection Via the Lasso[J].Journal of the Royal Statistical Society: Series B (Methodological),2018,58(1):267-288.
[2] Friedman J ,Hastie T ,Tibshirani R .Regularization Paths for Generalized Linear Models via Coordinate Descent[J].Journal of Statistical Software,2010,33(1):21-22.
[3] Li Lijun, Zhang Haiqing, Li Daiwei, et al. An Improved ReliefF Feature Selection Algorithm Based on Redundancy Analysis [J]. Software Engineering, 2023, 26(11):48-51.
[4] Zhang Jing, Cao Feng, Dong Yuying, et al. Feature Selection Algorithm Based on Mutual Information and Genetic Algorithm [J]. Journal of Shanxi University (Natural Science Edition), 2024, 47(01):1-8.
[5] Zheng Jia, Li Shuyou. Research on the Maximum Likelihood Estimation Method for Parameters of Nonlinear Regression Models Based on Uncertainty Theory[J]. Journal of Liaoning University of Technology (Natural Science Edition), 2023, 43(01): 24-28.
[6] Hu Guiping. A Case Study on Linearizing Nonlinear Regression Analysis Models[J]. Mathematics, Physics, and Chemistry Learning, 2024(20):9-14.
[7] Wei Dong. Non-Nuclear Nonlinear Regression Machine[D]. Xinjiang: Xinjiang University, 2023.
[8] Zhang Tengfei, Zhang Yudi, Ma Fumin. High-Dimensional Hybrid Data Feature Selection Algorithm Based on Improved Neighborhood Space[J]. Control and Decision, 2024, 39(3):929-938.
[9] Wang Zhenfei, Yuan Peiyao, Cao Zhongya, et al. Feature Selection Algorithm for High-Dimensional Imbalanced Data[J]. Journal of Mini-Micro Systems, 2024, 45(8):1839-1846.
[10] Yang Xuan. Feature Selection for High-Dimensional Time Series Data Based on Two Types of Metric Functions[D]. Shaanxi: Chang'an University, 2023.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







