Research On Regression of High-Dimensional Redundant Data Based on Elastic Net and Gaussian Process

Authors

  • Changsheng Zheng
  • Xinyang Miao

DOI:

https://doi.org/10.54097/0gg29650

Keywords:

High-dimensional data, Elastic net, Gaussian process, Regression model, Feature selection.

Abstract

In modern scientific research and engineering practice, managing high-dimensional and redundant datasets has become a prevalent challenge. Such datasets often contain a substantial amount of noise and irrelevant information, which significantly complicates traditional regression modeling approaches. Consequently, the development of a regression model that can effectively tackle high-dimensional redundant data, thereby extracting valuable information from complex datasets and making precise predictions, is of paramount importance. To tackle this challenge, this paper introduces an innovative regression model. The method integrates L1 and L2 regularization techniques to diminish redundant information within high-dimensional data, thereby enhancing the model's generalization capabilities. Simultaneously, the elastic net approach balances the model's complexity with noise sensitivity when handling high-dimensional data, preventing overfitting and underfitting, and thus improving prediction accuracy. The model parameters are determined using the maximum likelihood estimation algorithm. Simulation experiments and actual data analysis demonstrate that the proposed method surpasses benchmark methods in performance, showcasing its robust competitiveness.

Downloads

Download data is not yet available.

References

[1] Robert T .Regression Shrinkage and Selection Via the Lasso[J].Journal of the Royal Statistical Society: Series B (Methodological),2018,58(1):267-288.

[2] Friedman J ,Hastie T ,Tibshirani R .Regularization Paths for Generalized Linear Models via Coordinate Descent[J].Journal of Statistical Software,2010,33(1):21-22.

[3] Li Lijun, Zhang Haiqing, Li Daiwei, et al. An Improved ReliefF Feature Selection Algorithm Based on Redundancy Analysis [J]. Software Engineering, 2023, 26(11):48-51.

[4] Zhang Jing, Cao Feng, Dong Yuying, et al. Feature Selection Algorithm Based on Mutual Information and Genetic Algorithm [J]. Journal of Shanxi University (Natural Science Edition), 2024, 47(01):1-8.

[5] Zheng Jia, Li Shuyou. Research on the Maximum Likelihood Estimation Method for Parameters of Nonlinear Regression Models Based on Uncertainty Theory[J]. Journal of Liaoning University of Technology (Natural Science Edition), 2023, 43(01): 24-28.

[6] Hu Guiping. A Case Study on Linearizing Nonlinear Regression Analysis Models[J]. Mathematics, Physics, and Chemistry Learning, 2024(20):9-14.

[7] Wei Dong. Non-Nuclear Nonlinear Regression Machine[D]. Xinjiang: Xinjiang University, 2023.

[8] Zhang Tengfei, Zhang Yudi, Ma Fumin. High-Dimensional Hybrid Data Feature Selection Algorithm Based on Improved Neighborhood Space[J]. Control and Decision, 2024, 39(3):929-938.

[9] Wang Zhenfei, Yuan Peiyao, Cao Zhongya, et al. Feature Selection Algorithm for High-Dimensional Imbalanced Data[J]. Journal of Mini-Micro Systems, 2024, 45(8):1839-1846.

[10] Yang Xuan. Feature Selection for High-Dimensional Time Series Data Based on Two Types of Metric Functions[D]. Shaanxi: Chang'an University, 2023.

Downloads

Published

18-05-2025

How to Cite

Zheng, C., & Miao, X. (2025). Research On Regression of High-Dimensional Redundant Data Based on Elastic Net and Gaussian Process. Highlights in Science, Engineering and Technology, 142, 400-411. https://doi.org/10.54097/0gg29650