LSTM Ensemble Learning Model with Additive Gaussian Process Priors
DOI:
https://doi.org/10.54097/wfvzd034Keywords:
LSTM neural network, Gaussian process, Ensemble learning.Abstract
Deep learning technology is one of the key research directions in the field of machine learning, especially when dealing with high-dimensional and non-linear data prediction tasks, but it can not avoid the problem of overfitting and underfitting of prediction models. Taking LSTM neural network as an example, this paper proposes an ensemble learning model based on additive Gaussian process priors. On the one hand, the proposed method uses bootstrap technology to realize the randomization of neural network model, so that the network can capture effective information of predictor variables from multiple perspectives. On the other hand, this method takes the set of neural network models after randomization as a new input variable, and uses Gaussian process additive model to integrate the results of different models. By designing orthogonal additive kernel, the marginal effect and interaction effect of LSTM neural network are measured. In addition, the proposed method can quantitatively estimate the uncertainty of the forecast results. Simulation experiment and actual data analysis show that the proposed method is more competitive than some classical regression models.
Downloads
References
[1] Fan, J., & Li, R. (2020). Statistical Learning with High-Dimensional Data. Annual Review of Statistics and Its Application, 7, 149-176.
[2] Fan, J., Li, Q., & Wang, Y. (2020). Estimation of High Dimensional Mean Regression in the Absence of Symmetry and Light Tail Assumptions. Journal of the Royal Statistical Society: Series B, 82(2), 321-354.
[3] Gu, S., Kelly, B., & Xiu, D. (2020). Empirical asset pricing via machine learning. The Review of Financial Studies, 33(5), 2223-2273.
[4] Lim, B., & Zohren, S. (2021). Time-series forecasting with deep learning: a survey. Philosophical Transactions of the Royal Society A, 379(2194), 20200209.
[5] Fischer, T., & Krauss, C. (2018). Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research, 270(2), 654-669.
[6] Zhang, A., Lipton, Z. C., Li, M., & Smola, A. J. (2021). Dive into deep learning. arXiv preprint arXiv:2106.11342.
[7] Bianchi, D., Büchner, M., & Tamoni, A. (2021). Bond risk premiums with machine learning. The Review of Financial Studies, 34(2), 1046-1089.
[8] Molnar, C., König, G., Herbinger, J., Freiesleben, T., Dandl, S., Scholbeck, C. A., ... & Bischl, B. (2022). General pitfalls of model-agnostic interpretation methods for machine learning models. Nature Machine Intelligence, 4(6), 541-547.
[9] Gu, S., Kelly, B., & Xiu, D. (2021). Autoencoder asset pricing models. Journal of Econometrics, 222(1), 429-450.
[10] Gu, S., Kelly, B., & Xiu, D. (2023). "Machine Learning in Asset Pricing: A Methodological Survey." The Review of Financial Studies, 36(11), 4640-4694.
[11] Bu, F., Chen, W., & Qian, P. (2022). Deep learning in asset pricing: overfitting and risk decomposition. International Journal of Financial Engineering, 9(2), 2250011.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







