LSTM Ensemble Learning Model with Additive Gaussian Process Priors

Zhuo Sun; Yishu Wang; Hanlin Yin

doi:10.54097/wfvzd034

Authors

Zhuo Sun
Yishu Wang
Hanlin Yin

DOI:

https://doi.org/10.54097/wfvzd034

Keywords:

LSTM neural network, Gaussian process, Ensemble learning.

Abstract

Deep learning technology is one of the key research directions in the field of machine learning, especially when dealing with high-dimensional and non-linear data prediction tasks, but it can not avoid the problem of overfitting and underfitting of prediction models. Taking LSTM neural network as an example, this paper proposes an ensemble learning model based on additive Gaussian process priors. On the one hand, the proposed method uses bootstrap technology to realize the randomization of neural network model, so that the network can capture effective information of predictor variables from multiple perspectives. On the other hand, this method takes the set of neural network models after randomization as a new input variable, and uses Gaussian process additive model to integrate the results of different models. By designing orthogonal additive kernel, the marginal effect and interaction effect of LSTM neural network are measured. In addition, the proposed method can quantitatively estimate the uncertainty of the forecast results. Simulation experiment and actual data analysis show that the proposed method is more competitive than some classical regression models.

Downloads

Download data is not yet available.

References

[1] Fan, J., & Li, R. (2020). Statistical Learning with High-Dimensional Data. Annual Review of Statistics and Its Application, 7, 149-176.

[2] Fan, J., Li, Q., & Wang, Y. (2020). Estimation of High Dimensional Mean Regression in the Absence of Symmetry and Light Tail Assumptions. Journal of the Royal Statistical Society: Series B, 82(2), 321-354.

[3] Gu, S., Kelly, B., & Xiu, D. (2020). Empirical asset pricing via machine learning. The Review of Financial Studies, 33(5), 2223-2273.

[4] Lim, B., & Zohren, S. (2021). Time-series forecasting with deep learning: a survey. Philosophical Transactions of the Royal Society A, 379(2194), 20200209.

[5] Fischer, T., & Krauss, C. (2018). Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research, 270(2), 654-669.

[6] Zhang, A., Lipton, Z. C., Li, M., & Smola, A. J. (2021). Dive into deep learning. arXiv preprint arXiv:2106.11342.

[7] Bianchi, D., Büchner, M., & Tamoni, A. (2021). Bond risk premiums with machine learning. The Review of Financial Studies, 34(2), 1046-1089.

[8] Molnar, C., König, G., Herbinger, J., Freiesleben, T., Dandl, S., Scholbeck, C. A., ... & Bischl, B. (2022). General pitfalls of model-agnostic interpretation methods for machine learning models. Nature Machine Intelligence, 4(6), 541-547.

[9] Gu, S., Kelly, B., & Xiu, D. (2021). Autoencoder asset pricing models. Journal of Econometrics, 222(1), 429-450.

[10] Gu, S., Kelly, B., & Xiu, D. (2023). "Machine Learning in Asset Pricing: A Methodological Survey." The Review of Financial Studies, 36(11), 4640-4694.

[11] Bu, F., Chen, W., & Qian, P. (2022). Deep learning in asset pricing: overfitting and risk decomposition. International Journal of Financial Engineering, 9(2), 2250011.

LSTM Ensemble Learning Model with Additive Gaussian Process Priors

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Indexing

Latest publications