Research on Bone Joint Cancer miRNA Recognition Algorithm Based on Isomap and Improved Bagging
DOI:
https://doi.org/10.54097/1facz207Keywords:
Genetic Disease Diagnosis, miRNA Recognition, Manifold Learning, Improved Bagging.Abstract
With the widespread adoption of intelligent diagnosis for genetic diseases in the medical field, miRNA sequence-based diagnostic methods have gradually become a focal point of contemporary biological research. Against this backdrop, this paper innovatively proposes a genomic recognition algorithm for bone joint cancer based on manifold dimensionality reduction and the improved Bagging framework. First, the high-dimensional miRNA sequences are assumed to lie in a low-dimensional manifold space, and feature extraction is performed using the Isomap algorithm. This step not only effectively resolves the "curse of dimensionality" caused by high-dimensional data but also successfully preserves key information for prediction. Second, considering that single models are prone to overfitting or underfitting, an ensemble learning framework is introduced for the prediction task. To prevent model redundancy, an improved Bagging algorithm based on mutual information selection is proposed. This algorithm dynamically adjusts the combination of sub-models, effectively reducing redundancy and improving prediction accuracy. Finally, empirical analysis on a bone joint cancer dataset demonstrates that the proposed method outperforms traditional baseline methods, showing greater competitiveness and superiority.
Downloads
References
[1] Meola N, Gennarino V A, Banfi S. microRNAs and genetic diseases [J]. Pathogenetics, 2009, 2: 1-14.
[2] Rani V, Sengar R S. Biogenesis and mechanisms of microRNA‐mediated gene regulation [J]. Biotechnology and bioengineering, 2022, 119(3): 685-692.
[3] Hill M, Tran N. miRNA interplay: mechanisms and consequences in cancer [J]. Disease models & mechanisms, 2021, 14(4): dmm047662.
[4] Braig Z V. Personalized medicine: From diagnostic to adaptive [J]. biomedical journal, 2022, 45(1): 132-142.
[5] Aliferis C, Simon G. Overfitting, underfitting and general model overconfidence and under-performance pitfalls and best practices in machine learning and AI [J]. Artificial intelligence and machine learning in health care and medical sciences: Best practices and pitfalls, 2024: 477-524.
[6] Reddy G T, Reddy M P K, Lakshmanna K, et al. Analysis of dimensionality reduction techniques on big data [J]. Ieee Access, 2020, 8: 54776-54788.
[7] Salam M A, Azar A T, Elgendy M S, et al. The effect of different dimensionality reduction techniques on machine learning overfitting problem [J]. Int. J. Adv. Comput. Sci. Appl, 2021, 12(4): 641-655.
[8] Taguchi Y H. Unsupervised feature extraction applied to bioinformatics: A PCA based and TD based approach [M]. Springer Nature, 2024.
[9] Yousaf M, Shakoor Khan M S, Ullah S. An Extended-Isomap for high-dimensional data accuracy and efficiency: a comprehensive survey [J]. Multimedia Tools and Applications, 2024: 1-52.
[10] Mienye I D, Sun Y. A survey of ensemble learning: Concepts, algorithms, applications, and prospects [J]. IEEE Access, 2022, 10: 99129-99149.
[11] Mohammadi S, Narimani Z, Ashouri M, et al. Ensemble learning from ensemble docking: Revisiting the optimum ensemble size problem [J]. Scientific reports, 2022, 12(1): 410.
[12] Cao Y, Geddes T A, Yang J Y H, et al. Ensemble deep learning in bioinformatics [J]. Nature Machine Intelligence, 2020, 2(9): 500-508.
[13] Hernández-Roig H A, Aguilera-Morillo M C, Lillo R E. Functional modeling of high-dimensional data: a manifold learning approach [J]. Mathematics, 2021, 9(4): 406.
[14] Gao L, Wu W. Relevance assignation feature selection method based on mutual information for machine learning [J]. Knowledge-Based Systems, 2020, 209: 106439.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







