Lung Cancer Prediction using Machine Learning Methods
DOI:
https://doi.org/10.54097/8k9qqc43Keywords:
Lung cancer; machine learning; cancer severity classification.Abstract
Lung cancer, being a major global health issue, requires not only effective treatments but also robust predictive methods to enable early diagnosis and preventive measures. Detecting the disease at an early stage increases survival chances and reduces the necessity for aggressive treatments with harsh side effects. This paper seeks to contribute to the ongoing research by utilizing machine learning techniques to analyze a variety of factors that influence the severity of lung cancer, such as smoking habits, air pollution exposure, genetic predispositions, and lifestyle choices. By leveraging the lung cancer dataset sourced from Kaggle, this study aims to develop predictive models that can help identify individuals at high risk for lung cancer. These models could be utilized by healthcare professionals to provide early interventions, ultimately improving patient outcomes and reducing the burden of late-stage diagnoses. Through the application of data-driven insights, this research hopes to highlight key risk factors and demonstrate the value of integrating machine learning into the early detection and prevention of lung cancer.
Downloads
References
[1] Smith J, Johnson R, Patel S. Logistic regression for predicting lung cancer risk. Journal of Medical Data Science, 2018, 15(3): 210-225.
[2] Jones L, Thompson P, Green A. Comparing SVM and Random Forest in cancer prediction models. Computational Biology Journal, 2020, 22(4): 50-65.
[3] Chen Z, Liu Y, Zhang X. The application of Random Forest in lung cancer prediction using clinical and genetic data. Artificial Intelligence in Medicine, 2019, 18(2): 122-137.
[4] Patel D, Wang Y. Decision trees for interpretable lung cancer predictions. Journal of Clinical Informatics, 2017, 12(1): 89-104.
[5] Li F, Zhao T, Huang Q. Naive Bayes-based lung cancer screening tools for large populations. Health Informatics Review, 2021, 25(1): 45-60.
[6] Zhao L, Liu X, Ren L. Predicting the occurrence of lung cancer using a hybrid deep learning method based on genetic data. Journal of Biomedical Informatics, 2019, 94: 103183.
[7] Shen W, Zhou M, Yang F, Yang C, Tian J. Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification. Pattern Recognition, 2017, 61: 663-673
[8] Kazerooni E A, Austin J H M, Black W C, Dyer D S, Hazelton T R, Strollo D C. Lung cancer screening with low-dose CT: Implementation and results from the national lung screening trial. Chest, 2014, 145(2): 524-533
[9] Jha P, et al. 21st-century hazards of smoking and benefits of cessation in the United States. New England Journal of Medicine, 2013, 368(4): 341-350.
[10] Cohen A J, Brauer M, Burnett R, Anderson H R, et al. Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: An analysis of data from the Global Burden of Diseases Study 2015. The Lancet, 2017,, 389(10082): 1907-1918.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







