Paper Title

ENHANCED LUNG CANCER PREDICTION USING INTEGRATIVE RANDOM FOREST AND XGBOOST APPROACHES

Authors

S.AGNES JOSHY , N.SOUNTHARIYAA

Keywords

Early Prediction, Lung Cancer, Random Forest ,XG Boost, Machine Learning Models.

Abstract

Lung cancer remains one of the leading causes of mortality worldwide, making early and accurate detection critical for improving patient outcomes. This study focuses on developing an enhanced lung cancer prediction model using machine learning techniques. We evaluated 10 machine learning algorithms, including Logistic Regression, Decision Tree, K-Nearest Neighbour, Naive Bayes, Support Vector Classifier, Random Forest, XG Boost, Multi-layer Perceptron, and Gradient Boosting, implemented through the Scikit-learn library in Python. To ensure a robust evaluation, stratified K-Fold cross-validation was employed for all models. Following the evaluation, Random Forest and XG Boost emerged as the top-performing models based on their high accuracy, precision, and F1-scores. These models were optimized using grid search hyperparameter tuning, and a voting classifier was constructed to combine their predictions, significantly improving the overall prediction performance. The final model achieved an accuracy of 98.33%, outperforming individual models. Feature engineering played a critical role, with highly correlated variables such as ANXIETY and YELLOW_FINGERS being combined into a new feature to enhance prediction accuracy. The model was tested with new patient data, demonstrating its practical utility in predicting lung cancer risk based on clinical attributes. These results underscore the potential of integrating machine learning models, particularly Random Forest and XG Boost, to create more reliable and accurate lung cancer prediction tools. Future enhancements could involve incorporating medical image processing techniques, such as analyzing CT scans, to further improve the model's ability to detect early signs of lung cancer and broaden its applicability in clinical settings.

How To Cite

"ENHANCED LUNG CANCER PREDICTION USING INTEGRATIVE RANDOM FOREST AND XGBOOST APPROACHES", IJNRD - INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT (www.IJNRD.org), ISSN:2456-4184, Vol.9, Issue 10, page no.c337-c349, October-2024, Available :https://ijnrd.org/papers/IJNRD2410245.pdf

Issue

Volume 9 Issue 10, October-2024

Pages : c337-c349

Other Publication Details

Paper Reg. ID: IJNRD_301476

Published Paper Id: IJNRD2410245

Downloads: 00020

Research Area: Science and Technology

Country: TIRUNELVELI, TAMILNADU, India

Published Paper PDF: https://ijnrd.org/papers/IJNRD2410245

Published Paper URL: https://ijnrd.org/viewpaperforall?paper=IJNRD2410245

About Publisher

ISSN: 2456-4184 | IMPACT FACTOR: 8.76 Calculated By Google Scholar | ESTD YEAR: 2016

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.76 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Publisher: IJNRD (IJ Publication) Janvi Wave

Article Preview

academia
publon
sematicscholar
googlescholar
scholar9
maceadmic
Microsoft_Academic_Search_Logo
elsevier
researchgate
ssrn
mendeley
Zenodo
orcid
sitecreex