0% found this document useful (0 votes)
36 views3 pages

Lung Cancer Detection Using Ensemble Techniques

This paper implements a system for enhancing the detection of lung cancer through an ensemble approach, which amalgamates the predictive outputs generated by three distinct convolutional neural networks (CNNs): ResNet50, EfficientNet, and InceptionNet.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views3 pages

Lung Cancer Detection Using Ensemble Techniques

This paper implements a system for enhancing the detection of lung cancer through an ensemble approach, which amalgamates the predictive outputs generated by three distinct convolutional neural networks (CNNs): ResNet50, EfficientNet, and InceptionNet.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Volume 9, Issue 4, April – 2024 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24APR1516

Lung Cancer Detection using Ensemble Techniques


1
Piyush Choudhari; 2Yash Soniminde; 3Anubhav Sharma; 4Prisha Shah; 5Amish Faye; 6Nita J. Mahale
Department of Artificial Intelligence And Data Science, DY Patil College Of Engineering, Akurdi, Pune, Maharashtra, India

Abstract:- This paper implements a system for enhancing of medical imaging data of 1400+ images. Since the
the detection of lung cancer through an ensemble dataset is large, it was necessary to properly group them,
approach, which amalgamates the predictive outputs which is why it was systematically divided into three
generated by three distinct convolutional neural networks distinctive directories: train, test, and validation.
(CNNs): ResNet50, EfficientNet, and InceptionNet. Particularly, the train directory included 70% of the
Leveraging the diverse architectural features and images, whereas the test and validation directories
learning capabilities of these CNNs, the ensemble method contained 20% and 10%, accordingly, for the sake of later
aims to synergistically fuse their individual predictions to robust model evaluation.
achieve heightened accuracy and robustness in  After the data collection was complete, a detailed data
identifying potential lung cancer manifestations. cleaning phase was performed. Thus, every image was
classified into one of four classes: normal,
Keywords:- Lung Cancer Detection; CNN; Ensemble adenocarcinoma, large cell carcinoma, and squamous cell
Techniques; Resnet50; VGG16; Inceptionnet. carcinoma. Such type of categorization was essential for
the following stages of training and validation. Moreover,
I. INTRODUCTION each inappropriate or unusable image was found and
eliminated from the dataset to ensure its quality and
This paper introduces a methodology to enhance lung purity.
cancer detection by integrating predictions from ResNet50,  With a clean and well-organized dataset in place, the next
EfficientNet, and InceptionNet convolutional neural step was data preprocessing. This phase involved a series
networks. Leveraging the architectural features of these of essential transformations to prepare the images for
models, the ensemble approach averages their outputs, model training. Initially, the images were relabeled and
aiming for heightened accuracy and robustness in identifying indexed according to their respective classes to facilitate
potential lung cancer manifestations. Through evaluation, efficient data handling. Subsequently, they underwent
this study demonstrates the accuracy of the proposed rescaling and resizing to a standardized dimension of 224
ensemble method to be 90.2%, offering a promising avenue by 224 pixels, ensuring uniformity across the dataset. To
for advancing clinical diagnosis and patient outcomes in further enhance model generalization and robustness,
health management. A system has been proposed to various data augmentation techniques were applied,
streamline the operational efficiency of organizations, including horizontal flipping, contrast adjustment, and
researchers, and medical professionals by implementing grayscale conversion.
automated processes. This system entails the development of  Following data preprocessing, the focus shifted to model
application programming interfaces (APIs) to facilitate loading and initialization. Pre-trained convolutional
seamless interaction with the model and databases. Its core neural networks (CNNs) such as ResNet50, EfficientNet,
functionality involves the classification of CT-Scans in large- and InceptionNet were selected for their well-established
scale batches, followed by the systematic storage of the architectures and superior performance in image
processed data within the database infrastructure. classification tasks. These models were loaded along with
their pre-trained weights, allowing them to leverage the
 Goal and Objectives knowledge gained from extensive training on large-scale
image datasets.
 Implementing a model to classify CT-Scan images of  The subsequent training phase involved feeding the
lungs as cancerous or non cancerous. preprocessed images into the input layers of the CNNs.
 Delivering the model to the end user in a cost effective The models were trained with non-trainable weights,
and quick way. allowing them to learn and extract meaningful features
 Diagnosing patients and detecting early signs of lung from the input data over multiple epochs. A total of 15
cancer to encourage early intervention. epochs were chosen to balance between model
convergence and computational efficiency.
II. MODEL TRAINING  Upon completion of individual model training, an
ensemble method was employed to combine the
 The data collection process started with grouping the CT- predictive outputs of the three CNNs. This ensemble
scan images of lungs, which formed the most critical part model leveraged the collective intelligence and diverse
in the training of the following models. The source of the perspectives of multiple CNN architectures, leading to
images, the well-established Kaggle dataset, the Chest improved prediction accuracy and robustness.
CT-Scan Images Dataset, represents a thorough collection

IJISRT24APR1516 www.ijisrt.com 3322


Volume 9, Issue 4, April – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24APR1516

 Model evaluation was conducted using test data.  Model was saved to disk. APIs were created to interact
Precision, recall, F1 score were calculated and rate of type with the model and the database.
1 errors and type 2 errors were noted.

Fig 1: Model

III. RESULTS

Fig 2: Predicted Lables

 Overall Accuracy = 90.2%


 Overall Sensitivity = 89.5%

IJISRT24APR1516 www.ijisrt.com 3323


Volume 9, Issue 4, April – 2024 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24APR1516

IV. MODEL TRAINING WORKFLOW REFERENCES

 APIs [1]. Nageswaran S, Arunkumar G, Bisht AK, Mewada S,


Kumar JNVRS, Jawarneh M, Asenso E. Lung Cancer
 Using the implemented APIs, we have also created a way Classification and Prediction Using Machine Learning
for data to be exchanged seamlessly between multiple and Image Processing. Biomed Res Int. 2022 Aug
processes, using this to effectively interface with both the 22;2022:1755460. doi: 10.1155/2022/1755460.
model and the database. Retraction in: Biomed Res Int. 2024 Jan
 As an outcome of this interphase, we have also set up an 9;2024:9851527. PMID: 36046454; PMCID:
endpoint at which a user can send his or her CT-scan PMC9424001.
images in binary format. After submission, the system [2]. B. S, P. R and A. B, "Lung Cancer Detection using
triggers the classification and sends back the results in a Machine Learning," 2022 International Conference on
well-structured JSON format. These include all the Applied Artificial Intelligence and Computing
classification outcomes and other related details of the (ICAAIC), Salem, India, 2022, pp. 539-543, doi:
respective scans and the patients offering them for 10.1109/ICAAIC53929.2022.9793061.
classification. After that stage, the classified data is stored [3]. https://my.clevelandclinic.org/health/diseases/4375-
systematically in the database, complete with its names lung-cancer
and other related details . [4]. https://www.cancer.org/cancer/types/lung-
 APIs are built with interoperability in mind, facilitating cancer/about/what-is.html
seamless integration with existing software systems and [5]. https://www.cdc.gov/cancer/lung/basic_info/index.ht
workflows commonly utilized in medical and research ml
settings.
 Users receive prompt feedback on their submitted CT-
scans, enabling them to make informed decisions rapidly
based on the classification outcomes.

V. CONCLUSION

In conclusion, this paper presents a methodology for


enhancing lung cancer detection through the integration of
predictions from ResNet50, EfficientNet, and InceptionNet
convolutional neural networks. By leveraging the
architectural features of these models and employing an
ensemble approach to average their outputs, we were able to
achieve good accuracy and stability in identifying potential
lung cancer diagnosis.

Furthermore, a proposed system is outlined to


streamline operational efficiency for organizations,
researchers, and medical professionals. This system is
designed to automate processes through the development of
application programming interfaces (APIs), enabling
seamless interaction with the model and databases. Its core
functionality revolves around the classification of CT-Scans
in large-scale batches, followed by the systematic storage of
processed data within the database infrastructure. By
integrating efficient workflow management, this system
presents a significant step forward in optimizing healthcare
processes and facilitating timely and accurate diagnosis.

IJISRT24APR1516 www.ijisrt.com 3324

You might also like