0% found this document useful (0 votes)

8 views

Aihc Report

Uploaded by

Hasan Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Aihc Report

Uploaded by

Hasan Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Smt.

Indira Gandhi College of Engineering

Affiliated to University of Mumbai
Department of CSE(AIML)
(2024–2025)

LUNG CANCER DETECTION AND

CLASSIFICATION

1. Hasan Patel

2. Sandesh Patil

3. Jay Pardeshi

GUIDE

PROF. VENKAT PATIL

INDEX

1. ABSTRACT

2. INTRODUCTION

3. LITERATURE REVIEW

4. METHODOLOGY

5. RESULTS

6. CONCLUSION
ABSTRACT

Lung cancer remains a significant public health challenge, being one

of the leading causes of cancer-related mortality worldwide. Early
detection is crucial for improving patient outcomes and survival
rates. This project aims to develop an automated lung cancer
detection and classification system using deep learning techniques,
specifically convolutional neural networks (CNNs). A dataset of
lung CT scans was utilized to train the model, which distinguishes
between cancerous and non-cancerous images. Through data
augmentation and the application of a pre-trained model, the system
achieved high accuracy in classification tasks. Evaluation metrics
such as accuracy, precision, recall, and F1-score were employed to
assess model performance. The results indicate that the proposed
method can significantly aid in early diagnosis and provide a
reliable tool for healthcare professionals, ultimately contributing to
better patient management and treatment strategies.
Introduction

Background: Lung cancer is one of the most prevalent and deadly

forms of cancer globally, accounting for a significant number of
cancer-related deaths each year. The disease often goes undetected
until it reaches advanced stages, leading to poor prognosis and
limited treatment options. Traditional diagnostic methods, such as X-
rays and biopsies, can be invasive and may lack the sensitivity
required for early detection. As a result, there is a pressing need for
more effective and reliable diagnostic tools.

Objective: This project aims to develop a machine learning-based

system for the detection and classification of lung cancer using deep
learning techniques. By leveraging convolutional neural networks
(CNNs), the project seeks to automate the analysis of lung CT scans,
facilitating the differentiation between cancerous and non-cancerous
images. The ultimate goal is to enhance diagnostic accuracy and
provide timely support for healthcare professionals.

Motivation: The motivation behind this project stems from the urgent
need for improved diagnostic methods in lung cancer detection. As
the medical field increasingly embraces artificial intelligence, the
potential to harness deep learning for medical image analysis
represents a significant advancement in patient care. This project
aspires to contribute to early diagnosis, ultimately improving patient
outcomes and paving the way for more effective treatment strategies.
Literature Review

Lung cancer detection and classification have evolved significantly

over the past few decades, particularly with the advent of medical
imaging technologies and machine learning techniques. Traditional
diagnostic methods, such as chest X-rays and computed tomography
(CT) scans, have been the cornerstone of lung cancer diagnosis.
However, these methods often face challenges related to sensitivity
and specificity, leading to the need for enhanced diagnostic
approaches.

Traditional Diagnostic Methods: Historically, lung cancer

detection relied heavily on imaging techniques and invasive
procedures like biopsies. X-rays have been widely used due to their
availability and cost-effectiveness, but they are limited in
sensitivity, particularly in early-stage cancers. CT scans provide
higher resolution images and have become the standard for lung
cancer screening; however, interpreting these images can be
subjective and requires significant expertise. As a result, there is a
growing interest in leveraging AI to improve diagnostic accuracy
and reduce human error.

Machine Learning and Deep Learning in Medical Imaging:

Recent advancements in artificial intelligence have enabled the
development of machine learning models capable of analyzing
medical images with remarkable accuracy. In particular, deep learning
techniques, especially convolutional neural networks (CNNs), have
shown great promise in image classification tasks. Studies have
demonstrated that CNNs can effectively identify and classify lung
nodules in CT scans, often outperforming traditional methods. For
instance, a study by Ardila et al. (2019) utilized deep learning
algorithms to analyze chest X-rays, achieving higher accuracy than
radiologists in detecting lung cancer.
Dataset Utilization: The success of deep learning models relies
significantly on the quality and quantity of data available for training.
Several publicly available datasets, such as the LIDC-IDRI (Lung
Image Database Consortium Image Database Resource Initiative) and
NSCLC (Non-Small Cell Lung Cancer) datasets, have been
instrumental in training and validating AI models. These datasets
contain annotated images that allow researchers to develop robust
models that generalize well to new cases. The use of data
augmentation techniques has also been widely studied to enhance
model performance by artificially increasing the dataset size and
variability.

Comparative Studies: Various studies have compared the

performance of AI-driven diagnostic systems with that of human
experts. For example, a systematic review by Ghafoor et al. (2020)
highlighted that AI models could achieve performance levels
comparable to, or even exceeding, that of experienced radiologists in
detecting lung cancer from imaging data. These findings underscore
the potential of AI as a complementary tool in clinical settings,
enhancing the efficiency and accuracy of lung cancer diagnoses.

Challenges and Limitations: Despite the promising advancements,

several challenges remain in integrating AI into clinical practice.
Issues such as data privacy, the need for standardized protocols, and
the interpretability of AI models pose significant barriers.
Furthermore, the reliance on large, annotated datasets necessitates
collaboration between researchers and medical institutions to create
robust training resources.
Methodology
The methodology for this project consists of several key steps,
including dataset selection, data preprocessing, model development,
training, and evaluation. Each of these steps is crucial for building an
effective lung cancer detection and classification system using deep
learning techniques.

1. Dataset Description

For this project, the LIDC-IDRI (Lung Image Database

Consortium Image Database Resource Initiative) dataset was
utilized. This dataset contains a comprehensive collection of
annotated lung CT scans, including images labeled for the presence of
nodules and various types of lung cancer. It consists of a diverse
range of cases, ensuring representation across different cancer stages
and types. The dataset allows for both training and testing of the deep
learning model, providing a solid foundation for developing a robust
classification system.

2. Data Preprocessing

Data preprocessing is essential to ensure the model's effectiveness and

performance. The following preprocessing steps were implemented:

 Data Cleaning: Images were inspected for any corruption or

anomalies, and invalid images were removed from the dataset.
 Image Resizing: All CT scan images were resized to a standard
dimension (e.g., 224x224 pixels) to maintain consistency and
compatibility with the CNN model input.
 Normalization: Pixel values were normalized to a range of [0,
1] to improve convergence during training. This process helps
the model learn more effectively by ensuring that input values
are scaled similarly.
 Data Augmentation: To enhance the model's generalization
capabilities, data augmentation techniques were employed. This
included random rotations, shifts, flips, and zooms to artificially
increase the dataset's size and variability. These transformations
help prevent overfitting by providing the model with diverse
training examples.

3. Model Architecture

The deep learning model developed for this project is based on a

convolutional neural network (CNN) architecture. The model consists
of several layers, including:

 Convolutional Layers: These layers extract features from the

input images by applying various filters. Multiple convolutional
layers were stacked to learn complex patterns in the images.
 Activation Functions: The ReLU (Rectified Linear Unit)
activation function was used to introduce non-linearity, allowing
the model to learn more complex representations.
 Pooling Layers: Max pooling layers were added after
convolutional layers to down-sample feature maps, reducing
dimensionality while retaining essential information.
 Fully Connected Layers: After several convolutional and
pooling layers, the output was flattened and passed through fully
connected layers, culminating in a softmax layer that provides
probabilities for each class (cancerous and non-cancerous).

4. Training Process

The model was trained using the following parameters:

 Split of Data: The dataset was divided into training, validation,

and test sets, typically using a split ratio of 70% for training,
15% for validation, and 15% for testing.
 Batch Size: A batch size of 32 was chosen to balance memory
usage and training efficiency.
 Epochs: The model was trained for a predetermined number of
epochs (e.g., 50), allowing sufficient time for the model to learn
from the training data while monitoring performance on the
validation set.
 Loss Function: The categorical cross-entropy loss function was
used to measure the model's performance during training.
 Optimizer: The Adam optimizer was employed for efficient
training, allowing for adaptive learning rates.

5. Evaluation Metrics

To evaluate the performance of the trained model, the following

metrics were used:

 Accuracy: The proportion of correctly classified images out of

the total number of images in the test set.
 Precision: The ratio of true positive predictions to the total
predicted positives, indicating the model's ability to avoid false
positives.
 Recall: The ratio of true positive predictions to the total actual
positives, reflecting the model's ability to identify all relevant
instances.
 F1-Score: The harmonic mean of precision and recall, providing
a balance between the two metrics.

The evaluation process involved analyzing the model's performance

on the test set to ensure its generalization capability. A confusion
matrix was also generated to visualize the model's classification
performance and identify areas for improvement.
Results
Conclusion
In this project, machine learning algorithms were successfully
implemented to detect and classify lung cancer based on clinical
data. The Random Forest and XGBoost models provided the best
results, showing that both categorical and numerical features are
crucial in identifying lung cancer patients. Further improvements
can be made by utilizing larger datasets, integrating deep learning
models, or refining the feature engineering process.This study
demonstrates that AI and ML can be effective tools in supporting
early lung cancer detection, potentially aiding in saving lives
through timely diagnosis.

Future Scope
Incorporating more sophisticated deep learning techniques like
convolutional neural networks (CNNs) for image-based detection,
combined with clinical data.Utilizing additional medical datasets to
generalize the model further and improve its
performance.Implementing a real-time prediction system integrated
into clinical workflows to assist doctors in lung cancer screening.

(English (United States) ) Ethical Hacking Full Course - Learn Ethical Hacking in 10 Hours - Ethical Hacking Tutorial - Edureka (DownSub - Com)
No ratings yet
(English (United States) ) Ethical Hacking Full Course - Learn Ethical Hacking in 10 Hours - Ethical Hacking Tutorial - Edureka (DownSub - Com)
536 pages
Two Pointers
No ratings yet
Two Pointers
5 pages
Deep Learning Techniques For Lung Cancer Recogniti
No ratings yet
Deep Learning Techniques For Lung Cancer Recogniti
7 pages
Teja - Technical Seminar Presentation
No ratings yet
Teja - Technical Seminar Presentation
28 pages
A Novel Method To Detect Lung Cancer Using Deep Learning
No ratings yet
A Novel Method To Detect Lung Cancer Using Deep Learning
9 pages
Industrial Training Report
No ratings yet
Industrial Training Report
14 pages
DOI_FINAL
No ratings yet
DOI_FINAL
10 pages
8
No ratings yet
8
12 pages
10 1109@iccsp48568 2020 9182258
No ratings yet
10 1109@iccsp48568 2020 9182258
4 pages
PPT_minor[1]
No ratings yet
PPT_minor[1]
21 pages
Enhanced_Lung_Cancer_Detection_from_CT_Scans__Leveraging_Deep_Learning_for_Precise_Detection
No ratings yet
Enhanced_Lung_Cancer_Detection_from_CT_Scans__Leveraging_Deep_Learning_for_Precise_Detection
5 pages
Artificial intelligence
No ratings yet
Artificial intelligence
31 pages
A CAD System for Lung Cancer Detection Using Hybri
No ratings yet
A CAD System for Lung Cancer Detection Using Hybri
20 pages
Hybrid model detection and classification of lung cancer
No ratings yet
Hybrid model detection and classification of lung cancer
11 pages
Lung Cancer
No ratings yet
Lung Cancer
13 pages
Ijarcce 2023 12709
No ratings yet
Ijarcce 2023 12709
9 pages
2020_9470 defense
No ratings yet
2020_9470 defense
14 pages
Final Book
No ratings yet
Final Book
95 pages
Final Edition 1
No ratings yet
Final Edition 1
90 pages
ffffffffffffffffffffff
No ratings yet
ffffffffffffffffffffff
25 pages
Lung cancer detection_Research Paper-2
No ratings yet
Lung cancer detection_Research Paper-2
9 pages
Mukherjee 2020
No ratings yet
Mukherjee 2020
5 pages
Proposal 2 (AI)
No ratings yet
Proposal 2 (AI)
2 pages
Diagnostic Modelling For Lung Cancer Detection and Classification From Computed Tomography Using Machine Learning
No ratings yet
Diagnostic Modelling For Lung Cancer Detection and Classification From Computed Tomography Using Machine Learning
7 pages
Deep Learning Method For Lung Cancer Identification and Classification
No ratings yet
Deep Learning Method For Lung Cancer Identification and Classification
10 pages
Lungs - Front Page
No ratings yet
Lungs - Front Page
7 pages
Article 1
No ratings yet
Article 1
4 pages
Lung Cancer Cnn
No ratings yet
Lung Cancer Cnn
14 pages
Lung cancer detection using Ml
No ratings yet
Lung cancer detection using Ml
2 pages
Lung Cancer Detection Using Deep Learning and Explainable Methods
No ratings yet
Lung Cancer Detection Using Deep Learning and Explainable Methods
4 pages
IEEE Camera Ready Paper
No ratings yet
IEEE Camera Ready Paper
7 pages
PA Research Papers
No ratings yet
PA Research Papers
5 pages
Poc 3-1 All Units Notes
No ratings yet
Poc 3-1 All Units Notes
10 pages
Batch_11_journal_paper[1][2]
No ratings yet
Batch_11_journal_paper[1][2]
15 pages
Batch 11 Journal Paper
No ratings yet
Batch 11 Journal Paper
16 pages
Mini Project Doc
No ratings yet
Mini Project Doc
51 pages
Cancers 14 03856 v3
No ratings yet
Cancers 14 03856 v3
11 pages
Lung Tumor Classification and Detection From CT
No ratings yet
Lung Tumor Classification and Detection From CT
6 pages
MAjor Project Report
No ratings yet
MAjor Project Report
27 pages
1 s2.0 S2772941923000212 Main
No ratings yet
1 s2.0 S2772941923000212 Main
10 pages
Deep Learning and Machine Learning Algorithms to Predict Lung Cancer
No ratings yet
Deep Learning and Machine Learning Algorithms to Predict Lung Cancer
5 pages
Lung Cancer Detection Model Using Deep Learning Te
No ratings yet
Lung Cancer Detection Model Using Deep Learning Te
17 pages
Graduation Project Paper
No ratings yet
Graduation Project Paper
8 pages
Lung Cancer Detection - Full Ppt
No ratings yet
Lung Cancer Detection - Full Ppt
34 pages
Lung Cancer Prediction Literatur Survey
No ratings yet
Lung Cancer Prediction Literatur Survey
7 pages
IJRAR22B3053
No ratings yet
IJRAR22B3053
18 pages
Newppt Ai Sic
No ratings yet
Newppt Ai Sic
11 pages
1-s2.0-S2590123024017006-main (1)
No ratings yet
1-s2.0-S2590123024017006-main (1)
17 pages
Lung Cancer Detection CNN Abstract
No ratings yet
Lung Cancer Detection CNN Abstract
3 pages
AI Lab Case Study Report
No ratings yet
AI Lab Case Study Report
15 pages
Lung Cancer (CT) 2024
No ratings yet
Lung Cancer (CT) 2024
9 pages
CSE499A Draft
No ratings yet
CSE499A Draft
6 pages
Lung Cancer Detection System Using Image Processin
No ratings yet
Lung Cancer Detection System Using Image Processin
9 pages
CSE720 Lung Cancer Classification From Histopathological Images Using Deep (1)
No ratings yet
CSE720 Lung Cancer Classification From Histopathological Images Using Deep (1)
9 pages
Deep_Learning_Methods_for_Lung_Cancer_Detection_Classification_and_Prediction_-_A_Review
No ratings yet
Deep_Learning_Methods_for_Lung_Cancer_Detection_Classification_and_Prediction_-_A_Review
5 pages
1-s2.0-S2210650224003055-main
No ratings yet
1-s2.0-S2210650224003055-main
15 pages
Research Article: Using Deep Learning For Classification of Lung Nodules On Computed Tomography Images
No ratings yet
Research Article: Using Deep Learning For Classification of Lung Nodules On Computed Tomography Images
8 pages
D15 Final
No ratings yet
D15 Final
25 pages
Lung Cancer Detection by Using Image Processing Approach: IOP Conference Series: Materials Science and Engineering
No ratings yet
Lung Cancer Detection by Using Image Processing Approach: IOP Conference Series: Materials Science and Engineering
4 pages
LungCancer DZK TAB
No ratings yet
LungCancer DZK TAB
6 pages
Zeroth ReviewReport
No ratings yet
Zeroth ReviewReport
5 pages
Augmented Reality Assisted Surgery: Enhancing Surgical Precision through Computer Vision
From Everand
Augmented Reality Assisted Surgery: Enhancing Surgical Precision through Computer Vision
Fouad Sabry
No ratings yet
An Introduction To Wavelet Transform
No ratings yet
An Introduction To Wavelet Transform
80 pages
Multimodal Fusion Research Papers Survey
No ratings yet
Multimodal Fusion Research Papers Survey
1 page
Algo122 Assignment2 JohnsonsAlgorithm
No ratings yet
Algo122 Assignment2 JohnsonsAlgorithm
5 pages
Forecasting of Nonlinear Time Series Using Ann: Sciencedirect
No ratings yet
Forecasting of Nonlinear Time Series Using Ann: Sciencedirect
11 pages
Biomedical Signal Processing Assignment-Week 12
No ratings yet
Biomedical Signal Processing Assignment-Week 12
6 pages
ESB2021 Resit With Solution
No ratings yet
ESB2021 Resit With Solution
9 pages
04 - Probability in AI
No ratings yet
04 - Probability in AI
169 pages
Research Paper
No ratings yet
Research Paper
5 pages
Table of Specifications
No ratings yet
Table of Specifications
5 pages
0412055511MarkovChain
100% (5)
0412055511MarkovChain
508 pages
ECEg 3141 - Laplace Transform
No ratings yet
ECEg 3141 - Laplace Transform
59 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
163 pages
Simulation Modelsing Part 5
No ratings yet
Simulation Modelsing Part 5
11 pages
9A CH 8 - Quadratic Expressions Topic Test /38: Multiple Choice (8 Marks)
No ratings yet
9A CH 8 - Quadratic Expressions Topic Test /38: Multiple Choice (8 Marks)
4 pages
On The Use of Non-Linear Geostatistical Techniques For Recoverable Reserves Estimation: A Practical Case Study
No ratings yet
On The Use of Non-Linear Geostatistical Techniques For Recoverable Reserves Estimation: A Practical Case Study
11 pages
BT QB
No ratings yet
BT QB
2 pages
Z Request Program Compare
No ratings yet
Z Request Program Compare
18 pages
CS1.1 Discrete RV
No ratings yet
CS1.1 Discrete RV
4 pages
An Analysis of Fuzzy C Means and Logical Average Distance Measure Algorithms Using MRI Brain Images
No ratings yet
An Analysis of Fuzzy C Means and Logical Average Distance Measure Algorithms Using MRI Brain Images
5 pages
Cryptera WP Understanding-RKL To-Launch
No ratings yet
Cryptera WP Understanding-RKL To-Launch
3 pages
Cambridge Lower Secondary Computing - Mock Exam Paper Grade 7 - Copy
100% (1)
Cambridge Lower Secondary Computing - Mock Exam Paper Grade 7 - Copy
5 pages
The Benefits of Predictive Maintenance in Manufact
No ratings yet
The Benefits of Predictive Maintenance in Manufact
9 pages
Buy Ebook Engineering Optimization Methods and Applications 2nd Edition A. Ravindran Cheap Price
100% (5)
Buy Ebook Engineering Optimization Methods and Applications 2nd Edition A. Ravindran Cheap Price
75 pages
05_iai
No ratings yet
05_iai
61 pages
Chapter 01
No ratings yet
Chapter 01
10 pages
Study Material: Free Master Class Series
No ratings yet
Study Material: Free Master Class Series
23 pages
Simulation and Modelling: Chapter Two Simulation Concepts
No ratings yet
Simulation and Modelling: Chapter Two Simulation Concepts
31 pages
TABLE 2.6 Summary of Discrete Compounding Interest Factors. To Find
No ratings yet
TABLE 2.6 Summary of Discrete Compounding Interest Factors. To Find
2 pages

Aihc Report

Uploaded by

Aihc Report

Uploaded by

Smt.

Indira Gandhi College of Engineering

LUNG CANCER DETECTION AND

PROF. VENKAT PATIL

Lung cancer remains a significant public health challenge, being one

Background: Lung cancer is one of the most prevalent and deadly

Objective: This project aims to develop a machine learning-based

Lung cancer detection and classification have evolved significantly

Traditional Diagnostic Methods: Historically, lung cancer

Machine Learning and Deep Learning in Medical Imaging:

Comparative Studies: Various studies have compared the

Challenges and Limitations: Despite the promising advancements,

For this project, the LIDC-IDRI (Lung Image Database

Data preprocessing is essential to ensure the model's effectiveness and

 Data Cleaning: Images were inspected for any corruption or

The deep learning model developed for this project is based on a

 Convolutional Layers: These layers extract features from the

The model was trained using the following parameters:

 Split of Data: The dataset was divided into training, validation,

To evaluate the performance of the trained model, the following

 Accuracy: The proportion of correctly classified images out of

The evaluation process involved analyzing the model's performance

You might also like