A Review of Convolutional Neural Network-Based Computer Aided Lung Nodule Detection System
A Review of Convolutional Neural Network-Based Computer Aided Lung Nodule Detection System
Corresponding Author:
Tole Sutikno
Department of Electrical Engineering, Faculty of Industrial Technology, Universitas Ahmad Dahlan
South Ringroad st., Tamanan, Banguntapan, Bantul, Special Region of Yogyakarta, Indonesia 55191
Email: [email protected]
1. INTRODUCTION
Lung cancer is characterized by uncontrolled cell division in the lungs. Cells divide and replicate
themselves. When damaged cells divide uncontrollably, they become tumors, which eventually hinder organs
from functioning correctly. When compared to other kinds of cancer, lung cancer is by far the deadliest and
most common form of the disease, making it the top cause of death globally [1]–[6]. Lung cancer is the world's
second most common cancer, behind breast cancer. According to World Health Organization (WHO) figures
issued in 2020, lung cancer mortality in Indonesia reached 28,633 in 2019. In 2022, the United States is
expected to have 1,918,030 new cancer cases and 609,360 cancer deaths, with lung cancer, the leading cause
of cancer death, accounting for approximately 350 deaths per day [7]. However, if nodules are found at an
early stage, there is a chance that the human survival rate can be raised.
Lung cancer detection methods using computer-aided detection (CAD) have been developed in recent
years [8]–[16]. Early identification of lung cancer has been shown to both lower mortality rates and increase
the likelihood of survival. Nodules in the lungs that are still relatively tiny can be benign or malignant,
depending on the circumstances, even if the nodules themselves are not cancerous. Lung tissue that is benign
does not experience much growth, however lung tissue that is malignant will develop swiftly and assault the
body, making it extremely hazardous to one's health [17]. The most important goals of computer-aided design
(CAD) are for reliably recognizing images and extracting regions of interest (ROI) from images obtained from
a variety of imaging modalities. These imaging modalities include computed tomography (CT) scans, X-rays,
position emission tomography (PET), and magnetic resonance imaging (MRI) [18]–[27]. CAD systems are
further subdivided into computer-aided detection (commonly abbreviated as CADe) and computer-aided
diagnosis (CADd) (CADx). The CADe system's capabilities are limited to identifying abnormal tissue regions
and pictures, but the CADx system can be used to diagnose a disease by determining the sort of abnormality
present and whether it is malignant [28].
Previously, a substantial amount of study was conducted on the examination of lung nodule
identification using the CADe system. Because of limited resources and enormous volumes of data, this system
is often based on traditional machine learning, as well as linear discriminant analysis, multiple gray-level
thresholding, distance transformation, and support vector machines (SVM) [29]–[33]. However, in recent
years, a number of researchers have created deep learning-based lung detection techniques. One of these ways
is the convolutional neural network (CNN) approach, It has exceptional computer vision performance values,
and increases the CADe system's accuracy and sensitivity [34]. Trends for the years 2019-2022 include deep
learning in the form of CNN as well as the performance of each individual approach. The objective of this
study is to give researchers with an overview of the CNN-based CAD system for the identification of lung
nodules. The review will cover the phases of the CAD system for lung nodule identification in general, the
preprocessing, the segmentation and detection techniques that have been extensively utilized from the
beginning to the present, as well as the approach of lung nodule identification that is now being utilized.
2. LUNG NODULES
Lung nodules are abnormal growths that develop within the lungs. Lung nodules are extremely
frequent. The lung may have one or many nodules. Up to fifty percent of persons who have chest X-rays or CT
scans possess them. Nodules can form in either lung. About 95% of lung nodules are benign (not cancerous).
Infrequently, lung nodules indicate lung cancer. Because small lung nodules rarely cause symptoms, more
testing are required to identify whether or not it is lung cancer.
Analysis of lung nodules is one of the stages that must be taken in order to successfully prevent lung
cancer, which is accomplished through identification and categorization. Dark-level lung nodules are typically
between 3 and 30 millimeters in diameter [34]. In general, these nodules have a diameter of around 3
millimeters. Figure 1 depicts samples of various types of lung nodule categories. The circumscribed, juxta
vascular, juxta pleural, and pleural tails of nodules are represented in Figures 1(a) to 1(d), respectively. In
contrast, juxta vascular nodules are firmly linked to blood vessels, while juxta pleural nodules are located in
the region around the pleura [28]. Circumscribed nodules are not associated with any other tissue structures
and are seen in a dispersed manner throughout the tissue.
The nodule is centrally While placed centrally in the A substantial amount of the The nodule is located at the
positioned in the lung and has lung, the nodule has nodule is connected to the pleural surface and is
no link to the vascular. extensive linkages to pleural surface. connected by a thin surface.
adjacent arteries.
(a) (b) (c) (d)
Figure 1. Categories of lung nodule samples [28], (a) well-circumscribed (b) juxta vascular, (c) juxta pleural,
and (d) juxta pleural tail
A review of convolutional neural network-based computer-aided lung nodule detection system (Sekar Sari)
1046 ISSN: 2252-8938
Figure 2. Acquisition, preprocessing, lung segmentation, nodule detection, and classification are the steps in
the CAD schematic for lung nodule detection
4. PREPROCESSING
Preprocessing is an essential first step in the CT images used for lung detection because the raw CT
images contain a great deal of noise and extraneous data that will hinder the CAD system's ability to detect
lung nodules [34]. This review divides preprocessing into three stages: image smoothing, edge sharpening, and
noise removal. The preprocessing stages will be explained briefly in this section, and the section will conclude
with the possibility of using CNN-based methods to remove noise from lung images.
determining optimal smoothing via degrees of freedom that is relatively robust to the choice of location and a
relatively large number of knots. The natural spline is simply a constrained cubic spline that employs B-spline
basis functions instead of piecewise polynomials. Fractional polynomials, like P-splines and RCS, can be
employed with any generalized linear model to analyze survival data. Although a global (rather than local)
approach, the FP model has the advantage of being a simpler form than the other two possibilities and
incorporating a wider range of functional forms than the normal polynomial family allows. A broader range of
possible dose-response interactions could be accommodated [62], [63]. The mean filter, the Gaussian filter,
anisotropic diffusion, the median filter, the adaptive median filter, conservative smoothing, and the alpha trim
mean filter are other image smoothing methods that can be used to improve low-level distortion in lung images
[37], [53]–[55], [58], [64].
Figure 3. CNN-based methods for removing noise for lung images [68]
A review of convolutional neural network-based computer-aided lung nodule detection system (Sekar Sari)
1050 ISSN: 2252-8938
5. LUNG SEGMENTATION
Lung segmentation is a procedure that is used in lung detection. This process involves isolating lung
nodules from other portions of the lung CT scan image and then further improving the image that is produced
so that detail can be seen [55]. Lung segmentation techniques are broadly classified into four categories:
i) deformable boundary-based techniques; ii) edge-based techniques; and iii) threshold-based techniques, and
iv) registration-based method [28], [74]–[77]. Each of these categories has its own subcategories. Several of
these methods of segmentation each have their own set of benefits and drawbacks. This review divides lung
segmentation into three stages: histogram-based thresholding, connected component analysis, and lung
extraction. The stages will be explained briefly in this section, and the section will conclude with the possibility
of using CNN-based lung segmentation methods.
The labeled connected component in the binarized image is analyzed using connected component
analysis. The pixels in the binary image are divided into multiple linked components based on their pixel
connection. The lung lobes are segmented using the first-level linked components. The designated components'
area and boundary information are utilized to determine the choice. The second level of linked components is
identified, and geometric characteristics are retrieved from each component. The geometric characteristics used
include area, bounding box, eccentricity, equivalent diameter, main axis length, minor axis length, and
perimeter. The information gathered from the components is evaluated to identify whether or not they are
tumors. An overall evaluation of lung nodules is provided in [81]. A lung nodule greater than 5 mm in size has
a significant likelihood of becoming malignant cancer. According to their findings, an eccentric component
with an overall size of 5 mm or greater is likely to be malignant. The findings of these tests are utilized to label
the section as malignant and remove the remainder [82].
A review of convolutional neural network-based computer-aided lung nodule detection system (Sekar Sari)
1052 ISSN: 2252-8938
Ausawalaithong et 2018 JSRT, NIH 247 2048×2048 and Increasing contrast, noise CNN
al. [90] and 1024×1024 removal, image resizing
100 pixels and image normalizing
Anitha and Babu 2019 LIDC and 50 and 0.625 mm Morphological Fissure regions
[37] ELCAP 30 transformation and weiner segmentation
filter
Kasinathan et al. 2019 LIDC-IDRI 850 0.45-0.75 mm Remove the mediastinum ACM
[56] region and thoracic wall
Huang et al. [50] 2019 LIDC-IDRI 888 0.6-5.0 mm Linear mapping FCN
and LUNA16
Ardila et al. [91] 2019 LIDC, LUNA, 1139 8–15 mm CNN CNN
and NLST
The level of accuracy achieved by the SVM relies on the selection of kernel parameters such as C and the radial
basis function (RBF). For the testing, the LIDC-IDRI database was utilized, and in total, 140 new exams were
developed [58]. The Naive Bayes and KNN methods were proposed for the purpose of lung nodule detection
by Nóbrega et al. [57]. The builder for these methods was based on a Gaussian distribution of the probability
density function.
When attempting to detect lung nodules using machine learning techniques, the difficulty of defining
and choosing the attributes of a certain image arises. As the number of photographs in each category rises, the
process of extracting features from images gets increasingly time-consuming and labor-intensive [34] In
addition, over the past few years, methods based on deep learning have been developed for the detection of
lung nodules. The CNN method was developed by a group of researchers specifically for the purpose of
detecting lung nodules. In its most basic form, a CNN is made up of three layers: a convolutional layer (CONV),
a pooling layer, and a fully connected layer (FC). Jiang et al. [1] conducted research on the detection of lung
nodules using the CNN method, modifying it to use the function of rectified linear units in place of the
convolutional layer Rectified Linear Unit (ReLU). When compared to other activation functions on the CNN
method, the process of error training is sped up by using ReLU, which is one of the advantages obtained from
using it. At the pooling layer, operations known as max-pooling and average-pooling are carried out, whereas
the FC layer is made up of four separate channels. assessment of the suggested CNN approach using the LIDC
IDRI database. The CNN method, which consists of several layers, is utilized by Wang et al. [96] in their lung
nodule detection process. The activation function of the convolution layer is Leaky ReLU, the pooling layer
uses means and averages, and the final layer, the FC layer, uses global average pooling. The FC layer
implements a feature mapping strategy that makes use of a 4 by 4 matrix kernel in an effort to cut down on the
number of connected parameters. In addition, the batch normalization layer is implemented to lessen the impact
of overfitting and speed up the convergence of the network [96]. Kasinathan et al. [56] and Li et al. [36] carried
out yet another study for the purpose of detecting lung nodules using the CNN method. The CNN method is in
the process of being developed, and it currently incorporates architectural models such as region-based fully
convolutional networks (RFCN) [97], regional-CNN (R-CNN) [98], faster regional CNN (Faster R-CNN) [50],
ResNet50 [57], and 3D-CNN [99].
Deep learning is crucial for enabling the CNN approach to be employed in the analysis of medical
pictures as technology develops, the detection process gets computationally faster, and the amount of data
accessible increases. This was made possible by the use of the CNN method. The CNN method has a number
of benefits, the most notable of which are an enhanced image detection performance, high flexibility and
adaptability to a wide variety of datasets, and the capacity to be designed automatically and effectively by
making use of black-box operations [34]. According to the findings of previous research, the following are
some advantages of deep learning:
− The performance of the CAD system in detecting nodules in lung cancer may be improved through the
use of techniques from deep learning. Not only does the CAD system detect the presence of lung nodules,
but it also provides information on the location of those nodules and has the ability to categorize detected
nodules as either benign or malignant [97].
− Deep CNN has the potential to increase the sensitivity of the detection of lung nodules by reducing the
value of FPs/scan, thereby lowering the error rate in detection and, of course, improving the quality of
detection [99].
− Deep CNN can be used to detect lung nodules in a variety of dataset sources. For example, it can be used
to detect data from hospital A, and then it can be used to detect data from hospital B. Deep CNN is able
to discover various CT scans of the lungs and categorize them into distinct groups [37], [41], [50], [100].
A review of convolutional neural network-based computer-aided lung nodule detection system (Sekar Sari)
1054 ISSN: 2252-8938
specificity of 97.70%. Additionally, the test had a sensitivity of 85.91%. Han et al. [101] and Boroczky et al.
[102] carried out yet another investigation in which the SVM method was utilized. In this particular
investigation, the classification performed by feature-based SVM was dependent on categorical rules. These
categorical rules took the form of geometric or shape features, intensity features, gradient features, and
eigenvalue-based features. The proposed system was tested on 205 patient cases taken from the openly
accessible online LIDC database. The experimental results obtained a sensitivity of 89.2% at 4.14 FPs/scan,
and the system was found to be successful [101]. The results of the study showed that the genetic algorithm
method developed by Boroczky et al. for extracting features from private data obtained from lung CT scans
(including 52 real nodules and 443 fake ones) was 100% accurate and 56.4% accurate [102].
Jia et al. [103] proposed several three-dimensional methods for lung detection, some of which are
surface shadow display (SSD), volume rendering, maximum intensity rendering (MIP), and minimum intensity
rendering (MIP) (VR). The process of feature extraction uses ROI to more accurately identify suspicious
regions, paying particular attention to their shape, gray value, position, circularity, mean value of gray level,
and smoothness. Also taken into consideration is the mean value of gray level. With a sensitivity of 95% on
FPs/scan of 0.91, the proposed method was able to successfully detect lung image nodules. Tariq and Akram
[55] investigated the use of the neurofuzzy method in the detection of lung nodules. The neurofuzzy
classification can be broken down into two distinct sub-networks: the fuzzy self, which is responsible for
managing the network, and the multilevel, multilayer perception (MLP). In order to generate a pre-
classification vector, the feature vector is used as an input to the fuzzy layer. This pre-classification vector is
then assigned to the MLP as a sample test classification. The fuzzy self-layer network is in charge of locating
nodule pixels and organizing them into groups according to the similarity of the nodules (regardless of whether
or not there are nodules), but with varying membership values. In addition to this, the MLP network will
classify the input vectors that have been applied in order to select candidates from the appropriate category.
The testing was done with a total of one hundred datasets of lung CT images taken from various patients. The
accuracy reached using the strategy that was suggested is 95%. The back propagation neural network method,
which is composed of three layers, was utilized by Talebpour et al. in the process of classifying nodules as
distinct from other objects that are not nodules. There are 22 input neurons in the first layer, five hidden neurons
in the second layer, and one neuron in the output layer of the third layer. The first layer is considered the input
layer. Tan-sigmoid is an internal function that is present in every neuron. The proposed method was put to the
test using the LIDC-IDRI database, and the results showed that it had a sensitivity of 90% on an FP/scan of 10
[104]. Kuruvilla and Gunavathi [65] carried out yet another study, this time making use of the back propagation
neural network. The findings of the study indicated that a sensitivity of 91.4% could be achieved with an
FP/scan value of 30.
A random forest method was proposed by Gong et al. [41] for the purpose of classifying lung nodules
using the LUNA 16 and ANODE 09 databases. A sensitivity value of 79.3% was obtained from testing in both
databases with an FP/scan value of 4, and a sensitivity value of 84.62% was obtained from testing with an
FP/scan value of 2.8. Detecting lung cancer on CT images using the LIDC dataset was the focus of another
study that was carried out by Jayaraj and Sathiamoorthy [54], which utilized the random forest method. The
random forest method relies on a classification that takes into account both the index and the entropy to arrive
at its conclusions. The results obtained using the proposed method had an accuracy of 89.90%, a sensitivity of
90.85%, and a specificity of 88.32%, respectively. Nobrega et al. [57] conducted research on the effectiveness
of deep transfer learning when applied to the classification of lung nodule malignancy tasks. Their goal was to
improve such systems and put them to the test using the LIDC database. The proposed method is a contrast
between deep transfer learning and deep feature learning, and it yields the following results: an area under the
curve (AUC) of 93.1%, a true positive rate (TPR) of 85.38%, an evaluation metrics accuracy (ACC) of 88.41%,
precision of 73.48%, and an F1-score of 78.83%. They found that the deep transfer learning method is an
effective way to take CT images of lung nodules and extract the most important features from those images.
This was the finding of their research.
The CNN classification method is utilized next in the FPs reduction stage. Shin et al. [44] developed
the CNN approach for lung nodule identification, an architecture created with GoogLeNet that comprises of a
convolution layer, three pooling layers, and nine inception layers. GoogLeNet's inception layers are made up
of six convolutional layers and one pooling layer. The system was tested using the ILD dataset, and the findings
indicated a reduced accuracy (79%). To identify lung CT images, Golan et al. [105] suggested a deep CNN
technique utilizing a back-propagation algorithm. CNN is divided into two segments. The first section
comprises of numerous volumetric convolutions, rectified linear units (ReLU), and max-pooling layers. The
second component is a classifier composed of many fully linked, threshold, and softmax layers. The system
was tested using the LIDC dataset and yielded a poor sensitivity result of 78.9% at 20 frames per second.
Anthimopoulos et al. devised and tested CNN for the categorization of ILD patterns. The suggested technique
comprises of five convolutional layers, leaky ReLU activations, a pooling layer, and three dense layers. The
classification performance of lung patterns using the CNN approach is 85.5% [100]. Dou et al. proposed
employing three-dimensional (3-D) CNN to improve automated lung nodule identification from volumetric
CT data. The suggested approach was thoroughly tested in the LUNA16 challenge, yielding a sensitivity result
of 90% with 8 FPs per scan [99]. Tekade and Rajeswari [17] presented lung cancer detection and classification
using deep learning, and the technique is a 3D multipath VGG-like network that is tested on the LIDC,
LUNA16, and Kaggle datasets. The result for lung nodule identification and classification is 95.60% accuracy
and 0.387732 log loss. Kido et al. [98] developed a CAD method for lung anomalies based on CNN and areas
with CNN properties (R-CNN). R-CNN is an object detection system that use a CNN to categorize picture
areas inside an image. R-CNN was trained using marked aberrant lesions, and it indicated abnormal lesion
bounding boxes on the test picture. Their proposed approach has an accuracy of 84.7%. Jiang et al. [1]
suggested an automated detection approach for lung nodules based on a multigroup patch and a deep learning
network. With 15.1 FPs per scan, the CAD system achieved a result sensitivity of 94%. Huang et al. [50]
proposed utilizing deep convolutional neural networks to detect and separate lung nodules in thoracic CT
images in a quick and fully automated manner. The accuracy gained at the false-positive (FP) reduction step,
which is conducted using CNN, is 94.6% with 4 FPs per scan. The average dice coefficient of nodule
segmentation compared to the ground truth is 0.793.
Lung nodule identification using multi-resolution convolutional networks was proposed by Li et al.
[36] for the purpose of chest X-ray radiography. In order to extract the feature, they used patch-based multi-
resolution convolutional networks, and for classification, they used four distinct fusion algorithms. They
employed the JSRT database in order to evaluate their suggested technique. Within this database, they exhibited
an accuracy of 99% while only using 0.2 FPs each scan. Kasinathan et al. [56] proposed utilizing CNN to
automate the process of detecting and classifying three-dimensional lung tumors. The LIDC-IDRI dataset,
which included 850 lung nodule-lesion pictures, was utilized in the evaluation of the suggested model. The
outcome was that the model was accurate 97% of the time. In their study, Shi et al. [106] suggested a CNN
multi-scale feature fusion approach for the identification of lung nodules. The framework for detection is made
up of two parts: the production of region proposals and the minimization of false-positive results. The
architecture of the CNN model is VGG16, and trials performed on the LUNA16 dataset demonstrate that it has
an average sensitivity of 82.62%. Masood et al. suggested an automated technique for the identification of lung
cancer by employing a method known as the improved multidimensional region-based fully convolutional
network (mRFCN). The LIDC dataset was used to train and test their system, and the experiment findings
demonstrate that their system has a sensitivity of 98.1% and an accuracy of 97.91% [97]. Wang et al. [96]
proposed utilizing a raw patch-based CNN for the identification of lung nodules in CT images. On CT images
taken from the LIDC-IDRI dataset, they evaluated the performance of ResNet in comparison to that of many
alternative CNN architectures. As a result, they achieved a high detection sensitivity of 92.8% with 8 FPs per
scan.
of other researchers. Table 1 provides an overview of a variety of datasets taken from several different public
databases, in addition to providing information on these datasets.
In addition to the dataset that was utilized, we also performed an analysis of the CAD system by
employing the methods of preprocessing, segmentation, detection, and FPs reduction. The Table 2 provides a
summary of some of the preprocessing and segmentation techniques that we use. The Gaussian filter and the
Median Filter are the two types of preprocessing procedures that are utilized the most frequently. In order to
analyze all 1018 photos that were taken from the LIDC-IDRI database, Jayaraj et al. utilized Gaussian and
median filters. A gaussian filter is applied in order to reduce noise in lung cancer diagnosis, and a median filter
is employed in order to remove salt and pepper noise, which is a type of very tiny noise that can be found in
CT scans. In the CT picture of the lung, this will result in the smoothing of the image and a reduction in the
speckle noise [54]. Widodo et al. [53], Filho et al. [58], and Sankar et al. [86] utilized the same method in their
works.
The subsequent step in the process carried out by a CAD system is image segmentation. The process
of distinguishing lung nodules from other sections of a CT scan picture of the lung and then enhancing the
image that is produced as a consequence in order to acquire additional detail is referred to as lung segmentation.
The thresholding method is one of the most used approaches to segmentation [1], [55], [58]. Deep learning as
a segmentation approach has developed over time and can take on a variety of forms, depending on the lung
imaging nodule being analyzed. Using a faster convolutional neural network, Huang et al. suggested a
segmentation strategy for recognizing lung CT images [50], and they attained an accuracy of 94.6 f. RNN is a
method that was proposed by Sankar and George [86] to enhance the identification of juxtapleural and
juxtavascular lesions. UNet and CNN are two approaches that were proposed by Shaziya and colleagues for
segmenting lung CT images. When it comes to picture segmentation, the findings of the Thedicee similarity
coefficient (DSC) demonstrate that the UNet approach is 1.27 percent more effective than CNN [87]. In the
TB category, Arora et al. performed segmentation with the help of the UNet, and as a result, they achieved a
DSC value of 0.9680 [88]. As a direct consequence of this, deep learning segmentation strategies such as FCN,
RNN, CNN, and UNet offer a great deal of promise.
The procedure of segmentation is followed by the identification of nodules and the extraction of
features. The purpose of this step is to ascertain whether or not a certain picture is recognized as a nodule.
Using SVM’s and feature extraction with form and texture, Filho et al. [58] devised a method for the
identification of nodules that was both highly accurate and sensitive. Tariq et al. [55] employ neurofuzzy to
identify nodules, and they extract characteristics using vector and intensity. Back propagation neural network
(BPNN) was utilized by Talebpour et al. for the purpose of detecting nodules, in conjunction with geometric
and textural feature extraction [104]. However, machine learning algorithms for detecting nodules and
extracting features have difficulty with the work of defining and selecting the characteristics of a particular
picture, which also causes the task to become more time-consuming. In the process of nodule detection, deep
learning CNNs are utilized extensively. Several studies, such as the ones by Jiang et al. [1] using CNN, Tekade
et al. [17] using CNN with 3D multipath VGG, Wang et al. using CNN (ResNet) [96], and Masood et al. [97]
using RFCN, Dou et al. [99] using 3D-CNN, have achieved an accuracy and sensitivity that is more than 90%.
Performing methods like as feature extraction and false-positive reduction are essential for achieving the best
results possible in lung nodule detection.
Moving forward, there is a pressing need for more study into the development of CAD systems for
the identification of lung nodules. The target is to acquire a detection result that is more accurate as well as a
high sensitivity value that has the potential to lower the value of FP reduction. The following is a list of
significant concepts for potential CAD systems that can be utilized in the future to locate lung nodules:
− Developing new methods of deep learning, such as the CNN method, with the primary goal of enhancing
the performance of lung nodule identification. In addition, the batch normalization layer can be included
in order to lessen the effects of overfitting and speed up the convergence of the network [96].
− Developing a CAD system that is capable of detecting all types of lung nodules with high accuracy and
sensitivity, as well as a low percentage of false-positive results while maintaining these characteristics.
− If the suggested technique is trained and evaluated using a large number of datasets, such as the LIDC-
IDRI and LUNA16 public databases, then it will be able to provide a more comprehensive assessment of
the general and clinical performance of the detection system.
9. CONCLUSION
In this study, we have presented a critical analysis of some of the research that has been done on CAD
systems for lung nodule identification by doing literature investigations and analyzing the results. The research
that has been done on CAD systems for lung nodule identification focuses on the identification of nodules in
the lungs. There have been a few distinct lines of inquiry that have utilized CT scan photos to examine the
efficacy of the suggested method. After providing an overview of the computer-aided diagnosis (CAD) system
for detecting lung nodules, it has been established that the system consists of various steps. These phases
involve preparing or obtaining image data, preprocessing, segmenting, lung nodule detection, and FPs
reduction, which incorporates feature extraction. Other steps include detecting and segmenting lung nodules.
The steps are broken down into their component parts farther down in this article. We discovered that some of
the works on lung nodule detection had better results than others based on parameters such as sensitivity,
specificity, accuracy, and the number of FPs per scan, as well as other parameters, after reviewing a few of the
most well-known works on the subject and evaluating the proposed method with a dataset taken from the
LIDC-IDRI database. This was the conclusion we reached after looking at a few of the most well-known works
on lung nodule detection and evaluating the method. In addition, we have provided an overview of the various
methods that can be used for each of the steps of the process of identifying lung nodules. In recent years, deep
learning algorithms such as CNN have been an increasingly popular means of locating prospective nodules and
extracting attributes. Over the course of the last few years, this strategy has become increasingly common.
Despite the fact that we discovered that some CAD systems achieved outstanding sensitivity with low false-
positive rates, there are still a great many barriers to overcome in order to optimize CAD systems for the
detection of lung cancer. It is expected that a capable CAD system will be able to assist radiologists in the
detection of lung cancer. This is the single most critical thing that it ought to be able to accomplish.
ACKNOWLEDGMENT
The authors gratefully acknowledge Universitas Gadjah Mada and Universitas Ahmad Dahlan for
their financial support of this collaborative work, as well as the Embedded Systems and Power Electronics
Research Group (ESPERG) for their publication assistance.
REFERENCES
[1] H. Jiang, H. Ma, W. Qian, M. Gao, and Y. Li, “An automatic detection system of lung nodule based on multigroup patch-based
deep learning network,” IEEE Journal of Biomedical and Health Informatics, vol. 22, no. 4, pp. 1227–1237, 2018, doi:
10.1109/JBHI.2017.2725903.
[2] T. Uemura and T. Hida, “Durvalumab showed long and durable effects after chemoradiotherapy in stage III non-small cell lung
cancer: Results of the PACIFIC study,” Journal of Thoracic Disease, vol. 10, pp. S1108–S1112, 2018, doi:
10.21037/jtd.2018.03.180.
[3] T. Hou et al., “Alpha thalassemia/intellectual disability X-linked deficiency sensitizes non-small cell lung cancer to immune
checkpoint inhibitors,” Frontiers in Oncology, vol. 10, 2020, doi: 10.3389/fonc.2020.608300.
[4] C. Olingy et al., “CD33 expression on peripheral blood monocytes predicts efficacy of anti-PD-1 immunotherapy against non-small
cell lung cancer,” Frontiers in Immunology, vol. 13, 2022, doi: 10.3389/fimmu.2022.842653.
[5] M. Casás-Selves et al., “Tankyrase and the canonical Wnt pathway protect lung cancer cells from EGFR inhibition,” Cancer
Research, vol. 72, no. 16, pp. 4154–4164, 2012, doi: 10.1158/0008-5472.CAN-11-2848.
[6] Z. Huang et al., “Correlation of cancer stem cell markers and immune cell markers in resected non-small cell lung cancer,” Journal
of Cancer, vol. 8, no. 16, pp. 3190–3197, 2017, doi: 10.7150/jca.20172.
[7] R. L. Siegel, K. D. Miller, H. E. Fuchs, and A. Jemal, “Cancer statistics, 2022,” CA: A Cancer Journal for Clinicians, vol. 72, no.
1, pp. 7–33, 2022, doi: 10.3322/caac.21708.
[8] M. S. AL-Huseiny and A. S. Sajit, “Transfer learning with GoogLeNet for detection of lung cancer,” Indonesian Journal of
Electrical Engineering and Computer Science, vol. 22, no. 2, pp. 470–478, 2020, doi: 10.11591/ijeecs.v22.i2.pp470-478.
[9] R. Pandian, D. N. S. Ravi Kumar, and R. R. Kumar, “Development of algorithm for identification of maligant growth in cancer
using artificial neural network,” International Journal of Electrical and Computer Engineering, vol. 10, no. 6, pp. 5709–5713, 2020,
doi: 10.11591/ijece.v10i6.pp5709-5713.
[10] H. F. Kareem, M. S. AL-Husieny, F. Y. Mohsen, E. A. Khalil, and Z. S. Hassan, “Evaluation of SVM performance in the detection
of lung cancer in marked CT scan dataset,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 21, no. 3, pp.
1731–1738, 2021, doi: 10.11591/ijeecs.v21.i3.pp1731-1738.
[11] R. D. Abdu-Aljabar and O. A. Awad, “Parallel extreme gradient boosting classifier for lung cancer detection,” Indonesian Journal
of Electrical Engineering and Computer Science, vol. 24, no. 3, pp. 1610–1617, 2021, doi: 10.11591/ijeecs.v24.i3.pp1610-1617.
[12] S. Kothari, S. Chiwhane, S. Jain, and M. Baghel, “Cancerous brain tumor detection using hybrid deep learning framework,”
Indonesian Journal of Electrical Engineering and Computer Science, vol. 26, no. 3, pp. 1651–1661, 2022, doi:
10.11591/ijeecs.v26.i3.pp1651-1661.
[13] O. R. Kadhim, H. J. Motlak, and K. K. Abdalla, “Computer-aided diagnostic system kinds and pulmonary nodule detection
efficacy,” International Journal of Electrical and Computer Engineering, vol. 12, no. 5, pp. 4734–4745, 2022, doi:
10.11591/ijece.v12i5.pp4734-4745.
[14] M. F. Abdullah, S. N. Sulaiman, M. K. Osman, N. K. A. Karim, S. Setumin, and I. S. Isa, “A new procedure for lung region
segmentation from computed tomography images,” International Journal of Electrical and Computer Engineering, vol. 12, no. 5,
pp. 4978–4987, 2022, doi: 10.11591/ijece.v12i5.pp4978-4987.
[15] A. A. Abd Al-Ameer, G. A. Hussien, and H. A. Al Ameri, “Lung cancer detection using image processing and deep learning,”
Indonesian Journal of Electrical Engineering and Computer Science, vol. 28, no. 2, pp. 987–993, 2022, doi:
10.11591/ijeecs.v28.i2.pp987-993.
[16] R. S. Shankar, R. S. Chigurupati, P. Voosala, and N. Pilli, “An extensible framework for recurrent breast cancer prognosis using
deep learning techniques,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 29, no. 2, pp. 931–941, 2023,
doi: 10.11591/ijeecs.v29.i2.pp931-941.
[17] R. Tekade and K. Rajeswari, “Lung cancer detection and classification using deep learning,” Proceedings - 2018 4th International
A review of convolutional neural network-based computer-aided lung nodule detection system (Sekar Sari)
1058 ISSN: 2252-8938
Conference on Computing, Communication Control and Automation, ICCUBEA 2018, 2018, doi:
10.1109/ICCUBEA.2018.8697352.
[18] N. E. A. Khalid, M. F. Ismail, M. A. A. B. Manaf, A. F. A. Fadzil, and S. Ibrahim, “MRI brain tumor segmentation: A forthright
image processing approach,” Bulletin of Electrical Engineering and Informatics, vol. 9, no. 3, pp. 1024–1031, 2020, doi:
10.11591/eei.v9i3.2063.
[19] I. Assini, A. Badri, A. Sahel, and A. Baghdad, “Hybrid multiple watermarking technique for securing medical images of modalities
MRI, CT scan, and X-ray,” International Journal of Electrical and Computer Engineering, vol. 10, no. 3, pp. 2349–2356, 2020,
doi: 10.11591/ijece.v10i3.pp2349-2356.
[20] N. Raju, H. B. Anita, and P. Augustine, “Identification of interstitial lung diseases using deep learning,” International Journal of
Electrical and Computer Engineering, vol. 10, no. 6, pp. 6283–6291, 2020, doi: 10.11591/IJECE.V10I6.PP6283-6291.
[21] N. Saha and M. A. Rahaman, “Bone cancer detection using electrical impedance tomography,” Indonesian Journal of Electrical
Engineering and Computer Science, vol. 24, no. 1, pp. 245–252, 2021, doi: 10.11591/ijeecs.v24.i1.pp245-252.
[22] R. Supriyanti, M. Alqaaf, Y. Ramadhani, and H. B. Widodo, “Morphological characteristics of X-ray thorax images of COVID-19
patients using the Bradley thresholding segmentation,” Indonesian Journal of Electrical Engineering and Computer Science, vol.
24, no. 2, pp. 1074–1083, 2021, doi: 10.11591/ijeecs.v24.i2.pp1074-1083.
[23] W. S. Alazawee, Z. H. Naji, and W. T. Ali, “Analyzing and detecting hemorrhagic and ischemic stroke-based on bit plane slicing
and edge detection algorithms,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 25, no. 2, pp. 1003–
1010, 2022, doi: 10.11591/ijeecs.v25.i2.pp1003-1010.
[24] A. Elaraby and A. Taha, “An approach for cross-modality guided quality enhancement of liver image,” International Journal of
Electrical and Computer Engineering, vol. 12, no. 2, pp. 1449–1455, 2022, doi: 10.11591/ijece.v12i2.pp1449-1455.
[25] H. A. Owida, H. S. Migdadi, O. S. M. Hemied, N. F. F. Alshdaifat, S. F. A. Abuowaida, and R. S. Alkhawaldeh, “Deep learning
algorithms to improve COVID-19 classification based on CT images,” Bulletin of Electrical Engineering and Informatics, vol. 11,
no. 5, pp. 2876–2885, 2022, doi: 10.11591/eei.v11i5.3802.
[26] V. Bhavana and H. K. Krishnappa, “Multi-modal image fusion using contourlet and wavelet transforms: a multi-resolution
approach,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 28, no. 2, pp. 762–768, 2022, doi:
10.11591/ijeecs.v28.i2.pp762-768.
[27] T. Asmaria, D. A. Mayasari, A. D. G. Febrananda, N. Nurul, A. J. Rahyussalim, and I. Kartika, “Computed tomography image
analysis for Indonesian total hip arthroplasty designs,” International Journal of Electrical and Computer Engineering, vol. 12, no.
6, pp. 6123–6131, 2022, doi: 10.11591/ijece.v12i6.pp6123-6131.
[28] F. Shaukat, G. Raja, and A. F. Frangi, “Computer-aided detection of lung nodules: A review,” Journal of Medical Imaging, vol. 6,
no. 02, p. 1, 2019, doi: 10.1117/1.jmi.6.2.020901.
[29] S. Saeed and A. Abdullah, “Recognition of brain cancer and cerebrospinal fluid due to the usage of different MRI image by utilizing
support vector machine,” Bulletin of Electrical Engineering and Informatics, vol. 9, no. 2, pp. 619–625, 2020, doi:
10.11591/eei.v9i2.1869.
[30] S. Hartini and Z. Rustam, “The comparison study of kernel KC-means and support vector machines for classifying schizophrenia,”
Telkomnika (Telecommunication Computing Electronics and Control), vol. 18, no. 3, pp. 1643–1649, 2020, doi:
10.12928/TELKOMNIKA.v18i3.14847.
[31] S. A. Taie and W. Ghonaim, “A new model for early diagnosis of alzheimer’s disease based on bat-SVM classifier,” Bulletin of
Electrical Engineering and Informatics, vol. 10, no. 2, pp. 759–766, 2021, doi: 10.11591/eei.v10i2.2714.
[32] O. Y. Ogundepo, I. O. A. Omeiza, and J. P. Oguntoye, “Optimized textural features for mass classification in digital mammography
using a weighted average gravitational search algorithm,” International Journal of Electrical and Computer Engineering, vol. 12,
no. 5, pp. 5001–5013, 2022, doi: 10.11591/ijece.v12i5.pp5001-5013.
[33] B. S. Shukur and M. M. Mijwil, “Involving machine learning techniques in heart disease diagnosis: a performance analysis,”
International Journal of Electrical and Computer Engineering, vol. 13, no. 2, pp. 2177–2185, 2023, doi:
10.11591/ijece.v13i2.pp2177-2185.
[34] W. Cao, R. Wu, G. Cao, and Z. He, “A comprehensive review of computer-aided diagnosis of pulmonary nodules based on
computed tomography scans,” IEEE Access, vol. 8, pp. 154007–154023, 2020, doi: 10.1109/ACCESS.2020.3018666.
[35] V. A. Binson and M. Subramoniam, “Advances in early lung cancer detection: A systematic review,” 2018 International Conference
on Circuits and Systems in Digital Enterprise Technology, ICCSDET 2018, 2018, doi: 10.1109/ICCSDET.2018.8821188.
[36] X. Li et al., “Multi-resolution convolutional networks for chest X-ray radiograph based lung nodule detection,” Artificial
Intelligence in Medicine, vol. 103, 2020, doi: 10.1016/j.artmed.2019.101744.
[37] S. Anitha and T. R. Ganesh Babu, “An efficient method for the detection of oblique fissures from computed tomography images of
lungs,” Journal of Medical Systems, vol. 43, no. 8, 2019, doi: 10.1007/s10916-019-1396-0.
[38] S. G. Armato et al., “The lung image database consortium (LIDC) and image database resource initiative (IDRI): A completed
reference database of lung nodules on CT scans,” Medical Physics, vol. 38, no. 2, pp. 915–931, 2011, doi: 10.1118/1.3528204.
[39] Y. R. Zhao, X. Xie, H. J. De Koning, W. P. Mali, R. Vliegenthart, and M. Oudkerk, “NELSON lung cancer screening study,”
Cancer Imaging, vol. 11, no. SPEC. ISS. A, 2011, doi: 10.1102/1470-7330.2011.9020.
[40] S. G. Armato et al., “The reference image database to evaluate response to therapy in lung cancer (RIDER) project: A resource for
the development of change-analysis software,” Clinical Pharmacology and Therapeutics, vol. 84, no. 4, pp. 448–456, 2008, doi:
10.1038/clpt.2008.161.
[41] J. Gong, J. yu Liu, L. jia Wang, X. wen Sun, B. Zheng, and S. dong Nie, “Automatic detection of pulmonary nodules in CT images
by incorporating 3D tensor filtering with local image feature analysis,” Physica Medica, vol. 46, pp. 124–133, 2018, doi:
10.1016/j.ejmp.2018.01.019.
[42] “NLST datasets,” [Online]. Available: https://cdas.cancer.gov/nlst/.
[43] M. Dolejsi, J. Kybic, M. Polovincak, and S. Tuma, “The lung TIME: Annotated lung nodule dataset and nodule detection
framework,” Medical Imaging 2009: Computer-Aided Diagnosis, vol. 7260, p. 72601U, 2009, doi: 10.1117/12.811645.
[44] H. C. Shin et al., “Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and
transfer learning,” IEEE Transactions on Medical Imaging, vol. 35, no. 5, pp. 1285–1298, 2016, doi: 10.1109/TMI.2016.2528162.
[45] L. Kinahan, P. Muzi, M. Bialecki, B. Herman, and B. Coombs, “Data from the ACRIN 6668 trial NSCLC-FDG-PET [Data set].
The cancer imaging archive,” 2019, doi: 10.7937/tcia.2019.30il.
[46] M. Hershman et al., “Impact of interobserver variability in manual segmentation of non‐small cell lung cancer (Nsclc) applying
low‐rank radiomic representation on computed tomography,” Cancers, vol. 13, no. 23, 2021, doi: 10.3390/cancers13235985.
[47] O. Grove et al., “Quantitative computed tomographic descriptors associate tumor shape complexity and intratumor heterogeneity
with prognosis in lung adenocarcinoma,” PLOS ONE, vol. 10, no. 3, p. e0118261, Mar. 2015, doi: 10.1371/journal.pone.0118261.
BIOGRAPHIES OF AUTHORS
Indah Soesanti received the M.Eng. and Ph.D. degrees from the Department of
Electrical Engineering, Universitas Gadjah Mada, Yogyakarta, Indonesia. She is currently a
Lecturer with the Department of Electrical and Information Engineering, Faculty of
Engineering, Universitas Gadjah Mada. Her research interests include artificial intelligence,
optimization, signal processing, image processing, information systems, ICT-based systems,
biomedical engineering, and pattern classification. She can be contacted at
[email protected].
Noor Akhmad Setiawan received his Bachelor and Master degree in Electrical
Engineering from Universitas Gadjah Mada in 1998 and 2003 respectively. He received his
PhD degree in Electrical and Electronics Engineering from Universiti Teknologi
PETRONAS in 2009. He is with the Department of Electrical and Information Engineering
Universitas Gadjah Mada. His research interest includes machine learning, soft computing,
data mining, big data, medical engineering and informatics, and electrical engineering. He is
a member of IEEE, ACM, IRSS, IAENG and IACSIT. He can be contacted by email at
[email protected].
A review of convolutional neural network-based computer-aided lung nodule detection system (Sekar Sari)