0% found this document useful (0 votes)

9 views21 pages

jimaging-10-00329-v2

This study evaluates hybrid CNN-RNN deep learning models for detecting tumor tissue in dynamic breast thermography, highlighting the effectiveness of the VGG16-LSTM architecture, which achieved high accuracy (95.72%) and specificity (98.68%). The research demonstrates that hybrid models outperform standalone CNNs, particularly in capturing temporal data from sequential thermographic images. The findings suggest that integrating CNNs and RNNs can enhance breast cancer detection while maintaining efficient runtime.

Uploaded by

erik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views21 pages

jimaging-10-00329-v2

Uploaded by

erik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Article

The Use of Hybrid CNN-RNN Deep Learning Models to

Discriminate Tumor Tissue in Dynamic Breast Thermography
Andrés Munguía-Siu 1 , Irene Vergara 2 and Juan Horacio Espinoza-Rodríguez 1, *

1 Department of Computing, Electronics and Mechatronics, Universidad de las Américas Puebla, Sta. Catarina
Martir, San Andrés Cholula 72810, Mexico; [email protected]
2 Department of Immunology, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de
México, Mexico City 04510, Mexico; [email protected]
* Correspondence: [email protected]

Abstract: Breast cancer is one of the leading causes of death for women worldwide, and early
detection can help reduce the death rate. Infrared thermography has gained popularity as a non-
invasive and rapid method for detecting this pathology and can be further enhanced by applying
neural networks to extract spatial and even temporal data derived from breast thermographic images
if they are acquired sequentially. In this study, we evaluated hybrid convolutional-recurrent neural
network (CNN-RNN) models based on five state-of-the-art pre-trained CNN architectures coupled
with three RNNs to discern tumor abnormalities in dynamic breast thermographic images. The
hybrid architecture that achieved the best performance for detecting breast cancer was VGG16-LSTM,
which showed accuracy (ACC), sensitivity (SENS), and specificity (SPEC) of 95.72%, 92.76%, and
98.68%, respectively, with a CPU runtime of 3.9 s. However, the hybrid architecture that showed
the fastest CPU runtime was AlexNet-RNN with 0.61 s, although with lower performance (ACC:
80.59%, SENS: 68.52%, SPEC: 92.76%), but still superior to AlexNet (ACC: 69.41%, SENS: 52.63%,
SPEC: 86.18%) with 0.44 s. Our findings show that hybrid CNN-RNN models outperform stand-alone
CNN models, indicating that temporal data recovery from dynamic breast thermographs is possible
without significantly compromising classifier runtime.

Keywords: breast cancer; deep learning; neural networks; CNN-RNN; thermography;

Citation: Munguía-Siu, A.; Vergara, I.; binary classification
Espinoza-Rodríguez, J.H. The Use of
Hybrid CNN-RNN Deep Learning
Models to Discriminate Tumor Tissue
in Dynamic Breast Thermography. J. 1. Introduction
Imaging 2024, 10, 329. https://
Breast cancer (BC) is the most diagnosed malignancy and the second leading cause
doi.org/10.3390/jimaging10120329
of death due to malignant tumors worldwide [1]. Early detection plays an essential role
Academic Editor: Ebrahim Karami in decreasing the death rate caused by this type of cancer [2,3]. The gold standard for
breast cancer diagnosis is mammography. This technique, however, has low sensitivity
Received: 11 November 2024
in dense tissue and exposes the patient to ionizing radiation [4–6]. The FDA approved
Revised: 13 December 2024
thermography as a complementary imaging technique to mammography for the detection
Accepted: 19 December 2024
Published: 21 December 2024
of breast cancer. However, the sensitivity was low and hardly differentiated between
healthy and diseased tissue [7]. Infrared thermography is a non-invasive, rapid, and low-
cost imaging technique [8]. The principle of this technique is the absorption of infrared
(IR) light (7.5–13 µm) emitted by the surface of a body (e.g., breast) to generate a heat map
Copyright: © 2024 by the authors. and obtain its spatial temperature [9]. There are two acquisition protocols for infrared
Licensee MDPI, Basel, Switzerland. thermography: static infrared thermography (SIT), which registers corporal patterns in
This article is an open access article basal temperature, and dynamic infrared thermography (DIT), which studies the vascular
distributed under the terms and physiological response after applying a thermal stimulus [10]. Breast cancer diagnosis in
conditions of the Creative Commons infrared imaging results in higher accuracy when using DIT, increasing the metric from
Attribution (CC BY) license (https://
54% to 82% due to the enhancement of the vascular pattern after the thermal stimulus is
creativecommons.org/licenses/by/
removed [10–12].
4.0/).

J. Imaging 2024, 10, 329. https://doi.org/10.3390/jimaging10120329 https://www.mdpi.com/journal/jimaging

J. Imaging 2024, 10, 329 2 of 21

Human interpretation and analysis of medical images is a time-consuming and error-

prone task, so machine learning (ML) models have been widely used in breast cancer
diagnosis [13–18]. A limitation of ML models such as support vector machines, decision
trees, or naive Bayes is that they require manual intervention [19–21]. Deep learning tech-
niques have been used in medical imaging since they can automatically identify complex
patterns or extract the most relevant features from breast cancer images [5,13,20–23]. Deep
neural networks (DNNs) have become particularly promising for disease classification,
especially convolutional neural networks (CNNs) in diagnostic medical images [24,25].
CNNs are based on vision perception from significant inputs such as images; they extract
visual features automatically through convolutional layers and classify them based on the
training data [26]. These networks help diagnose the state of patients with various diseases,
such as diabetes [27], cardiomegaly [28], coronavirus disease 2019 (COVID-19) [29], and
breast cancer [30], using thermographic images. Another type of DNN known as a recurrent
neural network (RNN) links their nodes to process information sequentially, making it more
efficient for time-series data [31]. Recently, RNNs have been used in the medical field for
breast cancer imaging classification [32,33] due to their ability to extract dynamic patterns
using sequential data. In fact, there is evidence that hybrid CNN-RNN deep learning
models are more effective than single deep learning models such as CNNs or recurrent
neural networks (RNNs) in breast cancer detection modalities such as histopathology
images [34–36], mammography [32], and ultrasound [37].
Deep learning models have made significant advancements in the segmentation
and classification of medical images, leading to improved accuracy in critical clinical
applications. According to Zhao et al. [38], the RGGC-UNet deep learning framework can
be effective for semantic segmentation of signet ring cells (SRCs) in pathological images.
This model improves segmentation accuracy while reducing computation complexity
by incorporating residual ghost blocks and ghost coordinate attention. Aside from that,
Salehi et al. [39] provide an in-depth analysis of CNNs and transfer learning in medical
imaging. The study highlights the critical role of medical imaging in disease diagnosis and
treatment, focusing on how CNN-based models can be used to improve image analysis and
classification. It emphasizes the benefits of transfer learning, particularly for small datasets
and limited computational resources, as well as improving accuracy, reducing time and
resource requirements, and addressing class imbalances. Various CNN architectures are
discussed in Mohammed et al. [40], emphasizing their effectiveness in automating medical
image processing tasks, which are traditionally time-consuming, error-prone, and costly.
It focuses on advancements in CNN design, such as hybrid models that combine CNNs
with transformers and other architectures to improve the classification and segmentation
performance of medical images.
Different techniques have been used in existing methods for segmentation, feature
extraction, or training of a supervised ML model for breast cancer classification in thermal
imaging, but many of the models used do not consider the temporal component of the
dynamic acquisition protocol. Therefore, the lack of an automatic system that analyzes
the sequence of images obtained by the DIT protocol and detects breast cancer disease
motivated this research. Our study provides a novel framework that employs a hybrid
CNN-RNN architecture for automatic breast cancer classification on images acquired using
dynamic infrared thermography. CNNs are used to extract detailed spatial features from
thermographic images, capturing localized temperature patterns and structural anomalies.
Meanwhile, RNNs examine the temporal progression of these features throughout the
image sequence. Consequently, this paper evaluates the performance (accuracy, sensitivity,
and specificity) and efficiency (CPU execution time) of a deep learning model based on five
state-of-the-art CNN architectures coupled with different types of RNN cells to determine
whether tumor abnormalities are present or absent during dynamic breast thermography.
The contributions of this work are as follows:
sitivity, and specificity) and efficiency (CPU execution time) of a deep learning model
based on five state-of-the-art CNN architectures coupled with different types of RNN cells
to determine whether tumor abnormalities are present or absent during dynamic breast
thermography.
J. Imaging 2024, 10, 329 3 of 21
The contributions of this work are as follows:
• A hybrid deep learning model (CNN-RNN) to automatically extract spatial and tem-
• poral
A features
hybrid deep from thermal
learning model breast images obtained
(CNN-RNN) by the DIT
to automatically protocol
extract spatialtoand
classify
tem-
them into healthy or sick classes.
poral features from thermal breast images obtained by the DIT protocol to classify
• A comparison
them in performance
into healthy metrics of single and hybrid deep learning models for
or sick classes.
• breast
A cancer diagnosis
comparison based on
in performance sequential
metrics thermographic
of single images.
and hybrid deep learning models for
• Compute
breast CPU
cancer execution
diagnosis timeon
based forsequential
single andthermographic
coupled models using different neural
images.
• networks to
Compute CPUclassify breasttime
execution cancer
for under
single dynamic thermography.
and coupled models using different neural
networks to classify breast cancer under dynamic thermography.
2. Materials and Methods
2. Materials and Methods
2.1. Framework
2.1. Framework
The proposed
The proposedbinary
binaryclassification
classificationsystem
systemmethodology
methodology forfor
thethe breast
breast thermal
thermal im-
images
ages obtained by the DIT acquisition protocol consists of the following steps:
obtained by the DIT acquisition protocol consists of the following steps: data cleansing data cleans-
ing and
and pre-processing,
pre-processing, automatic
automatic segmentation,
segmentation, data augmentation,
data augmentation, sample sample selection,
selection, and a
coupled model model
and a coupled (CNN-RNN)
(CNN-RNN)for classification and assessment
for classification with performance
and assessment metrics
with performance
for a balanced
metrics dataset as
for a balanced illustrated
dataset in Figurein1.Figure 1.
as illustrated

Figure 1. Diagram of the proposed methodology for binary breast cancer classification using hybrid
CNN-RNN-based deep learning models.

2.2. Dataset
2.2. Dataset
The data were acquired from a public dataset from Antonio Pedro University Hospital
known Theasdata were acquired
Database from a public
for Mastology dataset
Research withfrom Antonio
Infrared Pedro
Image University
(DMR-IR) [41]Hospi-
from
tal known as Database for Mastology Research with Infrared Image (DMR-IR)
267 healthy volunteers and 44 sick volunteers. This database contains thermal breast [41] from
267 healthy volunteers and 44 sick volunteers. This database contains thermal
images that were acquired using static and dynamic protocols. However, in the presentbreast im-
ages that
work, were images
dynamic acquired using
were usedstatic and the
to extract dynamic
desiredprotocols.
temporal However, in the present
features. Thermal images
were obtained using the FLIR SC-620 IR camera. This camera possesses an image resolution
of 640 × 480 pixels with an image frequency of 30 Hz; the spectral range detected by this
camera is 7.5–13 µm, and the temperature range is from −40 ◦ C to 500 ◦ C. As part of the
DIT protocol acquisition, a fan was used to apply a thermal stimulus to the volunteer until
the thorax temperature reached an average of 30.5 ◦ C. A sequence of frontal, each 15 s
long, was then taken for 5 min. The data were stored in a txt file containing the spatial
temperature in degrees Celsius of the heat map captured by the thermal camera.
camera.

2.3. Data Cleansing and Pre-Processing

J. Imaging 2024, 10, 329
Data cleaning consists of removing, replacing, or modifying data that may cause 4 of 21
noise to the model to be trained [42]. We applied data cleaning to remove patients with
less than 20 thermal
2.3. images in sequence,
Data Cleansing as well as those with unclear images (noise or
and Pre-Processing
blurring) and those with unspecified
Data material
cleaning consists such asreplacing,
of removing, patches or implants data
or modifying overthat
themay
central
cause noise
to the
study region. Figure model to
2 shows anbeexample
trained [42]. We applied
of selected (a)data
andcleaning
discardedto remove
(b andpatients with less than
c) samples.
20 thermal images in sequence, as well as those with unclear images (noise or blurring)
After cleaning the data, we selected 166 labeled healthy samples and 38 diseased samples
and those with unspecified material such as patches or implants over the central study
based on thermographic images
region. Figure suitable
2 shows for the study.
an example Finally,
of selected the
(a) and number(bof
discarded volunteers
and c) samples. After
labeled as healthycleaning
was matched with
the data, the number
we selected of sickhealthy
166 labeled volunteers
samples toand
train
38the deepsamples
diseased learn- based
on thermographic images suitable for the study. Finally, the number
ing models with a balanced dataset, resulting in 38 volunteers for each class. The thermal of volunteers labeled as
healthy was matched with the number of sick volunteers to train the deep learning models
images were resized to matrices of 224 × 224 to reduce the computational time of training
with a balanced dataset, resulting in 38 volunteers for each class. The thermal images were
tasks. resized to matrices of 224 × 224 to reduce the computational time of training tasks.

Sample grayscale
Figure 2. thermograms
Figure 2. Sample grayscale thermograms
from fromfor
volunteers volunteers
a breastfor a breast
study: (a)study: (a) The is
The image image is clear, so
clear,
it is selected; (b) The image is blurry, so it is not selected; (c) The image contains material (bandaged
so it is selected; (b) The image is blurry, so it is not selected; (c) The image contains material (band-
breast) that covers the study region, so it is not selected.
aged breast) that covers the study region, so it is not selected.
2.4. Segmentation
2.4. Segmentation A fully automated segmentation was implemented using the U-Net architecture
to remove noise from thermographic images such as necks, stomachs, and armpits in
A fully automated segmentation
accordance was implemented
with the methodology described byusing Mohamedthe et
U-Net
al. [22].architecture to
The U-Net architecture
remove noise fromwas thermographic images
employed to reduce thesuch as necks,tostomachs,
time required segment each and armpits
image in accord-
manually for the use of
ance with the methodology described by Mohamed et al. [22]. The U-Net architecture was can be
editing software in all the thermal images. It has been reported that this network
trained using a limited number of samples [43]. This network is convenient for biomedical
employed to reduce the time required to segment each image manually for the use of ed-
data due to the large number of feature channels in its layers, which increases the resolution
iting software in all theoutput
of the thermal
[44].images. It has abeen
We conducted samplereported
number thatsweepthis network
to train can be
the U-Net to obtain
the number
trained using a limited number of frames needed
of samples to segment
[43]. the breast
This network is thermal images
convenient forproperly,
biomedicalstarting the
model training at 20 images with 5-frame increments. The appropriate number of samples
data due to the large number of feature channels in its layers, which increases the resolu-
to train U-Net for automatic segmentation of breast thermal images was 40. To segment the
tion of the output training
[44]. Weimages,
conducted
we used a sample numbersoftware
the open-source sweep to train the(version
ITK-SNAP U-Net3.8)to obtain
for interactive
the number of frames
image needed to segment
visualization the breast thermal
and semi-automatic segmentationimages properly,
of medical starting
images theonly the
to crop
region of interest (ROI), which, in this case, is the breasts.
model training at 20 images with 5-frame increments. The appropriate number of samples
to train U-Net forU-Net
automatic segmentation of breast thermal images was 40. To segment
Architecture
the training images, we used
The U-Net theis open-source software
a fully connected ITK-SNAP
layer (FCL) (version 3.8)
that automatically for inter-
segments medical im-
active image visualization
ages [45]; itand semi-automatic
comprises segmentation
23 convolutional of medical
layers distributed in twoimages
networkto cropThe first
steps.
only the region ofpart consists
interest of the
(ROI), contracting
which, in thispath
case,with breasts.convolution of 3 × 3, followed by
a repeated
is the
a rectified linear unit (ReLU) and a max pooling for downsampling of the input. The
next step is the expansive path; in this part, there is a concatenation in the feature maps
of the contracting path to unsample the signal and crop the original image for the auto-
mated segmentation. The U-Net network is shown in Figure 3, where the convolutional
layers are used to encode-decode the input data and crop the regions of the images in the
trained network.
contracting path to unsample the signal and crop the original image for the automated
segmentation. The U-Net network is shown in Figure 3, where the convolutional layers
are used to encode-decode the input data and crop the regions of the images in the trained
J. Imaging 2024, 10, 329 network. 5 of 21

Figure 3. U-Net architecture. The contracting path is on the left side of the U-shape, and the expanding
path is on the right. The blue boxes represent multi-channel feature maps. The number of channels is
Figure 3. U-Net architecture. The contracting path is on the left side of the U-shape, and the expand-
indicated on the top of the box. The x-y size is shown at the bottom left edge of the box. An orange
ing pathindicated
arrow is on theeach
right. The blue boxes represent multi-channel feature maps. The number of chan-
operation.
nels is indicated on the top of the box. The x-y size is shown at the bottom left edge of the box. An
2.5. Data Augmentation
orange arrow indicated each operation.
Once the thermal images were cropped to the ROIs, the data augmentation was applied
using four different transformations: a horizontal flip, a 15◦ rotation, a 30◦ rotation, and a
2.5.
15% Data Augmentation
zoom. Data augmentation was implemented to obtain more samples and to train the
hybrid DL model, since images
Once the thermal the ANNs require
were a largetoamount
cropped of data
the ROIs, the to function
data correctly. In
augmentation was ap-
J. Imaging 2024, 10, 329 this step, the sequences of images are adjusted to obtain an input tensor
plied using four diﬀerent transformations: a horizontal flip, a 15° rotation,of 224 × 224 × 20
a 30° rotation,
6 of 23
per patient. Figure 4 shows an example of the segmentation and data augmentation process
and a 15% zoom. Data augmentation was implemented to obtain more samples and to
for thermal images.
train the hybrid DL model, since the ANNs require a large amount of data to function
correctly. In this step, the sequences of images are adjusted to obtain an input tensor of
224 × 224 × 20 per patient. Figure 4 shows an example of the segmentation and data aug-
mentation process for thermal images.

Figure 4. Example of a grayscale thermogram of the volunteer with ID 28: (a) selected image by data
Figure 4. Example of a grayscale thermogram of the volunteer with ID 28: (a) selected image by data
cleansing; (b) thermal image segmented using U-Net; (c) data augmentation using the transformations
cleansing; (b) thermal image segmented using U-Net; (c) data augmentation using the transfor-
of horizontal flip, rotation 15◦ , rotation 30◦ , and zoom 15%.
mations of horizontal flip, rotation 15°, rotation 30°, and zoom 15%.
2.6. Hybrid Deep Learning Model (CNN-RNN)
2.6.1. Convolutional Neural Network
2.6. Hybrid Deep Learning Model (CNN-RNN)
CNNs have been studied to improve the performance of image classification, image
2.6.1. Convolutional
recognition, Neuraland
object detection, Network
other tasks [46]. CNNs are most used for visual image
classification
CNNs have as they allow
been to extract
studied the information
to improve from extensive
the performance data, such
of image as images image
classification,
with pixels [47]. Currently, there are different imaging modalities for breast cancer diagno-
recognition, object detection, and other tasks [46]. CNNs are most used for visual image
classification as they allow to extract the information from extensive data, such as images
with pixels [47]. Currently, there are diﬀerent imaging modalities for breast cancer diag-
nosis, such as mammography, ultrasound, and MRI, and the evaluation of these images
is mainly performed with deep learning models such as CNNs [48].
J. Imaging 2024, 10, 329 6 of 21

sis, such as mammography, ultrasound, and MRI, and the evaluation of these images is
mainly performed with deep learning models such as CNNs [48].
The CNNs are composed of three main layers: the convolutional layer, the pooling
layer, and the fully connected layer. The convolutional layer consists of feature learning.
Once inputs are in the network, they are used to extract local characteristics from the image
at different positions. These convolutions are computed with a kernel to extract several
features depending on the values of these small matrices. The results of these convolutions
are passed into a nonlinear activation function, i.e., sigmoid, rectified linear unit (ReLU),
tanh, or softplus, to obtain a continuous signal [49]. The activation function is an essential
part of neural networks as it allows the output to be nonlinear and continuous, enabling
the training of the model for either classification or logistic regression [50]. The next main
computation is the pooling layer; it extracts features by reducing the dimensions of the
feature maps. The most common pooling operations are the average and the max pooling.
Finally, the fully connected layer connects all the previous values of the feature vector to
apply linear transformations to obtain the product after an activation function. For the
classification, the SoftMax regression is the most used in multiclass probability distribution.
There is also a procedure known as dropout. It consists of inhibiting a certain number of
neurons to retrain the network and ensure robust training. The process of feeding an input
into the neural network to obtain the probability distribution in the output layer is known
as forward propagation. When there is an error in the regression, this value is considered to
retrain the layer in the CNN architecture through the back propagation algorithm. Figure 5
10, 329 shows the implementation of classifying tissue heterogeneity using CNN architectures.7 of 23 In
this figure, it is possible to visualize all the layers and the fully connected layers to obtain a
binary classification in the thermal images for the BC disease.

5. CNN
Figurefor
Figure 5. CNN model model for
the binary the binary classification
classification of breast of breastheterogeneity
tissue tissue heterogeneity (normal
(normal or or abnormal)
abnor-
in thermographic images.
mal) in thermographic images.
For image classification, there are five CNN architectures in the state-of-the-art:
For image Inception-V3,
classification,VGG-16,
there are five CNN
ResNet101, architectures
GoogLeNet, in the state-of-the-art:
and AlexNet In-
[51]. These architectures
ception-V3, VGG-16, ResNet101, GoogLeNet, and AlexNet [51]. These architectures in
were used for automated feature extraction in the hybrid DL model and for classification
the single DL model:
were used for automated feature extraction in the hybrid DL model and for classification
• Inception-v3: It consists of a network of 48 layers where there are 24 parameters to
in the single DL model:
train. It was developed to improve the performance of the GoogLeNet architecture.
• Inception-v3: It consists of a network of 48 layers where there are 24 parameters to
train. It was developed to improve the performance of the GoogLeNet architecture.
• VGG-16: This network consists of a structure of 16 layers, 13 from convolutional lay-
ers and 3 of fully connected layers. It is more accurate than the AlexNet architecture,
J. Imaging 2024, 10, 329 7 of 21

• VGG-16: This network consists of a structure of 16 layers, 13 from convolutional layers

and 3 of fully connected layers. It is more accurate than the AlexNet architecture, but
the training is slower.
• ResNet101: In this architecture, the number of layers is 101. It works by using the
residual blocks to optimize the training of the CNN, resulting in an error of 6.44 in the
ImageNet dataset.
• GoogLeNet: This CNN architecture has a total of 22 layers, but the number of pa-
rameters and the memory of the network size is small. Its training is stable by the
auxiliar classifier.
• AlexNet: It is a simple CNN architecture; it is structured by a total of 8 layers, where
5 of them consist of convolutional layers and 3 of fully connected layers. This model
can be trained fast, but the accuracy is low in comparison to other architectures.
Once the features were extracted by the five pre-trained CNN architectures, they were
reorganized into a vector that was input into three different recurrent neural networks
(RNN). As standalone models, RNNs present a considerable challenge when applied to
sequences such as raw thermographic image data. Due to their sequential nature, RNNs
often struggle with vanishing or exploding gradients during backpropagation through
time, especially when working with long input sequences [52]. This limitation reduces
their ability to effectively capture long-term dependencies or learn meaningful patterns
over extended sequences. In addition, RNNs are not optimized for handling the high
spatial dimensionality and intricate patterns present in raw medical images, which results
in suboptimal feature extraction performances. A CNN-RNN combination addresses these
concerns by using CNNs to extract robust spatial features from each frame of the image
before passing these compact, informative representations to the RNN. This approach
mitigates gradient problems and ensures effective modeling of both spatial and temporal
variations in thermographic image sequences.

2.6.2. Recurrent Neural Network

RNN is a type of artificial neural network that has been used for the analysis of data
in sequence in the time domain [53]. This network has demonstrated an enhancement in
image and language processing. RNNs are built through intermediate layers representing
hidden states where the activation of the actual step depends on the previous step [54].
RNNs work great with small data in sequence, but they have problems with large amounts
of data because of the vanishing gradient. Two proposed models based on the original
RNN model resolve the problem of gradient disappearance: the LSTM and GRU. LSTM
consists of a 3-gate structure to control the information through the memory cells; these are
as follows [55]:
• Input gate: It controls the input at the current time step and updates the hidden state
of the previous time step.
• Forget gate: It updates the internal state from the previous time step to the actual
time step.
• Output gate: It controls the new information into the hidden vector.
The GRU model is a similar model to LSTM but with fewer parameters to update. It
is built with two gates: the update gate, which is like the input gate in the LSTM model,
and the reset gate, which updates the internal state from the previous time and controls the
new information into the hidden vector. Thus, for this study we consider three types of
recurrent neural networks: the simple recurrent neural network (RNN), the long short-term
memory (LSTM), and the gated recurrent unit (GRU).

2.7. Network Specifications

Several CNN architectures were employed to extract features from the thermographic
images, including pre-trained versions of Inception-V3, VGG16, ResNet101, GoogLeNet,
and AlexNet. These architectures were pre-trained on the ImageNet dataset and adapted to
the thermographic imaging context using transfer learning. Specifically, their convolutional
J. Imaging 2024, 10, 329 8 of 21

bases were used as feature extractors, with weights frozen during training to leverage
their pre-trained capabilities. In order to integrate the thermographic image sequences,
the input layer of the combined model was adapted to accept a sequence of 20 frames
per sample, each resized to 224 × 224 pixels and normalized to a range of [0, 1]. The
CNNs were implemented as part of a TimeDistributed layer, enabling feature extraction
from each frame independently before passing the extracted features to the RNN layers.
On the output side, the classification task was binary (healthy vs. sick), so the final layer
was modified to include a single dense neuron with a softmax activation function for
probabilistic binary classification. A sequential training process was used to train the CNN
and RNN components of the model. The extracted features from the CNNs were processed
by different types of RNNs, including simple RNNs, GRUs, and LSTM networks, to capture
temporal dependencies in the thermographic image sequences. Each RNN configuration
consisted of two stacked layers, each with 64 units. The first recurrent layer was configured
to return sequences, enabling the second layer to process the full temporal information
of the thermographic data. For the GRU and LSTM models, the forget and update gate
mechanisms allowed for effective learning of long-range dependencies while mitigating the
vanishing gradient problem typically encountered in standard RNNs. All RNNs utilized
the ReLU activation function for hidden units to improve stability during training. The
recurrent networks were trained from scratch, with random weight initialization provided
by TensorFlow using the Glorot uniform initialization method to ensure stable gradient
propagation. Input sequences consisted of 20 thermographic frames per patient, normalized
to a range of [0, 1]. The model was trained using the ADAM optimizer with a softmax
activation function in the last layer. The learning rate was set to 0.001 per iteration, with
a batch size of 16 samples. The number of epochs was set at 30, which ensured model
convergence while avoiding overfitting.
To monitor the performance of the models during training, the validation accuracy
was evaluated at the end of each epoch. In this way, we were able to track the model’s
learning progress and detect overfitting potential. In Appendix A, Figures A1–A5 illustrate
the validation accuracy of the different models.

2.8. Performance Metrics

2.8.1. Segmentation Performance
The automatic segmentation of thermographic breast images was validated using the
Dice Coefficient and Jaccard Index, two widely used metrics for evaluating segmentation
performance [56,57]. The Dice Coefficient measures the similarity between two sets by cal-
culating the overlap between the predicted segmentation and the ground truth, expressed
as follows:
2| A ∩ B |
Dice = , (1)
| A|+| B|
where A is the predicted segmentation and B is the ground truth. Dice coefficient values
range from 0 to 1, which denotes no overlap, up to 1, which denotes perfect overlap.
The Jaccard Index, also known as the intersection over union (IoU), quantifies the
proportion of overlap between the predicted segmentation and the ground truth relative to
their union, calculated as follows:

| A ∩ B|
IoU = , (2)
| A ∪ B|

Like the Dice coefficient, its values range from 0 to 1, where higher values indicate
better segmentation accuracy.
The metrics obtained were as follows:
• Average Dice coefficient: 0.9347 ± 0.0138.
• Average Jaccard Index: 0.8776 ± 0.0242.
J. Imaging 2024, 10, 329 9 of 21

2.8.2. Classification Performance

The single and hybrid DL models were evaluated to classify the thermal images
sequentially from the DTI acquisition approach. For the single DL model, an FCL was set
in the last layer of each CNN architecture, unlike the hybrid DL model, where in the last
layer of the CNN there are two layers of RNN cells (RNN, LSTM, and GRU). The model
was evaluated using the following performance metrics:

TP + TN
Accuracy = , (3)
TP + TN + FP + FN
TP
Sensitivity = , (4)
TP + FN
TN
Specificity = (5)
TN + FP
where TP is the prediction of a sample for the sick class when the real class is sick, TN is for
a healthy predicted class when the real class is healthy, FP is the prediction of a sick class
when the real is healthy, and FN is for the prediction of a healthy class when the real class
is sick. The metrics were computed from the resultant confusion matrix. A CPU execution
time was also calculated, which shows the time in seconds required to predict a class using
a single and hybrid DL model, considering the input complexity, CNN architecture, and
system requirements.
In this methodology, leave-one-out cross-validation (LOOCV) was employed to assess
the model’s performance. For each iteration, the model was trained using all data samples
except one, which was reserved as the test set. This procedure was repeated for every
sample in the dataset, ensuring that each thermal image sequence was used as a test
case once. The performance metrics from all iterations were then averaged to obtain a
comprehensive evaluation of the model’s generalization capability.

2.9. System Requirements

The pipeline was implemented on a machine with the requirements indicated in
Table 1, which considers the computer’s specifications and the language and libraries required.

Table 1. System requirements.

RAM 8 GB
CPU 1.60 GHz processor, Core-i5, 8th Gen
GPU Nvidia, 1050
Languages Version 3.8 Python
OS 64-bit Windows
Libraries Numpy, Pandas, OpenCV, Scikitlearn, Tensor Flow

3. Results
The CNN architectures—Inception-v3, VGG16, ResNet101, GoogLeNet, and
AlexNet—were assessed when coupled with RNN, LSTM, and GRU to classify abnor-
malities in the breasts with thermal images in sequence. The DL models were evaluated
through performance metrics, and the validation used was the LOOCV.
The viability of selecting several datasets to acquire more samples was studied, but
none of them included the DIT acquisition approach. In the Visual Lab DMR dataset,
the number of sequences obtained was 38 for each class, resulting from the maximum
number of samples in the class labelled as sick. Moreover, the data were balanced with a
random sampling of 38 sequences from volunteers labelled as the healthy class. Balancing
the data was performed to train the model with the same number of samples from each
class. In addition, the data was increased with the transformations of data augmentation
J. Imaging 2024, 10, 329 10 of 21

techniques because of the small number of samples in the training of DL models using a
horizontal flip, a 15◦ rotation, a 30◦ rotation, and a 15% zoom. Table 2 depicts the number of
thermal images in sequence from the healthy and sick classes using filtered and augmented
data, and Table 3 shows the performance from the single and hybrid DL models, whose
metric values are derived from the confusion matrices of each model (see Appendix B,
Figures A6–A10).

Table 2. Number of thermal images acquired after application of the filters and transformations.

Healthy Sick
Data Cleansing 38 38
Data Augmentation 152 152

Table 3. Performance metrics and CPU execution time of the evaluated CNN architectures coupled
with RNNs or classifying with the fully connected layer.

Model Accuracy Sensitivity Specificity CPU Execution Time (s)

FC 71.05% 51.97% 90.13% 1.76
Inception RNN 72.70% 51.32% 94.08% 1.93
v3 LSTM 89.47% 82.89% 96.05% 2.02
GRU 76.32% 58.55% 94.08% 1.99
FC 69.08% 44.74% 93.42% 3.23
RNN 80.92% 67.11% 94.74% 3.49
VGG16
LSTM 95.72% 92.76% 98.68% 3.89
GRU 89.80% 81.58% 98.03% 3.71
FC 70.72% 50.00% 91.45% 3.69
RNN 72.70% 52.63% 92.76% 3.79
ResNet101
LSTM 76.97% 62.50% 91.45% 4.13
GRU 73.36% 53.29% 93.42% 3.88
FC 72.70% 55.26% 90.13% 0.88
RNN 77.30% 60.53% 94.08% 0.99
GoogLeNet
LSTM 94.08% 90.13% 98.03% 1.11
GRU 87.50% 80.26% 94.74% 1.03
FC 69.41% 52.63% 86.18% 0.44
RNN 80.59% 68.42% 92.76% 0.61
AlexNet
LSTM 85.53% 74.34% 96.71% 1.16
GRU 84.16% 72.19% 96.05% 1.09

The proposed CNN-RNN binary classifier obtained the highest metrics when VGG16
is used with the LSTM layers, reaching a total of 95.72%, 92.76%, and 98.68% in accuracy,
sensitivity, and specificity, respectively (Table 3). On the other hand, the worst performance
was achieved from the single DL model AlexNet with 69.41%, 52.63%, and 86.18% in
accuracy, sensitivity, and specificity, respectively. According to Table 3, the architecture with
the fastest CPU execution time is the AlexNet, a single CNN, with a time of 0.45 s. However,
its performance metrics are below those of other models, with accuracy, sensitivity, and
specificity of 69.41%, 52.63%, and 86.18%, respectively. On the other hand, the model that
obtained the best performance metrics (VGG16-LSTM) takes almost nine times longer in
CPU execution time than the single AlexNet model.
Additionally, there is evidence that any of the CNN architectures used in this work in
a hybrid DL model (LSTM-based) increases the performance metrics in comparison to the
single DL model or the remaining RNN cells, not to mention that the hybrid DL models
tested acquired better performance than the single DL model (see Table 4).
J. Imaging 2024, 10, 329 11 of 21

Table 4. Performance metrics of the model coupled after the CNN architectures. Each value represents
the mean of each model from Table 3.

Model After CNN

Accuracy Sensitivity Specificity
Architecture
Fully Connected 70.59 ± 0.08% 50.92 ± 0.62% 90.26 ± 0.28%
RNN 76.84 ± 0.65% 60.00 ± 2.51% 93.68 ± 0.03%
LSTM 88.36 ± 2.26% 80.53 ± 6.11% 96.18 ± 0.32%
GRU 82.23 ± 2.03% 69.17 ± 6.51% 95.26 ± 0.13%

Figure 6 presents a visual representation of the performance metrics with the different
DL models. Here, we compare the results from various CNN architectures with their
respective RNN cells or the FCL. Figure 7 shows a comparison of the CPU execution time
J. Imaging 2024, 10, 329
in seconds of the different models evaluated, either with the deep learning model alone or 12 of 23
coupled to an RNN.

(a) 100 CNN model (b)100 CNN − RNN model

90 90
80 80
Performance (%)

Performance (%)
70 70
60 60
50 50
40 40
30 30
20 20
10 10
0 0
InceptionV3 VGG16 ResNet101 GoogLeNet AlexNet InceptionV3 VGG16 ResNet101 GoogLeNet AlexNet
CNN Architecture CNN Architecture
Performance Sensitivity Specificity Performance Sensitivity Specificity

Performance (%)

70 70
60 60
50 50
40 40
30 30
20 20
10 10
0 0
InceptionV3 VGG16 ResNet101 GoogLeNet AlexNet InceptionV3 VGG16 ResNet101 GoogLeNet AlexNet
CNN Architecture CNN Architecture
Performance Sensitivity Specificity Performance Sensitivity Specificity

Figure 6.Performance
Figure 6. Performance evaluation
evaluation (accuracy,
(accuracy, sensitivity,
sensitivity, and specificity)
and specificity) of the
of the different diﬀerent
hybrid CNN- hybrid
RNN architectures
CNN-RNN to classify
architectures the presence
to classify or absence
the presence orofabsence
a tumorofinabreast
tumorthermographic images:
in breast thermographic im-
(a) The
ages: (a)independent CNN model;
The independent CNN (b) The hybrid
model; CNN-RNN
(b) The model; (c) The
hybrid CNN-RNN hybrid
model; (c)CNN-LSTM
The hybrid CNN-
model; (d) The hybrid CNN-GRU model. Inception-V3, VGG16, ResNet101, GoogLeNet, and AlexNet
LSTM model; (d) The hybrid CNN-GRU model. Inception-V3, VGG16, ResNet101, GoogLeNet,
are the five CNN models that are coupled to the three sequential networks (RNN, LSTM, and GRU).
and AlexNet are the five CNN models that are coupled to the three sequential networks (RNN,
LSTM, and GRU).

4.5

4.0
CNN-RNN architectures to classify the presence or absence of a tumor in breast thermographic im-
ages: (a) The independent CNN model; (b) The hybrid CNN-RNN model; (c) The hybrid CNN-
LSTM model; (d) The hybrid CNN-GRU model. Inception-V3, VGG16, ResNet101, GoogLeNet,
and AlexNet are the five CNN models that are coupled to the three sequential networks (RNN,
J. Imaging 2024, 10, 329 12 of 21
LSTM, and GRU).

4.5

4.0

CPU execution time (s)

3.5

3.0

2.5

2.0

1.5

1.0

0.5

0.0
GRU
LSTM

GRU
LSTM

GRU
LSTM
FC
RNN

FC
RNN

FC
RNN
Inception v3 VGG16 ResNet101 GoogLeNet AlexNet

Hybrid model
Figure 7. CPU execution time of different coupled CNN-RNN deep learning architectures for breast
cancer classification in images acquired using the DIT acquisition protocol.

The performance metrics calculated by the single CNN models, as well as by CNN
models coupled with RNN, LSTM, and GRU, were augmented using the bootstrap method.
This approach allowed for the calculation of confidence intervals for each class. The results
showed a confidence interval of 69.08 to 71.5 for the simple CNN model, while for the
coupled models, the intervals were 72.7 to 80.92, 85.53 to 95.72, and 76.32 to 89.8 for CNN
with RNN, LSTM, and GRU, respectively. An ANOVA (analysis of variance) was performed
to assess whether there were significant differences between the performance metrics of
the models. ANOVA is a statistical method used to compare the means of three or more
groups to determine if at least one group differs significantly from the others. In this case,
the p-value was less than 0.05. This indicates that there is a highly significant difference
between the models, as the p-value is far below the common significance threshold of 0.05.
Therefore, we can confidently reject the null hypothesis and conclude that the models have
statistically different performances.

4. Discussion
In this study, breast tissue thermographic image sequences were assessed using a
hybrid DL model to identify abnormalities that may indicate BC disease. The hybrid model
incorporates a CNN to extract spatial features, a RNN to extract temporal features, and a
fully connected layer to determine whether the samples belong to a healthy or sick patient.
Few studies for breast cancer disease classification with dynamic acquisition protocol in
thermal imaging with machine learning and deep learning models [11,20,58–60], have
shown a lower false negative rate than SIT. In the last decade, neural networks have at-
tracted much attention from researchers due to the increase in computational capabilities
and their application in the detection of complex patterns automatically, as in the case of
thermal imaging [12]. For instance, Ekici and Jawzal [61] developed software to extract
breast features based on bio-data, image analysis, and image statistics. A CNN model opti-
mized using the Bayes algorithm was used to classify the features, resulting in an accuracy
of 98.95%. However, this metric is not adequate since they worked with an unbalanced
database, and the CNN architecture does not provide reproducibility information. In the
study by Cabıoğlu and Oğul [61], it was shown that by performing transfer learning on
the AlexNet architecture, the accuracy for classifying breast thermal images can increase
from 89.5% to 94.3% if the database is balanced. However, there was no segmentation
process in the images, which increases noise caused by non-interest regions [22]. Although
J. Imaging 2024, 10, 329 13 of 21

CNNs have gained prominence due to their ability to extract features through pixel-based
pattern recognition [22], RNNs perform more effectively when images are sequenced (over
time-captured images) [22], making them ideal for temporal feature extraction from images
acquired by DIT.
Several studies have reported the use of coupled CNN + RNN networks for the
classification of breast cancer disease in different imaging modalities. Wang et al. [34]
assessed breast histological images using a CNN + GRU model and obtained an accuracy of
86.21%, while a single DL model achieved an accuracy of 80%. A later study conducted by
Srikantamurthy et al. [35] reported that for binary classification of histopathology images
of BC, the single DL model had an accuracy of 98.6%, while the hybrid DL model of CNN-
RNN reached an accuracy of 99.75%. Likewise, Atrey et al. [37] applied hybrid models
(CNN + LSTM) to dual-modality mammography and ultrasound images to improve early
detection of breast cancer, leading to an increase in classification accuracy from 88.73%
to 99.35%.
In this context, this study evaluated the efficiency of coupled deep learning models
based on convolutional and recurrent neural networks for classifying breast cancer disease
in thermal images obtained by the DIT acquisition protocol. Our findings indicate that
coupled models can improve the accuracy of dynamic breast thermographic images, since
the accuracy of the stand-alone CNN model (single CNN) was 70.56%, while CNN + RNN,
CNN + GRU, and CNN + LSTM were 76.84%, 82.23%, and 88.56%, respectively (see Table 4).
The LSTM model performed best when coupled with a pretrained CNN model, which
corresponds to a hybrid VGG16-LSTM architecture. However, it has been reported that
this type of sequential network is computationally expensive when compared to RNNs or
GRUs [35]. Therefore, we have addressed not only the performance metrics for classification
but also the CPU execution time associated with binary classification to compare the
different combinations of coupled models between the pre-trained CNN architectures and
the three sequential architectures (RNN, LSTM, and GRU) (see Figure 6). Thus, the hybrid
VGG16-LSTM architecture, which developed the best performance metrics (Acc = 95.72%,
Sens = 92.76%, Spec = 98.68%), showed a CPU execution time of 3.89 s, making it the
second hybrid architecture that required the most CPU time to complete the classification
process (the ResNet101-LSTM model took 4.13 s) (see Table 3). This result is due to the
higher number of parameters and layers in VGG16 and ResNet101, unlike Inception-v3,
AlexNet, and GoogLeNet [35]. These results are consistent with models that took less time,
such as AlexNet, which had a CPU execution time of 0.44 s. However, the classification
statistics of this stand-alone CNN model are lower (ACC: 69.41%, SENS: 52.63%, SPEC:
86.18%) than other models (see Table 3). This same pre-trained CNN architecture coupled
with the LSTM network improved the classification performance (ACC: 85.53%, SENS:
74.34%, SPEC: 96.71%) as well as CPU execution time (1.16 s). Additionally, the stand-alone
CNN architecture, known as GoogLeNet (ACC: 72.70%, SENS: 55.26%, SPEC: 90.13%),
also demonstrated high classification performance when combined with the sequential
LSTM neural network (ACC: 94.08%, SENS: 90.13%, SPEC: 98.03%), requiring only 0.15 s of
CPU execution time over the single CNN model ( GoogLeNet). This time that is negligible
when compared to the increase in binary classification performance metrics for determining
whether a breast thermographic image contains a tumor.
A limitation of this study is the relatively small dataset, which comprises only 38 se-
quences per class after balancing. While data augmentation is a commonly used approach
to expand sample size and reduce overfitting [39], medical images present a unique chal-
lenge due to their inherent complexity and variability. As a result of these characteristics,
more advanced techniques are required to ensure that the model can effectively capture
the specific features and variability of medical conditions. One possible solution is to use
deep generative models, such as VAEs, GANs, and DMs, which have shown promise in
generating realistic, diverse images that can improve training by better representing the
underlying distribution of the dataset [62].
J. Imaging 2024, 10, 329 14 of 21

In the present study, however, the limitation of the small data set persists, since our
approach involves analyzing sequential thermography images, and the only dataset with
such images (DTI) is the DMR dataset from Visual Lab. In view of restricted access to
patient data and the complexity of collecting thermal imaging data, we were not able to
create a larger dataset. Thus, the current model is not suitable for widespread clinical
application. Nevertheless, with further data collection, this model may contribute to the
early detection of breast cancer by aiding clinicians in identifying areas of concern in
thermal images, along with other diagnostic tools. The integration of this model into
existing clinical workflows is also a critical issue. In spite of the fact that our model has
not yet been applied in clinical settings, we consider it to be a potential supplementary
tool for radiologists and clinicians. It may be possible to provide additional insights into
breast cancer diagnosis by analyzing thermal images alongside other diagnostic methods.
However, it would be necessary to address a number of issues to make the model suitable
for clinical use, including the processing of real-time data, the design of user interfaces, and
the compatibility with existing medical technology.

5. Conclusions
Deep learning plays an important role in detecting complex patterns in medical im-
ages, making them more reliable, accurate, and faster for diagnosing diseases. In this
study, we address the challenge of analyzing sequential thermal images of the breast using
hybrid deep learning models. Unlike static protocols, which capture steady-state images
at the same time, our approach benefits from additional information obtained over time
through dynamic acquisition. A comprehensive evaluation of stand-alone and coupled
deep learning models using pre-trained CNN architectures and RNN cells to classify se-
quential thermal breast images revealed that the best architecture for classification was
VGG16 + LSTM. However, other coupled models, such as GoogLeNet and AlexNet with
LSTM, achieved higher classification accuracy with a shorter CPU execution time than
VGG16 with a higher accuracy. The findings suggest that coupled CNN-RNN deep learning
models improve classification performance in thermographic breast images obtained by
dynamic acquisition protocol without significantly affecting the execution time to distin-
guish normal or abnormal breast tissue, making it a promising option for preventative
breast cancer diagnosis with a considerable time to obtain its result. This suggests that
hybrid deep learning models may be implemented in dynamic breast thermography so
that spatial (with CNN models) and temporal (with sequential models) features can be
extracted for subsequent radiological determination to determine whether tumor tissue
exists or is absent. It would be interesting to investigate optimizing features extracted from
thermal images in sequence to reduce the computational cost since neural networks require
systems to support model computations, particularly when training.

Author Contributions: Conceptualization, A.M.-S., J.H.E.-R. and I.V.; methodology, A.M.-S. and
J.H.E.-R.; validation, A.M.-S.; investigation, A.M.-S. and J.H.E.-R.; writing—original draft preparation,
A.M.-S. and J.H.E.-R.; writing—review and editing, A.M.-S., J.H.E.-R. and I.V.; supervision, J.H.E.-R.
All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: This article does not contain any studies with human participants or
animals performed by any of the authors.
Data Availability Statement: No new data were created or analyzed in this study. Data sharing is
not applicable to this article.
Acknowledgments: A.M.-S. wishes to acknowledge the support of the National Council of Humani-
ties, Sciences and Technologies (CONAHCyT) of Mexico and Universidad de las Américas Puebla
(UDLAP) for his PhD scholarship.
J. Imaging 2024, 10, 329 15 of 21

Conflicts of Interest: The authors declare no conflicts of interest.

Appendix A
Validation Accuracy
J. J.Imaging
Imaging2024,
2024,10,
10,329
329 1616ofof2323
Comparison of validation accuracy over epochs for various combinations of CNNs
and RNNs. The architectures include Inception-V3, VGG16, ResNet101, GoogLeNet, and
AlexNet, each coupled with RNN, LSTM, and GRU layers. The graphs illustrate the
performanceofofeach
performance eachmodel
modelinincapturing
capturingboth
bothspatial
spatialand
andtemporal
temporalpatterns
patternsfrom
frombreast
breast
performance of each model in capturing both spatial and temporal patterns from breast
cancer
cancer
cancerthermographic
thermographic
thermographic image
image
image sequences,
sequences, highlighting
highlighting
sequences, the
the
highlighting diﬀerences
diﬀerences
the in
inin
differences convergence
convergence
convergence rates
rates
rates
and
and
and overall
overall
overall accuracy.
accuracy.
accuracy.

Figure
Figure
Figure A1.
A1.
A1. Validation
Validation
Validation accuracy
accuracy ofof
accuracy ofInceptionV3
InceptionV3
InceptionV3 using
using
using (a)
(a)(a)single
single
single CNN;
CNN;
CNN; (b)
(b)
(b) coupled
coupled
coupled RNN;
RNN;
RNN; (c)
(c)(c)coupled
coupled
coupled
LSTM;
LSTM;
LSTM; and
and
and (d)
(d)(d)coupled
coupled
coupled GRU.
GRU.
GRU.

A2. Validation
Figure A2.
Figure accuracy
accuracyof
Validationaccuracy VGG16 using (a) (a)
single CNN; (b) coupled RNN;RNN;
(c) coupled LSTM;
Figure A2. Validation ofofVGG16
VGG16 using
using single
(a) single CNN;
CNN; (b)coupled
(b) coupledRNN; (c)(c)coupled
coupled
and (d)
LSTM;and coupled
and(d) GRU.
(d)coupled
coupledGRU.
GRU.
LSTM;
J. Imaging 2024, 10, 329 17 of 23
J. Imaging 2024,2024,
J. Imaging 10, 329
10, 329 17 of 23
16 of 21

Figure A3. Validation accuracy of ResNet101 using (a) single CNN; (b) coupled RNN; (c) coupled
FigureA3.
Figure Validation accuracy
A3.Validation accuracyofofResNet101
ResNet101using (a) single
using CNN;
(a) single (b) coupled
CNN; RNN; RNN;
(b) coupled (c) coupled
(c) coupled
LSTM; and (d) coupled GRU.
LSTM; and (d) coupled GRU.
LSTM; and (d) coupled GRU.

FigureA4.
Figure Validation accuracy
A4. Validation accuracyofofAlexNet
AlexNetusing (a) (a)
using single CNN;
single (b) coupled
CNN; RNN; RNN;
(b) coupled (c) coupled
(c) coupled
LSTM;A4.
Figure andValidation
(d) coupled accuracy
GRU. of AlexNet using (a) single CNN; (b) coupled RNN; (c) coupled
LSTM; and (d) coupled GRU.
LSTM; and (d) coupled GRU.
J. Imaging 2024, 10, 329 18 of 23

J. Imaging 2024, 10, 329 18 of 23

J. Imaging 2024, 10, 329 17 of 21

Figure A5. Validation accuracy of GoogLeNet using (a) single CNN; (b) coupled RNN; (c) coupled
LSTM; and (d) coupled GRU.
Figure A5. Validation accuracy of GoogLeNet using (a) single CNN; (b) coupled RNN; (c) coupled
Figure A5. Validation accuracy of GoogLeNet using (a) single CNN; (b) coupled RNN; (c) coupled
LSTM; and (d) coupled GRU.
Appendix
LSTM; and (d) B
coupled GRU.

Appendix Matrix
Confusion B
Appendix B
Confusion Matrix
Confusion matrices illustrating the performance of various CNN-RNN architectures
Confusion Matrix
Confusion matrices illustrating thethermographic
performance of various CNN-RNN The architectures
on the classification of breast cancer image sequences. models include
on the classification
Confusion of
matricesbreast cancer thermographic
illustrating the image
performance sequences.
of various The models
CNN-RNN include
architectures
combinations of Inception-V3, VGG16, ResNet101, GoogLeNet, and AlexNet with RNN,
combinations of Inception-V3, VGG16, ResNet101, GoogLeNet, and AlexNet with RNN,
on the
LSTM, classification
andGRU
LSTM, and GRUlayers. of breast
layers.Each
Each cancer
matrix
matrix
thermographic
provides
provides
image
a detailed
a detailed
sequences.
breakdown
breakdown
The
of trueof
models include
true positive,
positive, true true
combinations
negative, falseof
negative, false Inception-V3,
positive,
positive, andfalse
and VGG16,
false negative
negative ResNet101, GoogLeNet,
predictions,
predictions, and
showcasing
showcasing AlexNet
the the ability
ability with RNN,
of each
of each ar-
LSTM, andto
architecture
chitecture GRU layers.classify
tocorrectly
correctly Each matrix
classify healthy
healthy provides
and
and a detailed
diseased
diseasedcases. breakdown of true positive, true
cases.
negative, false positive, and false negative predictions, showcasing the ability of each ar-
chitecture to correctly classify healthy and diseased cases.

Figure A6. Confusion

Figure A6. Confusionmatrix
matrixof of InceptionV3
InceptionV3 using
using (a) single
(a) single CNN;CNN; (b) coupled
(b) coupled RNN;
RNN; (c) (c) coupled
coupled
LSTM; and
LSTM; and(d)
(d)coupled
coupledGRU.
GRU.
Figure A6. Confusion matrix of InceptionV3 using (a) single CNN; (b) coupled RNN; (c) coupled
LSTM; and (d) coupled GRU.
J.J.Imaging
Imaging 2024,
J. Imaging 10,
2024,
2024, 10,329
10, 329
329 19
1921of
18 of of 23
23

J. Imaging 2024, 10, 329 19 of 23

Figure
Figure A7.
Figure A7. Confusion
A7. Confusion matrix
Confusionmatrix of
matrixof VGG16
ofVGG16 using
VGG16using
using (a)
(a)(a) single
single
single CNN;
(b)(b)
CNN;
CNN; (b) coupled
coupled
coupled RNN;
(c) (c)
RNN;
RNN; (c) coupled
coupled
coupled LSTM;
LSTM;
LSTM;
Figure A7. Confusion matrix of VGG16 using (a) single CNN; (b) coupled RNN; (c) coupled LSTM;
and
and (d) coupled
and (d) coupled GRU.
coupledGRU.
GRU.
and (d) coupled GRU.

Figure
Figure A8.
Figure
Figure A8.Confusion
A8.
A8. Confusionmatrix
Confusion
Confusion matrixofof
matrix
matrix ofResNet101
of ResNet101using
ResNet101
ResNet101 (a)
using
using
using (a) single
(a)
(a) CNN;
single
single
single (b)
CNN;
CNN;
CNN; coupled
(b)(b)
(b) RNN;
coupled
coupled
coupled (c)
(c) coupled
RNN;
RNN;
RNN; (c)
(c) coupled
coupled
coupled
LSTM;
LSTM; and
and
LSTM; and (d)
(d)coupled
coupledGRU.
GRU.
LSTM; and (d)coupled
coupledGRU.
GRU.

Figure
FigureA9.
A9. Confusion
Confusionmatrix
matrixofofGoogLeNet
GoogLeNetusing
using(a)
(a)single
singleCNN;
CNN;(b)
(b) coupled
coupled RNN;
RNN; (c)
(c) coupled
coupled
Figure
Figure A9. Confusion matrix of GoogLeNet using (a) single CNN; (b) coupled RNN; (c)
LSTM;
LSTM; A9.
and
and Confusion
(d)
(d) coupled
coupled matrix
GRU.
GRU. of GoogLeNet using (a) single CNN; (b) coupled RNN; (c) coupled
coupled
LSTM;
LSTM; and
and (d)
(d) coupled
coupled GRU.
GRU.
J.
J. Imaging 2024, 10,
Imaging 2024, 10, 329
329 2019of
of 23
21

A10.Confusion
Figure A10. Confusionmatrix
matrixofof
AlexNet using
AlexNet (a) single
using CNN;
(a) single (b) coupled
CNN; RNN;RNN;
(b) coupled (c) coupled LSTM;
(c) coupled
and (d) coupled GRU.
LSTM; and (d) coupled GRU.

References
References
1. GLOBOCAN Cancer Today. Available online: https://gco.iarc.fr/today/en (accessed on 6 August 2024).
1.
2.
GLOBOCAN Cancer Today. Available online: https://gco.iarc.fr/today/en (accessed on 6 August 2024).
Singh, D.; Singh, A.K. Role of Image Thermography in Early Breast Cancer Detection- Past, Present and Future. Comput. Methods
2. Singh, D.;Biomed.
Programs Singh, A.K.
2020,Role
183,of Image[CrossRef]
105074. Thermography in Early Breast Cancer Detection- Past, Present and Future. Comput. Methods
[PubMed]
3. Mahoro,
Programs E.; Akhloufi,
Biomed. 2020,M.A. Breast Cancer Classification on Thermograms Using Deep CNN and Transformers. Quant. Infrared
183, 105074.
3. Thermogr. J. 2024, 21, 30–49. [CrossRef]
Mahoro, E.; Akhloufi, M.A. Breast Cancer Classification on Thermograms Using Deep CNN and Transformers. Quant. Infrared
4. Gonzalez-Hernandez, J.L.; Recinella, A.N.; Kandlikar, S.G.; Dabydeen, D.; Medeiros, L.; Phatak, P. Technology, Application and
Thermogr. J. 2024, 21, 30–49. https://doi.org/10.1080/17686733.2022.2129135.
Potential of Dynamic Breast Thermography for the Detection of Breast Cancer. Int. J. Heat. Mass. Transf. 2019, 131, 558–573.
4. Gonzalez-Hernandez, J.L.; Recinella, A.N.; Kandlikar, S.G.; Dabydeen, D.; Medeiros, L.; Phatak, P. Technology, Application and
[CrossRef]
5. PotentialD.;
Tsietso, ofYahya,
Dynamic A.; Breast Thermography
Samikannu, R. A Reviewfor on
theThermal
Detection of Breast Cancer.
Imaging-Based Int.Cancer
Breast J. Heat.Detection
Mass. Transf.
Using2019,
Deep131, 558–573.Mob.
Learning.
5. Inf. Syst. 2022, 2022, 8952849. [CrossRef]
Tsietso, D.; Yahya, A.; Samikannu, R. A Review on Thermal Imaging-Based Breast Cancer Detection Using Deep Learning. Mob.
6. Rodrigues, A.L.;
Inf. Syst. 2022, de Santana,
2022, 8952849.M.A.; Azevedo, W.W.; Bezerra, R.S.; Barbosa, V.A.F.; de Lima, R.C.F.; dos Santos, W.P. Identification
of Mammary Lesions in Thermographic Images: Feature Selection Study Using Genetic Algorithms and Particle Swarm
6. Rodrigues, A.L.; de Santana, M.A.; Azevedo, W.W.; Bezerra, R.S.; Barbosa, V.A.F.; de Lima, R.C.F.; dos Santos, W.P. Identifica-
Optimization. Res. Biomed. Eng. 2019, 35, 213–222. [CrossRef]
7. tion of Mammary
Gershenson, Lesions in J.Thermographic
M.; Gershenson, Dynamic VascularImages: Feature
Imaging UsingSelection StudyThermography.
Active Breast Using Genetic Sensors
Algorithms
2023,and Particle
23, 3012. Swarm
[CrossRef]
Optimization. Res. Biomed. Eng. 2019, 35, 213–222. https://doi.org/10.1007/s42600-019-00024-z.
[PubMed]
8.
7. Lozano,
Gershenson,A.; Hassanipour,
M.; Gershenson,F. Infrared ImagingVascular
J. Dynamic for BreastImaging
Cancer Detection: An Objective
Using Active Review of Foundational
Breast Thermography. Studies
Sensors 2023, 23,and Its
3012.
Proper Role in Breast Cancer Screening. Infrared Phys. Technol. 2019, 97, 244–257. [CrossRef]
https://doi.org/10.3390/s23063012.
9. Ekici, S.; Jawzal, H. Breast Cancer Diagnosis Using Thermography and Convolutional Neural Networks. Med. Hypotheses 2020,
8. Lozano, A.; Hassanipour,
137, 109542. F. Infrared Imaging for Breast Cancer Detection: An Objective Review of Foundational Studies and
[CrossRef] [PubMed]
10. Its Proper Role
Mashekova, A.;in Breast
Zhao, Y.;Cancer Screening.
Ng, E.Y.K.; Infrared
Zarikas, V.; Fok, Phys.
S.C.;Technol. 2019, 97,
Mukhmetov, O. 244–257.
Early Detection of the Breast Cancer Using Infrared
9. Technology—A
Ekici, S.; Jawzal,Comprehensive
H. Breast Cancer Review. Therm.
Diagnosis UsingSci.Thermography
Eng. Prog. 2022,and
27,Convolutional
101142. [CrossRef]
Neural Networks. Med. Hypotheses 2020,
11. Ohashi, Y.; Uchida, I. Applying Dynamic Thermography
137, 109542. https://doi.org/10.1016/j.mehy.2019.109542. in the Diagnosis of Breast Cancer: Techniques for Improving Sensitivity
of Breast Thermography. IEEE Trans. Biomed. Eng. 2000, 47, 42–51. [CrossRef]
10. Mashekova, A.; Zhao, Y.; Ng, E.Y.K.; Zarikas, V.; Fok, S.C.; Mukhmetov, O. Early Detection of the Breast Cancer Using Infrared
12. D’Alessandro, G.; Tavakolian, P.; Sfarra, S. A Review of Techniques and Bio-Heat Transfer Models Supporting Infrared Thermal
Technology—A
Imaging Comprehensive
for Diagnosis Review.
of Malignancy. Therm.
Appl. Sci. Eng.
Sci. 2024, Prog. [CrossRef]
14, 1603. 2022, 27, 101142.
11.
13. Ohashi, Y.;
Rautela, K.;Uchida,
Kumar,I.D.;Applying
Kumar,Dynamic Thermography
V. A Systematic Review inonthe Diagnosis
Breast CancerofDetection
Breast Cancer:
UsingTechniques for Improving
Deep Learning Sensitiv-
Techniques. Arch.
Comput. Methods Eng. 2022, 29, 4599–4629. [CrossRef]
ity of Breast Thermography. IEEE Trans. Biomed. Eng. 2000, 47,42-51. http://doi.org/10.1109/51.844379.
14.
12. Olota, M.; Alsadoon,
D’Alessandro, A.; Alsadoon,
G.; Tavakolian, O.H.;S.Dawoud,
P.; Sfarra, A Review A.;ofPrasad, P.W.C.;
Techniques andIslam, R.; Jerew,
Bio-Heat O.D.Models
Transfer Modified Anisotropic
Supporting Diffusion
Infrared and
Thermal
Level-Set Segmentation for Breast Cancer. Multimed. Tools Appl. 2024, 83, 13503–13525. [CrossRef]
Imaging for Diagnosis of Malignancy. Appl. Sci. 2024, 14, 1603.
15. Acharya, U.R.; Ng, E.Y.K.; Tan, J.H.; Sree, S.V. Thermography Based Breast Cancer Detection Using Texture Features and Support
13. Rautela, K.; Kumar,
Vector Machine. D.; Syst.
J. Med. Kumar, V.36,
2012, A Systematic
1503–1510. Review on [PubMed]
[CrossRef] Breast Cancer Detection Using Deep Learning Techniques. Arch.
16. de Santana,
Comput. M.A.;Eng.
Methods Pereira,
2022,J.M.S.; da Silva, F.L.; de Lima, N.M.; de Sousa, F.N.; de Arruda, G.M.S.; de Lima, R.d.C.F.; de Silva,
29, 4599–4629.
14. W.W.A.;
Olota, M.;dos Santos, W.P.
Alsadoon, Breast Cancer
A.; Alsadoon, O.H.;Diagnosis
Dawoud,Based on Mammary
A.; Prasad, Thermography
P.W.C.; Islam, R.; Jerew,and
O.D.Extreme Learning
Modified Machines.
Anisotropic Res.
Diﬀusion
Biomed. Eng. 2018, 34, 45–53. [CrossRef]
and Level-Set Segmentation for Breast Cancer. Multimed. Tools Appl. 2024, 83, 13503–13525. https://doi.org/10.1007/s11042-023-
17. Gaber, T.; Ismail, G.; Anter, A.; Soliman, M.; Ali, M.; Semary, N.; Hassanien, A.E.; Snasel, V. Thermogram Breast Cancer Prediction
16021-5.
Approach Based on Neutrosophic Sets and Fuzzy C-Means Algorithm. In Proceedings of the Annual International Conference of
15. Acharya,
the U.R.; Ng, E.Y.K.;
IEEE Engineering Tan, J.H.;
in Medicine Sree,
and S.V. Thermography
Biology BasedMilan,
Society, EMBS 2015, BreastItaly,
Cancer Detection
25–29 August Using
2015. Texture Features and Sup-
port Vector Machine. J. Med. Syst. 2012, 36, 1503–1510. https://doi.org/10.1007/s10916-010-9611-z.
J. Imaging 2024, 10, 329 20 of 21

18. Sánchez-Ruiz, D.; Olmos-Pineda, I.; Olvera-López, J.A. Automatic Region of Interest Segmentation for Breast Thermogram Image
Classification. Pattern Recognit. Lett. 2020, 135, 72–81. [CrossRef]
19. Kufel, J.; Bargieł-Łaczek,
˛ K.; Kocot, S.; Koźlik, M.; Bartnikowska, W.; Janik, M.; Czogalik, Ł.; Dudek, P.; Magiera, M.; Lis, A.; et al.
What Is Machine Learning, Artificial Neural Networks and Deep Learning?—Examples of Practical Applications in Medicine.
Diagnostics 2023, 13, 2582. [CrossRef] [PubMed]
20. Farooq, M.A.; Corcoran, P. Infrared Imaging for Human Thermography and Breast Tumor Classification Using Thermal Images.
In Proceedings of the 2020 31st Irish Signals and Systems Conference, ISSC 2020, Letterkenny, Ireland, 11–12 June 2020.
21. Ensafi, M.; Keyvanpour, M.R.; Shojaedini, S.V. A New Method for Promote the Performance of Deep Learning Paradigm in
Diagnosing Breast Cancer: Improving Role of Fusing Multiple Views of Thermography Images. Health Technol. 2022, 12, 1097–1107.
[CrossRef]
22. Mohamed, E.A.; Rashed, E.A.; Gaber, T.; Karam, O. Deep Learning Model for Fully Automated Breast Cancer Detection System
from Thermograms. PLoS ONE 2022, 17, e0262349. [CrossRef]
23. Jafari, Z.; Karami, E. Breast Cancer Detection in Mammography Images: A CNN-Based Approach with Feature Selection.
Information 2023, 14, 410. [CrossRef]
24. Yadav, S.S.; Jadhav, S.M. Deep Convolutional Neural Network Based Medical Image Classification for Disease Diagnosis. J. Big
Data 2019, 6, 113. [CrossRef]
25. Goncalves, C.B.; Souza, J.R.; Fernandes, H. Classification of Static Infrared Images Using Pre-Trained CNN for Breast Cancer
Detection. In Proceedings of the IEEE Symposium on Computer-Based Medical Systems, Aveiro, Portugal, 7–9 June 2021.
26. Fourcade, A.; Khonsari, R.H. Deep Learning in Medical Image Analysis: A Third Eye for Doctors. J. Stomatol. Oral. Maxillofac.
Surg. 2019, 120, 279–288. [CrossRef] [PubMed]
27. Khandakar, A.; Chowdhury, M.E.H.; Reaz, M.B.I.; Ali, S.H.M.; Kiranyaz, S.; Rahman, T.; Chowdhury, M.H.; Ayari, M.A.; Alfkey,
R.; Bakar, A.A.A.; et al. A Novel Machine Learning Approach for Severity Classification of Diabetic Foot Complications Using
Thermogram Images. Sensors 2022, 22, 4249. [CrossRef] [PubMed]
28. Yoo, H.; Han, S.; Chung, K. Diagnosis Support Model of Cardiomegaly Based on CNN Using ResNet and Explainable Feature
Map. IEEE Access 2021, 9, 55802–55813. [CrossRef]
29. Barnawi, A.; Chhikara, P.; Tekchandani, R.; Kumar, N.; Alzahrani, B. Artificial Intelligence-Enabled Internet of Things-Based
System for COVID-19 Screening Using Aerial Thermal Imaging. Future Gener. Comput. Syst. 2021, 124, 119–132. [CrossRef]
[PubMed]
30. Grigore, M.A.; Neagoe, V.E. A Deep CNN Approach Using Thermal Imagery for Breast Cancer Diagnosis. In Proceedings of the
13th International Conference on Electronics, Computers and Artificial Intelligence, ECAI 2021, Pitesti, Romania, 1–3 July 2021.
31. Li, F.; Liu, M. A Hybrid Convolutional and Recurrent Neural Network for Hippocampus Analysis in Alzheimer’s Disease. J.
Neurosci. Methods 2019, 323, 108–118. [CrossRef] [PubMed]
32. Patil, R.S.; Biradar, N. Automated Mammogram Breast Cancer Detection Using the Optimized Combination of Convolutional
and Recurrent Neural Network. Evol. Intell. 2021, 14, 1459–1474. [CrossRef]
33. Soni, K.M.; Gupta, A.; Jain, T. Supervised Machine Learning Approaches for Breast Cancer Classification and a High Performance
Recurrent Neural Network. In Proceedings of the 3rd International Conference on Inventive Research in Computing Applications,
ICIRCA 2021, Coimbatore, India, 2–4 September 2021.
34. Wang, X.; Ahmad, I.; Javeed, D.; Zaidi, S.A.; Alotaibi, F.M.; Ghoneim, M.E.; Daradkeh, Y.I.; Asghar, J.; Eldin, E.T. Intelligent
Hybrid Deep Learning Model for Breast Cancer Detection. Electronics 2022, 11, 2767. [CrossRef]
35. Srikantamurthy, M.M.; Rallabandi, V.P.S.; Dudekula, D.B.; Natarajan, S.; Park, J. Classification of Benign and Malignant Subtypes
of Breast Cancer Histopathology Imaging Using Hybrid CNN-LSTM Based Transfer Learning. BMC Med. Imaging 2023, 23, 19.
[CrossRef]
36. Ahmad, S.; Ullah, T.; Ahmad, I.; Al-Sharabi, A.; Ullah, K.; Khan, R.A.; Rasheed, S.; Ullah, I.; Uddin, M.N.; Ali, M.S. A Novel
Hybrid Deep Learning Model for Metastatic Cancer Detection. Comput. Intell. Neurosci. 2022, 2022, 8141530. [CrossRef]
37. Atrey, K.; Singh, B.K.; Bodhey, N.K.; Bilas Pachori, R. Mammography and Ultrasound Based Dual Modality Classification of
Breast Cancer Using a Hybrid Deep Learning Approach. Biomed. Signal Process Control 2023, 86, 104919. [CrossRef]
38. Zhao, T.; Fu, C.; Song, W.; Sham, C.W. RGGC-UNet: Accurate Deep Learning Framework for Signet Ring Cell Semantic
Segmentation in Pathological Images. Bioengineering 2024, 11, 16. [CrossRef]
39. Salehi, A.W.; Khan, S.; Gupta, G.; Alabduallah, B.I.; Almjally, A.; Alsolai, H.; Siddiqui, T.; Mellit, A. A Study of CNN and Transfer
Learning in Medical Imaging: Advantages, Challenges, Future Scope. Sustainability 2023, 15, 5930. [CrossRef]
40. Mohammed, F.A.; Tune, K.K.; Assefa, B.G.; Jett, M.; Muhie, S. Medical Image Classifications Using Convolutional Neural
Networks: A Survey of Current Methods and Statistical Modeling of the Literature. Mach. Learn. Knowl. Extr. 2024, 6, 699–735.
[CrossRef]
41. Silva, L.F.; Saade, D.C.M.; Sequeiros, G.O.; Silva, A.C.; Paiva, A.C.; Bravo, R.S.; Conci, A. A New Database for Breast Research
with Infrared Image. J. Med. Imaging Health Inform. 2014, 4, 92–100. [CrossRef]
42. Sánchez-Cauce, R.; Pérez-Martín, J.; Luque, M. Multi-Input Convolutional Neural Network for Breast Cancer Detection Using
Thermal Images and Clinical Data. Comput. Methods Programs Biomed. 2021, 204, 106045. [CrossRef]
43. Siddique, N.; Paheding, S.; Elkin, C.P.; Devabhaktuni, V. U-Net and Its Variants for Medical Image Segmentation: A Review of
Theory and Applications. IEEE Access 2021, 9, 82031–82057. [CrossRef]
J. Imaging 2024, 10, 329 21 of 21

44. Du, G.; Cao, X.; Liang, J.; Chen, X.; Zhan, Y. Medical Image Segmentation Based on U-Net: A Review. J. Imaging Sci. Technol. 2020,
64, jist0710. [CrossRef]
45. Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image
Computing and Computer-Assisted Intervention—MICCAI 2015, Proceedings of the 18th International Conference, Munich, Germany,
5–9 October 2015; Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture
Notes in Bioinformatics); Springer: Cham, Switzerland, 2015; Volume 9351.
46. Guo, T.; Dong, J.; Li, H.; Gao, Y. Simple Convolutional Neural Network on Image Classification. In Proceedings of the 2017 IEEE
2nd International Conference on Big Data Analysis, ICBDA 2017, Beijing, China, 10–12 March 2017.
47. Jalloul, R.; Chethan, H.K.; Alkhatib, R. A Review of Machine Learning Techniques for the Classification and Detection of Breast
Cancer from Medical Images. Diagnostics 2023, 13, 2460. [CrossRef] [PubMed]
48. Mahoro, E.; Akhloufi, M.A. Applying Deep Learning for Breast Cancer Detection in Radiology. Curr. Oncol. 2022, 29, 8767–8793.
[CrossRef] [PubMed]
49. Wang, Y.; Li, Y.; Song, Y.; Rong, X. The Influence of the Activation Function in a Convolution Neural Network Model of Facial
Expression Recognition. Appl. Sci. 2020, 10, 1897. [CrossRef]
50. Rasamoelina, A.D.; Adjailia, F.; Sincak, P. Deep Convolutional Neural Network for Robust Facial Emotion Recognition. In
Proceedings of the IEEE International Symposium on INnovations in Intelligent SysTems and Applications, INISTA 2019, Sofia,
Bulgaria, 3–5 July 2019.
51. Oh, H.M.; Lee, H.; Kim, M.Y. Comparing Convolutional Neural Network(CNN) Models for Machine Learning-Based Drone and
Bird Classification of Anti-Drone System. In Proceedings of the International Conference on Control, Automation and Systems,
Jeju, Republic of Korea, 15–18 October 2019.
52. Zhang, H.; Qie, Y. Applying Deep Learning to Medical Imaging: A Review. Appl. Sci. 2023, 13, 10521. [CrossRef]
53. Azizi, S.; Bayat, S.; Yan, P.; Tahmasebi, A.; Kwak, J.T.; Xu, S.; Turkbey, B.; Choyke, P.; Pinto, P.; Wood, B.; et al. Deep Recurrent
Neural Networks for Prostate Cancer Detection: Analysis of Temporal Enhanced Ultrasound. IEEE Trans. Med. Imaging 2018, 37,
2695–2703. [CrossRef]
54. Pan, Q.; Zhang, Y.; Chen, D.; Xu, G. Character-Based Convolutional Grid Neural Network for Breast Cancer Classification. In
Proceedings of the 2017 International Conference on Green Informatics, ICGI 2017, Fuzhou, China, 15–17 August 2017.
55. Fang, W.; Chen, Y.; Xue, Q. Survey on Research of RNN-Based Spatio-Temporal Sequence Prediction Algorithms. J. Big Data 2021,
3, 97–110. [CrossRef]
56. da Queiroz, K.F.F.C.; de Queiroz Júnior, J.R.A.; Dourado, H.; de Lima, R.d.C.F. Automatic Segmentation of Region of Interest for
Breast Thermographic Image Classification. Res. Biomed. Eng. 2023, 39, 199–208. [CrossRef]
57. Rezaei, Z. A Review on Image-Based Approaches for Breast Cancer Detection, Segmentation, and Classification. Expert. Syst.
Appl. 2021, 182, 115204. [CrossRef]
58. De Freitas Oliveira Baffa, M.; Grassano Lattari, L. Convolutional Neural Networks for Static and Dynamic Breast Infrared Imaging
Classification. In Proceedings of the 31st Conference on Graphics, Patterns and Images, SIBGRAPI 2018, Paraná, Brazil, 29
October–1 November 2018.
59. Mambou, S.J.; Maresova, P.; Krejcar, O.; Selamat, A.; Kuca, K. Breast Cancer Detection Using Infrared Thermal Imaging and a
Deep Learning Model. Sensors 2018, 18, 2799. [CrossRef]
60. Chatterjee, S.; Biswas, S.; Majee, A.; Sen, S.; Oliva, D.; Sarkar, R. Breast Cancer Detection from Thermal Images Using a
Grunwald-Letnikov-Aided Dragonfly Algorithm-Based Deep Feature Selection Method. Comput. Biol. Med. 2022, 141, 105027.
[CrossRef]
61. Cabıoğlu, Ç.; Oğul, H. Computer-Aided Breast Cancer Diagnosis from Thermal Images Using Transfer Learning. In Bioinformatics
and Biomedical Engineering, Proceedings of the 8th International Work-Conference, IWBBIO 2020, Granada, Spain, 6–8 May 2020; Lecture
Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics);
Springer: Cham, Switzerland, 2020; Volume 12108, LNBI.
62. Kebaili, A.; Lapuyade-Lahorgue, J.; Ruan, S. Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review. J.
Imaging 2023, 9, 81. [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

Moe Light Catalog 1967
100% (10)
Moe Light Catalog 1967
140 pages
gradcam paper
No ratings yet
gradcam paper
11 pages
Mutim Tools App
No ratings yet
Mutim Tools App
112 pages
1-s2.0-S0010482524015178-main
No ratings yet
1-s2.0-S0010482524015178-main
10 pages
103 Submission
No ratings yet
103 Submission
19 pages
ALSHEHRI Breast Cancer Detection in Thermography Using Convolutional Neural Networks (CNNs) With Deep Attention Mechanisms
No ratings yet
ALSHEHRI Breast Cancer Detection in Thermography Using Convolutional Neural Networks (CNNs) With Deep Attention Mechanisms
19 pages
Density and Bi-Rads Classification
No ratings yet
Density and Bi-Rads Classification
70 pages
Breast Cancer Detection Using Extreme
No ratings yet
Breast Cancer Detection Using Extreme
13 pages
Deep_convolutional_neural_networks_for_mammography
No ratings yet
Deep_convolutional_neural_networks_for_mammography
21 pages
Day2021 - Screening of Breast Cancer From Thermogram Images by Edge Detection Aided Deep Transfer Learning Model
No ratings yet
Day2021 - Screening of Breast Cancer From Thermogram Images by Edge Detection Aided Deep Transfer Learning Model
19 pages
Ting 2019
No ratings yet
Ting 2019
28 pages
Musfequa Final Proposal
No ratings yet
Musfequa Final Proposal
15 pages
Breast Cancer Detection Using Extreme Learning Machine Based On Feature Fusion With CNN Deep Features
No ratings yet
Breast Cancer Detection Using Extreme Learning Machine Based On Feature Fusion With CNN Deep Features
13 pages
biology-10-01347-v2
No ratings yet
biology-10-01347-v2
44 pages
Breast Cancer-Caps - A Breast Cancer Screening System Based On Cap
No ratings yet
Breast Cancer-Caps - A Breast Cancer Screening System Based On Cap
36 pages
MDPI
No ratings yet
MDPI
25 pages
CNN Architecture Optimization Using Bio Inspired Algor - 2022 - Computers in Bio
No ratings yet
CNN Architecture Optimization Using Bio Inspired Algor - 2022 - Computers in Bio
13 pages
Chougrad2018 PDF
No ratings yet
Chougrad2018 PDF
21 pages
Breast Cancer Fully Completed Paper (2) - Abinav Batch
No ratings yet
Breast Cancer Fully Completed Paper (2) - Abinav Batch
16 pages
Convolutional Neural Networks For Breast Cancer Detection Using Regions of Interest From Infrared Images
No ratings yet
Convolutional Neural Networks For Breast Cancer Detection Using Regions of Interest From Infrared Images
10 pages
AH-0406-0407-318-Adam
No ratings yet
AH-0406-0407-318-Adam
14 pages
1-Breast Cancer Detection From Thermography Based On Deep Neural Networks
No ratings yet
1-Breast Cancer Detection From Thermography Based On Deep Neural Networks
5 pages
jimaging-09-00247
No ratings yet
jimaging-09-00247
13 pages
A Systematic Review of Breast Cancer Detection Using Thermography and Neural Networks
No ratings yet
A Systematic Review of Breast Cancer Detection Using Thermography and Neural Networks
16 pages
Multi-class Breast Cancer Classification Using CNN Features Hybridization
No ratings yet
Multi-class Breast Cancer Classification Using CNN Features Hybridization
19 pages
Project Report on Breast Cancer Segmentation and Development of Web App
No ratings yet
Project Report on Breast Cancer Segmentation and Development of Web App
30 pages
Multiscale Analysis Domain Interpretable Deep Neural Network For Detection of Breast Cancer Using Thermogram Images
No ratings yet
Multiscale Analysis Domain Interpretable Deep Neural Network For Detection of Breast Cancer Using Thermogram Images
13 pages
s41598-025-88459-6
No ratings yet
s41598-025-88459-6
14 pages
Final Major Project 7th Sem
No ratings yet
Final Major Project 7th Sem
72 pages
Paper-4
No ratings yet
Paper-4
28 pages
Transfer Learning and Fine Tuning in Bre
No ratings yet
Transfer Learning and Fine Tuning in Bre
12 pages
An Integrated Intelligent System For Breast Cancer Detection at Early Stages Using IR Images and Machine Learning Methods With Explainability
No ratings yet
An Integrated Intelligent System For Breast Cancer Detection at Early Stages Using IR Images and Machine Learning Methods With Explainability
16 pages
s12652-020-01680-1
No ratings yet
s12652-020-01680-1
17 pages
ResNet50 Based Effective Model for Breas
No ratings yet
ResNet50 Based Effective Model for Breas
17 pages
linux-lecture3
No ratings yet
linux-lecture3
81 pages
A classification method for breast images based on an improved VGG16
No ratings yet
A classification method for breast images based on an improved VGG16
16 pages
Multi Input Convolutional Neural Network For Breas - 2021 - Computer Methods and
No ratings yet
Multi Input Convolutional Neural Network For Breas - 2021 - Computer Methods and
9 pages
a21 Sayed Stamped e
No ratings yet
a21 Sayed Stamped e
6 pages
Paper Discussion
No ratings yet
Paper Discussion
15 pages
Rapid Segmentation and Diagnosis of Breast Tumor Ultrasound Images at The Sonographer Level Using Deep Learning
No ratings yet
Rapid Segmentation and Diagnosis of Breast Tumor Ultrasound Images at The Sonographer Level Using Deep Learning
13 pages
FastLeakyResNet-CIR_A_Novel_Deep_Learning_Framework_for_Breast_Cancer_Detection_and_Classification
No ratings yet
FastLeakyResNet-CIR_A_Novel_Deep_Learning_Framework_for_Breast_Cancer_Detection_and_Classification
8 pages
Draft research paper- word
No ratings yet
Draft research paper- word
6 pages
BreastCancer_CNNs
No ratings yet
BreastCancer_CNNs
17 pages
Breast Cancer Diagnosis in Mammography Images Using Deep Convolutional Neural Network-Based Transfer and Scratch Learning Approach
No ratings yet
Breast Cancer Diagnosis in Mammography Images Using Deep Convolutional Neural Network-Based Transfer and Scratch Learning Approach
10 pages
5class
No ratings yet
5class
9 pages
NWC079067 First Review
No ratings yet
NWC079067 First Review
18 pages
Breast Cancer MET
No ratings yet
Breast Cancer MET
24 pages
Breast cancer classification based on convolutional neural network and image fusion approaches using ultrasound images
No ratings yet
Breast cancer classification based on convolutional neural network and image fusion approaches using ultrasound images
16 pages
Development of an Artificial Intelligence Based Breast 19lv6v3x
No ratings yet
Development of an Artificial Intelligence Based Breast 19lv6v3x
18 pages
66 Data Analyst Interview Questions to Ace Your Interview
No ratings yet
66 Data Analyst Interview Questions to Ace Your Interview
47 pages
Session6 BCS428
No ratings yet
Session6 BCS428
49 pages
Research Article A Novel CNN-Inception-V4-Based Hybrid Approach For Classification of Breast Cancer in Mammogram Images
No ratings yet
Research Article A Novel CNN-Inception-V4-Based Hybrid Approach For Classification of Breast Cancer in Mammogram Images
10 pages
IET Biometrics - 2022 - Sun - Breast Mass Classification Based On Supervised Contrastive Learning and Multi View
No ratings yet
IET Biometrics - 2022 - Sun - Breast Mass Classification Based On Supervised Contrastive Learning and Multi View
13 pages
Efficient Breast Cancer Diagnosis from Complex MammographicImages Using Deep Convolutional Neural Network
No ratings yet
Efficient Breast Cancer Diagnosis from Complex MammographicImages Using Deep Convolutional Neural Network
11 pages
1 Deep Convolutional Neural Network Model For Breast
No ratings yet
1 Deep Convolutional Neural Network Model For Breast
1 page
Deep Learning Applied For Histological Diagnosis of Breast Cancer
No ratings yet
Deep Learning Applied For Histological Diagnosis of Breast Cancer
17 pages
Ekurhuleni Growth & Development Strategy 2025 PDF
No ratings yet
Ekurhuleni Growth & Development Strategy 2025 PDF
91 pages
classification of breast thermal
No ratings yet
classification of breast thermal
2 pages
Accuracy_Improvement_in_Binary_and_Multi-Class_Classification_of_Breast_Histopathology_Images
No ratings yet
Accuracy_Improvement_in_Binary_and_Multi-Class_Classification_of_Breast_Histopathology_Images
6 pages
2
No ratings yet
2
2 pages
Deep Convolutional Neural Networks For Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
No ratings yet
Deep Convolutional Neural Networks For Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
14 pages
Protocols Architecture
No ratings yet
Protocols Architecture
60 pages
OHE Manual Volume-I
67% (3)
OHE Manual Volume-I
404 pages
27TH Ramanujan Talent Test PDF
62% (13)
27TH Ramanujan Talent Test PDF
8 pages
115661979 U S DividendChampions
No ratings yet
115661979 U S DividendChampions
66 pages
Caterpillar 988f II | Wheel Loader | Service Manual | Download PDF
No ratings yet
Caterpillar 988f II | Wheel Loader | Service Manual | Download PDF
33 pages
Profesional practice IT Assignment
No ratings yet
Profesional practice IT Assignment
3 pages
Bharath Simha Reddy 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012020
No ratings yet
Bharath Simha Reddy 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012020
11 pages
research_paper
No ratings yet
research_paper
2 pages
Lecture 2a The Physical Layer 1
No ratings yet
Lecture 2a The Physical Layer 1
41 pages
2010 05 27 Shibaricon Max Fundamentals PDF
100% (3)
2010 05 27 Shibaricon Max Fundamentals PDF
13 pages
IPC2022-89297 Managing Rooftopping - An In-Line Inspection Based Approach Rev 0
No ratings yet
IPC2022-89297 Managing Rooftopping - An In-Line Inspection Based Approach Rev 0
8 pages
Solar Energy Lesson Plan
No ratings yet
Solar Energy Lesson Plan
4 pages
Arthritis: by DR Samra Tahseen Registrar, Radiology LNH
No ratings yet
Arthritis: by DR Samra Tahseen Registrar, Radiology LNH
86 pages
TM 9-1005-208-12 M1918a2 Bar
100% (2)
TM 9-1005-208-12 M1918a2 Bar
57 pages
PHYSICS 1 F4 MS - WAZAELIMU.COM-2
No ratings yet
PHYSICS 1 F4 MS - WAZAELIMU.COM-2
9 pages
Production of Corn Nut Snacks From Corn Kernels
No ratings yet
Production of Corn Nut Snacks From Corn Kernels
10 pages
ANSI/NEMA C29.9-1983 (R2002, R2012) : Wet Process Porcelain Insulators
No ratings yet
ANSI/NEMA C29.9-1983 (R2002, R2012) : Wet Process Porcelain Insulators
18 pages
TOTAL: 150 Time: 2 Hours Instructions and Information: Grade 11 Life Sciences November Examination
No ratings yet
TOTAL: 150 Time: 2 Hours Instructions and Information: Grade 11 Life Sciences November Examination
16 pages
Shop Supplies and Tools Catalog 2007 1
No ratings yet
Shop Supplies and Tools Catalog 2007 1
31 pages
Industry Keyword
No ratings yet
Industry Keyword
6 pages
Unit One Wireless Communication: 1 Cellular Systems
No ratings yet
Unit One Wireless Communication: 1 Cellular Systems
7 pages
Perth Dundee Arbroath Montrose Stonehaven Aberdeen X7: From 14 May 2018
No ratings yet
Perth Dundee Arbroath Montrose Stonehaven Aberdeen X7: From 14 May 2018
7 pages
C++ 07 Identifiers
No ratings yet
C++ 07 Identifiers
4 pages
Take Off List For Project
No ratings yet
Take Off List For Project
4 pages
6RJ25 H11+R2
No ratings yet
6RJ25 H11+R2
2 pages
Model Number Rxyq360Tatju, Vrv-Iv Heat Pump Outdoor Units - Rxyq Series
No ratings yet
Model Number Rxyq360Tatju, Vrv-Iv Heat Pump Outdoor Units - Rxyq Series
3 pages
ASD103WN
No ratings yet
ASD103WN
1 page
Flexible R. Joint Twinflex
No ratings yet
Flexible R. Joint Twinflex
3 pages
6.1 Underground Ventilation System - Underground Ventilation System
No ratings yet
6.1 Underground Ventilation System - Underground Ventilation System
13 pages
Bentoliner Nwl-60 Gcl Metric
No ratings yet
Bentoliner Nwl-60 Gcl Metric
1 page
Crison PH Eletrodes For Portable PH 25
No ratings yet
Crison PH Eletrodes For Portable PH 25
2 pages
FX 45a 55a SS
No ratings yet
FX 45a 55a SS
2 pages
Transformer Vault
No ratings yet
Transformer Vault
7 pages
Medical Imaging Mastery: Techniques and Technologies
From Everand
Medical Imaging Mastery: Techniques and Technologies
Bea D. Kinsley
No ratings yet

jimaging-10-00329-v2

Uploaded by

jimaging-10-00329-v2

Uploaded by

Article

The Use of Hybrid CNN-RNN Deep Learning Models to

Keywords: breast cancer; deep learning; neural networks; CNN-RNN; thermography;

J. Imaging 2024, 10, 329. https://doi.org/10.3390/jimaging10120329 https://www.mdpi.com/journal/jimaging

Human interpretation and analysis of medical images is a time-consuming and error-

2.3. Data Cleansing and Pre-Processing

• VGG-16: This network consists of a structure of 16 layers, 13 from convolutional layers

2.6.2. Recurrent Neural Network

2.7. Network Specifications

2.8. Performance Metrics

2.8.2. Classification Performance

2.9. System Requirements

Table 1. System requirements.

Model Accuracy Sensitivity Specificity CPU Execution Time (s)

Model After CNN

(a) 100 CNN model (b)100 CNN − RNN model

CPU execution time (s)

Conflicts of Interest: The authors declare no conflicts of interest.

J. Imaging 2024, 10, 329 18 of 23

Figure A6. Confusion

J. Imaging 2024, 10, 329 19 of 23

You might also like