smart system conf (1) (1)
smart system conf (1) (1)
from MRI
Iheb ELGHAIEB1*, Hiba MZOUGHI 2, Ahmed ZOUINKHI 1 and Naceur ABDELKRIM1, *
1
University of Gabes, National School of Engineers of Gabes.MACS-lab
Systems, St Omar Ibn El Khattab, 6029 Gabes, Tunisia
2
National engineering School of Sfax, ATMS-Lab, – Sfax
II. RELATED WORKS
Abstract— This study proposes a pretrained weight transfer
Convolutional Neural Networks (CNNs), one type of deep
*[email protected]
learning technique using popular pretrained convolutional
learning technology, have shown impressive results in a variety
*[email protected]
neural networks such as Vgg16 and Efficientnet-B3. We
of picture categorization applications, especially in recent years.
also used a classical CNN architecture with domain-specific
With only the raw visual input, CNNs may automatically
information and architectural alterations to evaluate the
generate hierarchical representations, capturing discriminative
performance of the suggested pretrained models. The
features necessary for precise classification.
outcomes of the trials show that the transfer learning
method performs better in classifying Alzheimer's disease
than the suggested CNN, efficientnetb3, and Vgg16 the
potential of transfer learning to enhance the early detection The use of deep learning techniques and transfer learning
and treatment of the condition. The suggested models algorithms has led to a major advancement in the field of
display results with an overall accuracy of 95.70% for CNN, Alzheimer disease (AD) classification using MRI images in
Vgg16 and Efficientnet-b3. recent years. The potential advantages of these techniques for
enhancing AD classification accuracy and early detection have
Keywords— Alzheimer Disease, MRI Images, Alzheimer’s been the subject of numerous investigations. An overview of the
key studies on this subject is given in this review of the literature.
stage classification,
Nawaz et al. [2] Based on deep characteristics, a real-time
I. INTRODUCTION approach for identifying Alzheimer disease stages is proposed.
The scientists use a combination of manually developed and
One prevalent kind of dementia that impairs memory and deep learning-based characteristics extracted from MRI scans
cognition is Alzheimer's disease (AD). Significant public to classify different stages of Alzheimer disease. Their
health issues have been raised by Alzheimer's disease and other approach produces promising results in terms of accuracy and
related dementias in North America and Europe. According to instantaneous operation.
estimates, 10% of people over 65 in North America and Europe
had Alzheimer's disease in 2019. That comes to about 5.6
million people. With the aging of the population and the Bhandari et al. [3] demonstrated how to use segmentation
anticipated sharp growth in the number of Alzheimer's algorithms to analyze brain subregions and predict AD in its early
patients, this figure is predicted to rise. Most of the care stages. The MRI dataset used in the study includes patients with
required by individuals with Alzheimer's disease is given by AD as well as those with normal cognitive function. During the
friends, family, and other informal caregivers [1]. AD is MRI processing, skull stripping and histogram equalization were
commonly divided into four phases: Non Demented Very Mild applied to improve the image quality. The multilevel
Demented Mild Demented Moderate Demented thresholding technique was then applied to the MRI's segment-
specific regions, including the white matter, corpus callosum,
gray matter, and hippocampus sections. The resulting segmented
images were placed into a CNN to predict AD. The proposed
framework improved CNN performance by highlighting AD-
related brain areas on MRI using multilayer thresholding.
Regretfully, the automatic thresholding approach relied on a
strong contrast between the backdrop and the applied
segmentation algorithm, which was not integrated with the CNN.
VGG|16
The first and second convolutional layers of the VGG-model
have 64 feature kernel filters, each with a 3x3 filter size. If an
RGB image with a depth of three is placed into these layers,
the dimensions become 224x224x64. The output is then sent
to a max pooling layer with a stride of two. In the third and
fourth convolutional layers, filters with a 3x3 filter size and
128 feature kernel filters are employed. After these layers, a
max pooling layer with a stride of 2 is applied, reducing the
output to 56x56x128. Five, Six, and Seven are convolutional
layers with 256 feature mappings and a 3x3 kernel size ,
Following these layers is a max pooling layer with a stride of
Figure 5: Architecture efficientnetb3
two.
The ninth through thirteenth layers are two sets of - CNN
convolutional layers with a 3x3 kernel size. Both sets have an MRI. Slices are obtained using three different picture
512 kernel filters. After convolution, a max pooling layer with planes: the axial or horizontal plane, the coronal or frontal
a stride of 1 comes after these layers. The fourteenth and plane, and the sagittal or center plane. In addition, we added
fifteenth fully connected hidden levels each contain 4096
additional training examples by using Horizontal Flipping.
units. Following them is a softmax output layer (sixteenth
layer) with 1000 units. Neurons in convolutional neural networks have weights
and biases that are tuned to match the different objects in
the image.
The practice of adapting a model trained on one task to
another that is similar is known as transfer learning in
machine learning. By transferring knowledge from one
domain to another, transfer learning can improve the
model's performance on the new task, particularly in
scenarios when training data is scarce.
Evaluation metrics
The validation set is used to assess the transfer learning model's
performance following classifier training. Measures like recall,
accuracy, precision, and F1-score are calculated to evaluate how
well the model performs in categorization. To enhance
performance, the model's parameters can be adjusted further as
necessary:
- Accuarcy (%):
A statistic known as accuracy is used to assess the overall
precision of the model's predictions. It shows the proportion of
the dataset's correctly identified samples. The accuracy of the
model attained after a particular epoch is shown by ACC (%),
which is expressed as a percentage.
𝑇𝑁 + 𝑇𝑃
(1)
𝐹𝑃 − 𝑇𝑃 + 𝐹𝑁 + 𝑇𝑁
- Precision (%):
The model's ability to discriminate positive examples from
expected positives is measured by a statistic called as accuracy.
It represents the ratio of genuine positives to the sum of false
positives and true positives. The percentage P (%) represents
the model's accuracy at the conclusion of a certain period.
Figure 6: Architecture CNN
𝑇𝑃
(2)
𝐹𝑃 + 𝑇𝑃
B. Model Architecture Modification:
The selected pretrained model architecture needs to be
modified for the specific purpose of Alzheimer's disease stage - Recall (%):
classification. The last few layers of the model, including the Recall, also known as true positive rate or sensitivity, assesses
fully connected layers, are typically adjusted in this step to the model's ability to distinguish between positive and true
better match the desired output classes and dimensions. positives. It is calculated by dividing the number of true
positives by the total number of false negatives and positives.
A percentage known as R (%) represents the recall that the
IV. RESULT AND ANALYSIS model achieved at the end of a given epoch.
Dataset 𝑇𝑃
(3)
The dataset includes preprocessed MRI (Magnetic Resonance 𝑇𝑃𝐹𝑁
Imaging) images. The data was collected from a variety of
websites, healthcare facilities, and free sources. Each image VALIDATION OF THE PROPOSED FRAMEWORK
has been reduced to 128 by 128 pixels. The dataset consists of
six 400 MRI scans divided into four picture groups. a) Accuracy Plot
Class1: Non Demented (896 pictures) Accustomed to CNN. A three-layer convolutional model is used
Class 2: Moderate Demented (64 pictures) to convert the input image size (227x227) to (127x127), yielding
Class 3: Mild Demented (3200 pictures) a feature matrix with 58 dense layers. Figure 8 shows a
Class 4: Very Mild Demented (2240 pictures) pretrained CNN. The plot's accuracy after 10 epochs is 88.89%.
The loss plot in Figure 8s depicts an exponential decrease in loss
after 8 epochs.
a) Confusion Matrix
The confusion matrix of the suited models are given in
the figure bellow.
(c)
Figure 11: A: VGG-16, B: CNN, and C: Efficient net’
Confusion Matrix
Conclusion
REFERENCES
[1] [1] Elghaieb, I., Souid, A., Zouinkhi, A., & Sakli, H. (2024). Defeating
Alzheimer's: AI Perspective from Diagnostics to Prognostics: Literature
Summary. In Machine Learning and Deep Learning Techniques for
Medical Image Recognition (pp. 245-256). CRC Press.
[2] [2] H. Nawaz, M. Maqsood, S. Afzal, F. Aadil, I. Mehmood, and S.
Rho,“A deep feature-based real-time system for Alzheimer disease
stage detection,” Multimedia Tools and Applications, vol. 80, no. 28–
29, pp.35789–35807, 2021, doi: 10.1007/s11042-020-09087-y.
[3] [3] A. K. Bhandari, V. K. Singh, A. Kumar, and G. K. Singh, “Cuckoo
search algorithm and wind driven optimization-based study of satellite
image segmentation for multi-level thresholding using Kapur’s
entropy,” Expert Systems with Applications, vol. 41,no. 7, pp. 3538–
3560, Jun. 2014, doi: 10.1016/j.eswa.2013.10.059.