Articulo 009
Articulo 009
sciences
Review
Deep Learning Applications for Dyslexia Prediction
Norah Dhafer Alqahtani 1,2, *, Bander Alzahrani 1 and Muhammad Sher Ramzan 1, *
1 Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia
2 Information Systems, King Khaled University, Abha 61421, Saudi Arabia
* Correspondence: [email protected] (N.D.A.); [email protected] (M.S.R.)
Abstract: Dyslexia is a neurological problem that leads to obstacles and difficulties in the learning
process, especially in reading. Generally, people with dyslexia suffer from weak reading, writing,
spelling, and fluency abilities. However, these difficulties are not related to their intelligence. An
early diagnosis of this disorder will help dyslexic children improve their abilities using appropriate
tools and specialized software. Machine learning and deep learning methods have been implemented
to recognize dyslexia with various datasets related to dyslexia acquired from medical and educational
organizations. This review paper analyzed the prediction performance of deep learning models for
dyslexia and summarizes the challenges researchers face when they use deep learning models for clas-
sification and diagnosis. Using the PRISMA protocol, 19 articles were reviewed and analyzed, with a
focus on data acquisition, preprocessing, feature extraction, and the prediction model performance.
The purpose of this review was to aid researchers in building a predictive model for dyslexia based
on available dyslexia-related datasets. The paper demonstrated some challenges that researchers
encounter in this field and must overcome.
1. Introduction
Dyslexia is a common learning difficulty that people encounter throughout their
Citation: Alqahtani, N.D.; Alzahrani, learning journey, which affects the reading, writing, spelling, fluency, word decoding, and
B.; Ramzan, M.S. Deep Learning dictation processes. However, it is not related to an individual’s level of intelligence. The
Applications for Dyslexia Prediction. term originates from the ancient Greek, with the prefix “dis” referring to disorder and the
Appl. Sci. 2023, 13, 2804. https:// root “lexia” to language. Hence, dyslexia signifies a language defect or disorder [1]. Many
doi.org/10.3390/app13052804 children with this disorder have normal intelligence and receive appropriate education
Academic Editor: José Salvador and parental support but have difficulty with learning certain skills. Today, dyslexia is
Sánchez Garreta the most frequent childhood learning disorder, accounting for up to 80% of all identified
learning disabilities [2].
Received: 27 January 2023 The World Federation of Neurology identifies dyslexia as a disturbance where the
Revised: 17 February 2023
child’s spelling, writing, or reading skills fail to meet predicted levels based on age and
Accepted: 20 February 2023
intellectual performance despite attending school regularly [3]. From a neuropsychological
Published: 22 February 2023
approach, these disorders result from one or more malfunctioning learning-related brain
systems [3], where the functions of the left hemisphere are imbalanced, such as impairment
in the area concerned with short-term memory, motor skills, visual perceptions, language
Copyright: © 2023 by the authors.
processing, auditory, speed, and speaking (Figure 1).
Licensee MDPI, Basel, Switzerland. The early diagnosis of suspected dyslexia in children is essential as it will increase
This article is an open access article the likelihood of the dyslexic child benefiting from effective intervention programs and
distributed under the terms and improving his or her abilities [4]. Traditional tests for diagnosing dyslexia depend on the
conditions of the Creative Commons evaluation of reading words and text, writing, and working memory. Experts in this field
Attribution (CC BY) license (https:// have defined and normalized the scores of these tests. Some screening tests include CASL,
creativecommons.org/licenses/by/ TAPS, CTOPP-2, WRMT, GSRT, and TEWL, which have become available on the Web due
4.0/). to ICT that authorized the digitalization of these tests [5]. ICT refers to a varied set of
due to ICT that authorized the digitalization of these tests [5]. ICT refers to a varied set of
resources and technological tools utilized to create, transfer, store, interchange, or share
resources and technological tools utilized to create, transfer, store, interchange, or share
information. All these tests are available for subscription under specific criteria.
information. All these tests are available for subscription under specific criteria.
Figure 1. This image shows that any disorder in the brain’s left hemisphere would lead to important
Figure 1. This image shows that any disorder in the brain’s left hemisphere would lead to important
difficulties in a person’s ability to read and write and some other skills.
difficulties in a person’s ability to read and write and some other skills.
Neurological aspects have also been considered in modern technologies for diagnosing
Neurological aspects have also been considered in modern technologies for diagnosing
dyslexia, helping to increase detection accuracy and reliability. Medical devices enable
dyslexia, helping to increase detection accuracy and reliability. Medical devices enable the ob-
the observation of the brain structure of dyslexics [6]. Habib [7] proposed three triggering
servation of the brain structure of dyslexics [6]. Habib [7] proposed three triggering mecha-
mechanisms of dyslexia: attentiveness disturbance (visual–attentional dyslexia), language
nisms of dyslexia:
disturbance attentiveness
(phonological disturbance
dyslexia), and motor (visual–attentional dyslexia), form
disturbance (a dyspraxic language disturb-
of dyslexia).
anceIn(phonological dyslexia), and motor disturbance (a dyspraxic
“phonological dyslexia”, during the reading, fMRI displays paralysis of three re-form of dyslexia).
gionsIn “phonological
that are concerneddyslexia,”
with languageduring the reading,
production andfMRI displays
grasping paralysis Wernicke’s,
(Geschwind’s, of three re-
gions that are concerned with language production and grasping
and Broca’s territory) [8]. Additionally, compared to controls, DTI reveals white matter (Geschwind’s, Wer-
nicke’s, and Broca’s territory) [8]. Additionally, compared to controls,
variation in the language area [9]. In visual attentional dyslexia, when dyslexic participants DTI reveals white
matter
are taskedvariation in thecongruous
to recognize language area stimuli[9].pairs,
In visual
fMRIattentional
displays a dyslexia,
separation when
betweendyslexic
the
participants are tasked to recognize congruous stimuli pairs, fMRI
temporal visual system and parietal attentional system as well as a disconnection, in the left displays a separation
between the of
hemisphere, temporal visualand
the temporal system and parietal
occipital attentional
zones [9]. system asform
In the “dyspraxic wellofasdyslexia”,
a discon-
nection, in the left hemisphere, of the temporal and occipital zones
there is inactivation in the cerebellum–ventral frontotemporal and cerebellum–dorsal fron- [9]. In the “dyspraxic
form of dyslexia,”
toparietal pathway there is inactivation
[10]. Besides fMRI, the in the cerebellum–ventral
surface measurement offrontotemporal
brain potential,and knowncer-
ebellum–dorsal frontoparietal pathway [10]. Besides fMRI, the
as EEG, assists in identifying brain activation patterns. During spelling tests, phoneme dele-surface measurement of
brain potential, known as EEG, assists in identifying brain activation
tion, the rapid naming of letters, and articulation, increased vigor is observed in the theta patterns. During
spelling
and deltatests,
EEG phoneme
frequencydeletion,
bands in thethe rapid
frontalnaming
and rightof letters,
temporal and articulation,
zones in dyslexics increased
[11].
vigorResearchers
is observedhave in the theta anddifferent
suggested delta EEG MLfrequency
methods bands in the frontal
for predicting dyslexia andinright tem-
children
poral zones
utilizing in dyslexics
datasets related[11].to dyslexics. Such datasets can be acquired from medical and
Researchers
educational have suggested
institutions as well asdifferent
throughML methods
special games forconstructed
predicting dyslexia
speciallyinfor children
them.
utilizing
For example,datasets related tobrain
eye tracking, dyslexics.
imaging, Such datasets
EEG, can beand
test scores, acquired from medical
handwriting have been and
educational
used in dyslexiainstitutions
prediction. as well as through special games constructed specially for them.
For example,
AI refers to eyethe tracking,
development brain imaging,
of machines EEG,andtest scores,toand
systems handwriting
enable have been
them to implement
used in dyslexia
functions and tasks prediction.
that demand human intelligence, such as translation, decision making,
visualAI perception,
refers to the anddevelopment
speech recognition.
of machinesML is considered
and systems a part of AI, them
to enable whichto concentrates
implement
on the evolution of computer programs that use different datasets
functions and tasks that demand human intelligence, such as translation, decision mak- to learn for themselves.
DL
ing,isvisual
a subset of ML that
perception, and seeks
speechtorecognition.
simulate the MLhuman brain, allowing
is considered a part of DL AI, systems
which con- to
cluster
centratesdataonandthemake incredibly
evolution accurateprograms
of computer predictions.thatTheuseliterature
differenthas demonstrated
datasets to learnthe for
success of MLDL
themselves. methods in classification
is a subset of ML thatproblems, particularly
seeks to simulate thethe classification
human of diseases.
brain, allowing DL
Moreover,
systems toML methods
cluster data haveand been
makefound to have
incredibly outstanding
accurate accuracy
predictions. Thefor diagnosing
literature has
dyslexia [12]. However, traditional ML methods fail to use raw
demonstrated the success of ML methods in classification problems, particularly the clas- data to implement these
tasks [13]. of
sification Todiseases.
overcome this obstacle,
Moreover, ML models
ML methods havehave
been used
found a layered
to havelearning
outstanding approach,
accu-
known as the DL. The general difference between DL models
racy for diagnosing dyslexia [12]. However, traditional ML methods fail to use raw and traditional ML models
data
is that DL models do not require the engineering step of feature extraction, which is
inherent to conventional models, as they automatically learn abstract hierarchical feature
Appl. Sci. 2023, 13, 2804 3 of 17
representations from data [14]. ML might be utilized to accomplish specific tasks, for
instance, the identification of objects in images, speech-to-text transformation, and isolation
of items in categories [13].
DL utilizes ANNs that simulate the brain’s function. Since ANNs were introduced,
they have undergone considerable alteration and development; however, the basic principle
has remained the same [13]. The ANN structure essentially comprises nodes and edges
that, together, create a structure that resembles a neuron and are in charge of transmitting
the information [13]. It consists of three strata: input strata, several hidden strata, and
output strata. In each layer, every node is linked to every other node in the next layer.
The network becomes deeper as the number of hidden layers increases, leading to a DNN
model. A DNN converts raw inputs to helpful features. Subsequent layers elicit a group of
features that are considered more abstract and help to achieve a desired task. The essential
processes in DL are feature extraction and selection [13]. DNNs have various architectures.
A basic one is a feed-forward network, in which the structure has links among layers in a
single direction (forward), and there are no existing loops or cycles in the whole structure.
Other architectures include particular functions, such as RNNs, which are implemented
with sequential datasets, whereas CNNs are implemented with grid-structured data-like
images. MLP and CNN models have been applied at different times and places to reveal
and classify dyslexia utilizing special data related to dyslexic children. This review paper
focuses on the use of advanced ML methods (DL) to help in diagnosing dyslexia disorder
and constructing a predictive model for prediction. The following points are investigated:
- The types of datasets that are used by prediction models.
- Different DL models that are utilized to predict dyslexia disorder.
- The performance of DL models in dyslexia prediction.
To the best of our knowledge, this is the first review to investigate current works that
specifically utilized DL models to diagnose dyslexia. This will allow us to investigate the
points above, seeking to find a new way to predict dyslexia. Previous reviews on dyslexia
prediction have focused on ML models in general, not on DL models.
Following this initial introduction, Section 2 provides an overview of related works,
both review papers and systematic reviews, and illustrates the difference between previous
reviews and the current work. Section 3 discusses the search strategy and explains the
stages of the article selection process, including the criteria. Section 4 provides a detailed
explanation of every article in the results section, including information on dataset acquisi-
tion, dataset preprocessing, feature selection, and model prediction performance. Section 5
lists and discusses some challenges related to scanning the articles, and Section 6 presents
concludes the paper.
2. Related Work
Several surveys and review papers have addressed the use of ML in dyslexia disorder
classification. During the research process, we found nine papers that were related to the
diagnosis of dyslexia disorders using ML methods. In a review paper [15], the research
covered several dimensions related to dyslexia prediction with ML methods and image
processing techniques. In addition, regarding design assessment tests and tools for prop
dyslexics, most of the research utilized ML techniques to predict dyslexia. A survey
paper [16] summarized the techniques for diagnosing dyslexia that use ML approaches. It
screened 13 studies that applied ML methods, and only one of these studies applied DL
methods for dyslexia classification. In three review papers [6,17,18], the authors conducted
literature surveys focused on ML methods utilized in dyslexia prediction. The surveys did
not include studies on dyslexia prediction based on handwriting datasets, which started
in 2019. A systematic review [19] has focused on ML and DL methods that have been
utilized for identifying dyslexia and its biomarkers and concluded that Support Vector
Machine (SVM) is the most frequently applied ML method for identifying and predicting
dyslexia. Moreover, it noted that the utilization of DL algorithms is still in its infant stage.
An extensive review [20] focused not only on the prediction of dyslexia disorder using
surveys did not include studies on dyslexia prediction based on handwriting datasets,
which started in 2019. A systematic review [19] has focused on ML and DL methods that
have been utilized for identifying dyslexia and its biomarkers and concluded that Support
Vector Machine (SVM) is the most frequently applied ML method for identifying and pre-
Appl. Sci. 2023, 13, 2804 4 of 17
dicting dyslexia. Moreover, it noted that the utilization of DL algorithms is still in its infant
stage. An extensive review [20] focused not only on the prediction of dyslexia disorder
using ML methods but also included studies on the prediction of attention deficit hyper-
ML methods
activity but also
disorder. Ahireincluded studies
et al. [21] on the
conducted prediction of attention
a comprehensive review deficit
on the hyperactivity
classification
disorder.
of dyslexiaAhire
using et
MLal.methods,
[21] conducted
focusingaspecifically
comprehensive review
on studies onused
that the classification
EEG signals forof
dyslexia usingThey
classification. ML found
methods,
thatfocusing specificallyother
SVM outperformed on studies that used
ML methods EEGsignal
in EEG signals for
clas-
classification.
sification. They found
Poornappriya andthat SVM outperformed
Gopinath [22] consideredother ML methods
studies in EEG signal
that implemented ML
classification. Poornappriya and Gopinath [22] considered studies that implemented
methods for dyslexia prediction in addition to studies that helped dyslexics to improve ML
methods for dyslexia prediction in addition to studies that helped dyslexics to
their reading and other skills. Their review was different from the other review papers, as improve
ittheir reading
focused on and other skills.ofTheir
the utilization review was
DL methods different
in the from of
prediction thedyslexia
other review
usingpapers, as
different
it focused on thedatasets.
dyslexia-related utilization of DL methods in the prediction of dyslexia using different
dyslexia-related datasets.
3. Research Strategy
3. Research Strategy
In this paper, a critical review of the literature has been performed to collect a broad
In this paper, a critical review of the literature has been performed to collect a broad
variety of studies that used DL methods for dyslexia prediction and classification and to
variety of studies that used DL methods for dyslexia prediction and classification and to
investigate the research points listed above. The identification and selection of pertinent
investigate the research points listed above. The identification and selection of pertinent
articles followed the Preferred Reporting Items for Systematic Reviews and Meta-Anal-
articles followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses
yses (PRISMA) guidelines [23]. As shown in Figure 2, the article selection process included
(PRISMA) guidelines [23]. As shown in Figure 2, the article selection process included three
three phases: identification, screening, and inclusion phases.
phases: identification, screening, and inclusion phases.
Figure
Figure 2.
2. PRISMA
PRISMA flow
flow diagram
diagram for
for selecting
selecting 19
19 definitive
definitive articles.
articles.
In the identification phase, we searched for appropriate articles from 2010 to 2022 in
different databases (Figure 2), including IEEE Xplore, Google Scholar, PubMed, Springer,
Web of Science (WoS), and Science Direct. The following keywords were used in the
search queries: “Dyslexia classification”, “Dyslexia Prediction”, “ML in Dyslexia”, “Deep
Learning in Dyslexia”, “Diagnosis Dyslexia”, and “CNN in Dyslexia”. The search produced
1995 articles, of which 138 were from IEEE, 667 from Google Scholar, 665 from PubMed,
Appl. Sci. 2023, 13, 2804 5 of 17
217 from Springer, and 248 and 60 from WoS and Science Direct, respectively. We excluded
13 non-English articles and 411 duplicate articles.
In the screening phase, there were 1571 articles we needed to screen. The reviewers
excluded 1428 irrelevant articles. Then, of the remaining 143 articles, 101 articles were
excluded for different reasons. Some of the articles focused on dyslexia with other learn-
ing disorders, some focused on predicting dyslexia using games and applications, and
some others were review papers. Then, the screening advanced after fulfillment of the
inclusion criteria.
As shown in Table 1, the inclusion criteria were as follows: (1) articles released
between 2010 and 2022 in English, (2) articles that utilized a DL method or combinations
with traditional ML for the identification of dyslexia, and (3) articles that utilized datasets
related to dyslexia.
Subsequently, in the inclusion phase, the articles were reviewed for further considera-
tion of their eligibility, based on the criteria shown in Table 1. Ultimately, only 19 articles
were chosen for critical review. Following a previous study [13], we included ANN and
Appl. Sci. 2023, 13, x FOR PEER REVIEW
MLP models in this review, where ANN include DL feed-forward networks as 6 of 18 as MLP.
well
Figure 3 illustrates the number of articles selected per year from 2010 to 2022.
5
NO. OF ARTICLES
0
2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022
YEAR OF PUBLICATION
Figure 3.
Figure 3.Annual
Annualdistribution
distributionof the 19 articles
of the that were
19 articles carefully
that were screened.
carefully screened.
4. Research
4. ResearchResults
Results
Dyslexia
Dyslexiaprediction using
prediction DLDL
using models involves
models several
involves steps, starting
several from acquiring
steps, starting from acquiring
the datasets to evaluating the prediction models. This can be seen in the articles selected
the datasets to evaluating the prediction models. This can be seen in the articles selected
for review.
for review.
4.1. Data Acquisition
The first step in predicting dyslexia disorders using DL methods is the acquisition of
datasets related to dyslexia. As mentioned above, brain imaging and eye-tracking data in
addition to traditional data (e.g., test scores and time consumption when performing spe-
cific tasks) can be used to diagnose dyslexia. In [19], the dyslexia datasets were divided
Appl. Sci. 2023, 13, 2804 6 of 17
images was collected through different reading tasks: a lexical decision task, semantic
categorization task, and lexical orthographic matching. The datasets were collected from
55 Spanish children between 9 and 12 years of age, who were recruited from schools and
the University Hospital of Cruces in Spain. Chimeno et al. [38] gathered 3D images from
both DTI and fMRI scans of 52 schoolchildren between nine and 12 years of age at various
times. In 2021, an fMRI dataset was collected on 32 Portuguese children, divided into
16 typical readers and 16 dyslexics aged 8–12 years [39]. All children were right-handed
and matched for IQ, age, and sex.
The final dataset category relates to patterns of eye movement when performing
cognitive tasks, as recent studies have used eye movements to differentiate between dyslexic
and non-dyslexic individuals. Statistical measures have been used to identify features
of children’s eye movement through cognitive exercise. The authors in [40,41] used the
same dataset containing raw eye movement data on 185 subjects, 88 at low risk of dyslexia
and 97 with a high risk of dyslexia. These data were collected in 2016 [42] from a wide
population of 2165 school children in second grade. Recently, Vajs et al. [43] collected and
analyzed a dataset on 30 subjects aged 7–13 years, 15 of whom had dyslexia and 15 of
whom were normal participants (11 male and 19 female), as they read a Serbian written
text with 13 different color configurations.
One study used an EOG in the prediction of dyslexia [44]. EOG is a method that
relies on the screening of the electrical potential for eye movements, which is helpful for
analyzing various types of eye movements, such as saccades, smooth pursuits, vergence,
blinks and gaze fixation. This study collected data on 33 children aged between 8 and
11 years of age (20 with dyslexia and 13 healthy; 17 female and 16 male). None of the
Appl. Sci. 2023, 13, x FOR PEER REVIEW 8 of 18
dyslexic children had hearing or vision defects. Figure 4 illustrates the usage percentage of
different datasets in the articles that employed DL methods to predict dyslexia.
EGG
21.10%
handwriting
15.79%
MRI & fMRI & DTI
Eye Tracking
Figure 4.
Figure 4. Distribution
Distribution of
of articles
articles that
that used
used DL
DL methods
methods based
based on
ondatasets
datasetsfrom
from2010
2010to
to2022.
2022.
4.2.
4.2. Data
Data Preprocessing
Preprocessing
Datasets
Datasets related
related toto dyslexia
dyslexia need
need to
to be
be processed
processed and and prepared
prepared before
before used
used in
in DL
DL
techniques. The goal of this task is to enable the classifier to elicit the most related
techniques. The goal of this task is to enable the classifier to elicit the most related inter- interpretable
features
pretablefrom the preprocessed
features dataset. There
from the preprocessed are several
dataset. Thereobjectives of preprocessing
are several objectives ofprocesses,
prepro-
including modulation, tissue segmentation, data normalization, smoothing,
cessing processes, including modulation, tissue segmentation, data normalization, alignment with a
particular
smoothing, alignment with a particular image template, and data normalization [19]. [30]
image template, and data normalization [19]. The study by Kohli and Prasad The
was
study different
by Kohlifrom
andthePrasad
other studies
[30] wasindifferent
this review, as the
from it utilized manual in
other studies data
thispreprocessing
review, as it
methods. One study
utilized manual data[32] applied decimation
preprocessing methods.and OneSTFT to the
study [32]fMRI signals,
applied which were
decimation and
filtered
STFT tousing a bandpass
the fMRI signals,procedure to remove
which were noises
filtered usingandairrelevant
bandpassartifacts.
procedure to remove
noises and irrelevant artifacts.
In addition, an FSL instrument, the FMRIB software library, was employed for pro-
cessing DTI and fMRI [38]. Usman and Muniyadi changed all of the T1w neuro-images
(images of the brain) into a format that could be used by FreeSurfer software [12]. Then,
Appl. Sci. 2023, 13, 2804 8 of 17
In addition, an FSL instrument, the FMRIB software library, was employed for pro-
cessing DTI and fMRI [38]. Usman and Muniyadi changed all of the T1w neuro-images
(images of the brain) into a format that could be used by FreeSurfer software [12]. Then, the
intensity of the images was adjusted so that they all had the same brightness and contrast.
The normalization of intensity was based on histograms. A FSL FNIRT software instrument
was utilized for non-rigid registration in the brain template of MNI152, and a Gaussian
filter was used to minimize noise. In [37], the preprocessing step included a DICOM to
NIFTI conversion, taking the “Digital Imaging and Communications in Medicine” file
and converting it into a “Neuroimaging Informatics Technology Initiative” file. This con-
version was necessary in order to use the data in SPM (Statistical Parametric Mapping)
software to analyze the brain image data. Normalization and smoothing were applied
in addition to the adjustment of head motion. In another study [39], the preprocessing
of the task-based fMRI involved the correction of slice time and motion, smoothing, and
normalization to a template of a voxel. Barhamtoshy and Motaweh [33] used standard
algorithms to filter irrelevant records and noise from the dataset. In addition, they removed
noise from EEG scans using a Fourier transform algorithm. Other authors used various
wavelet transformation approaches to convert EEG scans into low-pass and high-pass
filters [34]. Independent component analysis was used in the preprocessing of EEG signals
to remove artifacts due to recording eye blinking signals along with EEG signals [35]. The
studies that used the eye-tracking dataset to predict dyslexia focused on eliminating blinks
and processing missing data [40,41,43]. The study that used EOG signals utilized the
Butterworth bandpass filter to eliminate noise, followed by EOG segmentation. The aim
of segmentation was to balance the dataset (dyslexic and healthy groups). With regard to
a handwriting dataset, Spoon et al. [25,26] applied the DeepWriter concept [45] to create
50 random patches of handwriting features from each image in the handwriting dataset. Yo-
garajah and Bhushan [28] utilized the same concept to process Hindi letter datasets. Some
studies included different steps for processing unstructured data (handwriting images), as
shown in [27], including RGB to grayscale conversion, maximally stable extremal region,
canny edges detector, stroke width filter, and morphology as a final step. Isa et al. [4,29]
employed the same dataset. For preprocessing, they interchanged the foreground and
background to minimize computational overhead when an image had a lower black point
(value 0) than white point (value 1), which required more memory and power consumption
in training the image. In addition, they cropped handwriting images to the actual writing
part, thus resizing the images.
Table 2. Feature extraction methods for dyslexia disorder detection utilizing Deep Learning Methods.
Table 2. Cont.
presented in this paper except [30], which dealt with numerical data. Although some
studies used ANNs to deal with EGG datasets and achieved high accuracy (96%) [33], others
achieved lower accuracy (78%) [34]. In addition, some studies that adopted handwriting
datasets used CNN models and reported accuracies between 86% and 95% [4,28,29], while
others achieved low accuracy (maximum 77%) [22,23,25]. Typically, the datasets were
partitioned to train and test the prediction models. Most of the data (approximately more
than or equal to 70% of all the data) were utilized for training, and the smaller part was
allocated for testing model. Some studies allocated part of the dataset (might be 10% or
20%) as a validation dataset, which is a sample of data that is restricted from the process of
the training model. These data are used to provide an estimate of the model’s skill while
the hyperparameters are being tuned. This division varied between studies. In refs. [12,29],
70% of the dataset was allocated for training, and the remainder was allocated for validation
and testing. Some studies used a higher percentage. For example, refs. [39,41,43] allocated
80% for training, while study [40] allocated 90%. Recent studies have utilized ANN, MLP,
Appl. Sci. 2023, 13, x FOR PEER REVIEW
and CNN models for dyslexia prediction and most have reported high accuracy in12most of 18
studies. Figure 5 organizes the reviewed studies according to the DL algorithms adopted.
Figure 5. The reviewed studies organized according to the DL algorithms that each model adopted.
Figure 5. The reviewed studies organized according to the DL algorithms that each model adopted.
TheANN
The ANNmodel
modelsuggested
suggestedby bythe
theauthors
authorsin in[30]
[30]achieved
achievedananaccuracy
accuracyof of75%
75%after
after
ten-fold cross-validation when used with a dataset on pupils’ performance acquired
ten-fold cross-validation when used with a dataset on pupils’ performance acquired from from
aastructured
structuredquestionnaire.
questionnaire. In
In addition,
addition, the
the same
samemodel
modelachieved
achieved the
thehighest
highestaccuracy
accuracy
amongother
among othermodels
modelsinin two
two studies
studies (89.7%
(89.7% andand 94.87%)
94.87%) [29,33,34].
[29,33,34]. SpoonSpoon
et al.etwere
al. were the
the first
first to use a handwriting dataset for the prediction of dyslexia disorder. They
to use a handwriting dataset for the prediction of dyslexia disorder. They proposed a CNN proposed
a CNN model to automatically recognize dyslexic children based on their handwriting
[25]. They achieved an accuracy of 55.7 ± 1.4% by employing five-fold CV. Then, they
developed their model further [26], reaching an accuracy of 77.6%. Yogarajah et al. [28]
utilized handwriting images from children’s notebooks as a dataset to build a CNN
model, achieving good accuracy of 86.14 ± 1.02%. Two consecutive studies [4,29] used the
Appl. Sci. 2023, 13, 2804 12 of 17
model to automatically recognize dyslexic children based on their handwriting [25]. They
achieved an accuracy of 55.7 ± 1.4% by employing five-fold CV. Then, they developed their
model further [26], reaching an accuracy of 77.6%. Yogarajah et al. [28] utilized handwriting
images from children’s notebooks as a dataset to build a CNN model, achieving good
accuracy of 86.14 ± 1.02%. Two consecutive studies [4,29] used the same datasets from
NIST 2019 to build a prediction model. The first used different CNN models, such as
CNN-1, CNN-2, CNN-3, and LeNet-5, to compare the prediction performance. All models
showed an accuracy of more than 87%, with CNN-1 providing the highest accuracy. The
second study utilized a LeNet-5 model and achieved an outstanding accuracy of 95%, which
is the highest accuracy for predicting dyslexia using a handwriting dataset. Regarding the
performance of DL models for predicting dyslexia using MRI and fMRI, Zahia et al. [37]
presented a 3D CNN model that achieved an accuracy of 72.7%, a sensitivity of 75%, an
F1-score of 67%, a precision of 60%, and a specificity of 71.4%. Muniyandi and Usman [12]
proposed a two-path cascading CNN model and achieved an accuracy of 84.6%, a specificity
of 78.2%, and a sensitivity of 76.5%. In addition, Silva et al. [39] developed a CNN model
and obtained the highest accuracy of 94%. Karim et al. utilized an MLP classifier to monitor
accuracy with eyes opened and closed, reporting accuracies of 86% and 84.95% for eyes
opened and closed, respectively.
Eye-tracking datasets have been used to build a CNN model for dyslexia prediction,
which showed good results [40,41,43]. Nerušil et al. [40] achieved the highest accuracy
of 96.6%, while Vags et al. [43] achieved an accuracy of 87%. Moreover, Appadurai and
Bhargavi [41] achieved an accuracy of 82% for scan path images and 87% for fixation images.
Table 3 summarizes the performance of the predictive model of each study in this review.
Table 3. Summarizes the performance of the predictive model of each study in this review.
Table 3. Cont.
the size of the training dataset, such as cropping, translation, and rotation. Rotation
and noise injection techniques have been used in the study [29]; they contributed to
enlarging the training size dataset and solved the imbalanced class problem.
The dataset preprocessing step is a critical step, and the formative features extracted
from datasets using DL models, especially CNNs, depend on good processing, which
affects the classification accuracy. The better and more accurate the processing, the better
the performance of the model.
Hyper-parameter selection has a substantial effect on CNN performance. Any vari-
ation in the values of the hyper-parameters will affect the CNN’s overall performance.
Consequently, proper parameter selection is a crucial factor that should be considered
throughout the creation of optimization schemes [48].
Effective CNN training necessitates powerful hardware resources, such as GPUs,
which are robust concerning memory usage and processing speed. Implementing DL
models on a system with these hardware resources will increase the classification speed
for dyslexia.
The classification of dyslexia using DL methods requires more attention and research.
In particular, DL models have a strong ability to extract features from unstructured data
and, thus, to perform accurate classification. Moreover, some tools may be developed to
assist in diagnosing dyslexia, such as handwriting images, which were first utilized in
2019 for prediction.
6. Conclusions
Dyslexia is a learning difficulty that impacts the reading, writing, spelling, and dicta-
tion processes. It usually results from a deficit in the language phonological component,
which is often unforeseen by teachers or parents of kids who suffer from this disorder.
About 10% of the world’s population suffers from this disorder, and it is important to
discover it early to reduce its impact and improve the skills of dyslexics. Researchers
have suggested multiple techniques for identifying dyslexia in children. Recently, DL
methods have contributed significantly to diagnosing dyslexia. This review summarized
the dyslexia detection techniques that have employed DL approaches. Furthermore, this
review investigated the significant factors related to dyslexia prediction, seeking to help
researchers build a predictive model with good accuracy. In the future, more attention
could be given to the collection of dyslexia-related datasets.
Abbreviations
References
1. Granet, D.B. Learning disabilities, dyslexia, and vision: The role of the pediatric ophthalmologist. J. AAPOS 2011, 15, 119–120.
[CrossRef]
2. Wajuihian, S.O.; Od, O.; Pgcertmod, U. Dyslexia : An overview. Afr. Vis. Eye Health 2011, 43, 24–33. [CrossRef]
3. Laffey, J.L.; Siffin, C.F. ERIC/CRIER Reading Review Series. 1969. Available online: https://files.eric.ed.gov/fulltext/ED032453.pdf
(accessed on 26 January 2023).
4. Isa, I.S.; Zahir, M.A.; Ramlan, S.A.; Li-Chih, W.; Sulaiman, S.N. CNN Comparisons Models on Dyslexia Handwriting Classification.
ESTEEM Acad. J. 2021, 17, 12–25.
5. Drigas, A.S.; Politi-Georgousi, S. ICTs as a distinct detection approach for dyslexia screening: A contemporary view. Int. J. Online
Biomed. Eng. 2019, 15, 46–60. [CrossRef]
6. Jankovic, M.M. Biomarker-based approaches for dyslexia screening: A review. IEEE Zooming Innov. Consum. Technol. Conf. 2022,
2022, 28–33. [CrossRef]
7. Habib, M. The neurological basis of developmental dyslexia an overview and working hypothesis. Brain 2000, 123, 2373–2399.
[CrossRef]
8. Blau, V.; Reithler, J.; Van Atteveldt, N.; Seitz, J.; Gerretsen, P.; Goebel, R.; Blomert, L. Deviant processing of letters and speech
sounds as proximate cause of reading failure: A functional magnetic resonance imaging study of dyslexic children. Brain 2010,
133, 868–879. [CrossRef]
9. Vandermosten, M.; Cuynen, L.; Vanderauwera, J.; Wouters, J.; Ghesquière, P. White matter pathways mediate parental effects on
children’s reading precursors. Brain Lang. 2017, 173, 10–19. [CrossRef]
10. Alvarez, T.A.; Fiez, J.A. Current perspectives on the cerebellum and reading development. Neurosci. Biobehav. Rev. 2018, 92, 55–66.
[CrossRef]
11. Arns, M.; Peters, S.; Breteler, R.; Verhoeven, L. Different brain activation patterns in dyslexic children: Evidence from EEG power
and coherence patterns for the double-deficit theory of dyslexia. J. Integr. Neurosci. 2007, 6, 175–190. [CrossRef]
12. Usman, O.L.; Muniyandi, R.C. CryptoDL: Predicting dyslexia biomarkers from encrypted neuroimaging dataset using energy-
efficient residue number system and deep convolutional neural network. Symmetry 2020, 12, 836. [CrossRef]
Appl. Sci. 2023, 13, 2804 16 of 17
13. Sharma, D.K.; Chatterjee, M.; Kaur, G.; Vavilala, S. Deep Learning Applications for Disease Diagnosis; Elsevier: Amsterdam, The
Netherlands, 2022; pp. 31–51. [CrossRef]
14. Carin, L.; Pencina, M.J. On deep learning for medical image analysis. J. Am. Med. Assoc. 2018, 320, 1192–1193. [CrossRef]
15. Prabha, A.J.; Bhargavi, R. Prediction of Dyslexia Using Machine Learning—A Research Travelogue. Res. Travel. Lect. Notes Electr.
Eng. 2019, 556, 1–8. [CrossRef]
16. Kaisar, S. Developmental dyslexia detection using machine learning techniques: A survey. ICT Express 2020, 6, 181–184. [CrossRef]
17. Vanitha, G.; Kasthuri, M. Dyslexia Prediction Using Machine Learning Algorithms—A Review. Int. J. Aquat. Sci. 2021, 12, 3372–3380.
Available online: http://www.journal-aquaticscience.com/article_135190.html%0Ahttp://www.journal-aquaticscience.com/
article_135190_a14355942a6a1ab5eebf9c4eae0aa078.pdf (accessed on 26 January 2023).
18. Dhas, J.; Jose, S.H. A Study on Dyslexia Using Machine Learning-Review. EasyChair Prepr. 2021. Available online: https:
//easychair.org/publications/preprint/gLjk (accessed on 26 January 2023).
19. Usman, O.L.; Muniyandi, R.C.; Omar, K.; Mohamad, M. Advance Machine Learning Methods for Dyslexia Biomarker Detection:
A Review of Implementation Details and Challenges. IEEE Access 2021, 9, 36879–36897. [CrossRef]
20. Kothapalli, P.K.V.; Rathikarani, V.; Nookala, G.K.M. A Comprehensive Survey on Predicting Dyslexia and ADHD Using Machine
Learning Approaches. In Inventive Systems and Control; Springer: Singapore, 2022; pp. 105–121. [CrossRef]
21. Ahire, N.; Awale, R.; Patnaik, S.; Wagh, A. A comprehensive review of machine learning approaches for dyslexia diagnosis.
Multimed. Tools Appl. 2022, 1–12. [CrossRef]
22. Poornappriya, T.; Gopinath, R. Application of Machine Learning Techniques for Improving Learning Disabilities. Int. J. Electr.
Eng. Technol. 2020, 11, 403–411. [CrossRef]
23. Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA
statement. J. Clin. Epidemiol. 2009, 62, 1006–1012. [CrossRef]
24. Hebert, M.; Kearns, D.M.; Hayes, J.B.; Bazis, P.; Cooper, S. Why children with dyslexia struggle with writing and how to help
them. Lang. Speech Hear. Serv. Sch. 2018, 49, 843–863. [CrossRef]
25. Spoon, K.; Crandall, D.; Siek, K. Towards Detecting Dyslexia in Children’s Handwriting Using Neural Networks. In Proceedings
of the International Conference on Machine Learning AI for Social Good Workshop, Long Beach, CA, USA, 9–15 June 2019;
pp. 1–5.
26. Spoon, K.; Siek, K.; Crandall, D.; Fillmore, M. Can We (and Should We) Use AI to Detect Dyslexia in Children’s Handwriting? In
Proceedings of the International Conference on Machine Learning AI for Social Good Workshop, Long Beach, CA, USA, 9–15 June
2019; pp. 1–6.
27. Isa, I.S.; Rahimi, W.N.S.; Ramlan, S.A.; Sulaiman, S.N. Automated Detection of Dyslexia Symptom Based on Handwriting Image
for Primary School Children. Procedia Comput. Sci. 2019, 163, 440–449. [CrossRef]
28. Yogarajah, P. Deep Learning Approach to Automated Detection of Dyslexia-Dysgraphia. In Proceedings of the 25th IEEE
International Conference on Pattern Recognition, Milan, Italy, 10–15 January 2021; pp. 1–12. Available online: https://www.micc.
unifi.it/icpr2020/ (accessed on 12 November 2020).
29. Bin Rosli, M.S.A.; Isa, I.S.; Ramlan, S.A.; Sulaiman, S.N.; Maruzuki, M.I.F. Development of CNN Transfer Learning for Dyslexia
Handwriting Recognition. In Proceedings of the 2021 11th IEEE International Conference on Control System, Computing and
Engineering (ICCSCE), Penang, Malaysia, 27–28 August 2021; pp. 194–199. [CrossRef]
30. Kohli, M.; Prasad, T.V. Identifying dyslexic students by using artificial neural networks. In Proceedings of the World Congress on
Engineering 2010, London, UK, 30 June–2 July 2010; Volume 1, pp. 118–121.
31. Perera, H.; Shiratuddin, M.F.; Wong, K.W. Review of EEG-based pattern classifcation frameworks for dyslexia. Brain Inform. 2018,
5, 4. [CrossRef]
32. Karim, I.; Abdul, W.; Kamaruddin, N. Classification of Dyslexic and Normal Children during resting condition using KDE and
MLP. In Proceedings of the 2013 5th International Conference on Information and Communication Technology for the Muslim
World, Rabat, Morocco, 26–27 March 2013. [CrossRef]
33. Al-Barhamtoshy, H.M.; Motaweh, D.E.M. Diagnosis of Dyslexia using Computing Analysis. In Proceedings of the 2017 In-
ternational Conference on Informatics, Health & Technology (ICIHT), Riyadh, Saudi Arabia, 21–23 February 2017; pp. 1–7.
[CrossRef]
34. Frid, A.; Manevitz, L.M. Features and Machine Learning for Correlating and Classifying between Brain Areas and Dyslexia. arXiv
2018, arXiv:1812.10622.
35. Ortiz, A.; Martínez-Murcia, F.J.; Formoso, M.A.; Luque, J.L.; Sánchez, A. Dyslexia detection from EEG signals using SSA
component correlation and Convolutional Neural Networks. In Hybrid Artificial Intelligent Systems; Springer: Berlin/Heidelberg,
Germany, 2020; pp. 655–664.
36. Elnakib, A.; Soliman, A.; Nitzken, M.; Casanova, M.F.; Gimel’Farb, G.; El-Baz, A. Magnetic resonance imaging findings for
dyslexia: A review. J. Biomed. Nanotechnol. 2014, 10, 2778–2805. [CrossRef]
37. Zahia, S.; Garcia-Zapirain, B.; Saralegui, I.; Fernandez-Ruanova, B. Dyslexia detection using 3D convolutional neural networks
and functional magnetic resonance imaging. Comput. Methods Programs Biomed. 2020, 197, 105726. [CrossRef]
38. Chimeno, Y.G.; Zapirain, B.G.; Prieto, I.S.; Fernandez-Ruanova, B. Automatic classification of dyslexic children by applying
machine learning to fMRI images. Bio-Med. Mater. Eng. 2014, 24, 2995–3002. [CrossRef]
Appl. Sci. 2023, 13, 2804 17 of 17
39. Da Silva, L.T.; Esper, N.B.; Ruiz, D.D.; Meneguzzi, F.; Buchweitz, A. Visual Explanation for Identification of the Brain Bases for
Developmental Dyslexia on fMRI Data. Front. Comput. Neurosci. 2021, 15, 594659. [CrossRef]
40. Nerušil, B.; Polec, J.; Škunda, J.; Kačur, J. Eye tracking based dyslexia detection using a holistic approach. Sci. Rep. 2021, 11, 15687.
[CrossRef]
41. Appadurai, J.P.; Bhargavi, R. Eye Movement Feature Set and Predictive Model for Dyslexia: Feature Set and Predictive Model for
Dyslexia. Int. J. Cogn. Inform. Nat. Intell. 2021, 15, 22. [CrossRef]
42. Benfatto, M.N.; Seimyr, G.; Ygge, J.; Pansell, T.; Rydberg, A.; Jacobson, C. Screening for dyslexia using eye tracking during reading.
PLoS ONE 2016, 11, e0165508.
43. Vajs, I.; Kovic, V.; Papic, T.; Savic, A.M.; Jankovic, M.M. Dyslexia detection in children using eye tracking data based on
VGG16 network. In Proceedings of the European Signal Processing Conference, Belgrade, Serbia, 29 August–2 September 2022;
pp. 1601–1605.
44. Ileri, R.; Latifoğlu, F.; Demirci, E. A novel approach for detection of dyslexia using convolutional neural network with EOG
signals. Med. Biol. Eng. Comput. 2022, 60, 3041–3055. [CrossRef]
45. Xing, L.; Qiao, Y. DeepWriter: A multi-stream deep CNN for text-independent writer identification. In Proceedings of the
International Conference on Frontiers in Handwriting Recognition, ICFHR, Shenzhen, China, 23–26 October 2016; pp. 584–589.
[CrossRef]
46. Dara, S.; Tumma, P. Feature Extraction By Using Deep Learning: A survey. In Proceedings of the 2018 Second Interna-
tional Conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 29–31 March 2018;
pp. 1795–1801. [CrossRef]
47. Khan, A.; Sohail, A.; Zahoora, U.; Qureshi, A.S. A Survey of the Recent Architectures of Deep Convolutional Neural Networks; Springer:
Cham, The Netherlands, 2020; Volume 53. [CrossRef]
48. Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.;
Farhan, L. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J. Big Data 2021,
8, 53. [CrossRef]
49. Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.