Classification of Pathological Signs For Diabetic Retinopathy Diagnosis Using Image Enhancement Technique and Convolution Neural Network
Classification of Pathological Signs For Diabetic Retinopathy Diagnosis Using Image Enhancement Technique and Convolution Neural Network
net/publication/340691417
CITATIONS READS
5 54
6 authors, including:
All content following this page was uploaded by Abd Aziz Nor Azimah on 05 February 2022.
Abstract—Diagnosis of diabetic retinopathy (DR) involves Recently, the development of intelligent screening and
visual examination of retinal images by ophthalmologist to detection system of the DR using CNN shows an increasing
detect pathological signs such as exudate, haemorrhage (HEM) trend among researchers. CNN is a class of deep neural
and microaneurysm (MA). This process is conducted manually, networks learning algorithm most commonly use to
therefore it is time-consuming and subjected to human error. analyzing visual imagery. CNN involves very little pre-
This paper develops an automatic and intelligent machine processing as it learns image feature that in traditional
learning algorithm for the detection of diabetic retinopathy algorithms were hand-engineered. This independence from
(DR) in fundus image. It involves image enhancement and
human effort and prior knowledge in complex feature design
classification of pathological signs using convolution neural
is a major advantage.
network (CNN) for the DR pathological signs classification. In
the image enhancement process, high-pass filter and histogram
Motivated by the work of [8], this paper proposed an
equalization are applied to improve visual quality of fundus automatic and intelligent machine learning algorithm for the
images. A five layers CNN architecture is implemented to detection of diabetic retinopathy (DR) in fundus image.
classify the three pathological signs; exudate, HEM and MA. However, this work is different with [8] in image
Two dataset, DIARETDB1 and e-Ophtha are used to evaluate enhancement approach and network architecture.
the performance of the system. Simulation results using
enhanced DR images show significant improvement in II. CLINICAL FEATURES
classification accuracy compared to those images without In order to diagnose the DR disease, several clinical
enhancement for both datasets. features need to be identified by ophthalmologist from the
fundus images of the suspected DR patient. The features
Keywords- Diabetic retinopathy, convolutional neural network, include MA, HEM and exudate as shown in Fig. 1.
histogram equalization, high pass filter.
I. INTRODUCTION
Diabetes mellitus (DM) which is a non-communicable
disease directly affecting millions of people worldwide.
World Health Organization (WHO) estimates by 2030,
Malaysia would have 2.48 million people with DM [1]. DM
is caused by insufficient or resistance of insulin in the human
body [2]. This disease may lead to organ complications. One
of most common affected organs is the retina of the eye. The
specific disease is known as diabetic retinopathy (DR).
Prolong and unattended DR may lead to visual loss. Early
detection and treatment will save cost and prevention from
blindness [3].
A traditional DR screening is through manual inspection
of the retina based non-digital ophthalmoscopy method. Fig. 1. Example of common pathological signs of DR.
However, this conventional method is time-consuming,
tedious, and may lead to inaccurate diagnosis [2]. In the past
three decades, an automatic screening of diabetic retinopathy MA is a formation of small, balloon-round shape and dark
using computer vision and intelligent image processing red spot occur due to high blood sugar (hyperglycemia) in
technique become popular among medical researchers. It help blood vessels walls [11]. The number on MA increase as
lessen the work-load of ophthalmologist and provide timely increase stage of DR. Dimension range of MA is between
diagnosis for its early treatment [2]. For automatic screening, 20um and 200um equivalent 1-3 pixel in different image [12].
several machine learning has been developed to detect HEM occurs due to leakages or ruptured of blood from
common DR pathological sign such as microaneurysm (MA), capillaries or blood vessel. It appears as red flamed, dot and
haemorrhages (HEM), and exudate [4]. These include the use blot in size [13]. Usually HEM is 125µm greater than MA in
of artificial neural network (ANN) [5], support vector size [14]. Exudate appears in bright white or yellowish-white
machine (SVM) [6], k-nearest neighbor (K-NN) [7] and colour with sharp edges against the background from blood
convolution neural network (CNN) [4], [8]–[10]. vessel [12]. It is due to lipoproteins leaking out from retinal
978-1-7281-3500-7/19/$31.00 ©2019 IEEE 221
2019 9th IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 29 Nov.–1 Dec. 2019, Penang, Malaysia
B. Image Enhancement
The one of problem retinal image has variety in term of
brightness and contrast which result hard to identify DR
feature. Moreover, every eye has their own unique shape and
feature [11]. There are several reason fundus images was not
able to take clearly from ophthalmologist such as pupil unable
to open (constrict) at bright light condition and aging cataract
clouding at eye lens which lead to less light unable to pass
through the pupil. Since colour of retinal image must be tuned
to allowed CNN performing robust, our proposed method to
enhance the image using high pass filter technique and
Fig. 3. Patches are grouped into MA, HEM, exudate and no-Sign.
histogram equalization to remove noise.
These resulted 5569, 3328, 4816 patches for MA, HEM,
High pass filter is a filter that removes low frequency
exudate and 6642 patches with no pathological sign for
(blur) signal to become sharp signal [17]. It is usually used for
DIARETDB1. While 2292, 4193 patches for MA, exudate and
edge detection and image sharpening. The transfer function of
6863 patches with no signs for e-Optha dataset. The total
the filter of order n with distance D(u,v) and a cut-off
number patch is divided into 60% training and 40% testing for
frequency at a distance D0 is given by:
validation are shown in Table II and Table III:
1
𝐻(𝑢, 𝑣) = (1)
1+[𝐷0 /D(u,v)]2𝑛
222
2019 9th IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 29 Nov.–1 Dec. 2019, Penang, Malaysia
TABLE II. STATISTIC INFORMATION OF SIGN PATCHES OF DIARETDB1 Solver Method SGDM
Type of Microaneurysm Haemorrhage Exudate No- Filter Size Step Down (11,8,7,6,5,4,3,2,1)
Dataset Sign Feature Map Step Up (64,128,256,512)
Training 3341 1997 2890 3986 Activate Function ReLU, Softmax
Testing 2228 1331 1926 2656
Total Number 5569 3328 4816 6642 IV. PERFORMANCE EVALUATION
TABLE III. STATISTIC INFORMATION OF SIGN PATCHES OF E-OPHTA After In medical, the performance of the system will be
Type of Microaneurysm Haemorrhage Exudate No- evaluated based on accuracy. Accuracy of a clinical test refers
Dataset Sign to how correct a diagnostic test identifies and excludes a given
Training 1375 N/A 2516 4118 condition. The performance validation is given as follows:
Testing 917 N/A 1677 2745 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 =
𝑇𝑃+𝑇𝑁
(2)
Total Number 2292 N/A 4193 6863 𝑇𝑃+𝑇𝑁+𝐹𝑃+𝐹𝑁
where TP, TN, FP and FN are true positive, true negative, false
D. CNN - Patch Training positive and false negative respectively.
The CNN will be implemented on MATLAB 2017b
platform supported by NVIDIA's GTX980 graphics V. RESULTS AND DISCUSSION
processing unit (GPU). To design CNN network architecture
in Fig. 4, four (4) convolution layer were designed using 64 The fundus image has been successfully enhanced by first
feature maps in each convolution layer by kernel size of (3x3) applying the high-pass filter followed by histogram
pixels. The rectified liner unit (ReLU) activate function was equalization method. This resulted image gives clearer and
used in this research to avoid saturation and allows it to more visible pathological signs compared to the original
compute faster than sigmoid. Max-Pooling reduce the size of image. This can be observed in Fig. 5.
feature map with kernel (2x2) pixel to simplify the output,
reducing number of parameters that network need to learn.
The output of Max-Pooling was normalized by normalization
layer (NM) to speed up network training and reduce the
sensitivity to network initialization. Last 512 feature were
extracted from last Max-Pooling Layer connected together
using Fully-Connected (FC) layer layer become 512 neuron
link together. The softmax activated function normalized
output result of FC will determine which classification
probabilities by the classification layer. Fig. 5. Original image and the result after applying high-pass filter and
histogram equalization.
223
2019 9th IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 29 Nov.–1 Dec. 2019, Penang, Malaysia
learn important features required to classify the pathological image enhancement technique helps in improving visual
signs from the given dataset. quality of the fundus image as well as highlighting the
The CNN performance are further measured using pathological signs. The CNN architecture implemented in
testing dataset. The detailed test performance of the CNN is this study has achieved good classification accuracy for both
presented in confusion matrix. The confusion matrix gives DIARETDB1 and e-Ophtha datasets. The classification
detail information about the classification accuracy for each accuracy for DIARETDB1 and e-Ophtha has improved by
classes after the training process. Fig.7 shows an example of 3.6% and 2.5% respectively as compared to those images
confusion matrix for DIARETDB1 with image enhancement. without enhancement. Future work will consider other
enhancement method to be investigated for improve overall
accuracy when tested with multiple pathological signs.
ACKNOWLEDGMENT
Author would like to acknowledge Faculty of Electrical
Engineering, UiTM Shah Alam and Penang Branch,
Department of Ophthalmology, and Faculty of Medicine,
UiTM Sungai Buloh for supporting this research.
REFERENCES
224
2019 9th IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 29 Nov.–1 Dec. 2019, Penang, Malaysia
225