Pre Defence Report
Pre Defence Report
By
Md Abu Saeed
Roll:1907057
Supervisor:
Kazi Saeed Alam
Assistant Professor
Deapartment of CSE
khulna university of engineering & technology
1
Contents
1 Introduction 4
2 Related work 5
3 Methodology 7
3.1 Dataset Selection . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.2 Image Pre-processing . . . . . . . . . . . . . . . . . . . . . . . . 8
3.2.1 Extract the vessels . . . . . . . . . . . . . . . . . . . . . 8
3.2.2 Extract the microaneurysm . . . . . . . . . . . . . . . . . 10
3.2.3 Merge previous results . . . . . . . . . . . . . . . . . . . 11
3.3 Data Augmentation . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.4 Model Training . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4 Current Result 13
4.1 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
5 Conclusion 15
6 References 16
2
List of Figures
1 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2 Vessels extraction steps . . . . . . . . . . . . . . . . . . . . . . . 9
3 Microaneurysm extraction steps . . . . . . . . . . . . . . . . . . 10
4 Final merged image . . . . . . . . . . . . . . . . . . . . . . . . . 11
5 Data augmentation . . . . . . . . . . . . . . . . . . . . . . . . . 12
3
1 Introduction
Diabetic retinopathy (DR) is a severe eye condition that arises as a complication
of diabetes, resulting from prolonged high blood sugar levels. It leads to
progressive damage to the blood vessels in the retina, which may cause vision
impairment and, if left untreated, permanent blindness. The retina is a critical part
of the eye that senses light and sends visual signals to the brain, and damage to its
blood vessels severely affects vision. DR progresses through various stages,
starting from mild, non-proliferative forms where blood vessels may leak or swell
(Non-Proliferative Diabetic Retinopathy, NPDR) to more advanced stages
marked by abnormal blood vessel growth (Proliferative Diabetic Retinopathy,
PDR). In its early stages, diabetic retinopathy is often asymptomatic, making
timely detection and treatment difficult. However, as the disease progresses,
patients may experience blurred vision, floaters, and eventual vision loss
4
2 Related work
In recent years, the application of convolutional neural networks (CNNs) to
medical image analysis has gained significant traction, particularly in the
detection of diabetic retinopathy (DR) [1]. A notable study focused on
developing a computer-assisted tool using CNNs for rapid and accurate
classification of retinal images to diagnose diabetic retinopathy [2]. The tool
identifies key features such as exudates, microaneurysms, and hemorrhages from
retinal images, achieving a specificity of 73.65% and an accuracy of 73.68% in
validation tests on 35,126 images from the EyePACS dataset. This approach
demonstrates the potential of CNNs in reducing the time and effort required for
manual diagnosis by clinicians [3].
Several studies have explored similar deep learning approaches to automate the
diagnosis of DR. One paper employed a deep neural network, specifically a CNN,
to classify retinal images into diabetic and non-diabetic categories, achieving over
85% accuracy during training [4], [5]. This significant improvement over
traditional algorithms highlights the efficiency of CNNs in feature extraction,
leveraging layers like ReLU activation and MaxPooling to identify key patterns in
the images [6], [7]. Another study focused on screening for non-proliferative
diabetic retinopathy (NPDR) using a combination of advanced image processing
techniques such as recursive region growing segmentation and the Moat Operator.
This automated system demonstrated high sensitivity and specificity, comparable
to the performance of experienced ophthalmologists [8], [9].
5
achieved around 85% accuracy in two-class classification, further showcasing the
effectiveness of CNNs in medical imaging [10].
The VGGNet [11] model has also been effectively employed in the automatic
classification of DR. In one study, CNNs were used to extract features such as
blood vessels and microaneurysms, leading to an accuracy of 95.41% in detecting
the presence of diabetic retinopathy. The high accuracy of this model underscores
the importance of early detection through automated systems, which can play a
crucial role in preventing severe vision loss [3] [6].
6
3 Methodology
The full methodology can be described as follows:
3. Data Augmentation
Figure 1: Methodology
7
3.1 Dataset Selection
10. Small thin vessels are removed having length less than 30 pixels.
8
Figure 2: Vessels extraction steps
9
3.2.2 Extract the microaneurysm
6. Then large white areas are removed having length greater than 20 pixels.
10
3.2.3 Merge previous results
Both the images are merged by taking the maximum intensity at each pixel.
11
3.3 Data Augmentation
Data augmentation is employed to increase the size of the training dataset and
improve the generalization ability of the model. The augmentation process
involves creating new variations of each image by modifying its brightness and
applying rotations. The steps are as follows:
2. Rotation: Six variations of each image are generated. These include: The
original image. The brightness-adjusted image. 90-degree and 180-degree
rotated versions of both the original and brightness-adjusted images.
This augmentation strategy ensures that the model is trained on a diverse set of
images, improving its robustness to variations in lighting and orientation.
12
3.4 Model Training
4 Current Result
Model accuracy on validation set is around 70%. (unstable)
13
4.1 Future work
4. Expanding the Dataset to include more diverse and larger datasets, such as
those with multi-modal imaging techniques, to improve the generalization of
the model across different populations.
14
5 Conclusion
This thesis focuses on developing a model for classifying diabetic retinopathy
using convolutional neural networks (CNNs) and advanced image processing
techniques. The methodology includes preprocessing techniques to enhance
retinal features, feature extraction to isolate blood vessels and microaneurysms,
and data augmentation to create robust training datasets. Both custom CNN
models and transfer learning with VGG16 have been employed to get good result.
Since the system demonstrates unstable performance in classifying retinal
images, future work will focus on to resolve it, combining additional
preprocessing techniques to further enhance feature visibility and applying more
advanced transfer learning models to improve accuracy. The ongoing expansion
of the dataset and exploration of ensemble methods will further refine the model,
ultimately aiming for a more accurate and scalable solution for diabetic
retinopathy detection.
15
6 References
References
[1] C. Sinthanayothin, J. F. Boyce, T. H. Williamson, H. L. Cook, E. Mensah,
S. Lal, and D. Usher, “Automated detection of diabetic retinopathy on digital
fundus images,” Diabetic medicine, vol. 19, no. 2, pp. 105–112, 2002.
16
retinopathy status prediction using fundus images,” in 2020 IEEE region 10
symposium (TENSYMP). IEEE, 2020, pp. 190–193.
[10] K. Xu, D. Feng, and H. Mi, “Deep convolutional neural network-based early
automated detection of diabetic retinopathy using fundus image,”
Molecules, vol. 22, no. 12, p. 2054, 2017.
17