0% found this document useful (0 votes)

3 views

Report_mini_project_2

Uploaded by

darksmile629

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Report_mini_project_2

Uploaded by

darksmile629

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

GNR 638 Mini Project 2

Kunal Randad Ayush Raisoni

Roll Number: 20d070049 Roll Number: 200100041
March 2024

Problem-1: Image Deblurring

1 Introduction
Image deblurring is a crucial task in computer vision, aiming to restore sharpness and clarity to blurred
images. This project addresses the challenge of designing a neural network architecture capable of effec-
tively deblurring images, with a constraint on model complexity.

The dataset consists of two sets of images: Set A contains sharp images, while Set B comprises the
same images with varying levels of blur introduced through Gaussian filters of different kernel sizes and
sigma values. Set A serves as the ground truth for evaluation, while Set B is used as input for deblurring.

Initially, images from Set A are downscaled to (256,448) resolution for uniform processing. This was
done using python and the resized images were saved locally. Set B is generated by applying Gaussian
filters with kernel sizes of 3x3, 7x7, and 11x11, and corresponding sigma values of 0.3, 1, and 1.6, respec-
tively. This diverse dataset provides a challenging training environment.

The objective is to design a neural network capable of deblurring images from Set B to resemble their
sharp counterparts in Set A. However, model complexity is restricted to 15 million parameters to ensure
efficiency.

2 Assumption
Resized images have height and width as 256 and 448 respectively (rather than 448 and 256) to preserve
aspect ratio.

3 Pre-Processing
3.1 Data Compression
Initially running the models on personal laptop could not take place due to slow GPU and limited RAM.
So, it was necessary to shift to Google Colab. The dataset was of size 32 GB and could not be processed
on Google Colab due to limited memory. The compressed dataset was of size about 4.5 GB for sharp
images and filtered images. Thus, a total memory of 18 GB would be required if all images were to be

1
uploded which is not feasible.
To solve this, I explored the sub folders. There were 240 subfolders and each sub-folder contained 100
image frames which were taken with a very small time-gap. Out of these 100 images, 1’st, 40’th and 80’th
frames were chosen. Thus, only 3 out of 100 images were taken actually for training.

4 Models Tried
In this section, we discuss the various models explored for the image deblurring task. We experimented
with different architectures to find the most suitable one for our problem.

4.1 Baseline Convolutional Neural Network (CNN)

We began our exploration with a baseline Convolutional Neural Network (CNN) architecture. The baseline
CNN consisted of several convolutional layers. Max-Pooling was not used as Image dimensions had to be
conserved. Thus, we had to use big kernels so that local spatial information could play a role in deblurring.
However, because of larger kernel size, the number of parameters blew up and it took a lot of steps to
train. There were visible artefacts as well, for eg, lines at edges. Some images also turned out greyish.

4.2 U-Net
Next, we experimented with the U-Net architecture, which is widely used for image segmentation tasks.
U-Net consists of an encoder-decoder structure with skip connections between corresponding encoder
and decoder layers. These skip connections allow the network to preserve spatial information during
upsampling. Despite its success in segmentation tasks, we found that U-Net struggled to deblurr images
effectively.

4.3 Residual Networks (ResNet)

We also investigated the use of Residual Networks (ResNet) for image deblurring. ResNet introduces resid-
ual connections, allowing for deeper networks to be trained without encountering the vanishing gradient
problem. We experimented with different depths of ResNet architectures. As Max Pooling was not used
here, same problem was faced as earlier.

4.4 Custom Architectures

Finally, we explored designing custom architectures tailored specifically for the image deblurring task.
These architectures incorporated elements from the baseline CNN, U-Net, and ResNet architectures, along
with additional modifications.

U-Net seemed to be a good approach but it was not much succesful. I investigated the structure of
U-Net and found that they contain several Up-Sampling layers, some close to output layer. This could
have been a cause of blurred output as Up-Sampling the image causes blurring and it seems difficult to
deblur an image in 1 Convolutional Layer.

Thus, to tackle this problem, a few only convolutional layers were added before the output. As the
input had gone through multiple convolution, pooling and upsampling, it was no longer a simple repre-
sentation of input image but rather a representation of its features.

2
This meant we need to introduce the initial image once again so that the information in image is used and
this was, our output is guaranteed to be at-least as good as the input. This was done through concate-
nation. Input Image and output of first convolution layer was concatenated just before the output layer.
Finally, this archotecture worked!

5 Final Model Architecture

Input
Output

Conv2D
Conv2D layer just before Output

MaxPool

Concatenate + Input
Conv2D

MaxPool Conv2DTranspose

Conv2D Conv2D

MaxPool Concatenate

Conv2D Conv2DTranspose

Conv2D Conv2D

Conv2D Conv2DTranspose Concatenate

Figure 1: UNet Model Block Diagram

3
Table 1: Model: model
Layer (type) Output Shape Param # Connected to
InputLayer (None, 256, 448, 3) 0 []
Conv2D (None, 256, 448, 64) 1792 [’input 1[0][0]’]
Conv2D (None, 256, 448, 64) 36928 [’conv2d[0][0]’]
MaxPooling2D (None, 128, 224, 64) 0 [’conv2d 1[0][0]’]
Conv2D (None, 128, 224, 128) 73856 [’max pooling2d[0][0]’]
Conv2D (None, 128, 224, 128) 147584 [’conv2d 2[0][0]’]
MaxPooling2D (None, 64, 112, 128) 0 [’conv2d 3[0][0]’]
Conv2D (None, 64, 112, 256) 295168 [’max pooling2d 1[0][0]’]
Conv2D (None, 64, 112, 256) 590080 [’conv2d 4[0][0]’]
MaxPooling2D (None, 32, 56, 256) 0 [’conv2d 5[0][0]’]
Conv2D (None, 32, 56, 512) 1180160 [’max pooling2d 2[0][0]’]
Conv2D (None, 32, 56, 512) 2359808 [’conv2d 6[0][0]’]
Conv2D (None, 32, 56, 512) 2359808 [’conv2d 7[0][0]’]
Conv2DTranspose (None, 64, 112, 256) 524544 [’conv2d 8[0][0]’]
Concatenate (None, 64, 112, 512) 0 [’conv2d tr.[0][0]’, ’conv2d 5[0][0]’]
Conv2D (None, 64, 112, 256) 1179904 [’concatenate[0][0]’]
Conv2D (None, 64, 112, 256) 590080 [’conv2d 9[0][0]’]
Conv2DTranspose (None, 128, 224, 128) 131200 [’conv2d 10[0][0]’]
Concatenate (None, 128, 224, 256) 0 [’conv2d tr. 1[0][0]’, ’conv2d 3[0][0]’]
Conv2D (None, 128, 224, 128) 295040 [’concatenate 1[0][0]’]
Conv2D (None, 128, 224, 128) 147584 [’conv2d 11[0][0]’]
Conv2D (None, 128, 224, 128) 147584 [’conv2d 12[0][0]’]
Conv2DTranspose (None, 256, 448, 64) 32832 [’conv2d 13[0][0]’]
Concatenate (None, 256, 448, 128) 0 [’conv2d tr. 2[0][0]’, ’conv2d 1[0][0]’]
Conv2D (None, 256, 448, 64) 73792 [’concatenate 2[0][0]’]
Conv2D (None, 256, 448, 64) 36928 [’conv2d 14[0][0]’]
Conv2D (None, 256, 448, 64) 36928 [’conv2d 15[0][0]’]
Conv2D (None, 256, 448, 64) 36928 [’conv2d 16[0][0]’]
Conv2D (None, 256, 448, 64) 36928 [’conv2d 17[0][0]’]
Conv2D (None, 256, 448, 64) 36928 [’conv2d 18[0][0]’]
Conv2D (None, 256, 448, 64) 36928 [’conv2d 19[0][0]’]
Concatenate (None, 256, 448, 67) 0 [’conv2d 20[0][0]’, ’input 1[0][0]’]
Conv2D (None, 256, 448, 128) 77312 [’concatenate 3[0][0]’]
Conv2D (None, 256, 448, 128) 147584 [’conv2d 21[0][0]’]
Conv2D (None, 256, 448, 3) 387 [’conv2d 22[0][0]’]
Total params 10,577,667 (40.35 MB)

4
6 Training Details
Given Batch Size 10, the data loader returns 10 random blurred and their corresponding sharp images.
As there were 3 kernels given, images are chosen corresponding to a kernel randomly and uniformly. Out
of the 240 sub-folders, the sub-folder is also selected randomly and uniformly. From the provided code,
the following training details are utilized:
• Batch Size: 10
• Epochs: 40
• Loss Function: Mean Squared Error (MSE)
• Optimizer: Adam
• Steps per Epoch: 64 (Number of gradient steps taken per epoch)
Callbacks:
• ModelCheckpoint: Used to save the best model during training.
Training Procedure:
from tensorflow.keras.callbacks import ModelCheckpoint
import os

batch_size = 10
epochs = 40
checkpoint = ModelCheckpoint(’/content’, verbose=1, save_best_only=True)
model = create_unet_model2()
model.compile(optimizer=’adam’, loss=’mean_squared_error’)

from tensorflow.keras.callbacks import ModelCheckpoint, Callback, History

class LossPlotter(Callback):
def on_train_begin(self, logs={}):
self.losses = []

def on_epoch_end(self, epoch, logs={}):

self.losses.append(logs.get(’loss’))
plt.plot(self.losses)
plt.title(’Model Loss’)
plt.xlabel(’Epoch’)
plt.ylabel(’Loss’)
plt.show()

# loss_plotter = LossPlotter()
print(model.summary())
model.fit(dataloader(blurred_image_dir_1, blurred_image_dir_2, blurred_image_dir_3,
downscaled_image_dir, batch_size),
batch_size=batch_size,
epochs=epochs,
steps_per_epoch=64,
callbacks=[checkpoint])

5
7 Training Curves

Figure 2: Log to the base 10 of Training Loss

Figure 3: Training Loss vs Epoch starting from epoch 2

6
8 Results
8.1 Qualitative Results
8.1.1 Some Examples

8.1.2 Qualitative Description of Results

The results are impressive! We can see a significant improvement in quality of blurred images. The images
blurred by kernels 2 and 3 of size 7*7 and 11*11 respectively are significantly improved. The images
blurred by kernel of size 3*3 does not have much improvement. In fact, their PSNR has decreased but
this is because PSNR is extremely sensitive for closely related images. So, a small deviation is enough to
reduce PSNR of the model’s prediction and thus, its reliability is questionable in this range.

7
8.2 Quantitative Results
8.2.1 Kernel 1

Figure 4: Prediction and Blurred Image PSNR for 15 random samples

• Average PSNR of Predicted Images: 43.24

• Average PSNR Improvement: -25.34

The Average PSNR value is although higher than kernel 2 and 3, it is lower than PSNR of the blurred
image. This is because the kernel size and sigma for the Gaussian are very small. The PSNR formula is
very sensitive for closely related images and is thus unreliable in this range.

8.2.2 Kernel 2

Figure 5: Prediction and Blurred Image PSNR for 15 random samples

8
• Average PSNR of Predicted Images: 35.88

• Average PSNR Improvement: 8.14

The results are very good in this category. There is a significant improvement in PSNR value.

8.2.3 Kernel 3

Figure 6: Prediction and Blurred Image PSNR for 15 random samples

• Average PSNR of Predicted Images: 32.35

• Average PSNR Improvement: 5.58

The results are decent in this category. There is a good improvement in PSNR value.

9 Evaluation on Test Data

• Average PSNR of Blurred Images: 26.68

• Average PSNR of Predicted Images: 31.06

• Average PSNR Improvement: 4.38

9
Figure 7: Prediction and Blurred Image PSNR for all Test Data

Figure 8: Prediction and Blurred Images for 4 random samples

Computer Vision With Keras
No ratings yet
Computer Vision With Keras
67 pages
BATCH 16 (1)
No ratings yet
BATCH 16 (1)
24 pages
Image Super Resolution Report
No ratings yet
Image Super Resolution Report
12 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Diffusion
100% (5)
Diffusion
62 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Deep 2
No ratings yet
Deep 2
57 pages
Tensorflow Notes From Coursera
No ratings yet
Tensorflow Notes From Coursera
2 pages
Lect 12 21062023 043845pm 1 03062024 111022am
No ratings yet
Lect 12 21062023 043845pm 1 03062024 111022am
79 pages
Lecture4 - Convnets For CV Slide
No ratings yet
Lecture4 - Convnets For CV Slide
65 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Week 6 Unsupervised Learning
No ratings yet
Week 6 Unsupervised Learning
60 pages
Cad and Dog
No ratings yet
Cad and Dog
5 pages
Slides 1
No ratings yet
Slides 1
50 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
CNN 1721592934
No ratings yet
CNN 1721592934
53 pages
Implemented LeNet on PyTorch
100% (1)
Implemented LeNet on PyTorch
17 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
CNN With Tensor Flow
No ratings yet
CNN With Tensor Flow
61 pages
DOC-20241209-WA0029.
No ratings yet
DOC-20241209-WA0029.
11 pages
cat_dog_classification_CNN_Model
No ratings yet
cat_dog_classification_CNN_Model
13 pages
DeepDream
No ratings yet
DeepDream
14 pages
TensorFlow With R
No ratings yet
TensorFlow With R
46 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
TLM for CNN
No ratings yet
TLM for CNN
32 pages
TP3 Mi204 Santos Scardellato
No ratings yet
TP3 Mi204 Santos Scardellato
20 pages
Cad and Dog 2
No ratings yet
Cad and Dog 2
5 pages
dlweek7
No ratings yet
dlweek7
9 pages
106106213
No ratings yet
106106213
637 pages
image restoration 2
No ratings yet
image restoration 2
8 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
MVS_Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS_Expt8 Object Detection and Reconstruction Using CNN
5 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
Cats and Dogs Classification
No ratings yet
Cats and Dogs Classification
12 pages
CNN AI
No ratings yet
CNN AI
17 pages
7 CNNWithCustomImage
No ratings yet
7 CNNWithCustomImage
11 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
CS401 24 Assign 2 Template Fixed
No ratings yet
CS401 24 Assign 2 Template Fixed
11 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-09-09 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-09-09 Reference-Material-I
9 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
55 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Res Net
No ratings yet
Res Net
8 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
4 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Augmentation and Segmentation
No ratings yet
Augmentation and Segmentation
32 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
No ratings yet
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
5 pages
Ug4 Proj
No ratings yet
Ug4 Proj
44 pages
09-Neural Networks
No ratings yet
09-Neural Networks
17 pages
Experiment 12
No ratings yet
Experiment 12
3 pages
Time Series Classification: Lab Based Project
No ratings yet
Time Series Classification: Lab Based Project
14 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
1564724966662vF8bnKrhjXBdaiGy PDF
No ratings yet
1564724966662vF8bnKrhjXBdaiGy PDF
1 page
Preparation of Biscuts Commercially
No ratings yet
Preparation of Biscuts Commercially
86 pages
X.I.M.E Quest: SEPTEMBER 1, 2018
No ratings yet
X.I.M.E Quest: SEPTEMBER 1, 2018
4 pages
2marks With Answer
No ratings yet
2marks With Answer
46 pages
Activity 1: ABM01 - Applied Economics Lesson 3 - SEATWORK
100% (1)
Activity 1: ABM01 - Applied Economics Lesson 3 - SEATWORK
2 pages
Probability
No ratings yet
Probability
13 pages
Flex
No ratings yet
Flex
1 page
Information Technology
No ratings yet
Information Technology
5 pages
ADORE: 925 Sterling Silver - Best Sterling Silver 925
No ratings yet
ADORE: 925 Sterling Silver - Best Sterling Silver 925
3 pages
Sumayya Update CV
No ratings yet
Sumayya Update CV
4 pages
Quant_Roadmap_for_Newbs_to_Pros
No ratings yet
Quant_Roadmap_for_Newbs_to_Pros
2 pages
Fitwell: Pipe & Fittings Trading
No ratings yet
Fitwell: Pipe & Fittings Trading
4 pages
ABS PSC Quarterly Report (Q3 2023)
No ratings yet
ABS PSC Quarterly Report (Q3 2023)
20 pages
Microsoft Office 365 Enterprise Network Architecture
No ratings yet
Microsoft Office 365 Enterprise Network Architecture
5 pages
IFHT Series: Centrifugal Jet Fans
No ratings yet
IFHT Series: Centrifugal Jet Fans
3 pages
Firmware Package Release Notes 7 1
No ratings yet
Firmware Package Release Notes 7 1
6 pages
Tech Drawing and Drafting
No ratings yet
Tech Drawing and Drafting
14 pages
IPR
No ratings yet
IPR
12 pages
088 - Work Permit Form
No ratings yet
088 - Work Permit Form
1 page
COBOL Interview Question
100% (1)
COBOL Interview Question
13 pages
ST Solidthinking Embed 2017.1 ReleaseNotes PDF
No ratings yet
ST Solidthinking Embed 2017.1 ReleaseNotes PDF
4 pages
Week 4: Diversification and Portfolio Risk
No ratings yet
Week 4: Diversification and Portfolio Risk
35 pages
HHM1 Drawing Notes
No ratings yet
HHM1 Drawing Notes
14 pages
Remote System TR Tracking Tool
No ratings yet
Remote System TR Tracking Tool
7 pages
Terminologies in Disaster Nursing
No ratings yet
Terminologies in Disaster Nursing
3 pages
Bricks Made of Rubber Latex and Fly Ash
No ratings yet
Bricks Made of Rubber Latex and Fly Ash
52 pages
Acrylic Nail Course
No ratings yet
Acrylic Nail Course
88 pages
Information Bulletin: JELET-2020
No ratings yet
Information Bulletin: JELET-2020
35 pages
Rules of Business 1973 (Updated)
No ratings yet
Rules of Business 1973 (Updated)
90 pages
Cable Selcetion2 Afte RM Rev4
No ratings yet
Cable Selcetion2 Afte RM Rev4
9 pages