0% found this document useful (0 votes)

6 views

Convolutional Neural Networks

The document provides an overview of Convolutional Neural Networks (CNNs), explaining their structure and functionality in image processing and classification. It details the convolution operation, the role of filters in feature detection, and the importance of padding and strided convolutions to manage image size and data loss. Additionally, it describes the pooling layers and the transition from convolutional layers to fully connected layers for classification tasks.

Uploaded by

Muhammad Faizan Baig

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Convolutional Neural Networks

Uploaded by

Muhammad Faizan Baig

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Convolutional Neural Networks

CNN's

We have understood neural networks and how they can be used to build
robust models using numerical data.

Let's assume we have an image with height = 6, width = 6, and the

number of channels = 3 (known as RGB, where each parameter (red,
green, and blue) defines the intensity of the color as an integer between 0
and 255). So, there are 6 x 6 x 3 = 108 pixels. Each pixel will be a node in
the input layer.

Now, consider that we have the first hidden layer of neural network with
10 units. Since each node is connected to all the nodes for the subsequent
layer of a feed-forward neural network, the total number of parameters
(weights+biases) is 108 x 10 + 10 = 1090. So, we need 1090 weights for
only one layer and generally, the size of the image will be higher than 6 x
6 x 3, usually, 224 x 224 x 3. In these kinds of cases, we get a lot of
parameters to train which makes it computationally expensive and the
model doesn't perform better.

To deal with such problems, we have a special type of neural network

called Convolutional Neural Networks (CNNs) that are used in image
processing and image classification. It takes the pixels of an image as
input and generates the desired output.

Convolutional Operation

Let’s understand how a convolution operation works.

The first step of a CNN is to detect features like edges, shapes, etc.
which is done by applying a convolution operation to the image
using filters (filters are responsible for extracting the features from the
image).

Let’s understand this using an example: Consider a greyscale image of

size 6 x 6 x 1 (the number of channels equal to 1 for greyscale images)
represented as a matrix, where each entry represents pixel intensity.

We can convolve this 6 x 6 matrix with a 3 x 3 filter:

Image Source

After the convolution, we will get a 4 X 4 image. The first element of the 4
X 4 matrix will be calculated as:

Image Source

So, we take the first 3 X 3 matrix from the 6 X 6 matrix and multiply it
with the filter. The first element of the 4 X 4 matrix, will be the sum of the
element-wise product of these values, i.e., 3*1 + 0 + 1*-1 + 1*1 + 5*0
+ 8*-1 + 2*1 + 7*0 + 2*-1 = -5.

Similarly, we will convolve over the entire image and get a 4 X 4 matrix.

In 3D convolution, the total parameters for 10 filters are equal to 3 x 3 x 3

(size of the filters) x 10 (filters) + 10 (bias) = 280. The number
of parameters is very less as compared to ANN.

Note: The size of the filter is 3 x 3 x 3 because we are assuming a colored

image. So, one 3 x 3 filter for each channel - RGB.

Filters

Filters are responsible for locating objects in an image by detecting the

changes in pixel intensity of the images.

Generally, we have an edge detector that detects edges in an image. For

example,

1 0 -1
1 0 -1

1 0 -1

This filter is responsible for detecting vertical edges. Let's see how it
works. If the filter is sliding over a region of the image which has similar
pixels, then the result of the convolution is zero. As the positives and the
negatives cancel each other. However, if the filter is sliding over a region
that has a vertical edge, there are different colored pixels on the left and
right. Then, the result of this convolution is not zero detecting an edge
there.

1 1 1

0 0 0

-1 -1 -1

The above filter is responsible for detecting horizontal edges. The way it
works is if there are different colored pixels on the top and the bottom of
the region, where this filter is sliding through, the result of the convolution
is something other than zero, whereas the regions with uniform pixels
would have given zero as the result of the convolution.

Padding

We have seen that convolving an input of 6 x 6 dimensions with a 3 x 3

filter results in a 4 x 4 matrix. We can generalize this and say that if the
input is n X n and the filter size is f X f, then the output size will be (n-f+1)
x (n-f+1):

 Input size: n x n

 Filter size: f x f

 Output size: (n-f+1) x (n-f+1)

But there are certain disadvantages of a convolutional filter:

1. When we apply a convolutional filter, the size of the image reduces.

2. Pixels that are present in the corner of the image are used only a
few times during convolution when compared to the other pixels.
This leads to data loss.

To avoid these issues, we can add a border around the input image. This
border is called padding. If we apply a padding of 1, it means that the
input will be an 8 X 8 matrix (instead of a 6 x 6 matrix). Applying a
convolution of 3 x 3 on the padded input will result in a 6 x 6 matrix,
which is the original shape of the image.

 Input size: n x n

 Padding size: p

 Filter size size: f x f

 Output size: (n+2p-f+1) x (n+2p-f+1)

There are two common choices for padding:

1. Valid: It means no padding. If we are using valid padding, the

output will be (n-f+1) x (n-f+1)

2. Same: Here, we apply padding so that the output size is the same
as the input size, i.e.,
n+2p-f+1 = n. So, p = (f-1)/2

We now know how to use padded convolution. This way we don’t lose a lot
of information and the image does not shrink either.

Strided Convolutions

Stride is the number of pixel shifts over the input matrix when we apply a
convolutional filter. Suppose we choose a stride of 2. So, while convoluting
through the image, we will take two steps – both in the horizontal and
vertical directions, separately. The dimensions for stride s will be:

 Input size: n x n

 Padding size: p

 Stride: s

 Filter size: f x f

 Output size : [(n+2p-f)/s+1] x [(n+2p-f)/s+1]

Stride helps to reduce the size of the image.

Pooling Layers

Pooling layers are generally used to reduce the size of the input image
and to increase the speed of computation.

Consider a 4 x 4 matrix as shown below:

Applying max-pooling on this matrix will result in a 2 x 2 output:

Image Source

For every 2 x 2 box, we take the maximum number. Here, we have

applied a filter of size 2 and a stride of 2. These are the hyperparameters
for the pooling layer. Apart from max-pooling, we can also apply average-
pooling where, instead of taking the maximum of the numbers, we take
their average.

Convolution Layer

We have seen how the convolution operation works. Now, we will see how
convolutional layers operate in a Neural Network setting.

In artificial neural networks, each layer has multiple neurons. Similarly in

CNNs, we have multiple filters on each layer. We specify the shape of
these filters and other parameters such as stride, padding, etc. The output
of the convolution operation with each filter is a 2-D matrix as discussed
above.

Since we have multiple convolutional layers, each layer receives the

image channel from the output of the previous layer. For the first layer,
the three input channels are the red, blue, and green pixel values of the
images, respectively. The output channels of each layer (since each filter
produces one output channel) act as the inputs for the next convolutional
layer.

At the start of the fully connected section of our architecture, we perform

a flatten operation that converts all these images into a 1-D data format
which makes it suitable for the feed-forward layers lying ahead.

Fully Connected Layer

We can extract different features of the image using combinations of
different convolutional layers and other techniques mentioned above but
convolutional layers cannot be used to do classification or regression. For
this, we make a fully connected layer. However, after applying different
filters and layers, the output is a matrix. So, we have to flatten that
matrix in the form of a vector to feed it into the fully connected layer.

In the picture shown below, the first matrix is the result we get after the
image goes through convolutional layers, the second layer is the flattened
layer that acts as the input for the fully connected layers.

Image Source

Business Report Data Mining
91% (11)
Business Report Data Mining
18 pages
CNN Short
No ratings yet
CNN Short
61 pages
CNNS, Part 1: An Introduction To Convolutional Neural Networks
No ratings yet
CNNS, Part 1: An Introduction To Convolutional Neural Networks
17 pages
mod5
No ratings yet
mod5
96 pages
1.5+Convolutional+Neural+Networks (1)
No ratings yet
1.5+Convolutional+Neural+Networks (1)
9 pages
Unit 3
No ratings yet
Unit 3
80 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
No ratings yet
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
11 pages
5 - Convolutional Neural Network
No ratings yet
5 - Convolutional Neural Network
14 pages
Cnn
No ratings yet
Cnn
26 pages
convolution operation
No ratings yet
convolution operation
23 pages
Convolutional Neural Networks (CNN) : Convolutions
No ratings yet
Convolutional Neural Networks (CNN) : Convolutions
17 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
Student Notes: Convolutional Neural Networks (CNN) Introduction
No ratings yet
Student Notes: Convolutional Neural Networks (CNN) Introduction
9 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
Convolutional Neural Networks: Shusen Wang
No ratings yet
Convolutional Neural Networks: Shusen Wang
75 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Unit-4
No ratings yet
Unit-4
19 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
Deep Learning: Convolutional Neural Network & Its Applications
No ratings yet
Deep Learning: Convolutional Neural Network & Its Applications
53 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
ANN Unit 4
No ratings yet
ANN Unit 4
66 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
DL Unit4 CNN
No ratings yet
DL Unit4 CNN
132 pages
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
No ratings yet
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
15 pages
Unit4 CNN
No ratings yet
Unit4 CNN
187 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
21CS743_Module4_notes
No ratings yet
21CS743_Module4_notes
15 pages
CNN Basic Beak of Bird
100% (1)
CNN Basic Beak of Bird
20 pages
Week 7
No ratings yet
Week 7
24 pages
The Math Behind Convolutional Neural Networks - Towards Data Science
No ratings yet
The Math Behind Convolutional Neural Networks - Towards Data Science
37 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
Convolutional Networks1
No ratings yet
Convolutional Networks1
52 pages
CNN_Intro
No ratings yet
CNN_Intro
30 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Unit 2 (1)
No ratings yet
Unit 2 (1)
45 pages
21CS743_DL_Module4_notes
No ratings yet
21CS743_DL_Module4_notes
7 pages
Ch VI _ Convolutional Neural Network_24
No ratings yet
Ch VI _ Convolutional Neural Network_24
33 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
NN 06
No ratings yet
NN 06
18 pages
A convolutional neural network
No ratings yet
A convolutional neural network
6 pages
AE556_2024_Topic4_CNN
No ratings yet
AE556_2024_Topic4_CNN
26 pages
Sarma Cnn Vce Oct 2022
No ratings yet
Sarma Cnn Vce Oct 2022
63 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
27 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
05introduction To Convolutional Neural Networks
No ratings yet
05introduction To Convolutional Neural Networks
72 pages
DL_MOD3
No ratings yet
DL_MOD3
102 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
Intro To CNN
No ratings yet
Intro To CNN
17 pages
586_114_216_Convolutional_Neural_Networks
No ratings yet
586_114_216_Convolutional_Neural_Networks
48 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Lecture 2
No ratings yet
Lecture 2
29 pages
Histogram Equalization: Enhancing Image Contrast for Enhanced Visual Perception
From Everand
Histogram Equalization: Enhancing Image Contrast for Enhanced Visual Perception
Fouad Sabry
No ratings yet
Optimization in Operations Research PDF
No ratings yet
Optimization in Operations Research PDF
22 pages
Dsa 07 T Shankar
No ratings yet
Dsa 07 T Shankar
4 pages
Machine Learning Foundations: Supervised, Unsupervised, and Advanced Learning Taeho Jo - Download the ebook now to never miss important information
100% (2)
Machine Learning Foundations: Supervised, Unsupervised, and Advanced Learning Taeho Jo - Download the ebook now to never miss important information
70 pages
Get Data Structures & Algorithms using Kotlin, Second Edition Hemant Jain free all chapters
100% (1)
Get Data Structures & Algorithms using Kotlin, Second Edition Hemant Jain free all chapters
40 pages
A First Course in Num PDF
No ratings yet
A First Course in Num PDF
70 pages
Explain With Example That Rate of Convergence of False Position Method Is Faster Than That of The Bisection Method."
50% (2)
Explain With Example That Rate of Convergence of False Position Method Is Faster Than That of The Bisection Method."
8 pages
Introduction To Numerical Analysis For Engineers: - Systems of Linear Equations Mathews
No ratings yet
Introduction To Numerical Analysis For Engineers: - Systems of Linear Equations Mathews
10 pages
Digital Signal Processing Question Bank 01
No ratings yet
Digital Signal Processing Question Bank 01
37 pages
L16 - Karatsuba Algorithm
No ratings yet
L16 - Karatsuba Algorithm
17 pages
DAA Maual
No ratings yet
DAA Maual
24 pages
SoccerCPD TimeSeries Project
No ratings yet
SoccerCPD TimeSeries Project
6 pages
Introduction To Algorithms: Prof. Shafi Goldwasser Prof. Erik Demaine
No ratings yet
Introduction To Algorithms: Prof. Shafi Goldwasser Prof. Erik Demaine
53 pages
Maths (041) Xii PB 1 QP Set A
No ratings yet
Maths (041) Xii PB 1 QP Set A
7 pages
Mba 301
No ratings yet
Mba 301
3 pages
Sample Questions
No ratings yet
Sample Questions
4 pages
Alva
No ratings yet
Alva
6 pages
PowerPoint Presentation (5822034)
No ratings yet
PowerPoint Presentation (5822034)
157 pages
Lecture8,9-Neural Networks
No ratings yet
Lecture8,9-Neural Networks
65 pages
Proble Solution Os
No ratings yet
Proble Solution Os
7 pages
Statistical Pattern Recognition 3rd Edition Andrew R. Webb - The complete ebook is available for download with one click
100% (3)
Statistical Pattern Recognition 3rd Edition Andrew R. Webb - The complete ebook is available for download with one click
57 pages
Midterm 1 Eview
No ratings yet
Midterm 1 Eview
10 pages
Chapter 8 - Linear Programming
No ratings yet
Chapter 8 - Linear Programming
163 pages
Assignment Problem
0% (1)
Assignment Problem
20 pages
Analysis of Algorithms
No ratings yet
Analysis of Algorithms
19 pages
Running Time
No ratings yet
Running Time
1 page
EE322M Quiz-1 Solution
No ratings yet
EE322M Quiz-1 Solution
6 pages
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
No ratings yet
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
15 pages
Questions For Viva
No ratings yet
Questions For Viva
4 pages
What Is KNN
No ratings yet
What Is KNN
9 pages