Convolutional Neural Network

Convolutional neural networks use convolutional layers that apply filters to input data through a sliding window process called convolution. The convolutional layers impose strong priors on the network through the shared weights of the filters and the local receptive fields. This prior encodes the assumption that important patterns can be detected in local regions of the input. Pooling layers further strengthen these priors by reducing the spatial dimensions while preserving important information, guiding the network towards recognition tasks. The convolutional and pooling operations together provide a powerful set of constraints that shape the network's learning and allow it to learn effective representations for tasks like image recognition.

Uploaded by

raj858778

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Convolutional Neural Network

Uploaded by

raj858778

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

Convolutional neural

network
Dense neural network and Convolutional neural
network
Gray scale vs color image
Convolutional kernel
1 -1 -1
Convolution -1 1 -1 Filter 1
-1 -1 1
stride=1

1 0 0 0 0 1 Dot
product
0 1 0 0 1 0 3 -1
0 0 1 1 0 0
1 0 0 0 1 0
0 1 0 0 1 0
0 0 1 0 1 0

6 x 6 image
1 -1 -1
Convolution -1 1 -1 Filter 1
-1 -1 1
If stride=2

1 0 0 0 0 1
0 1 0 0 1 0 3 -3
0 0 1 1 0 0
Importance of stride:
1 0 0 0 1 0 Less overlaps between the
image pixels and filter mask
0 1 0 0 1 0 Less output volume
0 0 1 0 1 0

6 x 6 image
A convolutional layer
A CNN is a neural network with some convolutional layers
(and some other layers). A convolutional layer has a number
of filters that does convolutional operation.

Kernel

A filter
Convolutional kernel
Convolution These are the network
parameters to be learned.

1 -1 -1
1 0 0 0 0 1 -1 1 -1 Filter 1
0 1 0 0 1 0 -1 -1 1
0 0 1 1 0 0
1 0 0 0 1 0 -1 1 -1
-1 1 -1 Filter 2
0 1 0 0 1 0
0 0 1 0 1 0 -1 1 -1

…
…
6 x 6 image
Each filter detects a
small pattern (3 x 3).
Consider learning an image:
• Some patterns are much smaller than the whole
image

Can represent a small region with fewer parameters

“beak” detector
1=edge
-1= not edge
Rectified linear unit，ReLU
Pooling:

• The main purpose of pooling layer is to progressively

reduce the spatial size of the input image, so that
number of computations in the network are reduced.
Pooling performs downsampling by reducing the size
and sends only the important data to next layers in CNN.
• Types: max pooing,min pooling,average pooling
• Pooling layer is responsible to retain important information in an
image.
Pooling layer
Pooling
MNIST dataset

The MNIST database of handwritten digits,

available from this page,
has a training set of 60,000 examples, and a test set
of 10,000 examples.
It is a subset of a larger set available from NIST.
The digits have been size-normalized and centered in
a fixed-size image.
Convolution v.s. Fully Connected

1 0 0 0 0 1 1 -1 -1 -1 1 -1
0 1 0 0 1 0 -1 1 -1 -1 1 -1
0 0 1 1 0 0 -1 -1 1 -1 1 -1
1 0 0 0 1 0
0 1 0 0 1 0
0 0 1 0 1 0
convolution
image

x1
1 0 0 0 0 1
0 1 0 0 1 0 x2
0 0 1 1 0 0
1 0 0 0 1 0

…
…

…
…
0 1 0 0 1 0
0 0 1 0 1 0
Final image after set of x36
convolution operations
Fully-connected
Softmax unit:
The softmax function is often used as the last activation function of a neural network to
normalize the output of a network to a probability distribution over predicted output classes
Final block diagram of CNN
0.001
0.11
0.12
0.101
0.80 4 will
be activated
0.13
0.113
softmax 0.14
0.15
0.12
To be covered:
Sparsely connected image matrix
Padding
In a convolutional layer, we observe that the pixels located on the corners
and the edges are used much less than those in the middle.

Here we applied padding(p) =2

Initial image size(HxW) = 5x5
After padding the image size will be :
(H+2p)x(W+2p)= 9x9 in this case
Important Formula
• Given an input image of size (NxN) ,kernel size (KxK) and padding =P
and Stride =S
The output image size after one convolution will be:
[(N−K+2P)/S]+1
Transposed Convolution:
• A technique to make output size larger than input.
• Also comes under “Upsampling” techniques.
• Upsampling using fractional striding(Transposed convolution) increases image
resolution. Ex: from 100x100 to 300x300 pixels

•
Example:
Dilated convolution
• Another way to get larger output size is to spread out input images by
inserting padding –both around and between input elements.
• This is called Dilated convolution.
• Let’s say we have a 3x3 input image.
• Rather than putting 3x3 image as a whole, we split the image into
individual pixels and add padding between the pixels and along the
boundaries as well.
Convolution Algorithm:
1. Stride convolution
2. Padding convolution
3.Transposed convolution
4. Dilated Convolution
Parameters

• Besides this other parameters are learning rate, loss function, batch
size, initial weights also to be chosen according to the problem
Other Architectural considerations
Efficient Architectures in
Neural Networks
Major Architectures
• All Convolutional Net:
no pooling layers, just use strided convolution to shrink representation size
• Inception:
complicated architecture designed to achieve high accuracy with low computational
cost
• ResNet:
blocks of layers with same spatial size, with each layer’s output added to the same
buffer that is repeatedly updated. Very many updates = very deep net, but without
vanishing gradient.
Convolution and Pooling as an
Infinitely Strong Prior
Prior Parameter Distribution
• What is prior?
 Probability distribution over the parameters of the model that encode our believe about what
models are reasonable.
• What is a weight prior?
 Assumptions about the weights (before learning) in terms of acceptable values and range are
encoded into the prior distribution of the weights.
 These assumptions are based on the types of operations performed by convolution and pooling
layers, which impose specific characteristics or expectations on the data.
• Prior parameter distribution
 Role of a prior probability distribution over the parameters of a model is:
 Encode our belief as to what models are reasonable before seeing the data.
For example
• The "prior" enforced by convolution is
Important patterns or features are likely to be found in local regions of the input data.
• The "prior" enforced by pooling is that
Reducing the spatial dimensions of the data while preserving important information can
be beneficial for recognition tasks.
• Convolution and Pooling as an Infinitely Strong Prior means that
These operations provide a very strong and effective set of assumptions or constraints
that guide the neural network's learning process and make it better at tasks like image
recognition.
Weak and Strong Priors
• A weak prior
• A distribution with high entropy
• e.g., Gaussian with high variance
• A weak prior has a high variance and shows that there is low confidence in the initial
value of the weight.
• Data can move parameters freely
• A strong prior
• It has very low entropy
• E.g., a Gaussian with low variance
• A strong prior in turn shows a narrow range of values about which we are confident
before learning begins.
• Such a prior plays a more active role in determining where the parameters end up
Infinitely strong prior
• An infinitely strong prior places zero probability on some parameters
• It says that some parameter values are forbidden regardless of support from data
 With an infinitely strong prior, irrespective of the data the prior cannot be
Changed.
Convolutional Network
• Convolutional networks are simply neural
networks that use convolution in place of
general matrix multiplication in at least
one of their layers.
Convolution as infinitely strong prior
• Convolutional net is similar to a fully connected net but with an infinitely strong prior over its weights.
• It says that the weights for one hidden unit must be identical to the weights of its neighbor, but
shifted in space.
• Prior also says that the weights must be zero, except for in the small spatially contiguous receptive
field assigned to that hidden unit.

• Convolution introduces an infinitely strong prior probability distribution over the parameters of a layer
• This prior says that the function the layer should learn contains only local interactions and is
equivariant to translation
Convolution as infinitely strong prior
• In CNNs, convolution involves sliding a small filter (a matrix of weights) over the
input data (e.g., an image).
• At each step, the filter multiplies its values with the corresponding values in the
input and then sums them up.
• This helps detect patterns or features in different parts of the input.
• The key idea is that it enforces locality – meaning it looks for patterns in small,
nearby regions of the input.
• Example:
Imagine you want to detect edges in a black-and-white image.
Convolutional layers will help the network focus on local patterns like edges,
corners, or textures by sliding a small filter over the image to detect these
features.
Pooling as infinitely strong prior
• After convolution, pooling is often applied.
• Pooling reduces the spatial dimensions of the data by selecting the most important
information from a group of neighboring values.
• Max pooling, for instance, takes the maximum value from a group of values, which
helps preserve the most significant features while reducing the amount of data.
• Example:
Suppose you have an image with a cat, and you want to recognize it.
After convolution, pooling helps focus on the most important parts of the image
like the cat's ears, eyes, and nose, while reducing less important details.
Why is this a "strong prior"?
• Locality:
 Convolution enforces the idea that important features are found in local regions of the data.
 This is a strong prior knowledge because, in many real-world scenarios, objects or patterns have
local characteristics.
 For example, in an image, edges or textures are typically found in small regions.
• Hierarchy of Features:
 By using multiple layers of convolution and pooling, CNNs build a hierarchy of features.
 Early layers detect basic features like edges, while deeper layers combine them to detect more
complex patterns.
 This hierarchy is a strong prior because it mimics how our brains perceive and recognize objects –
from simple features to complex objects.
Efficient Convolution Algorithms
Efficient Convolution Algorithms
• How to speed up convolution?
 Parallel Computation Resources
 Selecting Appropriate Algorithms
 Fourier transform:
o converting input and kernel into frequency space.
o Perform point-wise multiplication.
o Convert them back to time domain using an inverse Fourier transform.
 When d-D kernel can be expressed as outer product of o vectors, the kernel is called separable.
o Composing d 1-D convolution with each of these vectors is significantly faster than performing 1 d-D
convolution with their outer product.
o The naive approach requires O(wd) runtime and parameter storage place. Separable approach
requires O(w∗d) runtime and storage place.
Even techniques that improve the efficiency of only forward propagation are useful because in the
commercial settings, it is typical to devote more resource to deployment of network than to its training.
Random or Unsupervised
Features
Random or Unsupervised Features
• Typically, the most expensive part of conv network training is learning the features. There are 3
basic strategies for obtaining convolution kernels without supervised training.
1. Simply initialize convolutional kernels randomly:
 Random filters work well in convolutional networks. Inexpensive way to choose the
architecture of a convolutional network:
2. Design them by hand
3. Learn the kernel with an unsupervised method:
 Learning the features from unsupervised method allows them to be determined separately
from the classifier layer at the top of the architecture.

• Intermediate approach, greedy layer-wise pretraining, e.g.: Convolutional Deep Believe Network.
• Instead of training an entire convolutional layer at a time, we can train a model of small patch, we
can use the parameters from this patch-based to define the kernels of a convolutional layer.
• Today, most convolution networks are trained in a purely supervised fashion, using full forward
and back-propagation through the entire network on each training iteration.
Popular Architectures in
Neural Networks

Introduction To Neural Networks by Ingrid Russel
No ratings yet
Introduction To Neural Networks by Ingrid Russel
16 pages
UNIT2-CNN
No ratings yet
UNIT2-CNN
34 pages
Module 3
No ratings yet
Module 3
67 pages
UNIT - 2
No ratings yet
UNIT - 2
31 pages
UNIT4
100% (1)
UNIT4
14 pages
Unit-4
No ratings yet
Unit-4
19 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
Convolutional Neural Networks (Part I)
No ratings yet
Convolutional Neural Networks (Part I)
61 pages
M4_IA2
No ratings yet
M4_IA2
6 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
ML Lec 13 CNN
No ratings yet
ML Lec 13 CNN
44 pages
Chapter 4 Ann
No ratings yet
Chapter 4 Ann
33 pages
CNN 1
No ratings yet
CNN 1
17 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
cnn
No ratings yet
cnn
10 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
No ratings yet
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
11 pages
Sarma Cnn Vce Oct 2022
No ratings yet
Sarma Cnn Vce Oct 2022
63 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
UNIT-III DLL full unit
No ratings yet
UNIT-III DLL full unit
63 pages
AD3501-DL-UNIT 2 NOTES
No ratings yet
AD3501-DL-UNIT 2 NOTES
29 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
Convolutional_Networks_2024
No ratings yet
Convolutional_Networks_2024
44 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
DL Unit3
No ratings yet
DL Unit3
8 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
DL_MOD3
No ratings yet
DL_MOD3
102 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
78 pages
CNN
No ratings yet
CNN
62 pages
Unit III
No ratings yet
Unit III
89 pages
Unit III
No ratings yet
Unit III
89 pages
unit-3-CNN-2024
No ratings yet
unit-3-CNN-2024
58 pages
Lecture 08
No ratings yet
Lecture 08
43 pages
DL unit 3
No ratings yet
DL unit 3
18 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
371810f3-a2d5-467f-aa88-bfa680405b79
No ratings yet
371810f3-a2d5-467f-aa88-bfa680405b79
17 pages
What is a Convolutional Neural Network-unit3.docx
No ratings yet
What is a Convolutional Neural Network-unit3.docx
12 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Theory of CNN (Convolutional Neural Network)
No ratings yet
Theory of CNN (Convolutional Neural Network)
4 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
CNN - Convolutional Neural Network
No ratings yet
CNN - Convolutional Neural Network
33 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Unit 3
No ratings yet
Unit 3
105 pages
Machine Learning-Lecture 17(Student)
No ratings yet
Machine Learning-Lecture 17(Student)
7 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
NN 06
No ratings yet
NN 06
18 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Lectures1 2
No ratings yet
Lectures1 2
28 pages
50 Question Ai
No ratings yet
50 Question Ai
6 pages
Artificial Intelligence Course Unlocking The Future of Technology
No ratings yet
Artificial Intelligence Course Unlocking The Future of Technology
3 pages
Urdu Poetry Generated by Using Deep Learning Techniques
No ratings yet
Urdu Poetry Generated by Using Deep Learning Techniques
11 pages
Algoritm For MOD
No ratings yet
Algoritm For MOD
32 pages
Review and Comparison of Various Technologies For Predicting Students' Academic Performance
No ratings yet
Review and Comparison of Various Technologies For Predicting Students' Academic Performance
8 pages
Lip Reading Using CNN and LTSM
No ratings yet
Lip Reading Using CNN and LTSM
9 pages
50 MachineLearning Algorithm With Python
No ratings yet
50 MachineLearning Algorithm With Python
5 pages
ECG-based Heartbeat Classification in
No ratings yet
ECG-based Heartbeat Classification in
8 pages
Unit 3 AI - Neural Networks
No ratings yet
Unit 3 AI - Neural Networks
11 pages
Report On Machine Learning
No ratings yet
Report On Machine Learning
13 pages
Viva Questions
No ratings yet
Viva Questions
2 pages
Sample Bejdi
No ratings yet
Sample Bejdi
11 pages
Left Ventricle Segmentation in Cardiac MR Images Using Fully Convolutional Network
No ratings yet
Left Ventricle Segmentation in Cardiac MR Images Using Fully Convolutional Network
4 pages
Hybrid Quantum Neural Network Structures For Image Multi-Classification
No ratings yet
Hybrid Quantum Neural Network Structures For Image Multi-Classification
20 pages
Machine Learning PYQ 2022 Ans
No ratings yet
Machine Learning PYQ 2022 Ans
17 pages
Learning
No ratings yet
Learning
1 page
Khayyam Offline Persian Handwriting Dataset
No ratings yet
Khayyam Offline Persian Handwriting Dataset
15 pages
SMART: Robust and E Fficient Fine-Tuning For Pre-Trained Natural Language Models Through Principled Regularized Optimization
No ratings yet
SMART: Robust and E Fficient Fine-Tuning For Pre-Trained Natural Language Models Through Principled Regularized Optimization
21 pages
Supervised Learning
No ratings yet
Supervised Learning
9 pages
Ain3001 Presentation Guideline For ML Midterm
No ratings yet
Ain3001 Presentation Guideline For ML Midterm
3 pages
Neural Networks in Data Mining
No ratings yet
Neural Networks in Data Mining
6 pages
Analyzing A16z's AI Investment Strategy - Where The Firm Sees Opportunity Amid The genAI Rush - CB Insights Research
No ratings yet
Analyzing A16z's AI Investment Strategy - Where The Firm Sees Opportunity Amid The genAI Rush - CB Insights Research
3 pages
AI_Term End_QP_IX_18.03.2024
No ratings yet
AI_Term End_QP_IX_18.03.2024
6 pages
Artificial Intelligence in Object Detection-Report
No ratings yet
Artificial Intelligence in Object Detection-Report
6 pages
Vision-Based Robotic Grasping From Object Localization, Object Pose Estimation To Grasp Estimation For Parallel Grippers
No ratings yet
Vision-Based Robotic Grasping From Object Localization, Object Pose Estimation To Grasp Estimation For Parallel Grippers
39 pages
AI
No ratings yet
AI
8 pages
Online Distilling From Checkpoints For Neural Machine Translation
No ratings yet
Online Distilling From Checkpoints For Neural Machine Translation
10 pages
Question Bank
No ratings yet
Question Bank
4 pages

Convolutional Neural Network

Uploaded by

Convolutional Neural Network

Uploaded by

Convolutional neural

Can represent a small region with fewer parameters

• The main purpose of pooling layer is to progressively

The MNIST database of handwritten digits,

Here we applied padding(p) =2

You might also like