465-Lecture 5-6

The document discusses Convolutional Neural Networks (CNNs) and their application in image categorization by using spatial features and filters for feature extraction. It highlights the importance of convolution operations, pooling layers, and 1x1 convolutions in reducing dimensionality and computational complexity. Additionally, it mentions the training of CNNs through backpropagation and their applications in fields like self-driving cars and image captioning.

Uploaded by

Sadat Parvej Prottoy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views40 pages

465-Lecture 5-6

Uploaded by

Sadat Parvej Prottoy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 40

CSE 465

Lecture 5 & 6
CNN – Convolutional Neural Network
CNN in a nutshell
Images/videos are just matrix with numbers
What do we want to do?
How to categorize images?

• Use features

• Rectangular shape • Long thick lines • Has nose

• Has bevels • Hanging ropes • Has long ears
• There is a logo of apple • Structure hanging over poles • Has eyes
• White color
Manual feature detection

• Must have domain knowledge

• Have previous experiences with best practices
• Define features
• Define previously known features
• Generate new features
• Use the features to classify
• The classifier will use the features
Manual feature detection: Problems

• Occlusion
• Blocking partially
• Different illumination
• Amount of light/brightness change
• Scale variation/deformation
• Viewpoint variation
How can we use machine to learn features?
Implementation so far

• Use fully connected feed-forward neural network with many layers

• Hopefully each layer will learn some important features
• And, finally we will be able to represent the correct function
• However, no spatial information
• Many, Many features
• Input
• Flattened 1-D vector of image representing numbers
• However, important spatial information gone!!!
What can we do?

• We need to use the spatial features

• How
• We can use filters to detect visual features like lines/segments etc.
• Do not use fully connected layers for the entire input
• Image size could be more than 256X256 nowadays
• If we use 1000 neurons for the first hidden layer
• We need to learn around 200 million parameters for the first layer
• Any bigger than that --- impossible to fit inside a single computer memory
• Use an architecture which reduces the images into features
• Learn the features first
• Then use the features to classify/recognize
First: Use spatial feature
And a match made in heaven - convolution
What does it do?
What does it do?
In practice: same filter sliding window algorithm
What does it do?
Feature extraction with convolution

• Filter of size 4x4 : 16 different weights

• Apply this same filter to 4x4 patches
in input
• Shift by 1/2 (stride) pixels for next
patch

• Apply a set of weights – a filter – to

extract local features
• Use multiple filters to extract different
features
• Spatially share parameters of each
filter
Convolution Operation
Convolution operation (2)
Filter sliding
Padding

• Padding preserves the spatial size of the

input image/volume
• So the input and output width and height
remain the same
• This is important for building
deeper networks
• Otherwise, the height/width would shrink as
we go deeper layers
Stride

• The number of pixels the filter slides over the image is called stride
• For example, to slide the convolution filter one pixel at a time, the strides value is 1
• If we want to jump two pixels at a time, the strides value is 2
• Strides of 3 or more are uncommon and rare in practice
• Jumping pixels produces smaller output volumes spatially
• Strides of 1 will make the output image roughly the same height and width of the
input image, while strides of 2 will make the output image roughly half of the input
image size
Pooling

• The goal of the pooling layer is to down sample the feature maps produced by the
convolutional layer into a smaller number of parameters, thus reducing
computational complexity
• Pooling filters do not have weights or any values
• All they do is slide over the feature map created by the previous convolutional layer and select
the pixel value to pass along to the next layer, ignoring the remaining values
Pooling

• Max pooling: Selects the max of the numbers on the window

• Average pooling: Selects the average of the numbers on the window
• Global average pooling: Selects the average of all the pixels in the feature
map
Convolution blocks
The complete network
Parameter Count for CNN

• Parameters
• Values inside the filters
• W matrix
• b vector
• Number of operations
• Number of multiplications for the CNN operations
• How does it change for different values of the padding or stride
• Number of additions for the CNN operations
• Number of filters
1X1 Convolution
Why 1X1 Convolution

• Most simplistic explanation would be that 1x1 convolution leads to

dimension reduction
• For example, an image of 200 x 200 with 50 features on convolution with
20 filters of 1x1 would result in size of 200 x 200 x 20
• Is this the best way to do dimensionality reduction in the convolutional
neural network? What about the efficacy vs efficiency?
Why 1X1 Convolution

• Although 1x1 convolution is a ‘feature pooling’ technique, there is more to

it than just sum pooling of features across various channels/feature-maps
of a given layer
• 1x1 convolution acts like coordinate-dependent transformation in the filter
space
• This transformation is strictly linear, but in most of application of 1x1
convolution is succeeded by a non-linear activation layer like ReLU
• This transformation is learned through the (stochastic) gradient descent
• An important distinction is that it suffers with less over-fitting due to
smaller kernel size (1x1)
1X1 Convolution
The feature extractor

cat dog ……
Convolution

Max Pooling
The
Fully Connected Feature
Feedforward network
Convolution extractor

Max Pooling

Flattened
A CNN compresses a fully connected network

• Reduces number of connections

• Shared weights on the edges
• Max pooling further reduces the complexity
Training CNN

• Learn weights for convolutional filters and fully connected layers using
backpropagation and the log loss (cross-entropy loss) function
CNN Application: Self driving cars/drones

• Convolution then de-convolution

• Get a pixel level map
Self driving cars/Drones
Generate image caption/description

• Use CNN to detect the image features

• Then instead of using the FCN
• Use RCN to generate the description

Module 5
No ratings yet
Module 5
1 page
Activation Function: Presented by
No ratings yet
Activation Function: Presented by
19 pages
Machine Learning Systems
No ratings yet
Machine Learning Systems
1,748 pages
Keras and Tensorflow
No ratings yet
Keras and Tensorflow
11 pages
Ee046746 Tut 03 04 Convolutional Neural Networks
No ratings yet
Ee046746 Tut 03 04 Convolutional Neural Networks
26 pages
soft computing roadmap
No ratings yet
soft computing roadmap
3 pages
Machine Learning-Lecture 17(Student)
No ratings yet
Machine Learning-Lecture 17(Student)
7 pages
Cours CNN eng (1)
No ratings yet
Cours CNN eng (1)
60 pages
Co-So-Tri-Tue-Nhan-Tao - 2021-Reviewexercise09-Nn-Sol - (Cuuduongthancong - Com)
No ratings yet
Co-So-Tri-Tue-Nhan-Tao - 2021-Reviewexercise09-Nn-Sol - (Cuuduongthancong - Com)
2 pages
Module 3
No ratings yet
Module 3
34 pages
Cnn
No ratings yet
Cnn
98 pages
NN-mdu-previousyears
No ratings yet
NN-mdu-previousyears
10 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
6 pages
E-Note_33951_Content_Document_20250328020322PM
No ratings yet
E-Note_33951_Content_Document_20250328020322PM
29 pages
371810f3-a2d5-467f-aa88-bfa680405b79
No ratings yet
371810f3-a2d5-467f-aa88-bfa680405b79
17 pages
DL Unit 4 CNN Sri Vasavi (4)
No ratings yet
DL Unit 4 CNN Sri Vasavi (4)
128 pages
convolution operation
No ratings yet
convolution operation
23 pages
cs237b Lecture 6
No ratings yet
cs237b Lecture 6
7 pages
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
No ratings yet
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
57 pages
Jurnal Sistem Pendeteksi Pejalan Kaki
No ratings yet
Jurnal Sistem Pendeteksi Pejalan Kaki
12 pages
5 Mcq Ann Ann Quiz Selected
No ratings yet
5 Mcq Ann Ann Quiz Selected
21 pages
223 COE 292 FinalExam Concept
No ratings yet
223 COE 292 FinalExam Concept
17 pages
Biological and Artificial Neuron
No ratings yet
Biological and Artificial Neuron
6 pages
cnn
No ratings yet
cnn
10 pages
Unit III
No ratings yet
Unit III
8 pages
Back Propagation
No ratings yet
Back Propagation
17 pages
AD3511 SET2
No ratings yet
AD3511 SET2
2 pages
DL Mod 3
No ratings yet
DL Mod 3
65 pages
A convolutional neural network
No ratings yet
A convolutional neural network
6 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
mod5
No ratings yet
mod5
96 pages
Final_DL
No ratings yet
Final_DL
26 pages
CNN Midterm
No ratings yet
CNN Midterm
103 pages
Convolutional Networks1
No ratings yet
Convolutional Networks1
52 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
11 pages
DL_MOD3
No ratings yet
DL_MOD3
102 pages
Tutorial Math Deep Learning 2018 PDF
No ratings yet
Tutorial Math Deep Learning 2018 PDF
103 pages
Cnn
No ratings yet
Cnn
26 pages
1.5+Convolutional+Neural+Networks (1)
No ratings yet
1.5+Convolutional+Neural+Networks (1)
9 pages
Convolutional Neural Networks (CNN) : Convolutions
No ratings yet
Convolutional Neural Networks (CNN) : Convolutions
17 pages
CSD311: Artificial Intelligence
No ratings yet
CSD311: Artificial Intelligence
12 pages
Ch VI _ Convolutional Neural Network_24
No ratings yet
Ch VI _ Convolutional Neural Network_24
33 pages
unit-3-CNN-2024
No ratings yet
unit-3-CNN-2024
58 pages
Unit-4
No ratings yet
Unit-4
19 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Deep Learning NLP and Computer Vision
No ratings yet
Deep Learning NLP and Computer Vision
9 pages
DL PRACTICAL FILE
No ratings yet
DL PRACTICAL FILE
58 pages
Unit I Architecture of Neural Network
No ratings yet
Unit I Architecture of Neural Network
74 pages
Intro to CNN
No ratings yet
Intro to CNN
93 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
Learning in A Feed Forward Multiple Layer ANN - Backpropagation
No ratings yet
Learning in A Feed Forward Multiple Layer ANN - Backpropagation
18 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Sarma Cnn Vce Oct 2022
No ratings yet
Sarma Cnn Vce Oct 2022
63 pages
Lecture-25 - Building - Training CNN
No ratings yet
Lecture-25 - Building - Training CNN
26 pages
Module 3
No ratings yet
Module 3
67 pages
Aust Cse Thesis Final Book
No ratings yet
Aust Cse Thesis Final Book
72 pages
Hopfield Network
No ratings yet
Hopfield Network
32 pages
NN 06
No ratings yet
NN 06
18 pages
Unit 3
No ratings yet
Unit 3
80 pages
Assignment2 ANN
No ratings yet
Assignment2 ANN
7 pages
Neural Network Fundamentals With Graphs
No ratings yet
Neural Network Fundamentals With Graphs
6 pages
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
No ratings yet
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
71 pages
AE556_2024_Topic4_CNN
No ratings yet
AE556_2024_Topic4_CNN
26 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
9 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Convolutional_Networks_2024
No ratings yet
Convolutional_Networks_2024
44 pages
CNN - Convolutional Neural Network
No ratings yet
CNN - Convolutional Neural Network
33 pages
Feed-Forward Neural Networks (Part 1)
No ratings yet
Feed-Forward Neural Networks (Part 1)
33 pages
UNIT2-CNN
No ratings yet
UNIT2-CNN
34 pages
Unit4 CNN
No ratings yet
Unit4 CNN
187 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
Single Layer Perceptron
No ratings yet
Single Layer Perceptron
14 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
Deep Learning For Computer Vision
No ratings yet
Deep Learning For Computer Vision
55 pages
Anisotropic Filtering: Unraveling Visual Complexity in Computer Vision
From Everand
Anisotropic Filtering: Unraveling Visual Complexity in Computer Vision
Fouad Sabry
No ratings yet
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
The Multilayer Perceptron
No ratings yet
The Multilayer Perceptron
11 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet