Convolutional Neural Networks

The document provides an overview of Convolutional Neural Networks (CNNs), highlighting their advantages over fully-connected networks for image data, including sparse connectivity and shared weights. It details the architecture of CNNs, which typically includes input, convolution, pooling, and fully connected layers, along with essential components like activation functions and downsampling operations. Additionally, it discusses techniques such as batch normalization, dropout, and data augmentation to enhance training and model robustness.

Uploaded by

quan.tran220401

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Convolutional Neural Networks

Uploaded by

quan.tran220401

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Convolutional Neural

Networks
Convolutional Neural Networks
Why CNN?

Problem of fully-connected neural networks on handling such image data:

● The number of input values are generally quite large
● The number of weights grows substantially as the size of the input images
● Pixels in distance are less correlated
Why CNN?

CNN:
● Sparse connectivity (local connectivity): a hidden unit is only connected to a local patch
(weights connected to the patch are called filter or kernel)
Why CNN?

CNN:
● Growing Receptive Fields: units in the deeper layers may indirectly interact with a larger
portion of the input.
Why CNN?

CNN:
● Shared weights at different spatial locations: Hidden nodes at different locations share the
same weights → reduces the number of parameters
Architecture of CNN
A typical CNN has 4 layers
● Input layer
● Convolution layer
● Pooling layer
● Fully connected layer
Building blocks of convolutional neural networks
Essential components of a CNN:
● the convolutional layers for feature extraction
● the activations to support learning of non-linear interactions
● the downsampling operations (pooling or striding)
● fully connected layers to transform the network output and Softmax layer
Optional components: batch normalization to speed up training and dropout to prevent overfitting
Convolution layer
A convolution matrix is used in image processing for tasks such as edge detection,
blurring, sharpening, etc. → producing feature maps
Convolution filter
A convolution matrix is used in image processing for tasks such as edge detection,
blurring, sharpening, etc. → producing feature maps
Convolution operator parameters
● Filter size
● Padding
● Stride
● Dilation
● Activation function
Filter size

● Filter size can be 5 by 5, 3 by 3, and so on

● Larger filter sizes should be avoided

● As learning algorithm needs to learn filter values (weights)

● Odd sized filters are preferred to even sized filters

● Nice geometric property of all input pixels being around output pixel
Padding
Image shrinks after applying convolutional operation → after many steps → a very small output.
Pixels on the corners or edges are used much less than pixels in the middle → Loss
information from the edges
Padding
→ Padding the image with additional border(s), set pixel values to 0 on the border
Type of padding:
● Valid Padding: no padding
● Same Padding: add ‘p’ padding layers such that output has the same dimensions as input.
● Padding with “p” layer: add ‘p’ padding layers
3 by 3 filter with padding of 1
Stride
Stride controls how far filter shifts at each step → Increase the stride if we want receptive fields
to have less overlaps and if we want smaller output dimensions → down sampling
3 by 3 filter with stride of 2
Dilation (Dilated Convolution)

● Dilation: To have a larger

receptive field (portion of
image affecting filter’s
output)
● If dilation set to 2, instead of
contiguous 3 by 3 subset of
image, every other pixel of a
5 by 5 subset of image
affects output
3 by 3 filter with dilation of 2
Activation function
After filter applied to whole image, apply activation function to output to introduce non-linearlity
Preferred activation function in CNN is ReLU
Relu activation function
ReLU leaves outputs with positive values as is, replaces negative values with 0
Relu activation function
2D Convolution Summary
Multiple input channels
● have a kernel for each channel → sum results over channels
Convolutions Over Channels
Convolution layer
Pooling
● Pooling layer is used to reduce the spatial size of
representation
● Pooling layer is usually attached after a convolutional
layer
● It helps to reduce the amount of parameters and speed
up the computation.
● Types:
- Max Pooling (most popular)
- Average Pooling
- L2 norm of a rectangular neighborhood
● It has hyperparameters but no parameters to learn
Max Pooling
Average Pooling
Pooling layer
Fully-connected layer
● Last layer in a CNN
● Connect all nodes from previous layer to this fully connected layer
○ Which is responsible for classification of the image
Batch Normalization
Feature vectors of length C at each pixel location of the 2D feature map P × Q × C is treated as
a sample to calculate the sample mean and sample standard deviation for normalization
→ training faster and more stable
Dropout
Dropout layer acts as a mask, eliminating some neurons’ contributions to the subsequent layer
while maintaining the functionality of all other neurons → reduce overfitting
Data Augmentation
Helps with improving model robustness and reducing overfitting.
Methods: Horizontal flips, random crops/scales, translation, color jitter, rotation,…
Example
import tensorflow as tf
def generate_model():
model = tf.keras.Sequential([

# first convolutional layer

tf.keras.layers.Conv2D(32, filter_size=3, activation='relu’),
tf.keras.layers.MaxPool2D(pool_size=2, strides=2),

# second convolutional layer

tf.keras.layers.Conv2D(64, filter_size=3, activation='relu’),
tf.keras.layers.MaxPool2D(pool_size=2, strides=2),

# fully connected classifier

tf.keras.layers.Flatten(),
tf.keras.layers.Dense(1024, activation='relu’),
tf.keras.layers.Dense(10, activation=‘softmax’)
# 10 outputs
])
return model

Module 4 - Machine Learning and Data Mining
No ratings yet
Module 4 - Machine Learning and Data Mining
35 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
11 pages
02 - Introduction to Convolutional Neural Networks (CNNs)
No ratings yet
02 - Introduction to Convolutional Neural Networks (CNNs)
28 pages
586_114_216_Convolutional_Neural_Networks
No ratings yet
586_114_216_Convolutional_Neural_Networks
48 pages
Student Notes: Convolutional Neural Networks (CNN) Introduction
No ratings yet
Student Notes: Convolutional Neural Networks (CNN) Introduction
9 pages
21-Foundations of Convolutional Neural Networks-04!09!2024
No ratings yet
21-Foundations of Convolutional Neural Networks-04!09!2024
10 pages
UNIT-2 - Part-1
No ratings yet
UNIT-2 - Part-1
116 pages
MLT UNIT-4 & 5 imp sol
No ratings yet
MLT UNIT-4 & 5 imp sol
22 pages
new
No ratings yet
new
8 pages
Convolution Neural Network: CP - 6 Machine Learning M S Prasad
No ratings yet
Convolution Neural Network: CP - 6 Machine Learning M S Prasad
28 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
ANN Unit 4
No ratings yet
ANN Unit 4
66 pages
CNN
No ratings yet
CNN
8 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
Unit 3 ML
No ratings yet
Unit 3 ML
27 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Module 3
No ratings yet
Module 3
34 pages
Unit III
No ratings yet
Unit III
38 pages
Unit III
No ratings yet
Unit III
89 pages
DL_MOD3
No ratings yet
DL_MOD3
102 pages
Convolutional Neural Network - 5
No ratings yet
Convolutional Neural Network - 5
21 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Unit III
No ratings yet
Unit III
89 pages
What Should You Consider or Pay Attention To When Preparing A Data Set
No ratings yet
What Should You Consider or Pay Attention To When Preparing A Data Set
7 pages
Short Qns CNN
No ratings yet
Short Qns CNN
11 pages
DL unit 3
No ratings yet
DL unit 3
18 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
DL Mod 3
No ratings yet
DL Mod 3
65 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
Convolutional Neural Networks (CNN)
No ratings yet
Convolutional Neural Networks (CNN)
7 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
Module 4 Notes
No ratings yet
Module 4 Notes
33 pages
371810f3-a2d5-467f-aa88-bfa680405b79
No ratings yet
371810f3-a2d5-467f-aa88-bfa680405b79
17 pages
Principles of Convolutional Neural Networks
No ratings yet
Principles of Convolutional Neural Networks
9 pages
CV Unit V
No ratings yet
CV Unit V
18 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
Module 3
No ratings yet
Module 3
67 pages
lec7-8+CNN-2
No ratings yet
lec7-8+CNN-2
69 pages
CNN Interview Question
No ratings yet
CNN Interview Question
16 pages
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
No ratings yet
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
15 pages
Summary Notes of Cnn
No ratings yet
Summary Notes of Cnn
23 pages
Unit 2 (1)
No ratings yet
Unit 2 (1)
45 pages
CONVOLUTIONAL NEURAL NETWORK
No ratings yet
CONVOLUTIONAL NEURAL NETWORK
36 pages
Introduction To CNNs
No ratings yet
Introduction To CNNs
26 pages
Convolutional Neural Networks-Part2
No ratings yet
Convolutional Neural Networks-Part2
21 pages
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
No ratings yet
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
11 pages
Convolution in CNN and GCN (Related Work)
No ratings yet
Convolution in CNN and GCN (Related Work)
12 pages
Step by Step Procedure That How I Resolve Given Task Pytorh
No ratings yet
Step by Step Procedure That How I Resolve Given Task Pytorh
6 pages
Ch VI _ Convolutional Neural Network_24
No ratings yet
Ch VI _ Convolutional Neural Network_24
33 pages
DL CNN
No ratings yet
DL CNN
129 pages
Poolin Layer
No ratings yet
Poolin Layer
28 pages
UNIT-4 Foundations of Deep Learning
100% (1)
UNIT-4 Foundations of Deep Learning
43 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
3 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
cnn
No ratings yet
cnn
10 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Unit II Back Propagation and Associative Memory
No ratings yet
Unit II Back Propagation and Associative Memory
162 pages
1Z0-1122-24-Demo
No ratings yet
1Z0-1122-24-Demo
14 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
25 pages
CISRM Internship CIR23
No ratings yet
CISRM Internship CIR23
1 page
CNN Architectures - LeNet, AlexNet, VGG, GoogLeNet, ResNet and More - by Siddharth Das - Analytics Vidhya - Medium
No ratings yet
CNN Architectures - LeNet, AlexNet, VGG, GoogLeNet, ResNet and More - by Siddharth Das - Analytics Vidhya - Medium
6 pages
RNN and LSTM
No ratings yet
RNN and LSTM
15 pages
Handwritten Digit Recognition Using a Neural Network (2)
No ratings yet
Handwritten Digit Recognition Using a Neural Network (2)
4 pages
Sentiment Analysis On Twitter Using Neural Network
No ratings yet
Sentiment Analysis On Twitter Using Neural Network
7 pages
Article Hand Writing Character Recognition Using CNN
No ratings yet
Article Hand Writing Character Recognition Using CNN
6 pages
Soft Computing Question Paper
No ratings yet
Soft Computing Question Paper
2 pages
D1-22683 Aam Tyan 2023-24 SMD
No ratings yet
D1-22683 Aam Tyan 2023-24 SMD
6 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
Open AI.
No ratings yet
Open AI.
2 pages
AI Crash Course for Beginners
No ratings yet
AI Crash Course for Beginners
60 pages
CNN Mind Map by BSN
No ratings yet
CNN Mind Map by BSN
1 page
DL UNIT 1
No ratings yet
DL UNIT 1
19 pages
Neural Network Notes
No ratings yet
Neural Network Notes
268 pages
Attention and Transformers
No ratings yet
Attention and Transformers
103 pages
Professional Machine Learning Engineer
No ratings yet
Professional Machine Learning Engineer
1 page
A Brief Review On Artificial Neural Network Network Structures and Applications
No ratings yet
A Brief Review On Artificial Neural Network Network Structures and Applications
6 pages
The Turbulent Past and Uncertain Future of AI
No ratings yet
The Turbulent Past and Uncertain Future of AI
10 pages
1-MATERIAL DL Syllabus V2
No ratings yet
1-MATERIAL DL Syllabus V2
2 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
22 pages
Report
No ratings yet
Report
1 page
CV - Deep Convolutional Neural Networks
No ratings yet
CV - Deep Convolutional Neural Networks
55 pages
Soft Computing CT QP
No ratings yet
Soft Computing CT QP
2 pages
Deep Learning Improved by Biological Activation Functions: March 2018
No ratings yet
Deep Learning Improved by Biological Activation Functions: March 2018
10 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages