0% found this document useful (0 votes)

10 views

ch4_CNN

The document provides an overview of Convolutional Neural Networks (CNNs) and their applications in deep learning, including object detection, image segmentation, and facial recognition. It discusses the architecture of CNNs, including convolution layers, pooling layers, and hyperparameters like stride and padding, as well as techniques such as transfer learning and fine-tuning. Additionally, it highlights notable CNN architectures like LeNet, AlexNet, VGG-Net, ResNet, and Inception models.

Uploaded by

hmercha.sarra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

ch4_CNN

Uploaded by

hmercha.sarra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

CH4-CNN

DEEP LEARNING
CNN USE CASES

Object
detection

Image segmentation Facial Recognition

DEEP LEARNING 2
INTRODUCTION TO IMAGE

DEEP LEARNING 3
RGB IMAGE

Shape( n*m*3)
n*m : image resolution
3= number of channels (Red Green Blue)

DEEP LEARNING 4
GRAYSCALE IMAGE

Pixel value in [0,255]

Shape( n*m*1) or quite simply (n*m)
1= one color channel
DEEP LEARNING 5
INTRODUCTION TO CNN

 The first convolutional neural network (CNN) has been introduced

in 1990 (LeNet).

 Why CNN ?
 Images have big pixels!
 Fully-connected neural network would have too many parameters!
 Translational invariance in images

Yann LeCun

DEEP LEARNING 6
WHAT IS A CNN?

 A CNN is a type of artificial neural network usually designed to extract features from data and to classify given
high dimensional data.
 Basic Architecture :

DEEP LEARNING 7
CONVOLUTION LAYER : 1 CHANNEL

 Convoluting a 5x5x1 image with a 3x3x1 kernel (or filter) to get a 3x3x1 convolved feature
 By convention, the value of ‘f,’ i.e. filter size, is usually odd in computer vision.

DEEP LEARNING 8
CONVOLUTION LAYER : 3 CHANNELS

 In the case of images with multiple channels (e.g. RGB), the Kernel has the same depth as that of the input image.

DEEP LEARNING 9
IMAGE KERNELS

 https://setosa.io/ev/image-kernels/

DEEP LEARNING 10
MULTIPLE CONVOLUTION LAYERS

DEEP LEARNING 11
CONVOLUTIONAL LAYER- HYPER PARAMETERS : STRIDE

 The stride indicates the pace by which the filter moves horizontally & vertically over the pixels of the input image during
convolution.
 Stride value :
 For several fine-grained features : small stride size
 If we are only interested in the macro-level of features : large stride size.

 Input image size : n x n

 Filter size : f x f
 Stride: s
𝑛−𝑓 𝑛−𝑓
 Output image size : ( + 1) × ( + 1)
𝑠 𝑠

DEEP LEARNING
Stride = 2 12
CONVOLUTIONAL LAYER- HYPER PARAMETERS : PADDING

 Downsides of convolution :
 After a convolution operator, the image shrinks : loose of a lot of information.
 During convolution, the pixels in the corners & the edges are considered only once :
a lot of information near the edge of the image are thrown away
 Solution : ‘pad’ the image.
 Padding : p
𝑛+2𝑝−𝑓 𝑛+2𝑝−𝑓
 Output image size : ( + 1) × ( + 1)
𝑠 𝑠

DEEP LEARNING 13
POOLING LAYERS

 A Pooling layer is added after the Convolutional layer(s)

 Pooling is like sub-sampling
 Pooling filter size usually is 2x2 (or 2n x 2n)
 Usually reduce the size to 1/N per each side
(e.g. N=2 for 2x2)

DEEP LEARNING 14
POOLING LAYERS

 Pooling has the advantage of making the representation more compact by reducing the spatial size of the feature
maps, thereby reducing the number of parameters to be learnt.
 The pooling layer has ‘NO PARAMETERS’ i.e. ‘ZERO TRAINABLE PARAMETERS’.
 Illustrations :

DEEP LEARNING 15
EXAMPLE MAXPOOLING

DEEP LEARNING 16
TRANSFER LEARNING

DEEP LEARNING 17
TRANSFER LEARNING

 A technique where knowledge acquired from solving one task is reused to enhance performance on a
related task.
 key points about transfer learning:
 Transfer learning has been studied since the 1970s.
 Transfer learning finds applications in various domains, including cancer subtype discovery, text
classification, medical imaging, and spam filtering.
 By reusing information from previously learned tasks, transfer learning significantly improves
learning efficiency.

DEEP LEARNING 18
BENEFITS OF USING TRANSFER LEARNING

 Reduces the amount of training time required for a new task.

 The knowledge of the pre-trained dataset can be generalized in their understanding of different
domain-related tasks.
 Small datasets are prone to overfitting, by using the transfer learning approach helps to mitigate this
issue by starting with the learned features.
 Building a model from scratch is computationally expensive and transfer learning helps to reduce the
training time.

DEEP LEARNING 19
IMPLEMENTING TRANSFER LEARNING

1. Get the pre-trained model: The first step is to obtain the pre-trained model adapted to the problem.
2. Create a base model: Instantiate the basic model using one of the known architectures.
3. Freeze layers so they don't change during training: base_model.trainable = False
4. Add new trainable layers
5. Train the new layers on the dataset
6. Enhance the model with fine tuning

DEEP LEARNING 20
FINE TUNING

 Fine-tuning refers to taking a pre-trained model and further training it on a new dataset.
 Fine-tuning involves training the entire model, including the initial layers.
 Fine-tuning is performed by unfreezing the base model or part of it and retraining the entire model on
the data set at a very low learning rate.
 The later layers make use of a higher learning rate to adapt to the new dataset.

DEEP LEARNING 21
FINE TUNING

DEEP LEARNING 22
WAYS TO FINE TUNE THE MODEL

 Feature extraction : remove the output layer and then use the entire network as a fixed feature
extractor for the new data set.
 Use the Architecture of the pre-trained model : use architecture of the model while initializing
all the weights randomly and train the model according to the dataset again.
 Train some layers while freeze others : keep the weights of initial layers of the model frozen
while retraining only the higher layers.

DEEP LEARNING 23
WAYS TO FINE TUNE THE MODEL

 retain the architecture of the model and

 its best to train the neural network from the initial weights of the model. Then
scratch according to the used data. retrain this model using the weights as
Size of the dataset

initialized in the pre-trained model.

 freeze the initial k layers of the pretrained

 customize and modify the output layers
model and train just the remaining(n-k)
according to our problem statement. We use
layers again. The top layers would then be the pretrained model as a feature extractor.
customized to the new data set.

DEEP LEARNING Data similarity 24

COMMON ARCHITECTURES

 LeNet-5: 1998
 AlexNet: 2012
 VGG-Net : 2014
 Inception-v1 to v3
 ResNet: 2015

DEEP LEARNING 25
LENET-5

 Proposed by Yann LeCun and others in the year 1998

 A multi-layer convolution neural network for image classification.
 Used for recognizing the handwritten and machine-printed characters.


DEEP LEARNING 26
ALEXNET

▪ Alex Krizhevsky released AlexNet

▪ won the Imagenet large-scale visual recognition challenge in 2012.
▪ AlexNet is a deeper and much wider version of the LeNet.
▪ The use of the relu as an activation function accelerated the speed of the training process by almost six times.
▪ The dropout layers, prevented the model from overfitting.
▪ The use of padding prevent the size of the feature maps from reducing drastically.
▪ The model is trained on the Imagenet dataset.
▪ The Imagenet dataset has almost 14 million images across a thousand classes.

DEEP LEARNING 27
ALEXNET

DEEP LEARNING 28
VGG-NET

 The VGG-Net is one of the most popular pre-

trained models for image classification.
 Introduced in the famous ILSVRC 2014
Conference.
 Developed at the Visual Graphics Group at the
University of Oxford,
 VGG-16 beat the standard of AlexNet and was
quickly adopted by researchers and the
industry for their image Classification Tasks.

DEEP LEARNING 29
RESNET : RESIDUAL BLOCKS

 VGG-Net method works with a small number of convolutional layers.

 Subsequent research discovered that increasing the number of layers could significantly improve
CNN performance.
 The ResNet architecture introduces the simple concept of adding an intermediate input to the
output of a series of convolution blocks.
 This technique smooths out the gradient flow during backpropagation, enabling the network to
scale to 50, 100, or even 150 layers.

DEEP LEARNING 30
RESNET

DEEP LEARNING 31
INCEPTION-1 :

 Problem :
 Because of this huge variation in the location of the information, choosing the right kernel size for
the convolution operation becomes tough
 Very deep networks are prone to overfitting. It also hard to pass gradient updates through the
entire network.
 Naively stacking large convolution operations is computationally expensive.
 Solution : filters with multiple sizes that operate on the same level

DEEP LEARNING 32
INCEPTION-1 :

 Inception is a deep convolutional neural network architecture that was introduced in 2014.
 It won the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC14).
 It was mostly developed by Google researchers.

DEEP LEARNING 33
INCEPTION-1 : GOOGLENET

 GoogLeNet has 9 inception modules stacked linearly.

 It is 22 layers deep (27, including the pooling layers).
 It uses global average pooling at the end of the last inception module.

DEEP LEARNING 34
INCEPTION-3

▪ Inception-v3 incorporated the above upgrades :

1. RMSProp Optimizer.
2. Factorized 5x5 and 7x7 convolutions.
3. BatchNorm in the Auxillary Classifiers.
4. …

DEEP LEARNING 35

FYP Project Report - YOLO V8 Object Detection
No ratings yet
FYP Project Report - YOLO V8 Object Detection
59 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Classic Cnn
No ratings yet
Classic Cnn
39 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
CNN 2
No ratings yet
CNN 2
47 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Unit 3
No ratings yet
Unit 3
105 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
Deep Learning: Alberto Ezpondaburu
No ratings yet
Deep Learning: Alberto Ezpondaburu
58 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Operations Slides
No ratings yet
Operations Slides
11 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Program 5n6 Dl
No ratings yet
Program 5n6 Dl
9 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
Cnn
No ratings yet
Cnn
56 pages
Unit 5a - Machine Vision
No ratings yet
Unit 5a - Machine Vision
55 pages
Lec_2
No ratings yet
Lec_2
42 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Convnets
No ratings yet
Convnets
41 pages
Lecture2 Advanced CNN
No ratings yet
Lecture2 Advanced CNN
55 pages
DL 4
No ratings yet
DL 4
5 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
Image Classification Using Convolutional Neural Networks (CNNS)
No ratings yet
Image Classification Using Convolutional Neural Networks (CNNS)
61 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
5b Dana
No ratings yet
5b Dana
67 pages
Week7_ConvNets and Transfer Learning
No ratings yet
Week7_ConvNets and Transfer Learning
39 pages
Introduction To Deep Convolutional Neural Networks: March 2016
No ratings yet
Introduction To Deep Convolutional Neural Networks: March 2016
51 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
Convolutional Neural Networks _ deeplearning-notes
No ratings yet
Convolutional Neural Networks _ deeplearning-notes
43 pages
Literature Review On Image Classification Architecture
No ratings yet
Literature Review On Image Classification Architecture
14 pages
CVlecture 6
No ratings yet
CVlecture 6
33 pages
DL_Unit3_1 (1)
No ratings yet
DL_Unit3_1 (1)
67 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
MLT CNN Architectures
No ratings yet
MLT CNN Architectures
104 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
02 - Introduction to Convolutional Neural Networks (CNNs)
No ratings yet
02 - Introduction to Convolutional Neural Networks (CNNs)
28 pages
mergeddv
No ratings yet
mergeddv
2 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
Unit III
No ratings yet
Unit III
58 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
AI For CX For Dummies
No ratings yet
AI For CX For Dummies
34 pages
The Influence of Parallel Computing On Building Deep Learning Model For The Classification of Bean Diseases
100% (1)
The Influence of Parallel Computing On Building Deep Learning Model For The Classification of Bean Diseases
6 pages
ICCAE 2021: Proceedings of
No ratings yet
ICCAE 2021: Proceedings of
139 pages
Part B Unit 2 AI Project Cycle
No ratings yet
Part B Unit 2 AI Project Cycle
25 pages
Metaaec
No ratings yet
Metaaec
15 pages
Purities Prediction in A Manufacturing Froth Flotation Plant The Deep Learning Techniques
No ratings yet
Purities Prediction in A Manufacturing Froth Flotation Plant The Deep Learning Techniques
12 pages
Research paper-Fake logo Detection
No ratings yet
Research paper-Fake logo Detection
7 pages
Urban Street Cleanliness Assessment Using Mobile Edge Computing and Deep Learning
No ratings yet
Urban Street Cleanliness Assessment Using Mobile Edge Computing and Deep Learning
13 pages
Make Up - V - VII - Sem B.E. Time Table - 2021-22
No ratings yet
Make Up - V - VII - Sem B.E. Time Table - 2021-22
3 pages
Comparative Analysis of Time Series Forecasting Models To Predict Amount of Rainfall in Telangana
No ratings yet
Comparative Analysis of Time Series Forecasting Models To Predict Amount of Rainfall in Telangana
5 pages
Fake Face Detection Using CNN
No ratings yet
Fake Face Detection Using CNN
6 pages
Chapter03 AKE Eng v2.0
No ratings yet
Chapter03 AKE Eng v2.0
49 pages
Predicting The Price of Bitcoin Using Machine Learning
No ratings yet
Predicting The Price of Bitcoin Using Machine Learning
5 pages
Smart Energy Management System[1]
No ratings yet
Smart Energy Management System[1]
11 pages
Artificial Intelligence: Dr. Sheraz Naseer Irfan Malik
No ratings yet
Artificial Intelligence: Dr. Sheraz Naseer Irfan Malik
23 pages
Deepfake Elsevier CVIU
No ratings yet
Deepfake Elsevier CVIU
19 pages
Ai Unleashed
No ratings yet
Ai Unleashed
108 pages
Deep Residual Learning for Image Recognition 2
No ratings yet
Deep Residual Learning for Image Recognition 2
26 pages
Official Google Cloud Certified Professional ML Engineer 9781119944461 9781119981848 9781119981565 - Compress
100% (1)
Official Google Cloud Certified Professional ML Engineer 9781119944461 9781119981848 9781119981565 - Compress
371 pages
Uddin et al (2023)
No ratings yet
Uddin et al (2023)
21 pages
Forum 1 - Maulana
No ratings yet
Forum 1 - Maulana
4 pages
NCA-AIIO (4)
No ratings yet
NCA-AIIO (4)
11 pages
ML Unit-1
No ratings yet
ML Unit-1
15 pages
Notes Deep Learning
No ratings yet
Notes Deep Learning
57 pages
Modeling CO2 Loading Capacity of Diethanolamine (DEA) Aqueous Solutions Using Advanced Deep Learning and Machine Learning Algorithms
No ratings yet
Modeling CO2 Loading Capacity of Diethanolamine (DEA) Aqueous Solutions Using Advanced Deep Learning and Machine Learning Algorithms
22 pages
Classification of Diabetes Using Deep Learning
No ratings yet
Classification of Diabetes Using Deep Learning
6 pages
Neuro TR Brochure - EN Compressed
No ratings yet
Neuro TR Brochure - EN Compressed
8 pages
Jeff Heaton-Artificial Intelligence For Humans, Volume 3 - Deep Learning and Neural Networks-CreateSpace Independent Publishing Platform (2015)
No ratings yet
Jeff Heaton-Artificial Intelligence For Humans, Volume 3 - Deep Learning and Neural Networks-CreateSpace Independent Publishing Platform (2015)
268 pages
Final PPT On Load Forecasting by Roll 1112 & 1137
No ratings yet
Final PPT On Load Forecasting by Roll 1112 & 1137
16 pages

ch4_CNN

Uploaded by

ch4_CNN

Uploaded by

CH4-CNN

Image segmentation Facial Recognition

Pixel value in [0,255]

 The first convolutional neural network (CNN) has been introduced

 Input image size : n x n

 A Pooling layer is added after the Convolutional layer(s)

 Reduces the amount of training time required for a new task.

 retain the architecture of the model and

initialized in the pre-trained model.

 freeze the initial k layers of the pretrained

DEEP LEARNING Data similarity 24

 Proposed by Yann LeCun and others in the year 1998

▪ Alex Krizhevsky released AlexNet

 The VGG-Net is one of the most popular pre-

 VGG-Net method works with a small number of convolutional layers.

 GoogLeNet has 9 inception modules stacked linearly.

▪ Inception-v3 incorporated the above upgrades :

You might also like