0% found this document useful (0 votes)

5 views

Week7_ConvNets and Transfer Learning

Uploaded by

jullianyorkgsantos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Week7_ConvNets and Transfer Learning

Uploaded by

jullianyorkgsantos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Review

▪ Do some review of concepts from the last lecture

▪ We will revisit kernel, stride, and pooling in the context of the Le-Net 5 model

2
LeNet-5
▪ Created by Yann LeCun in the 1990s
▪ Used on the MNIST data set
▪ Novel Idea: Use convolutions to efficiently learn features on data set

3
LeNet—Structure Diagram
Input: A 32 x 32 grayscale image (28 x 28)
with 2 pixels of padding all around.

4
LeNet—Structure Diagram

Next, we have a
convolutional layer.

5
LeNet—Structure Diagram

This is a 5x5 convolutional

layer with stride 1.

6
LeNet—Structure Diagram

This means the resulting “filter” has

dimension 28x28. (Why?)

7
LeNet—Structure Diagram

They use a depth of 6. This means

there are 6 different kernels that
are learned.

8
LeNet—Structure Diagram

They use a depth of 6. This means So the output of this

there are 6 different kernels that layer is 6x28x28.
are learned.

9
LeNet—Structure Diagram

What is the total number of

weights in this layer?

10
LeNet—Structure Diagram

What is the total number of Answer: Each kernel has 5x5=25 weights (plus a
weights in this layer? bias term, so actually 26 weights). So total
weights = 6x26 = 156.

11
LeNet—Structure Diagram
Next is a 2x2 pooling layer. (with stride 2)

12
LeNet—Structure Diagram
So output size is 6x14x14.
(we downsample by a factor of 2)

13
LeNet—Structure Diagram
So output size is 6x14x14.
(we downsample by a factor of 2)

Note: The original paper actually does a more complicated pooling then max or
avg. pooling, but this is considered obsolete now.

14
LeNet—Structure Diagram
No weights! (pooling layers have no weights to be
learned – it is a fixed operation.)

15
LeNet—Structure Diagram

Another 5x5 convolutional layer

with stride 2. This time the depth is
16.

16
LeNet—Structure Diagram

Output size: 16 x 10 x 10 How

many weights? (tricky!)

17
LeNet—Structure Diagram

The kernels “take in” the full depth of the previous layer. So each
5x5 kernel now “looks at” 6x5x5 pixels.
Each kernel has 6x5x5 = 150 weights + bias term = 151.

18
LeNet—Structure Diagram

So, total weights for this layer = 16*151 = 2416.

19
LeNet—Structure Diagram

Another 2x2 pooling layer.

Output is 16 x 5 x 5.

20
LeNet—Structure Diagram
We “flatten” this to a length
400 vector. (not shown)

21
LeNet—Structure Diagram
The following layers are just
fully connected layers!

22
LeNet—Structure Diagram
From 400 to 120.

23
LeNet—Structure Diagram
Then from 120 to 84.

24
LeNet—Structure Diagram
Then from 84 to 10.

25
LeNet—Structure Diagram
And a softmax output of
size 10 for the 10 digits.

26
LeNet-5
How many total weights in the network?
Conv1: 1*6*5*5 + 6 = 156
Conv3: 6*16*5*5 + 16 = 2416
FC1: 400*120 + 120 = 48120
FC2: 120*84 + 84 = 10164
FC3: 84*10 + 10 = 850
Total: = 61706

Less than a single FC layer with [1200x1200] weights!

Note that Convolutional Layers have relatively few weights.

27
Motivation
▪ Early layers in a Neural Network are the
hardest (i.e. slowest) to train
▪ Due to vanishing gradient property
▪ But these ”primitive” features should be
general across many image classification
tasks

28
Motivation
▪ Later layers in the network are capturing features that are more particular to the specific image
classification problem
▪ Later layers are easier (quicker) to train since adjusting their weights has a more immediate
impact on the final result

29
Motivation
▪ Famous, competition-winning models are difficult to train from scratch
– Huge datasets (like ImageNet)
– Long number of training iterations
– Very heavy computing machinery
– Time experimenting to get hyper-parameters just right

30
Transfer Learning
▪ However, the basic features (edges, shapes) learned in the early layers of the network should
generalize
▪ Results of the training are just weights (numbers) that are easy to store
▪ Idea: keep the early layers of a pre-trained network, and re-train the later layers for a specific
application
▪ This is called Transfer Learning

31
Transfer Learning

Convolutions
Fully Connected

softmax classifier

32
Transfer Learning
Train last layer
on new data.

Convolutions
Fully Connected

33
Transfer Learning
Perhaps, after a while Train last layer
train back a few more layers on new data.
(or even the whole network).

Convolutions
Fully Connected

34
Transfer Learning Options
▪ The additional training of a pre-trained network on a specific new dataset is referred to as
“Fine-Tuning”
▪ There are different options on “how much” and “how far back” to fine-tune
– Should I train just the very last layer?
– Go back a few layers?
– Re-train the entire network (from the starting point of the existing network)?

35
Guiding Principles for
Fine-Tuning
While there are no “hard and fast” rules,
there are some guiding principles to keep
in mind.

1) The more similar your data and problem are E.g. Using a network trained on ImageNet to
to the source data of the pre-trained network, distinguish “dogs” from “cats” should need
the less fine-tuning is necessary relatively little fine-tuning. It already
distinguished different breeds of dogs and
cats, so likely has all the features you will
need.

36
Guiding Principles for
Fine-Tuning
2) The more data you have about your E.g. If you have only 100 dogs and 100 cats
specific problem, the more the network will in your training data, you probably want to
benefit from longer and deeper fine-tuning do very little fine-tuning. If you have 10,000
dogs and 10,000 cats you may get more
value from longer and deeper fine-tuning.

37
Guiding Principles for
Fine-Tuning
3) If your data is substantially different in E.g. A network that was trained on recognizing
nature than the data the source model was typed Latin alphabet characters would not be
trained on, Transfer Learning may be of useful in distinguishing cats from dogs. But it
little value likely would be useful as a starting point for
recognizing Cyrillic Alphabet characters.

Advanced Deep Learning Ghosal
No ratings yet
Advanced Deep Learning Ghosal
9 pages
CNN Short
No ratings yet
CNN Short
61 pages
ch4_CNN
No ratings yet
ch4_CNN
35 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Classic Cnn
No ratings yet
Classic Cnn
39 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
NN 07
No ratings yet
NN 07
24 pages
UNIT III
No ratings yet
UNIT III
26 pages
Convolutional Neural Network - 5
No ratings yet
Convolutional Neural Network - 5
21 pages
CNN 2
No ratings yet
CNN 2
47 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
Program 5n6 Dl
No ratings yet
Program 5n6 Dl
9 pages
Data Aug Trans
No ratings yet
Data Aug Trans
4 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
DL_UNIT_IV
No ratings yet
DL_UNIT_IV
18 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
PROGRAM 5n6 Dl_final
No ratings yet
PROGRAM 5n6 Dl_final
9 pages
Unit 3
No ratings yet
Unit 3
105 pages
IC - Lez4-5-6 - Convolutional Nets
No ratings yet
IC - Lez4-5-6 - Convolutional Nets
85 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
Assignment_13_Modern_AI
No ratings yet
Assignment_13_Modern_AI
3 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
CNN Course-Notes 365
No ratings yet
CNN Course-Notes 365
29 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
Images, Neural Networks, CNNs
No ratings yet
Images, Neural Networks, CNNs
26 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
78 pages
Traffic Sign Classification Slides
No ratings yet
Traffic Sign Classification Slides
29 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Introduction to Deep Learning
No ratings yet
Introduction to Deep Learning
47 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
03 Convolutional Neural Networks
No ratings yet
03 Convolutional Neural Networks
83 pages
PNAL9_CNNs
No ratings yet
PNAL9_CNNs
61 pages
MLP and CNN
No ratings yet
MLP and CNN
56 pages
Deep Learning_Lecture 4_CNNs
No ratings yet
Deep Learning_Lecture 4_CNNs
53 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Create Simple Deep Learning Neural Network For Classification
No ratings yet
Create Simple Deep Learning Neural Network For Classification
11 pages
[Fall 2024] Deep Learning 3
No ratings yet
[Fall 2024] Deep Learning 3
54 pages
CNN
No ratings yet
CNN
8 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
Convolutional Neuralnetworks: Abin - Roozgard
No ratings yet
Convolutional Neuralnetworks: Abin - Roozgard
54 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
06 Transfer Learning With Tensorflow Part 3 Scaling Up
No ratings yet
06 Transfer Learning With Tensorflow Part 3 Scaling Up
29 pages
MATLAB for Beginners: A Gentle Approach
From Everand
MATLAB for Beginners: A Gentle Approach
Peter I. Kattan
No ratings yet
MATLAB for Beginners: A Gentle Approach - Revised Edition
From Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter Kattan
No ratings yet
Preceptron
No ratings yet
Preceptron
17 pages
Download Complete Data Mining: Concepts and Techniques, 4th Edition Jiawei Han PDF for All Chapters
100% (4)
Download Complete Data Mining: Concepts and Techniques, 4th Edition Jiawei Han PDF for All Chapters
40 pages
Notes For Electrical 2nd Year
No ratings yet
Notes For Electrical 2nd Year
4 pages
Data Analytics Unit-4
No ratings yet
Data Analytics Unit-4
47 pages
Soft Computing Practical Teacher Manual
No ratings yet
Soft Computing Practical Teacher Manual
87 pages
SVM
No ratings yet
SVM
19 pages
08 An Example of NN Using ReLu
No ratings yet
08 An Example of NN Using ReLu
10 pages
Dr.jap Ece3051 Mldl Fpga
No ratings yet
Dr.jap Ece3051 Mldl Fpga
90 pages
Syllabus
No ratings yet
Syllabus
2 pages
Feature extraction for machine learning-based intrusion detection in
No ratings yet
Feature extraction for machine learning-based intrusion detection in
12 pages
Lecture 04 (3hrs) Neural Network and Deep Learning-Part A
No ratings yet
Lecture 04 (3hrs) Neural Network and Deep Learning-Part A
76 pages
Data Science: Concepts and Practice: Course Slides
No ratings yet
Data Science: Concepts and Practice: Course Slides
9 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
4 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture6 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture6 Compressed
22 pages
Unit 4- Classification and Prediction
No ratings yet
Unit 4- Classification and Prediction
72 pages
13 516 3 Artificial Neural Network A T
No ratings yet
13 516 3 Artificial Neural Network A T
2 pages
MLSP Exp04 60002200083
No ratings yet
MLSP Exp04 60002200083
5 pages
Swipe
No ratings yet
Swipe
18 pages
lect8_dnn (1)
No ratings yet
lect8_dnn (1)
33 pages
ML-Lecture-15-Ensemble
No ratings yet
ML-Lecture-15-Ensemble
27 pages
ML IMP QUES 2
No ratings yet
ML IMP QUES 2
37 pages
nlp
No ratings yet
nlp
6 pages
2021 Lecture11 NeuralNetworks
No ratings yet
2021 Lecture11 NeuralNetworks
48 pages
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
No ratings yet
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
28 pages
L08 Clustering
No ratings yet
L08 Clustering
31 pages
Merged Presentation Choladeck Choladeck-compressed
No ratings yet
Merged Presentation Choladeck Choladeck-compressed
239 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
L17-Perceptron
No ratings yet
L17-Perceptron
21 pages
Data Mining MCQs - Unit-2 - DM _ Study Glance
No ratings yet
Data Mining MCQs - Unit-2 - DM _ Study Glance
10 pages

Week7_ConvNets and Transfer Learning

Uploaded by

Week7_ConvNets and Transfer Learning

Uploaded by

Review

▪ Do some review of concepts from the last lecture

This is a 5x5 convolutional

This means the resulting “filter” has

They use a depth of 6. This means

They use a depth of 6. This means So the output of this

What is the total number of

Another 5x5 convolutional layer

Output size: 16 x 10 x 10 How

So, total weights for this layer = 16*151 = 2416.

Another 2x2 pooling layer.

Less than a single FC layer with [1200x1200] weights!

You might also like