Famous Networks

The document discusses several famous convolutional neural networks including LeNet, AlexNet, VGGNet, GoogLeNet, and ResNet. It describes the architecture and contributions of each network. ResNet significantly improved upon previous networks by utilizing skip connections and batch normalization. ResNets currently achieve state-of-the-art performance on image classification tasks.

Uploaded by

mavoho1719

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Famous Networks

Uploaded by

mavoho1719

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Chapter 1

Famous Networks

LeNet-5

Figure 1.1: LeNet

The first successful applications of Convolutional Networks were developed by Yann LeCun in
1990s. Of these, the best known is the LeNet architecture that was used to read zip codes,
digits, etc.

AlexNet

The first work that popularized Convolutional Networks in Computer Vision was the AlexNet,
developed by Alex Krizhevsky, Ilya Sutskever and Geoff Hinton. The AlexNet was submitted to
the ImageNet ILSVRC challenge in 2012 and significantly outperformed the second runner-up
(top 5 error of 16% compared to runner-up with 26% error). The Network had a very similar
architecture to LeNet, but was deeper, bigger, and featured Convolutional Layers stacked on top
of each other (previously it was common to only have a single CONV layer always immediately
followed by a POOL layer). 60M parameters.

1
2 Chapter 1 Famous Networks

Figure 1.2: AlexNet. There is an error in the paper figure, the input image must be of 227
so that the rest of volumes are coherent

VGGNet

The runner-up in ILSVRC 2014 was the network from Karen Simonyan and Andrew Zisserman
that became known as the VGGNet. Its main contribution was in showing that the depth of
the network is a critical component for good performance. Their final best network contains 16
CONV/FC layers and, appealingly, features an extremely homogeneous architecture that only
performs 3 × 3 convolutions and 2 × 2 pooling from the beginning to the end. Their pretrained
model is available for plug and play use in Caffe. A downside of the VGGNet is that it is
more expensive to evaluate and uses a lot more memory and parameters (140M). Most of these
parameters are in the first fully connected layer, and it was since found that these FC layers
can be removed with no performance downgrade, significantly reducing the number of necessary
parameters.

The most of the network parameters are in the FC layer and most of the memory required by
the network is used in the first 2 ConvLayers.

Figure 1.3: VGGNet - 7.3% top 5 error in ImageNet

Chapter 1 Famous Networks 3

GoogLeNet

The ILSVRC 2014 winner was a Convolutional Network from Szegedy et al. from Google.
Its main contribution was the development of an Inception Module that dramatically reduced
the number of parameters in the network (4M, compared to AlexNet with 60M). Additionally,
this paper uses Average Pooling instead of Fully Connected layers at the top of the ConvNet,
eliminating a large amount of parameters that do not seem to matter much. There are also
several followup versions to the GoogLeNet, most recently Inception-v4.

Figure 1.4: VGGNet - 6.7% top 5 error in ImageNet

Figure 1.5: VGGNet structure

ResNet

Let H(x) be a function that you desire to obtian. In a typical net you would compute a squence
of steps ReLu(ReLu(xw1+b1)*w2+b2) to transform x to H(x). Instead, in a ResNet you compute
a delta to be added to the original input to obtain H(x).

What is nice about it is that in plain nets, gradients must flow through all the transforma-
tions. Instead, in residual nets because it is addition (distributes the gradient equally to all its
4 Chapter 1 Famous Networks

children), the gradient with flow through the (weights, ReLU) but will also skip this transfor-
mations and will go directly to the previous part and flow directly to the previous block. So the
gradients can skip all the transformations and go directly to the first layer. In this way, you can
train very fast the first layer which is doing simple statistics, and the rest of layers will learn to
add to the single in between to make it work at the end.

Figure 1.6: Plain vs Residual Net

Figure 1.7: ResNet (much more layers than the ones on the diagram)

Another way of seeing it, it that ResNets are only computing a delta on top of the identity. So
it makes it nice to optimize.

Residual Network developed by Kaiming He et al. was the winner of ILSVRC 2015. It features
specialskip connections and a heavy use of batch normalization. The architecture is also missing
fully connected layers at the end of the network. ResNets are currently by far state of the art
Convolutional Neural Network models and are the default choice for using ConvNets in practice
(as of May 10, 2016). In particular, also see more recent developments that tweak the original
architecture from Kaiming He et al. Identity Mappings in Deep Residual Networks (published
March 2016). It is interesting that after the first layer they do polling (the only polling in all
the net) and they scale the input image of 244 × 244 to 56 × 56, and the net works that well.
Its crazy that all the layer (except the first one) work wit 56 × 56×? and even compressing the
data this much it has a high accuracy.
Chapter 1 Famous Networks 5

Figure 1.8: ResNet structure. 3.6% top 5 error in ImageNet, 152 layers, 2-3 weeks training
on 8 GPU machine, faster at test time that VGGNet

Should we add infinite layers?

In plot ?? it is clear that networks are getting deeper and deeper But we have to be careful.

Figure 1.9: Depth revolution

Plot ?? shows CIFAR-10 training error. In the left plain nets (weighted layer + ReLU) in the
right ResNet. How it is possible to get a higher training error (dashed lines) with higher number
of layers? It should not happen, the model is more complex. The explanation is that we are
still not capable of optimizing them good enough.

Figure 1.10: CIFAR-10 training error

However, ResNets always improve the test and training error.

So the answer to the question is that we should keep adding more layers but not in a naive way,
we should do it in a ResNet way

Assignment No 2 (Aleeza Anjum CS101)
No ratings yet
Assignment No 2 (Aleeza Anjum CS101)
60 pages
RFP Facilities Management Consultant Services v10
100% (2)
RFP Facilities Management Consultant Services v10
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
17 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
CNN Apps
No ratings yet
CNN Apps
17 pages
Res Net
No ratings yet
Res Net
46 pages
10. Image Processing With Deep Learning
No ratings yet
10. Image Processing With Deep Learning
39 pages
Difference Between Alexnet, Vggnet, Resnet, and Inception
No ratings yet
Difference Between Alexnet, Vggnet, Resnet, and Inception
14 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Notes - CSE (DS)
No ratings yet
Notes - CSE (DS)
44 pages
GoogleNET and ResNet v4 With Nin and Bias
No ratings yet
GoogleNET and ResNet v4 With Nin and Bias
82 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Notes - CSE (DS)
No ratings yet
Notes - CSE (DS)
44 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
Introduction To Resnet
No ratings yet
Introduction To Resnet
14 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
6 Apr - 6 - DL
No ratings yet
6 Apr - 6 - DL
69 pages
Unit-3
No ratings yet
Unit-3
38 pages
Convolutional Neural Network2 26112024 015227pm
No ratings yet
Convolutional Neural Network2 26112024 015227pm
41 pages
TResNet
No ratings yet
TResNet
37 pages
Deep_Residual_Network_for_Image_Recognition
No ratings yet
Deep_Residual_Network_for_Image_Recognition
4 pages
Difference between AlexNet, VGGNet, ResNet, and Inception
No ratings yet
Difference between AlexNet, VGGNet, ResNet, and Inception
25 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
Difference Between AlexNet, VGGNet, ResNet, and Inception - by Aqeel Anwar - Towards Data Science
No ratings yet
Difference Between AlexNet, VGGNet, ResNet, and Inception - by Aqeel Anwar - Towards Data Science
14 pages
Case Studies
No ratings yet
Case Studies
17 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
37 pages
Unit 5
No ratings yet
Unit 5
24 pages
Deep Residual Learning For Image Recognition (Summary)
No ratings yet
Deep Residual Learning For Image Recognition (Summary)
11 pages
138 B Pretrained Networks Classification Complete
No ratings yet
138 B Pretrained Networks Classification Complete
47 pages
5b Dana
No ratings yet
5b Dana
67 pages
DL unit 3-5
No ratings yet
DL unit 3-5
44 pages
dl ass 742
No ratings yet
dl ass 742
14 pages
Age and Gender Classification
No ratings yet
Age and Gender Classification
26 pages
Res Net 2
No ratings yet
Res Net 2
40 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
cnn (1)_unit 3_merged
No ratings yet
cnn (1)_unit 3_merged
14 pages
Aggregated Residual Transformations For Deep Neural Networks
No ratings yet
Aggregated Residual Transformations For Deep Neural Networks
9 pages
What Is The Need For Residual Learning?
No ratings yet
What Is The Need For Residual Learning?
3 pages
Resnets: Background
No ratings yet
Resnets: Background
8 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
Malware_Image_Classification_Using_ML_DL (1)
No ratings yet
Malware_Image_Classification_Using_ML_DL (1)
5 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
VGG net
No ratings yet
VGG net
6 pages
Deep Residual Learning For Image Recognition
No ratings yet
Deep Residual Learning For Image Recognition
16 pages
Aggregated Residual Transformations For Deep Neural Networks
No ratings yet
Aggregated Residual Transformations For Deep Neural Networks
10 pages
Unit III
No ratings yet
Unit III
58 pages
Aggregated Residual Transformations For Deep Neural Networks
No ratings yet
Aggregated Residual Transformations For Deep Neural Networks
9 pages
Unit 2 CNN
No ratings yet
Unit 2 CNN
15 pages
Untitled document (2)
No ratings yet
Untitled document (2)
15 pages
Unit 5a - Machine Vision
No ratings yet
Unit 5a - Machine Vision
55 pages
Polynet: A Pursuit of Structural Diversity in Very Deep Networks
No ratings yet
Polynet: A Pursuit of Structural Diversity in Very Deep Networks
9 pages
Trustworthy - Final Essay
No ratings yet
Trustworthy - Final Essay
21 pages
REgnet
No ratings yet
REgnet
6 pages
Lecture06 VDL
No ratings yet
Lecture06 VDL
79 pages
CNN Architectures - LeNet, AlexNet, VGG, GoogLeNet, ResNet and More - by Siddharth Das - Analytics Vidhya - Medium
No ratings yet
CNN Architectures - LeNet, AlexNet, VGG, GoogLeNet, ResNet and More - by Siddharth Das - Analytics Vidhya - Medium
6 pages
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Data Preprocessing
No ratings yet
Data Preprocessing
2 pages
Activation F
No ratings yet
Activation F
4 pages
SimplifyingCFGs Examples
No ratings yet
SimplifyingCFGs Examples
3 pages
Substitution Example
No ratings yet
Substitution Example
1 page
FiniteAutomata Anim
No ratings yet
FiniteAutomata Anim
41 pages
S-86 - Booster Data Sheet
No ratings yet
S-86 - Booster Data Sheet
5 pages
Admit Card For Icmr Assistant Recruitment - 2021 Examination
100% (1)
Admit Card For Icmr Assistant Recruitment - 2021 Examination
3 pages
Automated Inventory Management: Project Report
No ratings yet
Automated Inventory Management: Project Report
13 pages
Service Manaul: Wall Mounted Type DC Inverter 1U71Sabfra Model No
No ratings yet
Service Manaul: Wall Mounted Type DC Inverter 1U71Sabfra Model No
73 pages
FINAL_YEAR_PROJECT_GLAUCOMA_DETECTION_TEAM_COPY _done
No ratings yet
FINAL_YEAR_PROJECT_GLAUCOMA_DETECTION_TEAM_COPY _done
70 pages
Aircraft Design Work Manual
No ratings yet
Aircraft Design Work Manual
8 pages
Stone Three Pulp Sensor Product Brochure 2020
No ratings yet
Stone Three Pulp Sensor Product Brochure 2020
2 pages
A.vamshi Mini Project 2
No ratings yet
A.vamshi Mini Project 2
34 pages
Assign
No ratings yet
Assign
5 pages
En LB470 Manual 56925BA2 04
No ratings yet
En LB470 Manual 56925BA2 04
229 pages
9th Eng Guide NOTES
No ratings yet
9th Eng Guide NOTES
32 pages
463 6901811 enUS om
No ratings yet
463 6901811 enUS om
104 pages
Engine Assembly gcv135 190 en
No ratings yet
Engine Assembly gcv135 190 en
1 page
Abrasive Jet Machining (Ajm) : Dept. of ME, ACE
No ratings yet
Abrasive Jet Machining (Ajm) : Dept. of ME, ACE
8 pages
HOW TO MAKE MONEY ONLINE IN 2024 Récupération Automatique
No ratings yet
HOW TO MAKE MONEY ONLINE IN 2024 Récupération Automatique
9 pages
AAggre Overview
No ratings yet
AAggre Overview
22 pages
1731743_Loading_WT_not_saved
No ratings yet
1731743_Loading_WT_not_saved
2 pages
Vocabulary Test 12A
No ratings yet
Vocabulary Test 12A
1 page
8.1.0 Artemis Cine Broadcast - Basic Sets
No ratings yet
8.1.0 Artemis Cine Broadcast - Basic Sets
1 page
Database Performance Monitoring Explored Explained
No ratings yet
Database Performance Monitoring Explored Explained
5 pages
EndNote Presentation Slide
No ratings yet
EndNote Presentation Slide
40 pages
Unix Commands
No ratings yet
Unix Commands
6 pages
Jmu 3001 - e 1
No ratings yet
Jmu 3001 - e 1
1 page
Usd 95,665.77
No ratings yet
Usd 95,665.77
1 page
Human Resource Management Performance Evaluation Process
No ratings yet
Human Resource Management Performance Evaluation Process
3 pages
NAT 12 (Set 4)
No ratings yet
NAT 12 (Set 4)
4 pages
Math 10
No ratings yet
Math 10
3 pages
Transaction: Atomicity (Transactions Are All or Nothing)
No ratings yet
Transaction: Atomicity (Transactions Are All or Nothing)
3 pages
250 C++ Program Examples & Solutions Techstudy
No ratings yet
250 C++ Program Examples & Solutions Techstudy
1 page