0% found this document useful (0 votes)

8 views

Lecture2 Advanced CNN

CNN

Uploaded by

Quang Uy Nguyen

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lecture2 Advanced CNN

CNN

Uploaded by

Quang Uy Nguyen

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 55

Advanced Convolutional Neural Networks

Nguyen Quang Uy

1
Outline
1. Alexnet

2. VGGnet

3. Googlenet

4. Resnet

5. Mobilenet

6. Efficientnet
2
Legends

3
Layers

4
Activation functions

5
Modules/Blocks

6
Repeated layers

7
Alexnet

8
Overview
• Paper: ImageNet Classification with Deep Convolutional Neural Networks
• Published in: NeurIPS 2012.
• Considered to be the most impact in computer vision.

9
Novelties
• Use Rectified Linear Units (ReLUs) as activation functions.
• Use Dropout layer.
• Use data augmentation.

10
Architecture
• AlexNet has 8 layers — 5 convolutional and 3 fully-connected.
• AlexNet Has 60M parameters.

11
Results
• Top-1 error rates is 37.5%
• Top-5 error rates 17.0%

12
VGG

13
Overview
• VGG: Visual Geometry Group
• Paper: Very Deep Convolutional Networks for Large-Scale Image
Recognition
• Published in arXiv 2014

14
Novelties
• Designing of deeper networks (roughly twice as deep as AlexNet). This was done by
stacking uniform convolutions.
• They use only 3x3 kernels, as opposed to AlexNet 11x11. This design decreases the
number of parameters.

15
Architecture
• VGG has 13 convolutional and 3 fully-connected layers.
• This network stacks more layers onto AlexNet.
• It consists of 138M parameters.

16
VGG result
• Top-1 accuracy is 71.5%
• Top-1 accuracy 90.1%

17
Googlenet

18
Overview
• Also known as Inception-v1
• Paper: Going Deeper with Convolutions
• Published in: 2015 IEEE Conference on Computer Vision and Pattern
Recognition (CVPR).
• Achieve competitive result compared to human

19
Novelties
• Building networks using modules/blocks, instead of stacking convolutional layers.
• 1×1 conv are used for dimensionality reduction to remove computational bottlenecks.
• Have parallel convolutions with filters at 1×1, 3×3 and 5×5, followed by concatenation.
• Use two auxiliary classifiers to encourage discrimination in the lower stages.

20
Architecture

21
Architecture
• Stem and Inception module.

22
Results
• Top-1 accuracy is 78.2%
• Top-5 accuracy is 94.1%
• Human error is 5%-8%.

23
Resnet

24
Overview
• Paper: Deep Residual Learning for Image Recognition.
• Published in: 2016 IEEE Conference on Computer Vision and Pattern
Recognition (CVPR).
• The first network achieves better result then human.

25
Novelties
• Popularise skip connections (they weren’t the first to use skip connections).
• Design even deeper CNNs (up to 152 layers) without compromising model’s
generalisation power
• Among the first to use batch normalisation.

26
Architecture
• Conv block and Identity module.

27
Architecture
• Conv block and Identity module.

28
Resnet result
• Top-1 accuracy is 87.0%.
• Top-5 accuracy 96.3%.
• Top-5 human accuracy: 95.0%

29
Mobilenet

30
Overview
• Paper: MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
Applications
• Published in: 2017 IEEE Conference on Computer Vision and Pattern Recognition
(CVPR).
• Specially designed to be used in mobile devices.

31
Novelties
• MobileNet uses depthwise separable convolutions. It significantly reduces the number of
parameters.
• It introduces two shrinking hyperparameters that efficiently trade off between latency and
accuracy

32
Architecture

33
Architecture
• Deepwise separable convolution.

34
Architecture
• Deepwise convolution:
• In a normal convolution, all channels of a kernel are used to produce a feature map.
• In a depthwise convolution, each channel of a kernel is used to produce a feature map.

35
Architecture
• Pointwise convolution.
• In a normal convolution, we just have to use 256 filters of size 5x5x3.
• In a pointwise convolution, we just have to use 256 filters of size 1x1x3.

36
Computation cost
• Standard convolution

• The computational cost can be calculated as

• Where DF is the dimensions of the input feature map and DK is the

size of the convolution kernel, M and N are the number of input and
output channels respectively.
37
Computation cost
• Depthwise convolution

• The computational cost can be calculated as

38
Computation cost
• Depthwise convolution

• The computational cost can be calculated as

39
Computation cost
• The total computational cost of Depthwise separable convolutions can be
calculated as.

• Comparing it with the computational cost of standard convolution, we get

the reduction in computation.

40
Results
• Mobilenet is better than Googlenet and VGG with much lower number of
operators and parameters.

41
Efficientnet

42
Overview
• Paper: EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
• Published in: International Conference on Machine Learning, 2019.
• It is considered as the state-of-the-art until today.

43
Novelties
• Compound Scaling from B0 to B7.
• The EfficientNet Architecture (developed using Neural Architecture Search)

44
Architecture

45
Compound scaling
• The most common way to scale up ConvNets was either depth (number of layers), width
(number of channels) or image resolution (image size).
• EfficientNets perform Compound Scaling to scale all three dimensions while mantaining a
balance between all dimensions of the network.

46
Compound scaling
• This idea of compound scaling makes sense because if the input image is
bigger then the network needs more layers (depth) and more channels
(width) to capture more fine-grained patterns.

47
Neural Architecture Search
• This is a reinforcement learning based approach used to develop Efficient-B0 by
leveraging a multi-objective search that optimizes for both Accuracy and FLOPS.

48
Neural Architecture Search
• The objective function can formally be defined as:

49
Mobile inverted bottleneck convolution (MBConv)
• MBConv without squeeze and excitation operation

50
Mobile inverted bottleneck convolution (MBConv)
• MBConv with squeeze and excitation operation

51
Squeeze and excitation operation
• Access to global information
• Modelling channel interdependencies
• Which can be regarded as a self-attention function on channels

52
Scaling Efficient-B0 to get B1-B7
• Let the network depth(d), widt(w) and input image resolution(r) be:

• We then fix α, β, γ as constants and scale up baseline network with

different φ using Equation 3, to obtain EfficientNet-B1 to B7
53
Results

54
Q&A
Thank you!

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
58% (81)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
69% (72)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
ch4_CNN
No ratings yet
ch4_CNN
35 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
2 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
MLT CNN Architectures
No ratings yet
MLT CNN Architectures
104 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Convolutional Networks
No ratings yet
Convolutional Networks
25 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Convolutional Neural Networks (Cnns / Convnets)
No ratings yet
Convolutional Neural Networks (Cnns / Convnets)
21 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
51 pages
Mobile Net
No ratings yet
Mobile Net
9 pages
CNN Apps
No ratings yet
CNN Apps
17 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
Convolutional Neural Network2 26112024 015227pm
No ratings yet
Convolutional Neural Network2 26112024 015227pm
41 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
cnn (1)_unit 3_merged
No ratings yet
cnn (1)_unit 3_merged
14 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
CS231n - Convolutional-Networks 1
No ratings yet
CS231n - Convolutional-Networks 1
3 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
5b Dana
No ratings yet
5b Dana
67 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Advanced DL Computer Vision
No ratings yet
Advanced DL Computer Vision
10 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
17 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
Efficient CNN Architecture Design Guided by Visualization
No ratings yet
Efficient CNN Architecture Design Guided by Visualization
6 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
TResNet
No ratings yet
TResNet
37 pages
VGG net
No ratings yet
VGG net
6 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
MobileNetV2 Inverted Residuals and Linear Bottlenecks
No ratings yet
MobileNetV2 Inverted Residuals and Linear Bottlenecks
11 pages
CS436_CS5310_EE513_L05_CNN2
No ratings yet
CS436_CS5310_EE513_L05_CNN2
27 pages
DL6 - Convnets 4
No ratings yet
DL6 - Convnets 4
57 pages
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
No ratings yet
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
55 pages
Multi-Layered Deep Convolutional Neural Network For Object Detection
No ratings yet
Multi-Layered Deep Convolutional Neural Network For Object Detection
6 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
Image Classification Using Convolutional Neural Networks (CNNS)
No ratings yet
Image Classification Using Convolutional Neural Networks (CNNS)
61 pages
DLP
No ratings yet
DLP
50 pages
10. Image Processing With Deep Learning
No ratings yet
10. Image Processing With Deep Learning
39 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
GoogleNet
No ratings yet
GoogleNet
40 pages
Szegedy Rethinking The Inception CVPR 2016 Paper PDF
No ratings yet
Szegedy Rethinking The Inception CVPR 2016 Paper PDF
9 pages
Deep Learning: Alberto Ezpondaburu
No ratings yet
Deep Learning: Alberto Ezpondaburu
58 pages
Classic Cnn
No ratings yet
Classic Cnn
39 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet
Mobilenet SSDv2An Improved Object Detection Model For Embedded Systems
No ratings yet
Mobilenet SSDv2An Improved Object Detection Model For Embedded Systems
5 pages
1 s2.0 S1568494623000844 Main
No ratings yet
1 s2.0 S1568494623000844 Main
18 pages
14 - Pedestrian Detection Based On YOLO Network Model
No ratings yet
14 - Pedestrian Detection Based On YOLO Network Model
5 pages
Gebreegziabher 2020
No ratings yet
Gebreegziabher 2020
5 pages
Instant download Deep Learning Vol 1 From Basics to Practice Andrew Glassner pdf all chapter
100% (5)
Instant download Deep Learning Vol 1 From Basics to Practice Andrew Glassner pdf all chapter
65 pages
PyTorch Geometric Temporal Spatiotemporal Signal Processing
No ratings yet
PyTorch Geometric Temporal Spatiotemporal Signal Processing
10 pages
Deepfake Detection Techniques: A Review: Neeraj Guhagarkar, Sanjana Desai Swanand Vaishyampayan, Ashwini Save
No ratings yet
Deepfake Detection Techniques: A Review: Neeraj Guhagarkar, Sanjana Desai Swanand Vaishyampayan, Ashwini Save
10 pages
Artificial Intelligence in Medicine
No ratings yet
Artificial Intelligence in Medicine
10 pages
Instant download Intelligent Computing Proceedings of the 2020 Computing Conference Volume 3 Kohei Arai pdf all chapter
100% (3)
Instant download Intelligent Computing Proceedings of the 2020 Computing Conference Volume 3 Kohei Arai pdf all chapter
62 pages
A Two Stage Estimation Method Based On Conceptors Aided Unsup 2023 Expert Sy
No ratings yet
A Two Stage Estimation Method Based On Conceptors Aided Unsup 2023 Expert Sy
17 pages
Image Restoration Via Frequency Selection
No ratings yet
Image Restoration Via Frequency Selection
16 pages
Face Recognition Based Attendance Management System
No ratings yet
Face Recognition Based Attendance Management System
5 pages
Plant Disease Detection by CNN
No ratings yet
Plant Disease Detection by CNN
10 pages
Plainmamba: Improving Non-Hierarchical Mamba in Visual Recognition
No ratings yet
Plainmamba: Improving Non-Hierarchical Mamba in Visual Recognition
22 pages
Applied Ai Schedule
No ratings yet
Applied Ai Schedule
19 pages
AI IITRopar Brochure Entrance Test (1)
No ratings yet
AI IITRopar Brochure Entrance Test (1)
12 pages
Aircraft Visual Inspection A Systematic Literature Review
No ratings yet
Aircraft Visual Inspection A Systematic Literature Review
15 pages
Smart Glasses For Visually Impaired Using Image Processing Techniques
No ratings yet
Smart Glasses For Visually Impaired Using Image Processing Techniques
6 pages
YOLOv12_A Breakdown of the Key Architectural Features
No ratings yet
YOLOv12_A Breakdown of the Key Architectural Features
9 pages
Draft 2
No ratings yet
Draft 2
71 pages
Fully Convolutional Networks With Sequential Information For Robust Crop and Weed Detection in Precision Farming
No ratings yet
Fully Convolutional Networks With Sequential Information For Robust Crop and Weed Detection in Precision Farming
16 pages
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
No ratings yet
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
15 pages
DL, Course Introduction
No ratings yet
DL, Course Introduction
9 pages
B19 PPT PRC 2
No ratings yet
B19 PPT PRC 2
27 pages
Black and White Both Sides MAIN
No ratings yet
Black and White Both Sides MAIN
23 pages
AI-Based Helmet Violation Detection For Traffic Ma
No ratings yet
AI-Based Helmet Violation Detection For Traffic Ma
17 pages
Fakespotter: A Simple Yet Robust Baseline For Spotting Ai-Synthesized Fake Faces
No ratings yet
Fakespotter: A Simple Yet Robust Baseline For Spotting Ai-Synthesized Fake Faces
8 pages
Neo ppt
No ratings yet
Neo ppt
18 pages
Atlas Medical Ultrasonografie
No ratings yet
Atlas Medical Ultrasonografie
130 pages
Holberton School Syllabus
No ratings yet
Holberton School Syllabus
47 pages