0% found this document useful (0 votes)

6 views

Week 11 - Convolutional

This document discusses Convolutional Neural Networks (CNNs) and their applications in image classification, object detection, and image segmentation. It covers key concepts such as invariance, equivariance, convolutional layers, and the differences between fully connected networks and convolutional networks. The document also highlights the architecture and performance of various CNN models, including AlexNet and VGG, along with techniques like data augmentation and transfer learning.

Uploaded by

sawerayaseen654

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Week 11 - Convolutional

Uploaded by

sawerayaseen654

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 78

Week 11

Convolutional Neural Networks

Dr. Muhammad Wasim

Slides adopted from Prof. Simon Prince (Understanding Deep Learning Book)
Convolutional networks
• Networks for images
• Invariance and equivariance
• 1D convolution
• Convolutional layers
• Channels
• Convolutional network for MNIST 1D
Image classification

• Multiclass classification problem (discrete classes, >2 possible classes)

• Convolutional network
Object detection
Image segmentation

• Multivariate binary classification problem (many outputs, two discrete classes)

• Convolutional encoder-decoder network
Networks for images
• Problems with fully-connected networks
1. Size
• 224x224 RGB image = 150,528 dimensions
• Hidden layers generally larger than inputs
• One hidden layer = 150,520x150,528 weights -- 22 billion
2. Nearby pixels statistically related
• But could permute pixels and relearn and get same results with FC
3. Should be stable under transformations
• Don’t want to re-learn appearance at different parts of image
Convolutional networks
• Parameters only look at local image patches
• Share parameters across image
Convolutional networks
• Networks for images
• Invariance and equivariance
• 1D convolution
• Convolutional layers
• Channels
• Convolutional network for MNIST 1D
Invariance
• A function f[x] is invariant to a transformation t[] if:

i.e., the function output is the same even after the transformation is
applied.
Invariance example
e.g., Image classification
• Image has been translated, but we want our classifier to give the same result
Equivariance
• A function f[x] is equivariant to a transformation t[] if:

i.e., the output is transformed in the same way as the input

Equivariance example
e.g., Image segmentation
• Image has been translated and we want segmentation to translate with it
Convolutional networks
• Networks for images
• Invariance and equivariance
• 1D convolution
• Convolutional layers
• Channels
• Convolutional network for MNIST 1D
Convolution* in 1D
• Input vector x:

• Output is weighted sum of neighbors:

• Convolutional kernel or filter:

Kernel size = 3

* Not really technically convolution

Convolution with kernel size 3
Convolution with kernel size 3
Convolution with kernel size 3

Equivariant to translation of input

Zero padding

Treat positions that are beyond end of the input as zero.

“Valid” convolutions

Only process positions where kernel falls in image (smaller output).

Stride, kernel size, and dilation
• Stride = shift by k positions for each output
• Decreases size of output relative to input
• Kernel size = weight a different number of inputs for each output
• Combine information from a larger area
• But kernel size 5 uses 5 parameters
• Dilated or atrous convolutions = intersperse kernel values with zeros
• Combine information from a larger area
• Fewer parameters
1
1 1
1 1 1
1 1 1 2
Convolutional networks
• Networks for images
• Invariance and equivariance
• 1D convolution
• Convolutional layers
• Channels
• Receptive fields
• Convolutional network for MNIST 1D
Convolutional layer
Special case of fully-connected
network
Convolutional network:

Fully connected network:

Special case of fully-connected
network
Convolutional network:

3 weights, 1 bias

Fully connected network:

weights, D biases
Special case of fully-connected
network

Fully connected network

Special case of fully-connected
network

Fully connected network Convolution, kernel 3,

stride 1, dilation 1
Special case of fully-connected
network

Fully connected network Convolution, size 3, stride 1, Convolution, size 3, stride 2,

dilation 1, zero padding dilation 1, zero padding
Question 1

• Kernel size?
• Stride?
• Dilation?
• Zero padding / valid?
Question 2

• Kernel size?
• Stride?
• Dilation?
• Zero padding / valid?
Question 3

• Kernel size?
• Stride?
• Dilation?
• Zero padding / valid?
Convolutional networks
• Networks for images
• Invariance and equivariance
• 1D convolution
• Convolutional layers
• Channels
• Convolutional network for MNIST 1D
Channels
• The convolutional operation averages together the inputs
• Plus passes through ReLU function
• Has to lose information
• Solution:
• apply several convolutions and stack them in channels
• Sometimes also called feature maps
Two output channels, one input
channel
Two output channels, one input
channel
Two input channels, one output
channel
How many parameters?
• If there are input channels and kernel size K

Kernel size, stride, dilation all

work as you would expect
How many parameters?
• If there are input channels and kernel size K x K

• If there are input channels and output channels

Convolution #2
• 2D Convolution
• Downsampling and upsampling, 1x1 convolution
• Image classification
• Object detection
• Semantic segmentation
Downsampling

Sample every other

position (equivalent to
stride two)
Downsampling

Sample every other Max pooling

position (equivalent to (partial invariance to
stride two) translation)
Downsampling

Sample every other Max pooling Mean pooling

position (equivalent to (partial invariance to
stride two) translation)
Upsampling

Duplicate
Upsampling

Duplicate Max-upsampling
Upsampling

Duplicate Max-upsampling Bilinear interpolation

Convolution #2
• 2D Convolution
• Downsampling and upsampling, 1x1 convolution
• Image classification
• Object detection
• Semantic segmentation
ImageNet database

• 224 x 224 images

• 1,281,167 training images, 50,000 validation images, and 100,000 test images
• 1000 classes
AlexNet (2012)

Almost all the 60 million

parameters
parameters are in fully
connected layers
Data augmentation

• Data augmentation a factor of 2048 using (i) spatial transformations

and (ii) modifications of the input intensities.
Dropout

• Dropout was applied in the fully connected layers

Details
• At test time average results from five different cropped and
mirrored versions of the image
• SGD with a momentum coeﬀicient of 0.9 and batch size of 128.
• L2 (weight decay) regularizer used.
• This system achieved a 16.4% top-5 error rate and a 38.1%
top-1 error rate.
VGG (2015)
Details
• 19 hidden layers
• 144 million parameters
• 6.8% top-5 error rate, 23.7% top-1 error rate
Convolution #2
• 2D Convolution
• Downsampling and upsampling, 1x1 convolution
• Image classification
• Object detection
• Semantic segmentation
• Residual networks
• U-Nets and hourglass networks
You Only Look Once (YOLO)

• Network similar to VGG (448x448 input)

• 7×7 grid of locations
• Predict class at each location
• Predict 2 bounding boxes at each location
• Five parameters –x,y, height, width, and confidence
• Momentum, weight decay, dropout, and data augmentation
• Heuristic at the end to threshold and decide final boxes
Object detection (YOLO)
Transfer learning

Transfer learning from ImageNet classification

Results
Convolution #2
• 2D Convolution
• Downsampling and upsampling, 1x1 convolution
• Image classification
• Object detection
• Semantic segmentation
Semantic Segmentation (2015)

Encoder Decoder
Semantic segmentation results

CompTIA Network+ Review Guide: Exam N10-008
From Everand
CompTIA Network+ Review Guide: Exam N10-008
Jon Buhagiar
No ratings yet
CS60010_CNN
No ratings yet
CS60010_CNN
39 pages
AE556_2024_Topic4_CNN
No ratings yet
AE556_2024_Topic4_CNN
26 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Sarma Cnn Vce Oct 2022
No ratings yet
Sarma Cnn Vce Oct 2022
63 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Module 3
No ratings yet
Module 3
67 pages
05introduction To Convolutional Neural Networks
No ratings yet
05introduction To Convolutional Neural Networks
72 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Cnn
No ratings yet
Cnn
123 pages
Lecture-CNN
No ratings yet
Lecture-CNN
68 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
DL6 - Convnets 4
No ratings yet
DL6 - Convnets 4
57 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
Cnns Convolution Neural Networks
No ratings yet
Cnns Convolution Neural Networks
50 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
102 pages
AIML_ECE_UNIT-5
No ratings yet
AIML_ECE_UNIT-5
48 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
55 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
Week6_Intro to Convolutional Neural Networks
No ratings yet
Week6_Intro to Convolutional Neural Networks
25 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
Convolutional_Networks_2024
No ratings yet
Convolutional_Networks_2024
44 pages
1.neural Networks and Convolutional Processing
No ratings yet
1.neural Networks and Convolutional Processing
94 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
21CS743_DL_Module4_notes
No ratings yet
21CS743_DL_Module4_notes
7 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
21CS743_Module4_notes
No ratings yet
21CS743_Module4_notes
15 pages
3.3 - CNNs
No ratings yet
3.3 - CNNs
29 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
PNAL9_CNNs
No ratings yet
PNAL9_CNNs
61 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Lec 8
No ratings yet
Lec 8
60 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
77 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
02 Cnn Slides
No ratings yet
02 Cnn Slides
77 pages
Module 3
No ratings yet
Module 3
46 pages
CNNs
No ratings yet
CNNs
22 pages
CNN
No ratings yet
CNN
62 pages
Ch. 10: Introduction To Convolution Neural Networks CNN and Systems
No ratings yet
Ch. 10: Introduction To Convolution Neural Networks CNN and Systems
69 pages
10-Variants of Convolution Function-21-Sep-2020Material I 21-Sep-2020 Module5 CNN
No ratings yet
10-Variants of Convolution Function-21-Sep-2020Material I 21-Sep-2020 Module5 CNN
23 pages
CNN2
No ratings yet
CNN2
70 pages
CONVOLUTIONAL NEURAL NETWORK
No ratings yet
CONVOLUTIONAL NEURAL NETWORK
36 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
MB Manual B550m-Aorus-Elite e 1101
No ratings yet
MB Manual B550m-Aorus-Elite e 1101
44 pages
Risk Assess T-17 - Using Portable Hand Tools
No ratings yet
Risk Assess T-17 - Using Portable Hand Tools
4 pages
Establishing a Hematopoietic Stem Cell Transplantation Unit A Practical Guide EPUB DOCX PDF Download
No ratings yet
Establishing a Hematopoietic Stem Cell Transplantation Unit A Practical Guide EPUB DOCX PDF Download
15 pages
UniPunch System
No ratings yet
UniPunch System
52 pages
PPA_Unit1 Indian Administration Kautilya
No ratings yet
PPA_Unit1 Indian Administration Kautilya
7 pages
Public Transportation Systems
No ratings yet
Public Transportation Systems
16 pages
85100-100 EN Product-Catalog 2023-10
No ratings yet
85100-100 EN Product-Catalog 2023-10
220 pages
Evangelista, Rizamae - Case Study 1
No ratings yet
Evangelista, Rizamae - Case Study 1
6 pages
Concurrency Control Dbms
No ratings yet
Concurrency Control Dbms
49 pages
Vijayanagara Sri Krishnadevaraya University, Ballari: A Business Familiarisation Report OF
No ratings yet
Vijayanagara Sri Krishnadevaraya University, Ballari: A Business Familiarisation Report OF
33 pages
Bu4 Act
No ratings yet
Bu4 Act
5 pages
Rohin Arora & Another vs. D.D.A PDF
No ratings yet
Rohin Arora & Another vs. D.D.A PDF
6 pages
The Spirit of A Biophilic Shopping Mall Final Version P5 Roos Bolleboom 4809076
No ratings yet
The Spirit of A Biophilic Shopping Mall Final Version P5 Roos Bolleboom 4809076
31 pages
22 NM
No ratings yet
22 NM
12 pages
NIT22
No ratings yet
NIT22
275 pages
Fae376 A20 Mitsumi
No ratings yet
Fae376 A20 Mitsumi
2 pages
The Book of Riddles - Anonymous
No ratings yet
The Book of Riddles - Anonymous
12 pages
Chapter 5
No ratings yet
Chapter 5
8 pages
Syteline UsingBOMs&EngChgNotices Slides
No ratings yet
Syteline UsingBOMs&EngChgNotices Slides
157 pages
Customer Feedback Management System
No ratings yet
Customer Feedback Management System
51 pages
Graphics and Digital Photo-Editing
No ratings yet
Graphics and Digital Photo-Editing
10 pages
Workmen Compensation Act 1923
No ratings yet
Workmen Compensation Act 1923
27 pages
SSA-A12 Paper Cup Machine Book Let
No ratings yet
SSA-A12 Paper Cup Machine Book Let
3 pages
Itu-T: Generic Functional Architecture of Transport Networks
No ratings yet
Itu-T: Generic Functional Architecture of Transport Networks
58 pages
Mexico Catalogue
No ratings yet
Mexico Catalogue
16 pages
Updated Entertainment List
No ratings yet
Updated Entertainment List
13 pages
Sand Art Club
No ratings yet
Sand Art Club
18 pages
Nitk Academic Calendar
No ratings yet
Nitk Academic Calendar
1 page
Lesson 19: Comparison Shopping-Unit Price and Related Measurement Conversions
No ratings yet
Lesson 19: Comparison Shopping-Unit Price and Related Measurement Conversions
7 pages
Case Study Assignment - Technology Strategy - WWW - Topgradepapers
No ratings yet
Case Study Assignment - Technology Strategy - WWW - Topgradepapers
7 pages