0% found this document useful (0 votes)

3 views

08-Convolution Neural Network

The document discusses the transition from Multilayer Perceptrons (MLP) to Convolutional Neural Networks (CNN), emphasizing the importance of spatial relationships in image data. CNNs leverage the structure of images to create more efficient models, reducing the number of parameters needed for training. The document also highlights concepts like translation invariance and locality, which are fundamental to the design of convolutional layers.

Uploaded by

Kalp Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

08-Convolution Neural Network

Uploaded by

Kalp Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

CSD456

Deep Learning
Convolution
Neural Network
Till Now…

• Multilayer Perceptron
• Training
• Activation
• Loss
Convolution Neural Network

• So far, we have ignored this rich structure and treated

images as vectors of numbers by flattening them,
irrespective of the spatial relation between pixels.

• It was necessary to feed the resulting one-dimensional

vectors through a fully connected MLP.

• MLP is invariant to the order of the features, we could get

similar results regardless of whether we preserve an order
corresponding to the spatial structure of the pixels or not.
Convolution Neural Network (CNN)

• We should leverage our prior knowledge that nearby pixels

are typically related to each other, to build efficient models
for learning from image data. -- CNN(LeCun et al., 1995)

• Modern CNNs, as they are called colloquially, owe their

design to inspirations from biology, group theory, and a
healthy dose of experimental tinkering.

• CNNs tend to be computationally efficient, both because

they require fewer parameters than MLP
From MLP to Convolution

• So far, the models that we have discussed so far remain

appropriate options when we are dealing with tabular data.

• With tabular data, we do not assume any structure a priori

concerning how the features interact.

• However, for high-dimensional perceptual data (e.g

Images, Video), such structureless networks can grow
unwieldy.
Distinguishing cats from dogs -- Example

• Let say, we have collected an annotated dataset of one-

megapixel photographs.

• This means that each input to the network has one million
dimensions.
• Even an aggressive reduction to one thousand hidden
dimensions would require a fully connected layer
characterized by 𝟏𝟎𝟔 × 𝟏𝟎𝟑 = 𝟏𝟎𝟗 parameters.
• learning the parameters of this network may turn out to be
infeasible.
Invariance

• Imagine that we want to detect an object in an image.

• It seems reasonable that whatever method we use to
recognize objects should not be overly concerned with the
precise location of the object in the image.
• We can now make these intuitions more concrete by
following points.
1. For early layers, respond similarly to the same patch
2. For early layers, focus on local regions
3. In deeper layer, capture longer-range features of the
image
Constraining MLP

• Consider an MLP with two-dimensional images 𝐗 as inputs

and their immediate hidden representations 𝐇 similarly
represented as matrices.

• For now, both 𝐗 and 𝐇 have same shape.

• At individual pixel level.
Constraining MLP

• We simply re-index the subscripts 𝑘, 𝑙 such that 𝑘 = 𝑖 + 𝑎

and 𝑙 = 𝑗 + 𝑏. Here (𝑎, 𝑏) can be negative also.
• In other words, we set V 𝑖,𝑗,𝑎,𝑏 = W 𝑖,𝑗,𝑖+𝑎,𝑗+𝑏

12
• Number of parameters : 10 (Infeasible)
Respond similarly to the same patch -- Translation Invariance

• This implies that a shift in the input 𝐗 should simply lead to

a shift in the hidden representation 𝐇.
• This is possible if V and U do not depend on (𝑖, 𝑗)
• Means, V 𝑖,𝑗,𝑎,𝑏 = V 𝑎,𝑏 and U is constant 𝑢.

• THIS IS CONVOLUTION !!!!

6
• Number of parameters: 4 × 10
Focus on local regions -- Locality

• We should not have to look very far away from location (𝑖, 𝑗)
• This means outside some range 𝑎 > Δ or 𝑏 > Δ,
V 𝑎,𝑏 = 0

• Number of parameters : 4 × Δ2
• This is called as convolutional layer
Convolution
Convolution
Channels
Channels
Channels

Al - Sayed, Turner Et Al. 2014 - Space Syntax Methodology
83% (6)
Al - Sayed, Turner Et Al. 2014 - Space Syntax Methodology
118 pages
A Conductor's Guide To The Interpretation of Mendelssohn's Elijah
100% (1)
A Conductor's Guide To The Interpretation of Mendelssohn's Elijah
74 pages
lecture_4 (5)
No ratings yet
lecture_4 (5)
36 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
ML Unit-5
No ratings yet
ML Unit-5
22 pages
Module11 - NNandDeep Learning
No ratings yet
Module11 - NNandDeep Learning
84 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
convolutional_neural_networks
No ratings yet
convolutional_neural_networks
108 pages
DL_Unit3_1 (1)
No ratings yet
DL_Unit3_1 (1)
67 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
Module11 - NNandDeep Learning
No ratings yet
Module11 - NNandDeep Learning
84 pages
neural-networks-part1
No ratings yet
neural-networks-part1
74 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
CNN_Unit
No ratings yet
CNN_Unit
52 pages
MLP and CNN
No ratings yet
MLP and CNN
56 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
Multi Layer Perceptron
No ratings yet
Multi Layer Perceptron
51 pages
DL unit 4 perfect pdf_1
No ratings yet
DL unit 4 perfect pdf_1
23 pages
04Introduction to Neural Networks
No ratings yet
04Introduction to Neural Networks
62 pages
05-Multilayer Perceptron-part2
No ratings yet
05-Multilayer Perceptron-part2
13 pages
NN 06
No ratings yet
NN 06
18 pages
Convnets
No ratings yet
Convnets
41 pages
Understanding Convolutional Neural Networks For NLP - WildML
No ratings yet
Understanding Convolutional Neural Networks For NLP - WildML
14 pages
2021 Pho1 15 Neural Networks Part1
No ratings yet
2021 Pho1 15 Neural Networks Part1
77 pages
Basic Introduction To Convolutional Neural Network in Deep Learning
No ratings yet
Basic Introduction To Convolutional Neural Network in Deep Learning
9 pages
Image Classification Using Convolutional Neural Network With Python
No ratings yet
Image Classification Using Convolutional Neural Network With Python
8 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Lec_2
No ratings yet
Lec_2
42 pages
02 Cnn Slides
No ratings yet
02 Cnn Slides
77 pages
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
No ratings yet
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
36 pages
UNIT - 2
No ratings yet
UNIT - 2
31 pages
DL_Unit II
No ratings yet
DL_Unit II
78 pages
Deep Learning
No ratings yet
Deep Learning
90 pages
DL CNN 2023
No ratings yet
DL CNN 2023
56 pages
Liu_2018_J._Phys.__Conf._Ser._1087_062032
No ratings yet
Liu_2018_J._Phys.__Conf._Ser._1087_062032
8 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
DL_UNIT-4_Part-1
No ratings yet
DL_UNIT-4_Part-1
10 pages
Convolutional Neural Networks. Before Kickstarting Into CNNs We Must - by Namita - Medium
No ratings yet
Convolutional Neural Networks. Before Kickstarting Into CNNs We Must - by Namita - Medium
13 pages
Max78000 Article Series Part 1
No ratings yet
Max78000 Article Series Part 1
4 pages
Unit 1
No ratings yet
Unit 1
109 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
Lesson 4 - Deep Learning
No ratings yet
Lesson 4 - Deep Learning
20 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Lecture_2 (1)
No ratings yet
Lecture_2 (1)
52 pages
Lecture 23
No ratings yet
Lecture 23
15 pages
AI_slide_2
No ratings yet
AI_slide_2
82 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
Lecture-25 - Building - Training CNN
No ratings yet
Lecture-25 - Building - Training CNN
26 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Mod 2.1,2.2
No ratings yet
Mod 2.1,2.2
24 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
Convolutional Neuralnetworks: Abin - Roozgard
No ratings yet
Convolutional Neuralnetworks: Abin - Roozgard
54 pages
Unit 3
No ratings yet
Unit 3
105 pages
CV Lab 7
No ratings yet
CV Lab 7
4 pages
7 Deep Learning
No ratings yet
7 Deep Learning
75 pages
03 PL, Activation, BackProp, CNN
No ratings yet
03 PL, Activation, BackProp, CNN
95 pages
Mathematical Representation of A Perceptron Layer (With Example in Tensorflow)
No ratings yet
Mathematical Representation of A Perceptron Layer (With Example in Tensorflow)
5 pages
Ethem Alpaydin-Introduction To Machine Learning-The MIT Press (2014) (330-333)
No ratings yet
Ethem Alpaydin-Introduction To Machine Learning-The MIT Press (2014) (330-333)
4 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
77 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
Research and Prospect of Image Recognition Based o
No ratings yet
Research and Prospect of Image Recognition Based o
7 pages
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet
Computational Intelligence and Data Analytics: Proceedings of ICCIDA 2022 Rajkumar Buyya - The ebook in PDF format is ready for download
No ratings yet
Computational Intelligence and Data Analytics: Proceedings of ICCIDA 2022 Rajkumar Buyya - The ebook in PDF format is ready for download
74 pages
User Acceptance of Information Technology: Toward A Unified View
No ratings yet
User Acceptance of Information Technology: Toward A Unified View
8 pages
Basic of Scaffolding
No ratings yet
Basic of Scaffolding
11 pages
08 Chapter 04 PDF
No ratings yet
08 Chapter 04 PDF
463 pages
Converting Rubric Scores To Percentages For Grading
No ratings yet
Converting Rubric Scores To Percentages For Grading
2 pages
DSP
No ratings yet
DSP
17 pages
Experiment - Rate of Fermentation of Fruit Juices
No ratings yet
Experiment - Rate of Fermentation of Fruit Juices
5 pages
DS ImagerCX3Series
No ratings yet
DS ImagerCX3Series
3 pages
Didactic Sdidactics
No ratings yet
Didactic Sdidactics
25 pages
ResMet
No ratings yet
ResMet
10 pages
Bacterial Men CPG
No ratings yet
Bacterial Men CPG
42 pages
Process Safety Accidents
100% (5)
Process Safety Accidents
44 pages
Cambridge Additional Mathematics Igcse 0606 And O Level 4037 2nd Edition 2nd Edition Michael Haese download
100% (1)
Cambridge Additional Mathematics Igcse 0606 And O Level 4037 2nd Edition 2nd Edition Michael Haese download
77 pages
Iso 10816 5 en PDF
No ratings yet
Iso 10816 5 en PDF
8 pages
Improving The Least Mastered Competencies in Science 9 Using - Pump It Up!
No ratings yet
Improving The Least Mastered Competencies in Science 9 Using - Pump It Up!
6 pages
D3165 en
No ratings yet
D3165 en
8 pages
Business Economics Cia - 1: Q1. Do You Think Aston Martin Car Presents An Exception To The Law of Demand? If Yes, Explain
No ratings yet
Business Economics Cia - 1: Q1. Do You Think Aston Martin Car Presents An Exception To The Law of Demand? If Yes, Explain
4 pages
WI QA 08-Plug Gauges
No ratings yet
WI QA 08-Plug Gauges
2 pages
MM-1062 SenSmart 7000 Installation Guide
No ratings yet
MM-1062 SenSmart 7000 Installation Guide
8 pages
Intellectual Disability-AGB-2023
No ratings yet
Intellectual Disability-AGB-2023
3 pages
By Jamie Andreas: "The Principles of Correct Practice For Guitar"
No ratings yet
By Jamie Andreas: "The Principles of Correct Practice For Guitar"
33 pages
Report Form (4) -محول PDF
No ratings yet
Report Form (4) -محول PDF
2 pages
NEET Medical Books PMT Study Material AI PDF
100% (1)
NEET Medical Books PMT Study Material AI PDF
31 pages
Ipbt Course 1
No ratings yet
Ipbt Course 1
246 pages
Personal Summary/ Career Objective:: Curriculum Vitae
No ratings yet
Personal Summary/ Career Objective:: Curriculum Vitae
5 pages
Mark Scheme Unit g484 The Newtonian World January
No ratings yet
Mark Scheme Unit g484 The Newtonian World January
14 pages
Numerical Methods
No ratings yet
Numerical Methods
22 pages
Dip Strike
No ratings yet
Dip Strike
37 pages

08-Convolution Neural Network

Uploaded by

08-Convolution Neural Network

Uploaded by

CSD456

• So far, we have ignored this rich structure and treated

• It was necessary to feed the resulting one-dimensional

• MLP is invariant to the order of the features, we could get

• We should leverage our prior knowledge that nearby pixels

• Modern CNNs, as they are called colloquially, owe their

• CNNs tend to be computationally efficient, both because

• So far, the models that we have discussed so far remain

• With tabular data, we do not assume any structure a priori

• However, for high-dimensional perceptual data (e.g

• Let say, we have collected an annotated dataset of one-

• Imagine that we want to detect an object in an image.

• Consider an MLP with two-dimensional images 𝐗 as inputs

• For now, both 𝐗 and 𝐇 have same shape.

• We simply re-index the subscripts 𝑘, 𝑙 such that 𝑘 = 𝑖 + 𝑎

• This implies that a shift in the input 𝐗 should simply lead to

• THIS IS CONVOLUTION !!!!

You might also like