0% found this document useful (0 votes)

34 views

12 Convolutional Neural Networks

The document discusses convolutional neural networks and convolutional layers. It explains that a convolutional layer takes an input image and convolves it with filters that are smaller than the input, such as a 5x5 filter on a 32x32 image. This results in activation maps where each map represents the outputs of one filter. Multiple filters are used, and their activation maps are stacked to produce a new image volume as output. Convolutional neural networks consist of multiple convolutional layers interleaved with activation functions to learn features from input images hierarchically.

Uploaded by

Eren Kalay

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views

12 Convolutional Neural Networks

Uploaded by

Eren Kalay

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 101

BBM413

Fundamentals of
Image Processing

Convolutional Neural Networks

Erkut Erdem
Hacettepe University
Computer Vision Lab (HUCVL)
Convolution Layer
32x32x3 image

32 height
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

32 width
3 depth

!2
Convolution Layer
32x32x3 image

5x5x3 filter
32

Convolve the filter with the

image
i.e. “slide over the image
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

spatially, computing dot

32
products”
3

!3
Convolution Layer
Filters always extend the full
depth of the input volume
32x32x3 image

5x5x3 filter
32

Convolve the filter with the

image
i.e. “slide over the image
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

spatially, computing dot

32
products”
3

!4
Convolution Layer
32x32x3 image
5x5x3 filter
32

1 number:
the result of taking a dot product between the
filter and a small 5x5x3 chunk of the image
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

32 (i.e. 553 = 75-dimensional dot product + bias)

!5
Convolution Layer
activation
32x32x3 image map
5x5x3 filter
32

convolve (slide) over all

spatial locations
28
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

32
3 1

!6
Convolution Layer
consider a second, green filter
32x32x3 image activation maps

5x5x3 filter
32

convolve (slide) over all

spatial locations
28
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

32
3 1

!7
For example, if we had 6 5x5 filters, we’ll get 6
separate activation maps:
activation maps

Convolution Layer

32 28
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

3 6

We stack these up to get a “new image” of size 28x28x6!

!8
Preview: ConvNet is a sequence of
Convolutional Layers, interspersed with
activation functions

32 28

CONV,
ReLU
e.g. 6
5x5x3
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

32 28
filters
3 6

!9
Preview: ConvNet is a sequence of
Convolutional Layers, interspersed with
activation functions

32 28 24

….
CONV, CONV, CONV,
ReLU ReLU ReLU
e.g. 6 e.g. 10
5x5x3 5x5x6
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

32 28 24
filters filters
3 6 10

!10
!11
[From recent Yann
LeCun slides]
Preview
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson
!12
[From recent Yann
LeCun slides]
Preview
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson
one filter =>
one activation map
example 5x5
filters
(32 total)
We call the layer convolutional
because it is related to
convolution of two signals:
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson

elementwise multiplication
and sum of a filter and the
signal (image)

!13
!14
!14
Preview
slide by Fei-Fei Li, Andrej Karpathy & Justin Johnson
A closer look at spatial
dimensions:
activation
32x32x3 image map
5x5x3 filter
32