0% found this document useful (0 votes)

149 views

cs231n Github Io Understanding CNN

This document provides an overview of techniques for visualizing and understanding what convolutional neural networks (ConvNets) have learned from visual data. These include visualizing activations and filter weights to see what patterns networks are sensitive to, finding images that maximally activate neurons to understand what they encode, embedding image representations in 2D using t-SNE to view classification topology, and occluding parts of images to determine what areas are important for predictions. The document also briefly discusses reconstructing images from codes, spatial information preservation, performance as a function of attributes, fooling networks, and comparing ConvNets to humans.

Uploaded by

iuvgzmznstddnmcqmn

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

149 views

cs231n Github Io Understanding CNN

Uploaded by

iuvgzmznstddnmcqmn

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

CS231n Convolutional Neural Networks for Visual Recognition

(this page is currently in draft form)

Visualizing what ConvNets learn

Several approaches for understanding and visualizing Convolutional Networks have been developed in the
literature, partly as a response the common criticism that the learned features in a Neural Network are not
interpretable. In this section we briefly survey some of these approaches and related work.

Visualizing the activations and first-layer weights

Layer Activations. The most straight-forward visualization technique is to show the activations of the network
during the forward pass. For ReLU networks, the activations usually start out looking relatively blobby and dense,
but as the training progresses the activations usually become more sparse and localized. One dangerous pitfall
that can be easily noticed with this visualization is that some activation maps may be all zero for many different
inputs, which can indicate dead filters, and can be a symptom of high learning rates.
Typical-looking activations on the first CONV layer (left), and the 5th CONV layer (right) of a trained AlexNet looking at a picture
of a cat. Every box shows an activation map corresponding to some filter. Notice that the activations are sparse (most values
are zero, in this visualization shown in black) and mostly local.

Conv/FC Filters. The second common strategy is to visualize the weights. These are usually most interpretable
on the first CONV layer which is looking directly at the raw pixel data, but it is possible to also show the filter
weights deeper in the network. The weights are useful to visualize because well-trained networks usually display
nice and smooth filters without any noisy patterns. Noisy patterns can be an indicator of a network that hasn’t
been trained for long enough, or possibly a very low regularization strength that may have led to overfitting.
Typical-looking filters on the first CONV layer (left), and the 2nd CONV layer (right) of a trained AlexNet. Notice that the first-
layer weights are very nice and smooth, indicating nicely converged network. The color/grayscale features are clustered
because the AlexNet contains two separate streams of processing, and an apparent consequence of this architecture is that
one stream develops high-frequency grayscale features and the other low-frequency color features. The 2nd CONV layer
weights are not as interpretable, but it is apparent that they are still smooth, well-formed, and absent of noisy patterns.

Retrieving images that maximally activate a neuron

Another visualization technique is to take a large dataset of images, feed them through the network and keep
track of which images maximally activate some neuron. We can then visualize the images to get an
understanding of what the neuron is looking for in its receptive field. One such visualization (among others) is
shown in Rich feature hierarchies for accurate object detection and semantic segmentation by Ross Girshick et
al.:
Maximally activating images for some POOL5 (5th pool layer) neurons of an AlexNet. The activation values and the receptive
field of the particular neuron are shown in white. (In particular, note that the POOL5 neurons are a function of a relatively large
portion of the input image!) It can be seen that some neurons are responsive to upper bodies, text, or specular highlights.

One problem with this approach is that ReLU neurons do not necessarily have any semantic meaning by
themselves. Rather, it is more appropriate to think of multiple ReLU neurons as the basis vectors of some space
that represents in image patches. In other words, the visualization is showing the patches at the edge of the
cloud of representations, along the (arbitrary) axes that correspond to the filter weights. This can also be seen by
the fact that neurons in a ConvNet operate linearly over the input space, so any arbitrary rotation of that space is
a no-op. This point was further argued in Intriguing properties of neural networks by Szegedy et al., where they
perform a similar visualization along arbitrary directions in the representation space.

Embedding the codes with t-SNE

ConvNets can be interpreted as gradually transforming the images into a representation in which the classes are
separable by a linear classifier. We can get a rough idea about the topology of this space by embedding images
into two dimensions so that their low-dimensional representation has approximately equal distances than their
high-dimensional representation. There are many embedding methods that have been developed with the
intuition of embedding high-dimensional vectors in a low-dimensional space while preserving the pairwise
distances of the points. Among these, t-SNE is one of the best-known methods that consistently produces
visually-pleasing results.

To produce an embedding, we can take a set of images and use the ConvNet to extract the CNN codes (e.g. in
AlexNet the 4096-dimensional vector right before the classifier, and crucially, including the ReLU non-linearity).
We can then plug these into t-SNE and get 2-dimensional vector for each image. The corresponding images can
them be visualized in a grid:

t-SNE embedding of a set of images based on their CNN codes. Images that are nearby each other are also close in the CNN
representation space, which implies that the CNN "sees" them as being very similar. Notice that the similarities are more often
class-based and semantic rather than pixel and color-based. For more details on how this visualization was produced the
associated code, and more related visualizations at different scales refer to t-SNE visualization of CNN codes.

Occluding parts of the image

Suppose that a ConvNet classifies an image as a dog. How can we be certain that it’s actually picking up on the
dog in the image as opposed to some contextual cues from the background or some other miscellaneous object?
One way of investigating which part of the image some classification prediction is coming from is by plotting the
probability of the class of interest (e.g. dog class) as a function of the position of an occluder object. That is, we
iterate over regions of the image, set a patch of the image to be all zero, and look at the probability of the class.
We can visualize the probability as a 2-dimensional heat map. This approach has been used in Matthew Zeiler’s
Visualizing and Understanding Convolutional Networks:

Three input images (top). Notice that the occluder region is shown in grey. As we slide the occluder over the image we record
the probability of the correct class and then visualize it as a heatmap (shown below each image). For instance, in the left-most
image we see that the probability of Pomeranian plummets when the occluder covers the face of the dog, giving us some level
of confidence that the dog's face is primarily responsible for the high classification score. Conversely, zeroing out other parts of
the image is seen to have relatively negligible impact.
Visualizing the data gradient and friends
Data Gradient.

Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps

DeconvNet.

Visualizing and Understanding Convolutional Networks

Guided Backpropagation.

Striving for Simplicity: The All Convolutional Net

Reconstructing original images based on CNN Codes

Understanding Deep Image Representations by Inverting Them

How much spatial information is preserved?

Do ConvNets Learn Correspondence? (tldr: yes)

Plotting performance as a function of image attributes

ImageNet Large Scale Visual Recognition Challenge

Fooling ConvNets
Explaining and Harnessing Adversarial Examples
Comparing ConvNets to Human labelers
What I learned from competing against a ConvNet on ImageNet

cs231n
cs231n
[email protected]

Convolutional Neural Networks (LeNet) - DeepLearning 0.1 Documentation
No ratings yet
Convolutional Neural Networks (LeNet) - DeepLearning 0.1 Documentation
12 pages
CNNS, Part 1: An Introduction To Convolutional Neural Networks
No ratings yet
CNNS, Part 1: An Introduction To Convolutional Neural Networks
17 pages
Introspective-Developmental Counseling
No ratings yet
Introspective-Developmental Counseling
18 pages
Training Methods That Work - A Handbook For Trainers
No ratings yet
Training Methods That Work - A Handbook For Trainers
95 pages
Understanding The Self
50% (2)
Understanding The Self
40 pages
Conv Neural Nets
No ratings yet
Conv Neural Nets
11 pages
Eccv2014 Zeiler Convolutional Networks 01
No ratings yet
Eccv2014 Zeiler Convolutional Networks 01
39 pages
Zeiler Ec CV 2014
No ratings yet
Zeiler Ec CV 2014
16 pages
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
No ratings yet
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
55 pages
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
What Do CNN Neurons Learn Visualization Clustering
No ratings yet
What Do CNN Neurons Learn Visualization Clustering
9 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Applsci 11 09374 v3
No ratings yet
Applsci 11 09374 v3
30 pages
4.Saliency Maps
No ratings yet
4.Saliency Maps
8 pages
Efficient CNN Architecture Design Guided by Visualization
No ratings yet
Efficient CNN Architecture Design Guided by Visualization
6 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
Evaluating The Visualization of What A Deep Neural Network Has Learned
No ratings yet
Evaluating The Visualization of What A Deep Neural Network Has Learned
13 pages
CNN_Unit
No ratings yet
CNN_Unit
52 pages
Visualization and Understanding Cnns
No ratings yet
Visualization and Understanding Cnns
27 pages
A Beginner's Guide To Understanding Convolutional Neural Networks Part 1 - Adit Deshpande - CS Under
100% (1)
A Beginner's Guide To Understanding Convolutional Neural Networks Part 1 - Adit Deshpande - CS Under
14 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
78 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Dlincv 161110052148 PDF
No ratings yet
Dlincv 161110052148 PDF
271 pages
Convolutional Neural Networks-CNN PDF
No ratings yet
Convolutional Neural Networks-CNN PDF
95 pages
CV Mot
No ratings yet
CV Mot
69 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Guide Convolutional Neural Network CNN
100% (1)
Guide Convolutional Neural Network CNN
25 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
The 9 Deep Learning Papers You Need To Know About 3
No ratings yet
The 9 Deep Learning Papers You Need To Know About 3
19 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Deep Learning CNN
No ratings yet
Deep Learning CNN
204 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
DL7 3
No ratings yet
DL7 3
16 pages
2019 6S191 L3 PDF
No ratings yet
2019 6S191 L3 PDF
71 pages
U R L D C G A N: Nsupervised Epresentation Earning With EEP Onvolutional Enerative Dversarial Etworks
No ratings yet
U R L D C G A N: Nsupervised Epresentation Earning With EEP Onvolutional Enerative Dversarial Etworks
15 pages
Arxiv Insights
No ratings yet
Arxiv Insights
2 pages
Week5_Computer_Vision
No ratings yet
Week5_Computer_Vision
58 pages
What is a Convolutional Neural Network-unit3.docx
No ratings yet
What is a Convolutional Neural Network-unit3.docx
12 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Convnets From Thesis
No ratings yet
Convnets From Thesis
9 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
Lecture4 - Convnets For CV Slide
No ratings yet
Lecture4 - Convnets For CV Slide
65 pages
CNN 3
No ratings yet
CNN 3
21 pages
W11 Lecture ITS69204 Image Recognition (1)
No ratings yet
W11 Lecture ITS69204 Image Recognition (1)
44 pages
CVlecture 6
No ratings yet
CVlecture 6
33 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
ETHARIUm
No ratings yet
ETHARIUm
10 pages
Graph Neural Networks
100% (1)
Graph Neural Networks
27 pages
Does The Brain Do Inverse Graphics?
No ratings yet
Does The Brain Do Inverse Graphics?
49 pages
DL Tutorial NIPS2015 PDF
No ratings yet
DL Tutorial NIPS2015 PDF
133 pages
Understanding Intra-Class Knowledge Inside CNN
No ratings yet
Understanding Intra-Class Knowledge Inside CNN
7 pages
Convolutional Neural Networks _ deeplearning-notes
No ratings yet
Convolutional Neural Networks _ deeplearning-notes
43 pages
Islam 等 - 2020 - How Much Position Information Do Convolutional Neu
No ratings yet
Islam 等 - 2020 - How Much Position Information Do Convolutional Neu
11 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
CS231n Convolutional Neural Networks For Visual Recognition 6
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition 6
17 pages
Harley Vis Isvc15
No ratings yet
Harley Vis Isvc15
11 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Building Self-Compassion - 01 - Understanding Self-Compassion
100% (2)
Building Self-Compassion - 01 - Understanding Self-Compassion
12 pages
Plan de Lectie
No ratings yet
Plan de Lectie
9 pages
Achieving Success On The Uceed Exam: Everything You Need To Know
No ratings yet
Achieving Success On The Uceed Exam: Everything You Need To Know
3 pages
Alpha Mathematics Homework Book
100% (1)
Alpha Mathematics Homework Book
5 pages
Features of Academic Writing
No ratings yet
Features of Academic Writing
35 pages
FS 1 Activity 9.1
No ratings yet
FS 1 Activity 9.1
2 pages
RPH Teras M9 Hani
No ratings yet
RPH Teras M9 Hani
4 pages
DIASS E-Portfolio (12-Bandura) BRUAN
67% (3)
DIASS E-Portfolio (12-Bandura) BRUAN
54 pages
Grade 8 Mid Term Spelling
No ratings yet
Grade 8 Mid Term Spelling
4 pages
Teacher Toolbox Assignment Example From Shane
0% (1)
Teacher Toolbox Assignment Example From Shane
2 pages
Untitled Presentation
No ratings yet
Untitled Presentation
6 pages
Management of Inclusive Education
No ratings yet
Management of Inclusive Education
44 pages
Daily Lesson Plan Mathematics (DLP) Year 2
No ratings yet
Daily Lesson Plan Mathematics (DLP) Year 2
3 pages
Course Title: Total Quality Management (TQM)
No ratings yet
Course Title: Total Quality Management (TQM)
24 pages
Organisation Design - Cisco (OB)
No ratings yet
Organisation Design - Cisco (OB)
64 pages
Social Psychology 20073-20075
No ratings yet
Social Psychology 20073-20075
18 pages
Grades 1 To 12 Daily Lesson Log: Monday Tuesday Wednesday Thursday Friday
No ratings yet
Grades 1 To 12 Daily Lesson Log: Monday Tuesday Wednesday Thursday Friday
3 pages
Language and Gender
100% (3)
Language and Gender
10 pages
(Ebook) Social Psychology and Human Nature, Brief Version by Roy F. Baumeister, Brad J. Bushman ISBN 0495602655 - The ebook is available for instant download, no waiting required
100% (1)
(Ebook) Social Psychology and Human Nature, Brief Version by Roy F. Baumeister, Brad J. Bushman ISBN 0495602655 - The ebook is available for instant download, no waiting required
55 pages
Interview Business Development
No ratings yet
Interview Business Development
28 pages
Research 1
No ratings yet
Research 1
6 pages
An Assignment On: North Western University
No ratings yet
An Assignment On: North Western University
6 pages
Burgos St. Talisay Sorsogon City: Higher Education Department
No ratings yet
Burgos St. Talisay Sorsogon City: Higher Education Department
5 pages
I Am Not A Scholar of English or Literature
No ratings yet
I Am Not A Scholar of English or Literature
3 pages
Rufino, Hillary Joy M. Maed-English Hbo-Human Behavior in Organization Teacher: Ma'Am Ma. Cristina Diniega Activity 1: Reflection Paper
No ratings yet
Rufino, Hillary Joy M. Maed-English Hbo-Human Behavior in Organization Teacher: Ma'Am Ma. Cristina Diniega Activity 1: Reflection Paper
2 pages
ASAL - Business - CB - Chapter - 12 - Answers BB
No ratings yet
ASAL - Business - CB - Chapter - 12 - Answers BB
6 pages
Techniques of Teaching Strategic Reading
No ratings yet
Techniques of Teaching Strategic Reading
21 pages

cs231n Github Io Understanding CNN

Uploaded by

cs231n Github Io Understanding CNN

Uploaded by

CS231n Convolutional Neural Networks for Visual Recognition

(this page is currently in draft form)

Visualizing what ConvNets learn

Visualizing the activations and first-layer weights

Retrieving images that maximally activate a neuron

Embedding the codes with t-SNE

Occluding parts of the image

Visualizing and Understanding Convolutional Networks

Striving for Simplicity: The All Convolutional Net

Reconstructing original images based on CNN Codes

How much spatial information is preserved?

Plotting performance as a function of image attributes

You might also like