cs231n_2019_lecture01
cs231n_2019_lecture01
Lecture 1: Introduction
Top row, left to right: Middle row, left to right Bottom row, left to right
Image by Roger H Goun is licensed under CC BY 2.0 Image by BGPHP Conference is licensed under CC BY 2.0; changes made Image is CC0 1.0 public domain
Image is CC0 1.0 public domain Image is CC0 1.0 public domain Image by Derek Keats is licensed under CC BY 2.0; changes made
Image is CC0 1.0 public domain Image by NASA is licensed under CC BY 2.0 Image is public domain
Image is CC0 1.0 public domain Image is CC0 1.0 public domain Image is licensed under CC-BY 2.0; changes made
Mathematics
• CS231n overview
This image is licensed under CC-BY 2.5 This image is licensed under CC-BY 3.0
Leonardo da Vinci,
16th Century AD
This work is in the public domain
Complex cells:
Response to light orientation
and movement Stimulus
Hypercomplex cells: response
to movement with an end point
Stimulus Response
No response Response Cat image by CNX OpenStax is licensed
(end point) under CC BY 4.0; changes made
(a) Original picture (b) Differentiated picture (c) Feature points selected
This image is CC0 1.0 public domain This image is CC0 1.0 public domain
Image is public
domain
Level 0 Level 1
Spatial Pyramid Matching, Lazebnik, Schmid & Ponce, 2006
Train
Person
Airplane
Output: Output:
Scale Scale
T-shirt
Steel drum
Drumstick
✔ T-shirt
Giant panda
Drumstick
✗
Mud turtle Mud turtle
• CS231n overview
Image by Kippelboy is licensed under CC BY-SA 3.0 Image by Christina C. is licensed under CC BY-SA 4.0
Person on Bike
Person
Hammer
Person Bike
conv-128
Dense descriptor grid:
conv-128
HOG, LBP
maxpool
conv-256
Coding: local coordinate, conv-256
super-vector maxpool
conv-512
conv-512
Pooling, SPM maxpool
conv-512
conv-512
Linear SVM maxpool
fc-4096
fc-4096
fc-1000
softmax
Figure copyright Alex Krizhevsky, Ilya [Szegedy arxiv 2014] [Simonyan arxiv 2014] [He ICCV 2015]
Lion image by Swissfrog is
Sutskever, and Geoffrey Hinton, 2012.
licensed under CC BY 3.0
Reproduced with permission.
1998
LeCun et al.
K Output
Fully Connected
Convolutions
Subsampling
2012
Krizhevsky et al.
Data
Computation
14
GTX 1080 Ti
12
10
8
GeForce
6
GTX 580
(AlexNet)
4
GeForce
2
8800 GTX
0
1/2004 10/2006 7/2009 4/2012 12/2014 9/2017
Time
20 GTX 1080 Ti
15
GeForce
10 GTX 580
GeForce (AlexNet)
5 8800 GTX
0
1/2004 10/2006 7/2009 4/2012 12/2014 9/2017
Time
Wall Laptop
Glass Wire
Image is GFDL
Desk
Waving
Teaching Assistants
• Deep Learning by
Goodfellow, Bengio,
and Courville
• Free online