0% found this document useful (0 votes)
20 views

Lecture 1 Part 1

Uploaded by

abczyxpqr
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views

Lecture 1 Part 1

Uploaded by

abczyxpqr
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 68

CS231n: Deep Learning for

Computer Vision

Lecture 1: Introduction

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 1 April 2, 2024


Welcome to CS231n

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 2 April 2, 2024


Welcome to CS231n
2015

2016
2017 2020

2018 2019 2021

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 3 April 2, 2024


Artificial Intelligence

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 4 April 2, 2024


Artificial Intelligence

Machine Learning

Computer
Vision

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 5 April 2, 2024


Artificial Intelligence

Machine Learning

Computer Deep
Vision Learning

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 6 April 2, 2024


This class
Artificial Intelligence

Machine Learning

Computer Deep
Vision Learning

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 7 April 2, 2024


This class
Artificial Intelligence

Machine Learning

Computer Deep
Vision Learning

n g u age
atu ral La ing
N ss
Proce

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 8 April 2, 2024


This class
Artificial Intelligence

Machine Learning

Computer Deep
Vision Learning

n g u age
ral La ing e e ch
u p
Nat ss S ition
Proce Reco
g n

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 9 April 2, 2024


This class
Artificial Intelligence

Rob
o tics Machine Learning

Computer Deep
Vision Learning

n g u age
ral La ing e e ch
u p
Nat ss S ition
Proce Reco
g n

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 10 April 2, 2024


Mathematics Artificial Intelligence This class

Neuroscience
Rob
o tics Machine Learning

Computer Deep
Vision Learning
Computer
Science
n g u age
ral La ing e e ch
u p
Nat ss S ition
Proce Reco
g n
Physics Psychology
Biology Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 11 April 2, 2024


Today’s agenda
• A brief history of computer vision and deep learning

• CS231n overview

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 12 April 2, 2024


Evolution’s Big Bang:
Cambrian Explosion, 530-540million years, B.C.

This image is licensed under CC-BY 2.5

This image is licensed under CC-BY 2.5


This image is licensed under CC-BY 3.0

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 13 April 2, 2024


April 2, 2024 14
Camera Obscura
Gemma Frisius, 1545 Encyclopedia, 18th Century

This work is in the public


domain

Leonardo da Vinci,
16th Century AD
This work is in the public domain

This work is in the public


domain

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 15 April 2, 2024


Computer Vision is everywhere!
Left to right:
Image by Roger H Goun is
licensed under CC BY 2.0
Image is CC0 1.0 public domain
Image is CC0 1.0 public domain
Image is CC0 1.0 public domain

Left to right:
Image is free to use
Image is CC0 1.0 public
domain
Image by NASA is licensed
under CC BY 2.0
Image is CC0 1.0 public
domain

Bottom row, left to right


Image is CC0 1.0 public
domain
Image by Derek Keats is
licensed under CC BY 2.0;
changes made
Image is public domain
Image is licensed under CC-BY
2.0; changes made

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 16 April 2, 2024


Where did we come from?

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 17 April 2, 2024


Hubel and Wiesel, 1959
Measure
brain activity

Simple cells:
Response to specific
rotation and orientation

Complex cells:
Response to light
orientation and
Cat image by CNX OpenStax is licensed movement, some
under CC BY 4.0; changes made
translation invariance

1959
Hubel & Wiesel
Response Stimulus
No
response
Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 18 April 2, 2024


Larry Roberts, 1963

(a) Original picture (b) Differentiated picture (c) Feature points selected

1959 1963
Hubel & Wiesel Roberts

Lawrence Gilman Roberts, “Machine Perception of Three-Dimensional Solids”, 1963 Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 19 April 2, 2024


1959 1963
Hubel & Wiesel Roberts

https://dspace.mit.edu/handle/1721.1/6125 Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 20 April 2, 2024


2 ½-D sketch 3-D model
Input image Edge image

This image is CC0 1.0 public domain This image is CC0 1.0 public domain

Input Primal 2 ½-D 3-D Model


Image Sketch Sketch Representation

Zero crossings, Local surface 3-D models


blobs, edges, orientation and hierarchically
Perceived bars, ends, discontinuities organized in
intensities virtual lines, in depth and in terms of surface
groups, curves surface and volumetric
boundaries orientation primitives
1959 1963 1970s
Hubel & Wiesel Roberts David Marr

Stages of Visual Representation, David Marr, 1970s


Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 21 April 2, 2024


Recognition via Parts (1970s)

Generalized Cylinders, Pictorial Structures,


Brooks and Binford, Fischler and Elshlager, 1973
1979
1959 1963 1970s 1979
Hubel & Wiesel Roberts David Marr Gen. Cylinders

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 22 April 2, 2024


Recognition via Edge Detection (1980s)

1959 1963 1970s 1979 1986 John Canny, 1986


Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny David Lowe, 1987

Image is CC0 1.0 public domain Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 23 April 2, 2024


Arriving at an “AI winter”

- Enthusiasm (and funding!) for AI research dwindled


- ”Expert Systems” failed to deliver on their promises
- But subfields of AI continues to grow
- Computer vision, NLP, robotics, compbio, etc.

1959
1963 1970s 1979 1986
Hubel &
Roberts David Marr Gen. Cylinders Canny
Wiesel
AI Winter

Left Image is CC BY 3.0 Middl Image is public Right Image is CC-BY 2.0; changes made Slide inspiration: Justin Johnson
domain

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 24 April 2, 2024


In the meantime…seminal work in
cognitive and neuroscience

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 25 April 2, 2024


I. Biederman, Science, 1972
Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 26 April 2, 2024
Rapid Serial Visual Perception (RSVP)

Potter, etc. 1970s

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 27 April 2, 2024


150 ms !! Thorpe, et al. Nature, 1996
Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 -
Neural correlates of object & scene recognition

Kanwisher et al. J. Neuro. 1997 Epstein & Kanwisher, Nature, 1998

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 29 April 2, 2024


Visual recognition is a fundamental
task for visual intelligence

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 30 April 2, 2024


Recognition via Grouping (1990s)

1959 1963 1970s 1979 1986 1997


Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts
Normalized Cuts, Shi and Malik, 1997
AI Winter

Left Image is CC BY 3.0 Middl Image is public Right Image is CC-BY 2.0; changes made Slide inspiration: Justin Johnson
domain

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 31 April 2, 2024


Recognition via Matching (2000s)

Image is public domain


Image is public domain

1959
Hubel & Wiesel
1963
Roberts
1970s
David Marr
1979
Gen. Cylinders
1986
Canny
1997
Norm. Cuts
1999
SIFT
SIFT, David
Lowe, 1999
AI Winter

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 32 April 2, 2024


Face Detection

Viola and Jones, 2001

One of the first successful


applications of machine
learning to vision

1959 1963 1970s 1979 1986 1997 1999 2001


Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J

AI Winter

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 33 April 2, 2024


PASCAL Visual
Object Challenge
Image is CC0 1.0 public domain

Train
Person

Airplane

Image is CC0 1.0 public domain

1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter

Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 34 April 2, 2024


Perceptron

Frank Rosenblatt, ~1957


1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter
1958
Perceptron Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 35 April 2, 2024


Minsky and Papert, 1969
X Y F(x,y) y
0 0 0

0 1 1

1 0 1

1 1 0
x

Showed that Perceptrons could not learn the XOR


function
Caused a lot of disillusionment in the field
1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter
1958 1969
Perceptron Minsky & Papert Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 36 April 2, 2024


Neocognitron: Fukushima, 1980

Computational model the visual system,


directly inspired by Hubel and Wiesel’s
hierarchy of complex and simple cells

Interleaved simple cells (convolution)


and complex cells (pooling)

No practical training algorithm


1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter
1958 1969 1980
Perceptron Minsky & Papert Neocognitron Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 37 April 2, 2024


Backprop: Rumelhart, Hinton, and Williams, 1986
Introduced
backpropagation
for computing recognizable
gradients in neural math
networks

Successfully trained
perceptrons with
multiple layers Illustration of Rum elhart et al., 1986 by Lane M cIntosh,
copyright CS231n 2017

1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter
1958 1969 1980 1985
Perceptron Minsky & Papert Neocognitron Backprop Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 38 April 2, 2024


Convolutional Networks: LeCun et al, 1998

Applied backprop algorithm to a Neocognitron-like architecture


Learned to recognize handwritten digits
Was deployed in a commercial system by NEC, processed handwritten checks
Very similar to our modern convolutional networks!

1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter
1958 1969 1980 1985 1998
Perceptron Minsky & Papert Neocognitron Backprop LeNet Slide inspiration: Justin Johnson

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 39 April 2, 2024


2000s: “Deep Learning”
People tried to train neural networks that
were deeper and deeper

Not a mainstream research topic at this time

Hinton and Salakhutdinov, 2006


Bengio et al, 2007

Slide inspiration: Justin Johnson


Lee et al, 2009
Glorot and Bengio, 2010

1959 1963 1970s 1979 1986 1997 1999 2001 2007


Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter
1958 1969 1980 1985 1998 2006
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 40 April 2, 2024


2000s: “Deep Learning”
People tried to train neural networks that
were deeper and deeper

Not a mainstream research topic at this time

No good dataset to work on

Hinton and Salakhutdinov, 2006


Bengio et al, 2007

Slide inspiration: Justin Johnson


Lee et al, 2009
Glorot and Bengio, 2010
1959 1963 1970s 1979 1986 1997 1999 2001 2007
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL

AI Winter
1958 1969 1980 1985 1998 2006
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 41 April 2, 2024


Output:
The Image Classification Challenge: Scale
T-shirt
1,000 object classes Steel drum
1,431,167 images Drumstick
Mud turtle

Deng et al, 2009


Russakovsky et al. IJCV 2015

1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007 2009
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL ImageNet

AI Winter
1958 1969 1980 1985 1998 2006
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 42 April 2, 2024


1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007 2009
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL ImageNet

AI Winter
1958 1969 1980 1985 1998 2006
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 43 April 2, 2024


AlexNet, 2012

1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007 2009
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL ImageNet

AI Winter
1958 1969 1980 1985 1998 2006 2012
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning AlexNet

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 44 April 2, 2024


AlexNet: Deep Learning Goes Mainstream

Slide inspiration: Justin Johnson


Krizhevsky, Sutskever, and Hinton, NeurIPS 2012

1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007 2009
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL ImageNet

AI Winter
1958 1969 1980 1985 1998 2006 2012
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning AlexNet

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 45 April 2, 2024


AlexNet vs. Neocognitron: 32 years apart

Slide inspiration: Justin Johnson


1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007 2009
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL ImageNet

AI Winter
1958 1969 1980 1985 1998 2006 2012
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning AlexNet

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 46 April 2, 2024


2012 to Present: Deep Learning Explosion
CVPR Papers
8000
6000
Subm…
4000 Acce…
2000
0
1985 1990 1995 2000 2005 2010 2015 2020

Slide inspiration: Justin Johnson


Publications at top Computer Vision conference arXiv papers per month (source)
1959 1963 1970s 1979 1986 1997 1999 2001 2004, 2007 2009
Caltech101;
Hubel & Wiesel Roberts David Marr Gen. Cylinders Canny Norm. Cuts SIFT V&J PASCAL ImageNet

AI Winter
1958 1969 1980 1985 1998 2006 2012
Perceptron Minsky & Papert Neocognitron Backprop LeNet Deep Learning AlexNet

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 47 April 2, 2024


2012 to Present: Deep Learning is Everywhere
Year 2010 Year 2012 Year 2014 Year 2015
NEC-UIUC SuperVision GoogLeNet VGG MSRA
Image
Pooling
Convoluti conv-64
on conv-64
Softmax maxpool
Other conv-
Dense descriptor grid: 128
conv-
HOG, LBP 128
maxpool
conv-
256
conv-
Coding: local coordinate,
super-vector 256
maxpool
conv-
512
conv-
Pooling, SPM 512
maxpool
conv-
512
conv-
512
Linear SVM maxpool

fc-4096
fc-4096
fc-1000
softmax

[Lin CVPR 2011] [Krizhevsky NIPS 2012]

Figure copyright Alex Krizhevsky, [Szegedy arxiv 2014]


Ilya and Geoffrey Hinton, [Simonyan arxiv 2014] [He ICCV 2015]
Lion image by Swissfrog Figures copyright Alex Krizhevsky, Ilya Sutskever, 2012. Reproduced with permission.
Sutskever, and Geoffrey Hinton, 2012.
is
Reproduced with permission.
licensed under CC BY 3.0

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 48 April 2, 2024


2012 to Present: Deep Learning is Everywhere
Image Classification Image Retrieval

Slide inspiration: Justin Johnson


Figures copyright Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton, 2012. Reproduced with permission.

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 49 April 2, 2024


2012 to Present: Deep Learning is Everywhere
Object Detection Image Segmentation

Slide inspiration: Justin Johnson


Ren, He, Girshick, and Sun, 2015 Fabaret et al, 2012

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 50 April 2, 2024


2012 to Present: Deep Learning is Everywhere

Video Classification Activity Recognition

Slide inspiration: Justin Johnson


Simonyan et al, 2014

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 51 April 2, 2024


2012 to Present: Deep Learning is Everywhere
Pose Recognition (Toshev and Szegedy, 2014)

Playing Atari games (Guo et al, 2014)

Slide inspiration: Justin Johnson


Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 52 April 2, 2024
2012 to Present: Deep Learning is Everywhere
Medical Imaging
Whale recognition

Levy et al, 2016 Figure reproduced with perm ission

Slide inspiration: Justin Johnson


Galaxy Classification

Dieleman et al, 2014


From left to right: public dom ain by NASA, usage perm itted by
ESA/Hubble, public dom ain by NASA, and public dom ain. Kaggle Challenge This im age by Christin Khan is in the public dom ain and
originally cam e from the U.S. NOAA.

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 53 April 2, 2024


2012 to Present: Deep Learning is Everywhere

A white teddy bear A man in a baseball A woman is holding


Image Captioning sitting in the grass uniform throwing a ball a cat in her hand
Vinyals et al, 2015
Karpathy and Fei-Fei,
2015

Slide inspiration: Justin Johnson


All im ages are CC0 Public dom ain:
https://pixabay.com /en/luggage-antique-cat-1643010/
https://pixabay.com /en/teddy-plush-bears-cute-teddy-bear-1623436/
https://pixabay.com /en/surf-wave-sum m er-sport-litoral-1668716/
A man riding a wave A cat sitting on a A woman standing on a
on top of a surfboard
https://pixabay.com /en/wom an-fem ale-m odel-portrait-adult-983967/
https://pixabay.com /en/handstand-lake-m editation-496008/
https://pixabay.com /en/baseball-player-shortstop-infield-1045263/
suitcase on the floor beach holding a surfboard
Captions generated by Justin Johnson using Neuraltalk2

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 54 April 2, 2024


2012 to Present: Deep Learning is Everywhere
Results:
spatial, comparative, asymmetrical,
verb, prepositional

taller than

person person
left of
wear on wear

shirt snow ski


Krishna*, Lu*, Bernstein, Fei-Fei, ECCV 2016

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 55 April 2, 2024


Slide inspiration: Justin Johnson
Original im age is CC0 public dom ain
Starry Night and Tree Roots by Van Gogh are in the public dom ain
Bokeh im age is in the public dom ain Mordvinsev et al, 2015
Stylized im ages copyright Justin Johnson, 2017;
reproduced with perm ission Gatys et al, 2016
Figures copyright Justin Johnson, 2015. Reproduced with perm ission. Generated using the Inceptionism approach from a blog post by Google Research.

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 56 April 2, 2024


2012 to Present: Deep Learning is Everywhere

Slide inspiration: Justin Johnson


Karras et al, “Progressive Growing of GANs for Improved Quality, Stability, and Variation”, ICLR 2018

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 57 April 2, 2024


2012 to Present: Deep Learning is Everywhere

Slide inspiration: Justin Johnson


Ramesh et al, “DALL·E: Creating Images from Text”, 2021. https://openai.com/blog/dall-e/

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 58 April 2, 2024


2012 to Present: Deep Learning is Everywhere

Slide inspiration: Justin Johnson


Ramesh et al, “DALL·E: Creating Images from Text”, 2021. https://openai.com/blog/dall-e/

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 59 April 2, 2024


Computation
April 2, 2024 Algorithms Data 60
GFLOP per Dollar
CPU GPU (FP32)
RTX 3080
50
RTX 3090
40
Deep Learning Explosion
30
GTX 1080
20 GeForce
Ti RTX 2080
Ti

Slide inspiration: Justin Johnson


GeForce GTX 580
8800 GTX
10 (AlexNet
)

0
Jan-04 Jul-05 Jan-07 Jul-08 Jan-10 Jul-11 Jan-13 Jul-14 Jan-16 Jul-17 Jan-19 Jul-20
Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 61 April 2, 2024
GFLOP per Dollar
CPU GPU (FP32) GPU (Tensor Core)
200 Recent GPUs have
“Tensor Cores”:
Special hardware
150
for deep learning!
100

Slide inspiration: Justin Johnson


50

0
Jan-04 Jul-05 Jan-07 Jul-08 Jan-10 Jul-11 Jan-13 Jul-14 Jan-16 Jul-17 Jan-19 Jul-20
Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 62 April 2, 2024
AI’s Explosive Growth & Impact

Number of attendance Startups Developing AI Enterprise Application AI


At AI conferences Systems Revenue
Source: The Gradient Source: Crunchbase, VentureSource, Sand Source: Statista
Hill Econometrics

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 63 April 2, 2024


Despite the successes, computer
vision still has a long way to go

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 64 April 2, 2024


Computer Vision Can Cause Harm
Harmful Stereotypes Affect people’s lives

Barocas et al, “The Problem With Bias: Allocative Versus Representational Harms in Machine Learning”, SIGCIS 2017
Kate Crawford, “The Trouble with Bias”, NeurIPS 2017 Keynote Source: https://www.washingtonpost.com/technology/2019/10/22/ai-hiring-face-scanning-algorithm-increasingly-decides-whether-you-deserve-job/
Source: https://twitter.com/jackyalcine/status/615329515909156865 (2015) https://www.hirevue.com/platform/online-video-interviewing-software
Example Credit: Timnit Gebru

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 65 April 2, 2024


Computer Vision Can Save Lives

Slide inspiration: Justin Johnson


Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 66 April 2, 2024
And there is a lot we don’t know how to do

Slide inspiration: Andrej Karpathy


https://fedandfit.com/wp- This image is
content/uploads/2020/06/summer-activities- copyright-free United
for-kids_optimized-scaled.jpeg States government
work

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 67 April 2, 2024


Today’s agenda
• A brief history of computer vision & deep learning

• CS231n overview

Fei-Fei Li & Ehsan Adeli CS231n: Lecture 1 - 68 April 2, 2024

You might also like