0% found this document useful (0 votes)

18 views

Lec01 Intro

lec1 vision

Uploaded by

shihyunnam7

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

Lec01 Intro

lec1 vision

Uploaded by

shihyunnam7

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 47

CS 444: Deep Learning for Computer Vision

D. Hockney, Pool with two figures, 1972

https://slazebni.cs.illinois.edu/spring23/
Lecture overview
• About the class
• Milestones of deep learning
• Recent successes and origins
• Visual recognition
• Natural language understanding
• Generative modeling
• Games
• Robotics
• Topics to be covered in class
A few historical milestones
• 1958: Rosenblatt’s perceptron

Frank Rosenblatt (1928-1971)

A few historical milestones
• 1958: Rosenblatt’s perceptron
• 1969: Minsky and Papert Perceptrons book
• Fascinating reading: M. Olazaran, A Sociological Study
of the Official History of the Perceptrons Controversy,
Social Studies of Science, 1996
A few historical milestones
• 1958: Rosenblatt’s perceptron
• 1969: Minsky and Papert Perceptrons book
• 1980: Fukushima’s Neocognitron
• Video (short version)
• Inspired by the findings of Hubel & Wiesel
about the hierarchical organization
of the visual cortex in cats and monkeys (1959-1977)
Kunihiko Fukushima

Image source
A few historical milestones
• 1958: Rosenblatt’s perceptron
• 1969: Minsky and Papert Perceptrons book
• 1980: Fukushima’s Neocognitron
• 1986: Back-propagation
• Origins in control theory and optimization: Kelley (1960), Dreyfus (1962),
Bryson & Ho (1969), Linnainmaa (1970)
• Application to neural networks: Werbos (1974)
• Popularized by Rumelhart, Hinton & Williams (1986)
A few historical milestones
• 1958: Rosenblatt’s perceptron
• 1969: Minsky and Papert Perceptrons book
• 1980: Fukushima’s Neocognitron
• 1986: Back-propagation
• 1989 – 1998: Convolutional neural networks
• LeNet to LeNet-5

Yann LeCun
2018 ACM Turing Award winner
(with Hinton and Bengio)
A few historical milestones
• 1958: Rosenblatt’s perceptron
• 1969: Minsky and Papert Perceptrons book
• 1980: Fukushima’s Neocognitron
• 1986: Back-propagation
• 1989 – 1998: Convolutional neural networks
• 2012: AlexNet

Photo source
A few historical milestones
• 1958: Rosenblatt’s perceptron
• 1969: Minsky and Papert Perceptrons book
• 1980: Fukushima’s Neocognitron
• 1986: Back-propagation
• 1989 – 1998: Convolutional neural networks
• 2012: AlexNet
• Fascinating reading: The secret auction that set off the race for AI supremacy,
Wired, 3/16/2021
A few historical milestones
• 1958: Rosenblatt’s perceptron
• 1969: Minsky and Papert Perceptrons book
• 1980: Fukushima’s Neocognitron
• 1986: Back-propagation
• 1989 – 1998: Convolutional neural networks
• 2012: AlexNet
• 2012 – present: deep learning explosion

Source, via J. Johnson

Lecture overview
• About the class
• Milestones of deep learning
• Progress in the last decade
• Visual recognition
• Natural language understanding
• Generative modeling
• Games
• Robotics
Recognition: ImageNet Challenge

Convolutional Human
ILSVRC Before deep
learning architectures baseline

Figure source
ImageNet is obsolete?

“Programmer”

K. Yang, K. Qinami, L. Fei-Fei, J. Deng, O. Russakovsky,

Towards Fairer Datasets: Filtering and Balancing the Distribution of the People Subtree in th
L. Beyer et al. Are we done with ImageNet? arXiv:2006.07159, 2020 e ImageNet Hierarchy
, Conference on Fairness, Accountability, and Transparency (FAccT), 2020
Object instance segmentation

K. He, G. Gkioxari, P. Dollar, and R. Girshick, Mask R-CNN,

ICCV 2017 (Best Paper Award)
Recognition on my iPhone
Recognition on my iPhone
Recognition: Concerns

How China Uses High-Tech Surveillance to Subdue Minorities – New York Times, 5/22/2019
The Secretive Company That Might End Privacy As We Know It – New York Times, 1/18/2020
Wrongfully Accused by an Algorithm – New York Times, 6/24/2020
Lecture overview
• About the class
• Milestones of deep learning
• Progress in the last decade
• Visual recognition
• Natural language understanding
Neural machine translation

Google Neural Machine Transformers

Translation (GNMT)
(BLEU score)

Y. Wu et al.
Google's Neural Machine Translation System: Bri Figure source
dging the Gap between Human and Machine Tra
nslation
Previous system (before deep learning):
. arXiv 2016
PBMT (2014): 37 BLEU A. Vaswani et al. Attention is all you need.
https://mobile.nytimes.com NeurIPS 2017
/2016/12/14/magazine/the-great-ai-
awakening.html
Large language models: Google BERT
• Self-supervised pre-training task: masked token prediction
Bidirectional Encoder Representations from Transformers (BERT)

Figure source

J. Devlin et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. EMNLP 2018
Large language models: OpenAI GPT
• Self-supervised pre-training task: next token prediction

Figure source

GPT: A. Radford et al. Improving language understanding with unsupervised learning. 2018
GPT-2 (1.5B parameters): A. Radford et al. Language models are unsupervised multitask learners. 2019
GPT-3 (175B parameters): T. Brown et al. Language models are few-shot learners. NeurIPS 2020 (Best Paper Award)
Stochastic parrots or sentient entities?*
*Asking either question will get you fired from Google

https://www.technologyreview.com/2020/12/04/1013294/google-ai https://www.cnn.com/2022/07/23/business/google-ai
-ethics-research-paper-forced-out-timnit-gebru/ -engineer-fired-sentient/index.html

E. Bender et al., On the dangers of stochastic partots

: Can language models be too big? FAccT 2021
InstructGPT and ChatGPT
Reinforcement Learning with Human Feedback (RLHF)

L. Ouyang et al. Training language models to follow instructions with human feedback. NeurIPS 2022
https://openai.com/blog/chatgpt/
ChatGPT

Generated on 1/10/2023
ChatGPT

Generated on 1/10/2023
ChatGPT: Concerns

https://www.nytimes.com
/2023/01/16/technology/chatgpt
-artificial-intelligence-universities.html
ChatGPT: Concerns – and opportunities

Some Google search results as of 1/10/2023

Lecture overview
• About the class
• Milestones of deep learning
• Progress in the last decade
• Vision
• Language
• Generative modeling
Progress in face generation
Progress in general category generation

GAN-generated dogs in 2017 GAN-generated dogs in 2018

Source: EBGAN Source: BigGAN

Text-to-image generation: OpenAI DALL-E

A. Ramesh et al., Zero-Shot Text-to-Image Generation, ICML 2021

https://openai.com/blog/dall-e/
Text-to-image generation: OpenAI DALL-E
• Underlying technology: autoregressive generation using a
transformer decoder

Decode to 256x256
Text prompt encoding (256 tokens) Image encoding (1024 = 32x32 tokens) image

A. Ramesh et al., Zero-Shot Text-to-Image Generation, ICML 2021

https://openai.com/blog/dall-e/
Text-to-image generation: OpenAI DALL-E 2

A. Ramesh et al. Hierarchical text-conditional image generation with CLIP latents. 2022
Diffusion models
• Idea: convert noise to an image in multiple passes

J. Ho et al. Denoising diffusion probabilistic models. NeurIPS 2020

Blog introduction: https://lilianweng.github.io/posts/2021-07-11-diffusion-models/
Diffusion models
• Idea: convert noise to an image in multiple passes
• Proliferation of models: Imagen, Stable Diffusion, Midjourney, …
• Text-to-video, text-to-3D, …
Diffusion models: The next gold rush?

https://www.foley.com/en/insights/publications/2022/12/venture-capital-investors-betting-generative-ai
Generative modeling: Concerns
• Deepfakes DALL-E 2 images of lawyers, flight attendants (source)

• Biases, toxic content

• AI replacing artists?

https://www.wired.com/story/zelensky-deepfake-facebook-twitter-playbook/
AI-generated work wins first prize at art fair
Lecture overview
• About the class
• Milestones of deep learning
• Progress in the last decade
• Vision
• Language
• Generative modeling
• Games
Games

• 2013:
DeepMind uses deep reinforcement learning t
o beat humans at some Atari games

• 2016:
DeepMind’s AlphaGo system beats Go grand
master Lee Sedol 4-1
• 2017:
AlphaZero learns to play Go and chess from s
cratch
• 2019:
DeepMind’s StarCraft 2 AI is better than 99.8 p
Lecture overview
• About the class
• Milestones of deep learning
• Progress in the last decade
• Vision
• Language
• Generative modeling
• Games
• Robotics
Sensorimotor learning

Overview video,
training video

S. Levine, C. Finn, T. Darrell, P. Abbeel, End-to-end training of deep visuomotor policies, JMLR 2016
Sensorimotor learning

A. Agarwal, A. Kumar, J. Malik, and D. Pathak. Legged Locomotion in Challenging Terrains using Egocentric Vision. CoRL 2022
Lecture overview
• About the class
• Milestones of deep learning
• Progress in the last decade
• Vision
• Language
• Generative modeling
• Games
• Robotics
• Topics to be covered in class
Topics to be covered in class
ML basics, linear classifiers Multilayer neural networks, backpropagation Convolutional networks for classification

Networks for detection, dense prediction Self-supervised learning Generative models (GANs, image-to-image
translation, diffusion models)

Transformers, large language models, Deep reinforcement learning

Recurrent models
transformers for vision
Fascinating historical reading
• 1943: McCulloch and Pitts neurons
• The Man Who Tried to Redeem the World with Logic, Nautilus, 2/5/2015

Walter Pitts (1923-1969)

Fascinating historical reading
• 1959: First pattern recognition benchmark, training-test split

1500 characters (26 letters, 10 digits from 50 writers), 12x12 resolution, stored on IBM 704 punch cards
Bill Highleyman and Louis Kamentsky, Bell Labs

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
57% (82)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
91% (35)
The 36 Questions That Lead To Love - The New York Times
3 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
70% (73)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
A Survey On Vision Transformer
No ratings yet
A Survey On Vision Transformer
23 pages
Review Article: Deep Learning For Computer Vision: A Brief Review
No ratings yet
Review Article: Deep Learning For Computer Vision: A Brief Review
14 pages
ANN Lab Manual
100% (3)
ANN Lab Manual
35 pages
Lecture1 ANN -Full
No ratings yet
Lecture1 ANN -Full
66 pages
The Evolution of Deep Learning
No ratings yet
The Evolution of Deep Learning
53 pages
Lec 1 Intro
No ratings yet
Lec 1 Intro
54 pages
ANN Unit 3 Answers
No ratings yet
ANN Unit 3 Answers
12 pages
Tubingen DL Notes
No ratings yet
Tubingen DL Notes
151 pages
Lec 1 - Deep Learning introduction
No ratings yet
Lec 1 - Deep Learning introduction
46 pages
The Artificial Intelligence Renaissance Deep Learning and The Road To Human Level Machine Intelligence
No ratings yet
The Artificial Intelligence Renaissance Deep Learning and The Road To Human Level Machine Intelligence
19 pages
ppt1dl
No ratings yet
ppt1dl
50 pages
Deep Learning Basics Concepts
90% (10)
Deep Learning Basics Concepts
69 pages
Deep Learning For Computer Vision A .Brief Review, Research Paper
No ratings yet
Deep Learning For Computer Vision A .Brief Review, Research Paper
7 pages
NN DL Unit - III
No ratings yet
NN DL Unit - III
19 pages
DL Slides 1
No ratings yet
DL Slides 1
63 pages
mv_cs4243_2024_amir_6_p0
No ratings yet
mv_cs4243_2024_amir_6_p0
40 pages
History
No ratings yet
History
75 pages
Lecun 20201027 Att
No ratings yet
Lecun 20201027 Att
72 pages
Lecture 3 - Deep Learning
No ratings yet
Lecture 3 - Deep Learning
52 pages
How Powerful Is AI - A Deep Learning Literature Review by Alban Tchikladze
No ratings yet
How Powerful Is AI - A Deep Learning Literature Review by Alban Tchikladze
10 pages
AI-and-ML-Workshop-pptx_250131_193538
No ratings yet
AI-and-ML-Workshop-pptx_250131_193538
44 pages
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
No ratings yet
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
36 pages
Chapter 1 - Vision AI
No ratings yet
Chapter 1 - Vision AI
40 pages
Recent Advances in Deep Learning Based Computer Vision
No ratings yet
Recent Advances in Deep Learning Based Computer Vision
6 pages
Deep Learning: A Critical Appraisal: Gary Marcus New York University
No ratings yet
Deep Learning: A Critical Appraisal: Gary Marcus New York University
27 pages
Deep Learning Hardware
No ratings yet
Deep Learning Hardware
82 pages
AI vs Machine Learning
No ratings yet
AI vs Machine Learning
79 pages
Deep Learning Full
No ratings yet
Deep Learning Full
25 pages
01_intro
No ratings yet
01_intro
73 pages
001 Intro
No ratings yet
001 Intro
66 pages
Session 01 - Classical Machine Learning (1)
No ratings yet
Session 01 - Classical Machine Learning (1)
111 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Lecture 2 Understanding The History of AI
No ratings yet
Lecture 2 Understanding The History of AI
35 pages
XCXCXCXCXCXCXCXC
No ratings yet
XCXCXCXCXCXCXCXC
20 pages
Advancements_and_Applications_of_Deep_Learning
No ratings yet
Advancements_and_Applications_of_Deep_Learning
4 pages
Computational Intelligence and Neuroscience - 2018 - Voulodimos - Deep Learning for Computer Vision A Brief Review
No ratings yet
Computational Intelligence and Neuroscience - 2018 - Voulodimos - Deep Learning for Computer Vision A Brief Review
13 pages
Image Classification Using Convolutional Neural Networks
No ratings yet
Image Classification Using Convolutional Neural Networks
8 pages
ETH Zurich Talk - April 14, 2025
No ratings yet
ETH Zurich Talk - April 14, 2025
84 pages
DL Casestudy
No ratings yet
DL Casestudy
2 pages
Paper 12
No ratings yet
Paper 12
3 pages
Lecture 1
No ratings yet
Lecture 1
135 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
24 pages
Video Clasification PDF
100% (1)
Video Clasification PDF
114 pages
2003.03253v1
No ratings yet
2003.03253v1
29 pages
A Brief History of Deep Learning - DATAVERSITY
No ratings yet
A Brief History of Deep Learning - DATAVERSITY
7 pages
Deep Learning carona
No ratings yet
Deep Learning carona
95 pages
Deep Learning Module-01 Search Creators
No ratings yet
Deep Learning Module-01 Search Creators
17 pages
Computation 11 00052
No ratings yet
Computation 11 00052
24 pages
Deep Learning-1
No ratings yet
Deep Learning-1
20 pages
DL_IT324a_1
No ratings yet
DL_IT324a_1
38 pages
AI-Powered Visual Sensors and Sensing: Where We Are and Where WeAreGoing
No ratings yet
AI-Powered Visual Sensors and Sensing: Where We Are and Where WeAreGoing
17 pages
A Brief Survey and An Application of Sem
No ratings yet
A Brief Survey and An Application of Sem
38 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
62 pages
Lecun DLHardware Isscc2019
No ratings yet
Lecun DLHardware Isscc2019
8 pages
Deep Learning Most Important Ideas PDF
No ratings yet
Deep Learning Most Important Ideas PDF
16 pages
人造的智力和深度学习
No ratings yet
人造的智力和深度学习
27 pages
A Survey of Convolutional Neural Networks - Analysis-Applications-Prospects
No ratings yet
A Survey of Convolutional Neural Networks - Analysis-Applications-Prospects
21 pages
The Little Book of Deep Learning François Fleuret download pdf
100% (3)
The Little Book of Deep Learning François Fleuret download pdf
55 pages
7 CNN
No ratings yet
7 CNN
66 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
A Review On Deep Learning Applications
No ratings yet
A Review On Deep Learning Applications
11 pages
Beyond Silicon
From Everand
Beyond Silicon
Piyush yadav
5/5 (1)
RNN Vanishing Gradients LSTM Compressed
No ratings yet
RNN Vanishing Gradients LSTM Compressed
53 pages
Deep Learning Notebook
No ratings yet
Deep Learning Notebook
7 pages
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
No ratings yet
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
3 pages
Ann Book
No ratings yet
Ann Book
16 pages
Few Selected Questions of Neural Network
No ratings yet
Few Selected Questions of Neural Network
3 pages
NN Matlab - Examples
No ratings yet
NN Matlab - Examples
14 pages
8th_lecture_Delta_Rule_Learning_s1_21_22
No ratings yet
8th_lecture_Delta_Rule_Learning_s1_21_22
48 pages
C2 W1
No ratings yet
C2 W1
20 pages
Unit 1: 1. Introduction To Artificial Neural Network
No ratings yet
Unit 1: 1. Introduction To Artificial Neural Network
17 pages
ML Visuals
No ratings yet
ML Visuals
61 pages
Soft Computing AND Neural Networks LAB (IT-408) : Submitted By:-Vipin Kumar 785/IT/11
No ratings yet
Soft Computing AND Neural Networks LAB (IT-408) : Submitted By:-Vipin Kumar 785/IT/11
9 pages
Restricted Boltzmann Machine
No ratings yet
Restricted Boltzmann Machine
13 pages
Neural Network and Fuzzy Logic
No ratings yet
Neural Network and Fuzzy Logic
46 pages
3-Intro To Deep Learning and Perceptron
No ratings yet
3-Intro To Deep Learning and Perceptron
43 pages
BackPropagation for Exam Problem -2
No ratings yet
BackPropagation for Exam Problem -2
3 pages
Fallsem2018-19 Eee1007 Eth Tt424 Vl2018191002720 Reference Material I Unit - IV Maxnet
No ratings yet
Fallsem2018-19 Eee1007 Eth Tt424 Vl2018191002720 Reference Material I Unit - IV Maxnet
11 pages
genaifile
No ratings yet
genaifile
39 pages
Neural_N_Problems - SLP
No ratings yet
Neural_N_Problems - SLP
123 pages
Unit I - Afs
No ratings yet
Unit I - Afs
18 pages
11 ANN (Backpropagation)
No ratings yet
11 ANN (Backpropagation)
37 pages
Image Classification Using Convolutional Neural Networks (CNNS)
No ratings yet
Image Classification Using Convolutional Neural Networks (CNNS)
61 pages
Week 6 Prev & Current Assignments
No ratings yet
Week 6 Prev & Current Assignments
21 pages
Syllabus - Deep Learning and Edge Intelligence
No ratings yet
Syllabus - Deep Learning and Edge Intelligence
3 pages
BCS 465 Neural Network - 2020
No ratings yet
BCS 465 Neural Network - 2020
5 pages
UNIT-3
No ratings yet
UNIT-3
30 pages
Engineering Applications of Artificial Intelligence: Mohit Beniwal, Archana Singh, Nand Kumar
No ratings yet
Engineering Applications of Artificial Intelligence: Mohit Beniwal, Archana Singh, Nand Kumar
11 pages
MLSP Exp04 60002200083
No ratings yet
MLSP Exp04 60002200083
5 pages

Lec01 Intro

Uploaded by

Lec01 Intro

Uploaded by

CS 444: Deep Learning for Computer Vision

D. Hockney, Pool with two figures, 1972

Frank Rosenblatt (1928-1971)

Source, via J. Johnson

K. Yang, K. Qinami, L. Fei-Fei, J. Deng, O. Russakovsky,

K. He, G. Gkioxari, P. Dollar, and R. Girshick, Mask R-CNN,

Google Neural Machine Transformers

E. Bender et al., On the dangers of stochastic partots

Some Google search results as of 1/10/2023

GAN-generated dogs in 2017 GAN-generated dogs in 2018

Source: EBGAN Source: BigGAN

A. Ramesh et al., Zero-Shot Text-to-Image Generation, ICML 2021

A. Ramesh et al., Zero-Shot Text-to-Image Generation, ICML 2021

J. Ho et al. Denoising diffusion probabilistic models. NeurIPS 2020

• Biases, toxic content

Transformers, large language models, Deep reinforcement learning

Walter Pitts (1923-1969)

You might also like