Lecture Notes 01
Lecture Notes 01
Xuming He
SIST, ShanghaiTech
Fall, 2020
Grading policy
Pre-requisite / Syllabus
https://d2l.ai/
TAs:
Haozhe Wang, Qiuyue Wang, Guoxing Sun, Yannan He, Quan
Meng, Yinwenqi Jiang
Plagiarism punishment
When one student copies from another student, both students
are responsible
Zero point on the assignment or exam in question
Repeated violation will result in an F grade for this course as well
as further discipline at the school/university level
Pre-requisite
Proficiency in Python
All class assignments will be in Python (and use numpy)
A Python tutorial available on Piazza
Calculus, Linear Algebra, Probability and Statistics
Undergrad course level
Equivalent knowledge of Andrew Ng’s CS229 (Machine
Learning)
Formulating cost functions
Taking derivatives
Performing optimization with gradient descent
Will be evaluated in next quiz (Wednesday)
9/7/2020 16
A data-driven approach
Each task as a mapping function (or a model)
Mapping function
Input data Expected output
Mapping function
x: input data
y: expected output
: parameters to be estimated
9/7/2020 18
What is deep learning?
Using deep neural networks as the mapping function
9/7/2020 19
What is deep learning?
Using deep neural networks as the mapping function
9/7/2020 20
Why deep networks?
Inspiration from visual cortex
9/7/2020 21
Why deep networks?
A deep architecture can represent certain functions
(exponentially) more compactly
Learning a rich representation of input data
9/7/2020 22
Recent success with DL
Recent
Somesuccess with
recent neuralwith
success networks
neural networks
Steel drum
The Im ag e C lassification C halleng e:
1,000 ob ject classes
1,431,167 im ag es
9/7/2020 23
Summary: Why deep learning?
One of the major thrust areas recently in various pattern
recognition, prediction and data analysis
Efficient representation of data and computation
Other key factors: large datasets and hardware
9/7/2020 24
Is it alchemy?
Future development
Its limitation and weakness?
Supervised learning
Artificial neurons
Sufficient condition
Hessian is positive definite
Maximum likelihood
Perceptron algorithm
https://www.youtube.com/watch?v=m0rHZ_RDdyQ
9/7/2020 Xuming He – CS 280 Deep Learning 53
Artificial Neuron
Biological inspiration
9/7/2020 55
Activation functions
https://towardsdatascience.com/perceptron-explanation-implementation-and-a-visual-example-3c8e76b4e2d1
Task formulation
Next time
Basic neural networks
First Quiz on prerequisite