Lecture 04 (6hrs) Neural Network and Deep Learning
Lecture 04 (6hrs) Neural Network and Deep Learning
Learning
Xizhao WANG
Big Data Institute
College of Computer Science
Shenzhen University
March 2021
Gradient Descent Algorithm
BP Algorithm for Feed-Forward Neural Network Model
Convolutional Neural Network
Deep Learning
Outline
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Lecture 01
Gradient Descent Algorithm
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
f
Then tan .
l
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
f f f f
cos cos cos ,
l x y z
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
f f f
f i j k
x y z
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
In the above two images, the values of the function are represented in black and white, black re
presenting higher values, and its corresponding gradient is represented by blue arrows.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
f f
grad f ( x, y ) f ( x, y ) i j.
x y
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
f f f f f
cos cos , cos , cos
l x y x y
grad f ( x, y ) e
grad f ( x, y ) e cos gradf ( x, y ), e
cos gradf ( x, y ), e 1,
f
then the directional derivative attains its maximum value, which equals to the
l
norm of gradient, i.e.
2 2
f f
grad f ( x, y )
y
.
x
Then when variables change along the gradient direction, the rate of change of a
function attains its maximum value, which is the norm of the gradient.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
f
x
1
f T
f f f
f ( X ) x2 , ,, .
x1 x2 xn
f
x
n
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
initial point
The gradient descent algorithm may lead to local optimal solution; the
global optimal one can be ensured when the loss function is convex.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
2. Another termination condition is the iteration numbers (time control)
3. The learning rate, alpha, is to control the “walking-step”, too small will
lead to slow convergence (low efficiency), but too big will result in
vibrating (non-convergence). Its appropriate value is dependent on the
specific function to be minimized.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
1. Definition of gradient
2. Gradient descent algorithm (GDA)
3. Difference between GDA and Newton’ s method
4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Its gradient is
then
where H(xk) is the Hessian matrix of f(x) at point xk.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
wi
1 ''
fT x fT xn x f xn f '
xn x f xn x 2 .
2
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
1
x n 1 x n H f xn f xn , n 0.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
1. Definition of gradient
2. Gradient descent algorithm (GDA)
3. Difference between GDA and Newton’s method
4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
x, t , x The initial value of x is 2,
and the step length is 0.1.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
x, t , x0
The initial value of x is 2.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Definition of Gradient
BP Algorithm for Feed-Forward Neural Network Model 2. Gradient Descent Algorithm (GDA)
Convolutional Neural Network 3. Difference between GDA and Newton's Method
Deep Learning 4. An example
Gradient Descent
Algorithm
The End.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An application
6. Questions
Outline
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
David
Rumelhart
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
x AT A
1
AT b,
Iteration method, approaches the optimal solution gradually through each updating
step.
Gradient descent, which belongs to iteration methods, is available for least squares
problems.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
1. Brief introduction
2. Feedforward NN
3. BP algorithm
4. Notes on BP
5. An application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
• Learning essence :
– Dynamically update connection weights
• Learning rule :
– It is the rule of how updating the connection weights
(What rule is followed)
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
2. Feature: the inputs of the algorithm, which are used to describe the
samples.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
The key differences between standard gradient descent and stochastic gradient
descent are:
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
1. Brief introduction
2. Feedforward NN
3. BP algorithm
4. Notes on BP
5. An application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
1. Brief introduction
2. Feedforward NN
3. BP algorithm
4. Notes on BP
5. An application
6. Questions
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Questions:
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Brief Introduction
Gradient Descent Algorithm 2. Feedforward NN
BP Algorithm for Feed-Forward Neural Network Model 3. BP Algorithm
Convolutional Neural Network 4. Notes on BP
Deep Learning 5. An Application
6. Questions
Feedforward NN and
BP Algorithm
The End.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Outline
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
1. Convolution definition
2. Convolution layer
3. Pooling layer
4. Fully connected layer
5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
卷积与傅里叶变换有着密切的关系。利用一点性质,即两函数的傅里叶
变换的乘积等于它们卷积后的傅里叶变换,能使傅里叶分析中许多问题
的处理得到简化。
卷积的概念还可以推广到数列、测度以及广义函数上去。
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
1. Convolution definition
2. Convolution layer
3. Pooling layer
4. Fully connected layer
5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
1 0 -1 -1 -1 -1
The value of weights can be other number, What we need to do is to train the
weights and bias.
The filters can become more intricate as they start incorporating information from a
n increasingly larger spatial extent.
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
10 10 10 0 0 0
0 30 30 0
10 10 10 0 0 0
10 10 10 1 0 -1
1 0 -1
10 10 10
= 0
10 10 10 1 0 -1
Then slide the local receptive field across the entire input image.
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
10 10 10 0 0 0
0 30 30 0
10 10 10 0 0 0
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
* =
* =
RGB channels depth
Feature1
Filter
W1
Why convolutions ?
Parameter sharing
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
1. Convolution definition
2. Convolution layer
3. Pooling layer
4. Fully connected layer
5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Pooling:
1.Pick a window size(usually 2 or 3)
2.Pick a stride(usually 2)
3.Walk your window across your filtered images.
4.From each window
23/9/20, take the maximum value.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
2 9 1 1 3.75 1.25
2 3 2 3 4 2
5 6 1 2
Less spatial information also means less parameters, so less chance to over-fit
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
1. Convolution definition
2. Convolution layer
3. Pooling layer
4. Fully connected layer
5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
1. Convolution definition
2. Convolution layer
3. Pooling layer
4. Fully connected layer
5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
For example
Say whether a picture Is of an X or O.
A two-dimensional array of pixels
CNN
CNN XX or O
CNN O
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
-1 -1 -1 -1 -1 -1 -1 -1 -1
-1 1 -1 -1 -1 -1 -1 1 -1
-1 -1 1 -1 -1 -1 1 -1 -1
-1 -1 -1 1 -1 1 -1 -1 -1
-1 -1 -1 -1 1 -1 -1 -1 -1
-1 -1 -1 1 -1 1 -1 -1 -1
-1 -1 1 -1 -1 -1 1 -1 -1
-1 1 -1 -1 -1 -1 -1 1 -1
-1 -1 -1 -1 -1 -1 -1 -1 -1
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
stack
extracted
by three
filters
depth
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Max pooling
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
9
5
5
9
9
5
5
5
5
9
9
5
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Classifier
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
Convolutional Layer Another story based on filter
Filter 1
3x3x
channel
tensor
Convolution
Filter 2
3x3x
channel
tensor
……
……
6 x 6 image ……
(The values in the filters
are unknown parameters.)
100
Convolutional Layer 1 -1 -1
-1 1 -1 Filter 1
-1 -1 1
stride=1
1 0 0 0 0 1
0 1 0 0 1 0 3 -1 -3 -1
0 0 1 1 0 0
1 0 0 0 1 0 -3 1 0 -3
0 1 0 0 1 0
0 0 1 0 1 0 -3 -3 0 1
6 x 6 image 3 -2 -2 -1
101
Convolutional Layer -1 1 -1
-1 1 -1 Filter 2
-1 1 -1
stride=1 Do the same process for
1 0 0 0 0 1 every filter
0 1 0 0 1 0 3 -1 -3 -1
-1 -1 -1 -1
0 0 1 1 0 0
1 0 0 0 1 0 -3 1 0 -3
-1 -1 -2 1
0 1 0 0 1 0 Feature
0 0 1 0 1 0 -3 -3 Map0 1
-1 -1 -2 1
6 x 6 image 3 -2 -2 -1
-1 0 -4 3
102
Convolutional 3
-1
-1
-1
-3
-1
-1
-1
Layer
-3 1 0 -3
-1 -1 -2 1
-3 -3 0 1
-1 -1 -2 1
3 -2 -2 -1
-1 0 -4 3
64
Convolution
filters “Image” with 64 channels
Convolution
……
Multiple
3 -1 -3 -1
Convolutional -1 -1 -1 -1
Layers -3 1 0 -3
-1 -1 -2 1
-3 -3 0 1
-1 -1 -2 1
3 -2 -2 -1
-1 0 -4 3
64
Convolution
filters “Image” with 64 channels
Convolution
Filter:
3 x 3 x 64
……
64 104
1 0 0 0 0 1
Multiple
0 1 0 0 1 0
Convolutional
0 0 1 1 0 0
Layers
1 0 0 0 1 0
0 1 0 0 1 0
0 0 1 0 1 0
64 3 -1 -3 -1
Convolution -1 -1 -1 -1
filters
-3 1 0 -3
-1 -1 -2 1
Convolution
-3 -3 0 1
-1 -1 -2 1
3 -2 -2 -1
……
-1 0 -4 3 105
Comparison of Two Stories
1 -1 -1 Filter
.
…..
-1 1 -1 3 x 3 x
channel
-1 -1 1 tensor
Receptiv
e field (ignore bias in this slide)
106
The neurons with different
.
…..
receptive fields share the
parameters.
bias
1 0 0 0 0 1 1
11 00 00 00 00 11
…
0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 1 0 0
00 00 11 11 00 00
1 0 0 0 1 0
11 00 00 00 11 00
.
…..
0 1 0 0 1 0
00 11 00 00 11 00
0 0 1 0 1 0
00 00 11 00 11 00
bias
Each filter convolves 1
…
bird
bird
subsampling
109
Pooling – Max Pooling
1 -1 -1 -1 1 -1
-1 1 -1 Filter 1 -1 1 -1 Filter 2
-1 -1 1 -1 1 -1
3 -1 -3 -1 -1 -1 -1 -1
-3 1 0 -3 -1 -1 -2 1
-3 -3 0 1 -1 -1 -2 1
3 -2 -2 -1 -1 0 -4 3
110
Convolutional
3 -1 -3 -1
Layers -1 -1 -1 -1
+ Pooling -3 1 0 -3
-1 -1 -2 1
-3 -3 0 1
-1 -1 -2 1
3 -2 -2 -1
-1 0 -4 3
Convolution
Repeat
Pooling 3 0
-1 1
3 1
0 3
……
111
The whole CNN
cat dog ……
Convolution
softmax
Pooling
Fully Connected
Layers Convolution
Pooling
Flatten 112
Application: Playing Go
Next move
Network (19 x 19
positions)
19 x 19 classes
19 x 19 matrix
19(image)
x 19 vector
Black: 1 Fully-connected
48 network can be used
white: -1
channels in
Alpha Go none: 0 But CNN performs much better.
113
Why CNN for Go playing?
114
Why CNN for Go playing?
Speech
https://dl.acm.org/doi/10.11
09/TASLP.2014.2339736
Natural Language
Processing
https://www.aclweb.org/ant
hology/S15-2079/
116
Gradient Descent Algorithm 1. Convolution Definition
BP Algorithm for Feed-Forward Neural Network Model 2. Convolutional Layer
Convolutional Neural Network 3. Pooling Layer
4. Fully Connected Layer
Deep Learning 5. Example
Convolutional Neural
Networks
The End.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Outline
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Deep Learning
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Deep Learning
1. Introduction
2. What is Deep Learning
3. Partial connections
4. Initial weights
5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
+1
+1 +1
Non-convex
Need a lot of tricks to play with
Hard to do theoretical analysis
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
72%, 2010
74%, 2011
85%, 2012
The Architecture
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Deep Learning
1. Introduction
2. What is Deep Learning
3. Partial connections
4. Initial weights
5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
23/9/20 Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
5. Further thinking about DL
Fundamental structures
Multi-scale fusion (Inception)
Randomly connected
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Deep Learning
1. Introduction
2. What is Deep Learning
3. Partial connections
4. Initial weights
5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Pooling:
1.Pick a window size(usually 2 or 3)
2.Pick a stride(usually 2)
3.Walk your window across your filtered images.
4.From each window
23/9/20, take the maximum value.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
23/9/20
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Convolutional Neural Network 4. Initial Weights
Deep Learning 5. Biological & Theoretical Justification
6. Looking Forward
Deep Learning
1. Introduction
2. What is Deep Learning
3. Partial connections
4. Initial weights
5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Auto-encoder Neural Network Convolutional Neural Network
Deep Learning
4.
5.
Initial Weights
Biological & Theoretical Justification
6. Looking Forward
+
1
+
1
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Auto-encoder Neural Network Convolutional Neural Network
Deep Learning
4.
5.
Initial Weights
Biological & Theoretical Justification
6. Looking Forward
Error
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Sparse Auto-Encoder
Input Code Prediction
Encoder Decoder
Error
Sparsity
Penalty
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Stacked Auto-Encoders
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
v
Visible variables
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
v
Visible variables
Similarly:
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Hidden variables
h
W
Maximize (penalized) log-likelihood
objective:
v
Visible variables
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
a a
x X’
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Deep Learning
1. Introduction
2. What is Deep Learning
3. Partial connections
4. Initial weights
5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Why Hierarchy?
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network
Deep Learning
4.
5.
6.
Initial Weights
Biological & Theoretical Justification
Looking Forward
Biological:
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
object models
object parts
edges
pixels
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Deep Learning
1. Introduction
2. What is Deep Learning
3. Partial connections
4. Initial weights
5. Biological & Theoretical Justification
6. Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Looking Forward
Plan:
propose explanatory hypotheses
observe the effects of pre-training
infer its role & level of agreement with our hypotheses
Regularization hypothesis:
Optimization hypothesis:
Unsupervised initialization near better local minimum of P(y|x)
Reach lower local minimum not achievable by random initialization.
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Open
Stacked Questions
Auto-encoders
Auto-encoder Neural Network …
Convolutional Neural Network
Deep Learning
4.
5.
6.
Initial Weights
Biological & Theoretical Justification
Looking Forward
Looking Forward
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning
1. Introduction
Gradient Descent Algorithm 2. What is Deep Learning
BP Algorithm for Feed-Forward Neural Network Model 3. Partial Connections
Sparse Auto-encoder
Stacked Auto-encoders
Auto-encoder Neural Network Convolutional Neural Network 4.
5.
Initial Weights
Biological & Theoretical Justification
Deep Learning
6. Looking Forward
Deep Learning
Deep Learning
The End
Machine Learning Lecture – Xizhao Wang Lecture 03: Neural Network and Deep Learning