0% found this document useful (0 votes)

4 views

Lec 7

The document discusses different types of non-Bayes classifiers including linear discriminants and neural networks. It describes discriminant functions and how linear discriminant functions use hyperplanes as decision surfaces. It also discusses perceptron cost functions and algorithms, and how linear discriminants can be modeled as perceptrons. The document then covers topics like artificial neurons, multilayer perceptrons, their discriminating ability, training methods using gradient descent and backpropagation algorithms.

Uploaded by

Perike Chandra Sekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Lec 7

Uploaded by

Perike Chandra Sekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

Non-Bayes classifiers.

Linear discriminants,
neural networks.
Discriminant functions(1)
Bayes classification rule:
P ( w1 | x )  P ( w2 | x )  0 ? w1 : w2

Instead might try to find a function:

f w1 , w2 ( x)  0 ? w1 : w2

f w1 , w2 ( x) is called discriminant function.

{x | f w1 , w2 ( x)  0} - decision surface
Discriminant functions (2)
Class 1 Class 1

Class 2 Class 2

Linear discriminant function:

f w1 , w2 ( x)  w x  w0
T

Decision surface is a hyperplane w x  w0  0

T
Linear discriminant – perceptron cost function
w  x
Replace w    and x   
 w0  1 
f
Thus now decision function is w1 , w2 ( x )  wT
x
and decision surface is wT
x0

Perceptron cost function:

J ( x)    x w x T

 1, if x  w1 and x is wT x  0

where  x  1, if x  w2 and x is wT x  0
0, x is correctly classified

Linear discriminant – perceptron cost function
Class 1
Perceptron cost function:
Class 2
J ( x)    x wT x
x
Value of J (x ) is proportional to
the sum of distances of all
misclassified samples to the
decision surface.

If discriminant function separates classes perfectly, then J ( x )  0

Otherwise, J ( x )  0 and we want to minimize it.

J (x) is continuous and piecewise linear. So we might try to

use gradient descent algorithm.
Linear discriminant – Perceptron algorithm
Gradient descent:
J ( w)
w(t  1)  w(t )   t
w w w( t )

J ( w)
At points where J (x ) is differentiable   δx x
w misclas
sified x

Thus w(t  1)  w(t )   t δ

misclas
x x
sified x

Perceptron algorithm converges when classes are linearly

separable with some conditions on  t
Sum of error squares estimation
Let denote y ( x)  1 as desired output function, 1 for
one class and –1 for the other.
Want to find discriminant function f w1 , w2 ( x)  w x
T

whose output is similar to y (x)

Use sum of error squares as similarity criterion:

N
J ( w )   yi  w x i T 2

i 1

ˆ  arg min J ( w )
w
w
Sum of error squares estimation
Minimize mean square error:
J ( w ) N
 2  x i ( yi  w x i )  0
T

w i 1

 N  N
  x i x i w
T
ˆ   x i yi
 i 1  i 1

Thus 1
 N
  N

ˆ    x i x i    x i yi 
w T

 i 1   i 1 
Neurons
Artificial neuron.
x1
w1
x2
w2  f

xl wl
w0

Above figure represent artificial neuron calculating:

 l 
y  f   wi xi 
 i 1 
Artificial neuron.
Threshold functions f:

Step function Logistic function

1 1

0 0

1 x  0 1
f ( x)   f ( x) 
0 x  0 1  e  ax
Combining artificial neurons

x1
x2

Multilayer perceptron with 3 layers.

Discriminating ability of multilayer perceptron
Since 3-layer perceptron can approximate any smooth
function, it can approximate F ( x )  P ( w1 | x )  P ( w2 | x )
- optimal discriminant function of two classes.
Training of multilayer perceptron
f f

ykr 1 v r y rj
wrjk j
f f

f
f

Layer r-1 Layer r

Training and cost function
Desired network output: x(i )  y (i )
Trained network output: x(i )  yˆ (i )

Cost function for one training sample:

1 kL
E (i )   ( ym (i )  ym (i ))
ˆ 2

2 m 1
N
Total cost function: J   E (i )
i 1

r
Goal of the training: find values of w jk which minimize
cost function J .
Gradient descent
Denote: w rj  [ wrj 0 , wrj1 ,..., wrjk r 1 ]T

J
Gradient descent: w ( new)  w (old )  
r r
j j
w rj

N
Since J   E (i ) , we might want to update weights
after processing
i 1 each training sample separately:

E (i )
w (new)  w (old )  
r
j
r
j
w jr
Gradient descent
Chain rule for differentiating composite functions:

 r
E (i ) E (i ) j (i ) E (i ) r 1
v
 r  r y (i )
w jr
v j (i ) w jr
v j (i )

E (i )
Denote:  (i )  r
r
j
v j (i )
Backpropagation
If r=L, then
E ( i )   1 kL
2
 j (i )  L  L   ( f (vm (i ))  yˆ m (i )) 
L L

v j (i ) v j (i )  2 m 1 
 ( f (v Lj (i ))  yˆ j (i )) f (v Lj (i ))  e j (i ) f (v Lj (i ))

If r<L, then
E (i ) E (i ) v
kr r
j (i )
 j (i )  r 1   r
r 1

v j (i ) k 1 v j (i ) v rj 1 (i )
kr
v rj (i ) kr
   jr (i ) r 1
   jr (i ) wkjr f (v rj 1 (i ))
k 1 v j (i ) k 1
Backpropagation algorithm

• Initialization: initialize all weights with random values.

• Forward computations: for each training vector x(i)
r r
compute all j v (i ), y j (i )
• Backward computations: for each i, j and r=L, L-1,…,2
compute  jr 1 (i )
• Update weights: E (i )
w j (new)  w j (old )  
r r

w rj
 w rj (old )   jr (i ) y r 1 (i )
MLP issues
• What is the best network configuration?
• How to choose proper learning parameter  ?
• When training should be stopped?
• Choose another threshold function f or cost function J?

6.86x Machine Learning With Python: Linear Classifiers
No ratings yet
6.86x Machine Learning With Python: Linear Classifiers
7 pages
Galois Field Computations With Matlab
No ratings yet
Galois Field Computations With Matlab
31 pages
SolutionsTute2 PDF
No ratings yet
SolutionsTute2 PDF
9 pages
Lecture 2 Math
No ratings yet
Lecture 2 Math
34 pages
NN Theory
No ratings yet
NN Theory
138 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Introduction To Machine Learning Lecture 3: Linear Classification Methods
No ratings yet
Introduction To Machine Learning Lecture 3: Linear Classification Methods
40 pages
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
41 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
46 pages
Minsky y Papert
No ratings yet
Minsky y Papert
77 pages
Linear-classifiers
No ratings yet
Linear-classifiers
48 pages
Lec1 PerceptronPocket Recap
No ratings yet
Lec1 PerceptronPocket Recap
61 pages
ANN notes 1-5
No ratings yet
ANN notes 1-5
217 pages
Unit 6
No ratings yet
Unit 6
41 pages
ML-UNIT-I
No ratings yet
ML-UNIT-I
14 pages
NN-Ch2 New V1
No ratings yet
NN-Ch2 New V1
99 pages
Linear Classifier: Linear Discriminant Function: Compiled by Lakshmi Manasa, CED16I033
No ratings yet
Linear Classifier: Linear Discriminant Function: Compiled by Lakshmi Manasa, CED16I033
31 pages
Linear - Classification
No ratings yet
Linear - Classification
72 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
Lecture Notes 3 Perceptron
No ratings yet
Lecture Notes 3 Perceptron
7 pages
Lecturenotes Perceptron
No ratings yet
Lecturenotes Perceptron
7 pages
CS60010: Deep Learning: Spring 2021
No ratings yet
CS60010: Deep Learning: Spring 2021
32 pages
10 Multilayer Perceptrons
No ratings yet
10 Multilayer Perceptrons
54 pages
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
No ratings yet
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
54 pages
Slide 2
No ratings yet
Slide 2
35 pages
NN Unit 2
No ratings yet
NN Unit 2
20 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
Single Layer Feedforward Networks
No ratings yet
Single Layer Feedforward Networks
21 pages
11-Nonlinear Models (Neural Networks)
No ratings yet
11-Nonlinear Models (Neural Networks)
6 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
3 Linear
No ratings yet
3 Linear
5 pages
Unit 2-nn
No ratings yet
Unit 2-nn
40 pages
UNIT V (1)
No ratings yet
UNIT V (1)
25 pages
Session 6 Machine Learning Algorithms
No ratings yet
Session 6 Machine Learning Algorithms
46 pages
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
No ratings yet
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
21 pages
Intro Perceptron
No ratings yet
Intro Perceptron
70 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
הרצאה-Classifiers and Decision Trees
No ratings yet
הרצאה-Classifiers and Decision Trees
119 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
Basics of Deep Learning: Pierre-Marc Jodoin and Christian Desrosiers
No ratings yet
Basics of Deep Learning: Pierre-Marc Jodoin and Christian Desrosiers
183 pages
ANN - Perceptron - Adaline
No ratings yet
ANN - Perceptron - Adaline
15 pages
Unit en Multilayer Perceptron
No ratings yet
Unit en Multilayer Perceptron
71 pages
cs188 Fa23 Note21
No ratings yet
cs188 Fa23 Note21
8 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
nn1
No ratings yet
nn1
6 pages
Discriminant, Generative, Discriminative Models
No ratings yet
Discriminant, Generative, Discriminative Models
98 pages
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
No ratings yet
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
38 pages
Neural Networks
No ratings yet
Neural Networks
14 pages
International Baccalaureate (IB) : Artificial Neural Networks - #1
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #1
33 pages
06 Lectureslides LinearClassification Fixed
No ratings yet
06 Lectureslides LinearClassification Fixed
52 pages
Linear Classifiers PPT 1
No ratings yet
Linear Classifiers PPT 1
14 pages
S2_5_NN
No ratings yet
S2_5_NN
22 pages
Perceptron Notes
No ratings yet
Perceptron Notes
5 pages
Math Behind Machine Learning
No ratings yet
Math Behind Machine Learning
9 pages
20.NeuralNets Short
No ratings yet
20.NeuralNets Short
60 pages
Sample Question
100% (1)
Sample Question
4 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Chapter 4 - Position Analysis
No ratings yet
Chapter 4 - Position Analysis
6 pages
Graphing Sine and Cosine WS Key - 1
No ratings yet
Graphing Sine and Cosine WS Key - 1
2 pages
Dynamical System and Chaos an Introduction With Applications
No ratings yet
Dynamical System and Chaos an Introduction With Applications
329 pages
Limits PDF
No ratings yet
Limits PDF
3 pages
Applications of Numerical Method in Chemical Engineering
No ratings yet
Applications of Numerical Method in Chemical Engineering
3 pages
5.4 Special Factoring Techniques
No ratings yet
5.4 Special Factoring Techniques
19 pages
2d - Linear and Quadratic Equations
No ratings yet
2d - Linear and Quadratic Equations
11 pages
14352 Books Doubtnut Question Bank
No ratings yet
14352 Books Doubtnut Question Bank
65 pages
Solved CBSE XII Maths (EF1GH-4)
No ratings yet
Solved CBSE XII Maths (EF1GH-4)
22 pages
CCP Lab Manual
No ratings yet
CCP Lab Manual
19 pages
Igcse Maths Cie: 3.12 Trigonometric Graphs & Equations
No ratings yet
Igcse Maths Cie: 3.12 Trigonometric Graphs & Equations
11 pages
Questions Sheet: Do The Following Problems
No ratings yet
Questions Sheet: Do The Following Problems
3 pages
Material and Geometric Nonlinearity
0% (1)
Material and Geometric Nonlinearity
16 pages
Equations of Motion of 2 DOF Flight
No ratings yet
Equations of Motion of 2 DOF Flight
6 pages
Gradients and Straight Line Graphs
No ratings yet
Gradients and Straight Line Graphs
15 pages
The Zeros of Hankel Functions
No ratings yet
The Zeros of Hankel Functions
13 pages
Tunneling Wave Function of The Universe: Alexander Vilenkin and Masaki Yamada
No ratings yet
Tunneling Wave Function of The Universe: Alexander Vilenkin and Masaki Yamada
18 pages
Caltech HW 2 Solutions Abstract Algebra
No ratings yet
Caltech HW 2 Solutions Abstract Algebra
2 pages
College Algebra 12th Edition Lial Solutions Manualpdf download
100% (4)
College Algebra 12th Edition Lial Solutions Manualpdf download
43 pages
Lahiri & Pal Problems 04.16-17
No ratings yet
Lahiri & Pal Problems 04.16-17
3 pages
Review of Aerofoil Parameterisation Methods For Aerodynamic Shape Optimisation
No ratings yet
Review of Aerofoil Parameterisation Methods For Aerodynamic Shape Optimisation
20 pages
Cbiemaco 03
No ratings yet
Cbiemaco 03
6 pages
Lab 1 & 2 Calculus For It - 501031: 1 Exercises
0% (1)
Lab 1 & 2 Calculus For It - 501031: 1 Exercises
3 pages
Cluster C - Model Paper - 1 - Final
No ratings yet
Cluster C - Model Paper - 1 - Final
3 pages
Math 1 - SAT Subject Test Math Level 1 Practice Questions
100% (1)
Math 1 - SAT Subject Test Math Level 1 Practice Questions
24 pages
January 2011 QP - S1 Edexcel
No ratings yet
January 2011 QP - S1 Edexcel
14 pages
Tuned Mass Damper With Fractional Derivative Damping: F. Rudinger
No ratings yet
Tuned Mass Damper With Fractional Derivative Damping: F. Rudinger
6 pages
(Junoon - E - JEE) - Matrices & Determinants - 16th Oct
No ratings yet
(Junoon - E - JEE) - Matrices & Determinants - 16th Oct
282 pages

Lec 7

Uploaded by

Lec 7

Uploaded by

Non-Bayes classifiers.

Instead might try to find a function:

f w1 , w2 ( x) is called discriminant function.

Linear discriminant function:

Decision surface is a hyperplane w x  w0  0

Perceptron cost function:

If discriminant function separates classes perfectly, then J ( x )  0

J (x) is continuous and piecewise linear. So we might try to

Thus w(t  1)  w(t )   t δ

Perceptron algorithm converges when classes are linearly

whose output is similar to y (x)

Use sum of error squares as similarity criterion:

Above figure represent artificial neuron calculating:

Step function Logistic function

Multilayer perceptron with 3 layers.

Layer r-1 Layer r

Cost function for one training sample:

• Initialization: initialize all weights with random values.

You might also like