0% found this document useful (0 votes)

9 views60 pages

Lecture W12ab

The document covers Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) as techniques for dimensionality reduction. PCA aims to project high-dimensional data onto a lower-dimensional surface to minimize projection error, while LDA focuses on maximizing class separation in supervised learning contexts. It also discusses various feature scaling methods and the mathematical formulations involved in both PCA and LDA.

Uploaded by

Hadia Ramzan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views60 pages

Lecture W12ab

Uploaded by

Hadia Ramzan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

CS-871: Machine Learning

Fall 2024 - Week 12

Principal Component Analysis – Linear Discriminant Analysis

Dr. M. Daud Abdullah Asif

Assistant Professor
Faculty of Computing, SEECS
Email: [email protected]
Dimensionality Reduction - Agenda
• Motivation I: Data Compression
• Motivation II: Visualization
• PCA Problem Formulation
• PCA Algorithm
• Reconstruction
• Number of PCs
• Applications of PCA
If we approximate the original data set by projecting all original examples onto this green
line we need only one number to represent the location of each of
the training examples after they've been projected onto that green line.
How can we visualize this data having a large number of features?
Can we simplify the features so that instead of 50 values for a
country, we can represent the information by 2D?
Can we simplify the features so that instead of 50 values for a
country, we can represent the information by 2D?
Reduce from 2D to 1D ----- Project on a straight line
The length of blue line segments, is the
projection error. Before applying PCA, it's
standard practice to first perform mean
PCA finds a lower dimensional surface, (e.g., line), normalization at feature scaling so that the
onto which to project the data so that the sum of features x1 and x2 should have zero mean,
squares of these blue line segments is minimized. and should have comparable ranges of values.
Larger projection errors for an irrelevant / inaccurate choice
Note the direction of the project lines and the error lines
Note the vertical axis
Summary
• PCA tries to find a lower dimensional surface onto which to
project the data, to minimize the squared projection error
• The goal is to minimize the square distance between each point
and the location of where it gets projected
• How to find the lower dimensional surface onto which to project
the data?
Different Types of Feature scaling:
1. Mean normalization
Replace with to make features have approximately zero mean
(Do not apply to ). Divide by max-min
(𝑥𝑖 −𝜇𝑖 )
𝑥𝑖 =
max 𝑥𝑖 − min⁡(𝑥𝑖 )

E.g.

2. Standardization: Subtract 3. Min-max Normalization

mean and divide by standard (𝑥𝑖 −min⁡(𝑥𝑖 ))
𝑥𝑖 =
deviation 𝑥𝑖 − 𝜇𝑖 max 𝑥𝑖 − min⁡(𝑥𝑖 )
𝑥 =
𝑖
𝑠𝑖 0 ≤ 𝑥𝑖 ≤ 1
Find U’s and Z’s
Reconstruction
• So, given an unlabeled data set, we know how to apply PCA
and take high dimensional features x and map that to this lower-
dimensional representation z
• We also know how to take these low-representation z and map
it back up to an approximation of your original high-dimensional
data
Linear Discriminant Analysis
• Supervised dimensionality reduction technique.

• Pre-processing step in many pattern recognition problems.

• Can be used for feature extraction.

• Linear transformation that maximize the separation between

multiple classes and minimize the within class variability.

1
LDA
• Let us start with a data set which we can write as a matrix:
x1,1 x1,2 x1,N
X = x2,1 x2,2 x2,N
x3,1

xn,1 xn,N
• Each column is one data point, each row is a variable, but take
care sometimes the transpose is used
The mean adjusted data matrix
• We form the mean adjusted data matrix by subtracting the mean
of each variable
x1,1 - m1 x1,2 - m1 x1,N - m1
U = x2,1 - m2 x2,2 - m2 x2,N - m2
x3,1 - m3

xn,1 - mn xn,N - mn

• mi is the mean of the data items in row i

Intelligent Data Analysis and Probabilistic Inference Lecture 17 Slide No 3

Covariance Matrix

• The covariance matrix can be formed from the product:

1
𝑆 = 𝑈 𝑈𝑇
𝑁
Geometric Idea

x2
u
f1
f2

x2 • PCA: (f1,f2)

LDA: u

x1 x1
Method (Additional Notes)
• Let the between-class scatter matrix Sb be defined as
g
Sb  
i 1
N i ( xi  x )( xi  x )T
• and the within-class scatter matrix Sw be defined as
g g Ni
Sw  
i 1
( N i  1) S i  
i 1 j 1
( xi , j  xi )( xi , j  xi )T

• where xi,j is an n-dimensional data point j from class pi, Ni is the

number of training examples from class pi, and g is the total number of
classes or groups
Method (Additional Notes cont.)
• It has been shown that Plda is in fact the solution of the following
eigensystem problem:
Sb P  S w P  0

• Multiplying both sides by the inverse of Sw

S w1S b P  S w1S w P  0
S w1S b P  P  0
( S w1S b ) P  P
Standard LDA (Additional Notes)
• If Sw is a non-singular matrix then the Fisher’s criterion is
maximised when the projection matrix Plda is composed of the
eigenvectors of
1
S S
w b

• with at most (g-1) nonzero corresponding eigenvalues.

• (since there are only g points to estimate Sb)
Questions?

UNIT-4 Machine Learning
No ratings yet
UNIT-4 Machine Learning
20 pages
UNIT - 4
No ratings yet
UNIT - 4
76 pages
Unit I - QB
100% (1)
Unit I - QB
3 pages
Rohini 98229548802
No ratings yet
Rohini 98229548802
5 pages
Lec 11: Linear Dimensionality Reduction: 11.33.1 Minimizing Variance
No ratings yet
Lec 11: Linear Dimensionality Reduction: 11.33.1 Minimizing Variance
3 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
16 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Lecture 9_PCA
No ratings yet
Lecture 9_PCA
44 pages
lec15
No ratings yet
lec15
28 pages
PCALDAICA (2)
No ratings yet
PCALDAICA (2)
28 pages
Number System Class Notes 13
No ratings yet
Number System Class Notes 13
9 pages
Eigenvectors_2
No ratings yet
Eigenvectors_2
31 pages
Susilo_2018_J._Phys.__Conf._Ser._1097_012140 (1)
No ratings yet
Susilo_2018_J._Phys.__Conf._Ser._1097_012140 (1)
10 pages
Bayesian Approach For Uncertainty Analysis of An Urban Storm Wate
No ratings yet
Bayesian Approach For Uncertainty Analysis of An Urban Storm Wate
12 pages
Lecture8 2015
No ratings yet
Lecture8 2015
51 pages
Central Potential
No ratings yet
Central Potential
10 pages
9th Cbse Ix Sample Paper 19-20
No ratings yet
9th Cbse Ix Sample Paper 19-20
5 pages
12 Angi
No ratings yet
12 Angi
4 pages
Class 11 Maths Notes Chapter 7 Studyguide360
No ratings yet
Class 11 Maths Notes Chapter 7 Studyguide360
13 pages
2 Qwaz
No ratings yet
2 Qwaz
35 pages
hPSO SA
No ratings yet
hPSO SA
29 pages
CS-878 Lecture-02 Logistic Regression
No ratings yet
CS-878 Lecture-02 Logistic Regression
55 pages
Stress-Intensity Factors For
No ratings yet
Stress-Intensity Factors For
19 pages
11. Eigen Values and Eigen Vectors
No ratings yet
11. Eigen Values and Eigen Vectors
53 pages
Lecture 14 15 - Temporal Difference Learning, Lambda-return, Backward View of TD (Lambda)
No ratings yet
Lecture 14 15 - Temporal Difference Learning, Lambda-return, Backward View of TD (Lambda)
26 pages
Lecture 34 - Model Based Reinforcement Learning
No ratings yet
Lecture 34 - Model Based Reinforcement Learning
26 pages
Estimation of Pulp
No ratings yet
Estimation of Pulp
6 pages
Lecture 11 12 - Model Free Prediction, Monte-Carlo Learning, Temporal Difference Learning
No ratings yet
Lecture 11 12 - Model Free Prediction, Monte-Carlo Learning, Temporal Difference Learning
24 pages
cs229 Notes10 PDF
No ratings yet
cs229 Notes10 PDF
6 pages
Self Reading - KNN - Notes
No ratings yet
Self Reading - KNN - Notes
7 pages
Lecture W7ab
No ratings yet
Lecture W7ab
21 pages
Lecture 19 - Model-free Control, Off-Policy Learning
No ratings yet
Lecture 19 - Model-free Control, Off-Policy Learning
9 pages
Lecture 35 36 - Exploration vs. Exploitation
No ratings yet
Lecture 35 36 - Exploration vs. Exploitation
18 pages
Lecture 22 - Value Function Approximation
No ratings yet
Lecture 22 - Value Function Approximation
17 pages
Lecture W5ab
No ratings yet
Lecture W5ab
56 pages
Lecture W6b
No ratings yet
Lecture W6b
33 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
19 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
Lecture W3
No ratings yet
Lecture W3
28 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
No ratings yet
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
18 pages
Pca
No ratings yet
Pca
6 pages
Dimension Reduction
No ratings yet
Dimension Reduction
23 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
Peano Baker
No ratings yet
Peano Baker
6 pages
Outline: Reducing Data Dimension
No ratings yet
Outline: Reducing Data Dimension
7 pages
Grade 10 Maths 2024 PRE-JUNE EXAM
100% (1)
Grade 10 Maths 2024 PRE-JUNE EXAM
10 pages
DimensionalityReduction Pca
No ratings yet
DimensionalityReduction Pca
24 pages
Week12_PCA_BayesianInference_before_lecture
No ratings yet
Week12_PCA_BayesianInference_before_lecture
82 pages
Curse of Dimensionality, Dimensionality Reduction With PCA
No ratings yet
Curse of Dimensionality, Dimensionality Reduction With PCA
36 pages
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
Flange Bolt-Up Bolting Torque Table 4 PDF
100% (1)
Flange Bolt-Up Bolting Torque Table 4 PDF
4 pages
Visualization 9 Dim Reduction
No ratings yet
Visualization 9 Dim Reduction
73 pages
ML_Lec-20
No ratings yet
ML_Lec-20
17 pages
Applied Science CS SUMMER 2022
No ratings yet
Applied Science CS SUMMER 2022
2 pages
MLSP-6 dimensionality reduction
No ratings yet
MLSP-6 dimensionality reduction
39 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
Fishers LDA
No ratings yet
Fishers LDA
47 pages
Principal Component Analysis (PCA) and Linear Discriminant Analysis for Image Recognition
No ratings yet
Principal Component Analysis (PCA) and Linear Discriminant Analysis for Image Recognition
17 pages
1-7 MMC Efficient and Robust Feature Extraction by Maximum Margin Criterion
No ratings yet
1-7 MMC Efficient and Robust Feature Extraction by Maximum Margin Criterion
9 pages
Danessa 1
No ratings yet
Danessa 1
6 pages
EE 566 - Pattern Recognition Project
No ratings yet
EE 566 - Pattern Recognition Project
19 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
No ratings yet
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
22 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
Partial Differential Equation - Classification
No ratings yet
Partial Differential Equation - Classification
7 pages
ML Unit 3
No ratings yet
ML Unit 3
29 pages
Lecture 16_25.09.2024_PCA, Unsupervised Learning-Clustring & Metrics
No ratings yet
Lecture 16_25.09.2024_PCA, Unsupervised Learning-Clustring & Metrics
51 pages
Lesson 8-Image Segmentation - Traditional Approaches
No ratings yet
Lesson 8-Image Segmentation - Traditional Approaches
35 pages
03 Dimensionality Reduction
No ratings yet
03 Dimensionality Reduction
38 pages
G1-Q3-LE-WEEK 1-MATH
No ratings yet
G1-Q3-LE-WEEK 1-MATH
13 pages
Dimensionality Reduction 22-01-22
No ratings yet
Dimensionality Reduction 22-01-22
47 pages
Unit 3
No ratings yet
Unit 3
102 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
Linear Regression: Dimensionality Reduction
No ratings yet
Linear Regression: Dimensionality Reduction
7 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
The Risk of Automation For Jobs in OECD Countries: OECD Social, Employment and Migration Working Papers No. 189
No ratings yet
The Risk of Automation For Jobs in OECD Countries: OECD Social, Employment and Migration Working Papers No. 189
35 pages
Lda PDF
No ratings yet
Lda PDF
47 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
No ratings yet
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
40 pages
16 dm2 Dimred 2022 23
No ratings yet
16 dm2 Dimred 2022 23
49 pages
FR Pca Lda
No ratings yet
FR Pca Lda
52 pages
Bed Practica RT
No ratings yet
Bed Practica RT
11 pages
Principal Component Analysis: Atent Ariables
No ratings yet
Principal Component Analysis: Atent Ariables
13 pages
Implementation of Dimensionality Reduction Techniques in Hospital Management
No ratings yet
Implementation of Dimensionality Reduction Techniques in Hospital Management
4 pages
Math 10 - Illustrating Polynomial Functions
100% (2)
Math 10 - Illustrating Polynomial Functions
37 pages
Vfpencrkyption Documentation
No ratings yet
Vfpencrkyption Documentation
6 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Will He Marry Me: Horary Astrology
100% (1)
Will He Marry Me: Horary Astrology
5 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Computer Vision: Spring 2006 15-385,-685
No ratings yet
Computer Vision: Spring 2006 15-385,-685
58 pages
Python Data Types
No ratings yet
Python Data Types
20 pages
LDA Tutorial
No ratings yet
LDA Tutorial
47 pages
RENR 690 - Geostatistics Lab
No ratings yet
RENR 690 - Geostatistics Lab
6 pages
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
No ratings yet
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
7 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
CBLM Interpreting Technical Drawing
No ratings yet
CBLM Interpreting Technical Drawing
18 pages
Js
No ratings yet
Js
29 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Lecture W12ab

Uploaded by

Lecture W12ab

Uploaded by

CS-871: Machine Learning

Fall 2024 - Week 12

Dr. M. Daud Abdullah Asif

2. Standardization: Subtract 3. Min-max Normalization

• Pre-processing step in many pattern recognition problems.

• Can be used for feature extraction.

• Linear transformation that maximize the separation between

• mi is the mean of the data items in row i

Intelligent Data Analysis and Probabilistic Inference Lecture 17 Slide No 3

• The covariance matrix can be formed from the product:

• where xi,j is an n-dimensional data point j from class pi, Ni is the

• Multiplying both sides by the inverse of Sw

• with at most (g-1) nonzero corresponding eigenvalues.

You might also like