0% found this document useful (0 votes)

9 views

09_PCA

Uploaded by

Obaida Almoula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

09_PCA

Uploaded by

Obaida Almoula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

AI

PCA

Dr. Ali Al-Saegh

Computer Engineering Department, College of Engineering, University of Mosul

1
Introduction
• Principal component analysis (PCA) is a popular technique for analyzing large datasets
containing a large number of dimensions/features per observation.

• PCA is a statistical technique for reducing the dimensionality of a dataset. This is

accomplished by linearly transforming the data into a new coordinate system where most
of the variation in the data can be captured with fewer dimensions than the initial data.
• Those few dimensions are called the principal components (PCs).

The greater the variance, the more the information. Vice versa.

2
PCs
• The first few principal components (PCs) account for most of the contained information
in the data, the remaining PCs can be discarded.
• The first PC is along the maximum variation of the data
• Discarding some of the PCs will reduce the dimensionality of the data.

This captures most of the

information in the data

3
Dimensionality reduction
• Benefits of dimensionality reduction:
• Reduction of computational overhead of subsequent processing.
• Noise reduction because only the most relevant information will be captured
and kept.
• A projection into a subspace of low dimension is useful for visualizing the
data.

4
PCA algorithm
• Step 1: Remove (subtract) the mean from the data points (the data is centered around the
origin point).
• Step 2: Calculate the covariance matrix for the features in the dataset.
• Step 3: Calculate the eigenvalues and eigenvectors for the covariance matrix.
• Step 4: Sort eigenvalues and their corresponding eigenvectors.
• Step 5: Pick k eigenvalues (i.e. best PCs) and form a matrix of eigenvectors.
• Step 6: Transform the original data matrix to the new fewer-dimension data.

5
Variance and covariance
• Variance refers to the spread of a data set around its mean value.
σ 𝑥 − 𝑥 ҧ 2
𝑖
𝜎2 =
𝑁
• Covariance provides insight into how two variables vary from the mean with respect to
each other.
σ 𝑥𝑖 − 𝑥ҧ 𝑦𝑖 − 𝑦ത
𝑐𝑜𝑣 𝑥, 𝑦 =
𝑁
• Covariance matrix contains variance values between all possible dimensions, the matrix is
always symmetric. Example of three dimensional covariance matrix:
𝑐𝑜𝑣 𝑥, 𝑥 𝑐𝑜𝑣 𝑥, 𝑦 𝑐𝑜𝑣 𝑥, 𝑧
𝐶 = 𝑐𝑜𝑣 y, 𝑥 𝑐𝑜𝑣 y, 𝑦 𝑐𝑜𝑣 y, 𝑧
𝑐𝑜𝑣 z, 𝑥 𝑐𝑜𝑣 z, 𝑦 𝑐𝑜𝑣 𝑧, 𝑧

6
Eigenvalues and eigenvectors
• Eigenvalues measure the amount of the variation explained by each PC.
• Eigenvalue is largest for the first PC and smaller for the subsequent PCs.
• Eigenvectors provide the directions in which the data cloud is stretched most.
• Steps of calculating eigenvalues and eigenvectors of matrix 𝐴:
• Roots of 𝐴 − 𝜆𝛪 = 0 are the eigenvalues (𝜆1 , 𝜆2 , … ).
• Solve 𝐴𝑢 = 𝜆𝑢 for each 𝜆 to obtain eigenvectors (𝑢, 𝑣, … ).

7
Example

point x y
• We want to find a transformed dataset of 1 126 78
the shown one so that it contains only one 2 128 80
feature instead of two. 3 128 82
4 130 82
5 130 84
6 132 86

8
Solution – centering data & covariance matrix
Original data Centered data For calculating the covariance matrix

point 𝑥 𝑦 𝑥 − 𝑥ҧ 𝑦 − yത (𝑥 − 𝑥)ҧ 𝟐 (𝑦 − 𝐲ത)𝟐 𝑥𝑖 − 𝑥ҧ 𝑦𝑖 − 𝑦ത

1 126 78 -3 -4 9 16 12
2 128 80 -1 -2 1 4 2
3 128 82 -1 0 1 0 0
4 130 82 1 0 1 0 0
5 130 84 1 2 1 4 2
6 132 86 3 4 9 16 12

𝑥ҧ = 129 𝑦ത = 82 ෍(𝑥 − 𝑥)ҧ 𝟐 = 22 ෍(𝑦 − yത )𝟐 = 40 ෍ 𝑥𝑖 − 𝑥ҧ 𝑦𝑖 − 𝑦ത = 28

22/6 = 3.6 40/6 = 6.6 28/6 = 4.6

3.6 4.6
Covariance matrix 𝐶=
4.6 6.6 9
Solution – original and centered data points

10
Solution - eigenvalues

𝐴 − 𝜆𝐼 = 0

3.6 4.6 𝜆 0
− =0
4.6 6.6 0 𝜆

3.6 − 𝜆 4.6
=0
4.6 6.6 − 𝜆

3.6 − 𝜆 6.6 − 𝜆 − 4.6 ∗ 4.6 = 0

𝜆2 − 12.4𝜆 + 3.84 = 0

The eigenvalues are: 𝜆1 = 9.94 , 𝜆2 = 0.26

11
Solution – eigenvectors
𝐴𝑢1 = 𝜆1 𝑢1

Let 𝑢11 = 1 then 𝑢12 = 1.37

3.6 4.6 𝑢11 𝑢11
= 9.94
4.6 6.6 𝑢12 𝑢12 𝑢11 1
𝑢1 = 𝑢 =
12 1.37
3.6𝑢11 + 4.6𝑢12 = 9.94𝑢11 ➔ 𝑢12 = 1.37𝑢11
Normalizing the vector to get the unit length
4.6𝑢11 + 6.6𝑢12 = 9.94𝑢12 ➔ 1.37𝑢11 = 𝑢12
vector
Add the two equations:
2𝑢12 = 2.74𝑢11 1/ 12 + 1.372 0.59
=
1.37/ 12 + 1.372 0.81

−0.81
in the same procedure, the eigenvector of the second eigenvalue (𝜆2 = 0.26) is 𝑢2 =
0.59 12
Solution – variations and eigenvectors
• The covariance matrix is symmetric.
• Therefore, the eigenvectors are orthogonal to each
other.
• The angle between the eigenvectors is 90 degrees.
• The eigenvector related to the largest eigenvalue is in 𝑢2 𝑢1
the direction of the most variation of data.

13
Solution – PCs matrix
• The eigenvectors are then arranged based on the eigenvalues.
• Since 𝜆1 = 9.94 > 𝜆2 = 0.26 then PC1 is first and PC2 is next.
• The eigenvector matrix is constructed. The first column is for the eigenvector related to the largest
eigenvalue.

0.59 −0.81
𝑢=
0.81 0.59

• The first eigenvector represents PC1 while the second eigenvector represents PC2.
• This matrix is the transformation matrix, it is used to change the original data to a new form.

14
Solution – data transform
• The matrix (𝑢) can be used to transform the centered data (𝐷) such that their variables are
uncorrelated.
• The transformed data is found by 𝐷 ∗ 𝑢

−3 −4 −5 0.1
−1 −2 −2.2 −0.4
−1 0 0.59 −0.81 −0.6 0.8
=
1 0 0.81 0.59 0.6 −0.8
1 2 2.2 0.4 Note that the sum of variances is
3 4 5 −0.1 equal for both
3.6 4.6
• The covariance matrix 𝐶′ of the transformed data is 𝐶= ➔ 3.6+6.6=10.2
4.6 6.6
9.94 0
• 𝐶′ = 𝐶′ =
9.94 0
➔9.94+0.26=10.2
0 0.26 0 0.26
• The eigenvalues represent the variances of PC1 and PC2. 15
Solution – data transform
• The transformed data is the same as the original centered data with clockwise rotation
until the eigenvectors become in the same direction of the original axes.

9.94
• 𝑣𝑎𝑟 𝑃𝐶1 = = 97.4%

Variance of PC2 is 0.26

9.94+0.26

0.26
• 𝑣𝑎𝑟 𝑃𝐶2 = = 2.5%
9.94+0.26

• PC1 captures the most variance in the data

• i.e. PC1 is much important than PC2.
Variance of PC1 is 9.94

16
Solution – new form of data
• Since PC1 counts for the most variance in the data (i.e. most of the information in the data
is in PC1), PC2 can be simply ignored because it contains almost no information.
• Now, PC1 represents the new form of the original data.

−5
−2.2
We changed the data from −0.6
2D to 1D 0.6
2.2
5

17
Number of PCs to retain
• One of several methods can be used to decide on the number of PCs to keep:
• Select the PCs that hold a specific amount of total variance (e.g. 90% of the total
variance).
• Select the PCs with variance (eigenvalue) greater than the average of the whole
eigenvalues.

18
Example – selecting PCs
PC1 PC2 PC3 PC4
• Suppose we have the shown matrix of eigenvectors
(PCs) -0.62 -1.14 -0.10 -0.14
1.73 -1.32 -0.16 0.14
• First method:
0.26 -1.35 0.65 0
• If we want to keep the least number of PCs that
account for at least 90% of the total variance then we -1.05 0.06 -0.09 0.46
will select only PC1 and PC2 because the sum of their 2.35 1.68 -0.28 0
variances is 60.2% + 36.2% = 96.4%. -0.59 1.43 0.23 0.16
• Second method: -2.47 -0.06 -0.40 -0.17
• Also PC1 and PC2 will be selected because their 0.41 1.51 0.32 -0.24
variances are greater than the average of eigenvalues. -1.58 0.19 0.03 -0.09
• Average(variance) = (2.41 + 1.45 + 0.1 + 0.04) / 4 = 1 1.58 -1.00 -0.20 -0.13
• 2.41 > 1 and 1.45 > 1 Variance 2.41 1.45 0.10 0.04
Variance % 60.2 36.2 2.5 1.1
19

Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
Lecture6 PCA
No ratings yet
Lecture6 PCA
30 pages
PCA
100% (1)
PCA
33 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
5 Pca
No ratings yet
5 Pca
14 pages
1-Python Algebra Maths
No ratings yet
1-Python Algebra Maths
26 pages
Pac
No ratings yet
Pac
70 pages
Presentation
No ratings yet
Presentation
31 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
No ratings yet
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
19 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Unit-3
No ratings yet
Unit-3
28 pages
Steps for PCA
No ratings yet
Steps for PCA
5 pages
16. Principal Component Analysis
No ratings yet
16. Principal Component Analysis
27 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Pca
No ratings yet
Pca
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
1501589578da-mod15-Q1-e-text
No ratings yet
1501589578da-mod15-Q1-e-text
9 pages
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
No ratings yet
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
16 pages
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
No ratings yet
Principal Component Analysis: Courtesy:University of Louisville, CVIP Lab
48 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
ML Chapter 4 Part3
No ratings yet
ML Chapter 4 Part3
82 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
MLSP Exp02
No ratings yet
MLSP Exp02
10 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
Principal Component Analysis (PCA) Final
No ratings yet
Principal Component Analysis (PCA) Final
37 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
79 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Presentation a i Std 2
No ratings yet
Presentation a i Std 2
63 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
The Math Behind PCA
No ratings yet
The Math Behind PCA
3 pages
MLPDF 2
No ratings yet
MLPDF 2
9 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Maths Pca
No ratings yet
Maths Pca
6 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
16 pages
Principal Components Analysis (PCA) : 2.1 Outline of Technique
No ratings yet
Principal Components Analysis (PCA) : 2.1 Outline of Technique
21 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
2. PCA
No ratings yet
2. PCA
22 pages
Pca
No ratings yet
Pca
18 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Principal_Component_Analysis_PCA__17
No ratings yet
Principal_Component_Analysis_PCA__17
58 pages
DR Pca
No ratings yet
DR Pca
22 pages
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
23 pages
ML - Unit 3
No ratings yet
ML - Unit 3
4 pages
UploadFile_9116
No ratings yet
UploadFile_9116
21 pages
Principal Component Analysis (1)
No ratings yet
Principal Component Analysis (1)
12 pages
How Do You Do A Principal Component Analysis?
No ratings yet
How Do You Do A Principal Component Analysis?
13 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Week 9 Lecture - Revision Test-dual-translated
No ratings yet
Week 9 Lecture - Revision Test-dual-translated
92 pages
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
3-Fourier Series l2
No ratings yet
3-Fourier Series l2
17 pages
C. N. II Lec.5 2024
No ratings yet
C. N. II Lec.5 2024
50 pages
07_Decision tree
No ratings yet
07_Decision tree
45 pages
08_k-means
No ratings yet
08_k-means
19 pages
Lecture Five Mesh Analysis: - Identify Mesh - How To Apply Mesh Analysis - Super Mesh
No ratings yet
Lecture Five Mesh Analysis: - Identify Mesh - How To Apply Mesh Analysis - Super Mesh
15 pages
Electrical Circuits Analysis: Lecturer Dr. Ahmed Maamoon Al-Kababji
No ratings yet
Electrical Circuits Analysis: Lecturer Dr. Ahmed Maamoon Al-Kababji
26 pages
Lecture Three Source Transformation
No ratings yet
Lecture Three Source Transformation
16 pages
Block Diagram & Signal Flow Graphs
No ratings yet
Block Diagram & Signal Flow Graphs
6 pages
LPP Total
No ratings yet
LPP Total
33 pages
DM ML Practical
No ratings yet
DM ML Practical
13 pages
Reducing Artificial Reverberation Algorithm Requirements Using Time-Variant Feedback Delay Networks
No ratings yet
Reducing Artificial Reverberation Algorithm Requirements Using Time-Variant Feedback Delay Networks
130 pages
Algorithms, Design and Analysis: Types of Formulas For Basic Operation Count
No ratings yet
Algorithms, Design and Analysis: Types of Formulas For Basic Operation Count
6 pages
Csp
No ratings yet
Csp
30 pages
Review On Online Feature Selection
No ratings yet
Review On Online Feature Selection
4 pages
Worksheet 6 HashFunction
No ratings yet
Worksheet 6 HashFunction
2 pages
Lec 18 Unidirectional Search PDF
No ratings yet
Lec 18 Unidirectional Search PDF
32 pages
DLT Unit-1
No ratings yet
DLT Unit-1
66 pages
dsaAssignment3FALL2024
No ratings yet
dsaAssignment3FALL2024
3 pages
ML Lab Manual Devansh (1)
No ratings yet
ML Lab Manual Devansh (1)
57 pages
Sample Mid Term For Review: 2. A. Iteration (S)
No ratings yet
Sample Mid Term For Review: 2. A. Iteration (S)
5 pages
Multimedia Systems Chapter 7
No ratings yet
Multimedia Systems Chapter 7
21 pages
BDA Experiment 7
No ratings yet
BDA Experiment 7
7 pages
A Matlab Tutorial
No ratings yet
A Matlab Tutorial
39 pages
Run-Length Encoding
No ratings yet
Run-Length Encoding
3 pages
System Identification and Adaptive Control
No ratings yet
System Identification and Adaptive Control
2 pages
Chapter 4-Searching and Sorting
No ratings yet
Chapter 4-Searching and Sorting
20 pages
Polynomials Class 9
No ratings yet
Polynomials Class 9
5 pages
SRT Div
No ratings yet
SRT Div
8 pages
DM Unit 4 MCQ
No ratings yet
DM Unit 4 MCQ
17 pages
Ant Colony Optimization and Local Search For Bin P
No ratings yet
Ant Colony Optimization and Local Search For Bin P
13 pages
Assignment Mtech
No ratings yet
Assignment Mtech
5 pages
Data Structures and Algorithms: Werner Nutt
No ratings yet
Data Structures and Algorithms: Werner Nutt
68 pages
Short Notes _ Linear Programming __ Lakshya MHTCET 2025
No ratings yet
Short Notes _ Linear Programming __ Lakshya MHTCET 2025
2 pages
Elementry Sorting Algorithm - LeetCode Discuss
No ratings yet
Elementry Sorting Algorithm - LeetCode Discuss
1 page
BCS401-Module-5
No ratings yet
BCS401-Module-5
22 pages
2.3 Factor Theorem
No ratings yet
2.3 Factor Theorem
12 pages
Two Digit Multiplication Worksheet 1
No ratings yet
Two Digit Multiplication Worksheet 1
1 page

09_PCA

Uploaded by

09_PCA

Uploaded by

AI

Dr. Ali Al-Saegh

• PCA is a statistical technique for reducing the dimensionality of a dataset. This is

This captures most of the

point 𝑥 𝑦 𝑥 − 𝑥ҧ 𝑦 − yത (𝑥 − 𝑥)ҧ 𝟐 (𝑦 − 𝐲ത)𝟐 𝑥𝑖 − 𝑥ҧ 𝑦𝑖 − 𝑦ത

𝑥ҧ = 129 𝑦ത = 82 ෍(𝑥 − 𝑥)ҧ 𝟐 = 22 ෍(𝑦 − yത )𝟐 = 40 ෍ 𝑥𝑖 − 𝑥ҧ 𝑦𝑖 − 𝑦ത = 28

22/6 = 3.6 40/6 = 6.6 28/6 = 4.6

3.6 − 𝜆 6.6 − 𝜆 − 4.6 ∗ 4.6 = 0

The eigenvalues are: 𝜆1 = 9.94 , 𝜆2 = 0.26

Let 𝑢11 = 1 then 𝑢12 = 1.37

Variance of PC2 is 0.26

• PC1 captures the most variance in the data

You might also like