0% found this document useful (0 votes)

10 views

The Math Behind PCA

Principal Component Analysis (PCA) is a technique for reducing the dimensionality of datasets while retaining variance by transforming original variables into uncorrelated principal components. The process involves standardizing data, computing a covariance matrix, finding eigenvalues and eigenvectors, selecting top components, forming a projection matrix, and projecting data into a lower-dimensional space. PCA helps identify the axes of greatest variance, allowing for effective data representation and analysis.

Uploaded by

Mahrukh Malik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

The Math Behind PCA

Uploaded by

Mahrukh Malik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

The Math Behind PCA (Principal Component Analysis)

What is PCA?

Principal Component Analysis (PCA) is a linear transformation technique

used to reduce the dimensionality of a dataset while preserving as much
variance (information) as possible.

It does this by transforming the original variables into a new set of

uncorrelated variables called principal components, ordered by how
much variance they capture from the data.

The Mathematical Steps of PCA

Step 1: Standardize the Data

PCA is sensitive to scale, so we start by centering and standardizing the

dataset:

(std) x i−μi
xi =
σi

Where:

 x i = original value of the feature

 μi = mean of the feature

 σ i = standard deviation of the feature

This step ensures all features contribute equally.

Step 2: Compute the Covariance Matrix

Next, we measure the relationships (covariances) between all pairs of

features.
1 T
C= X X
n−1

Where:

 X = standardized data matrix (rows: samples, columns: features)

 C = covariance matrix (symmetric)

Each element c i j of C tells us the covariance between features i and j .

Step 3: Compute the Eigenvalues and Eigenvectors

We solve the eigen decomposition of the covariance matrix:

C v=λ X

Where:

 λ = eigenvalue (amount of variance captured)

 v = eigenvector (direction of the new axis)

You get:

 A set of eigenvectors (principal directions)

 A set of eigenvalues (explained variance per direction)

The eigenvector with the highest eigenvalue is the first principal

component, and so on.

Step 4: Sort Eigenvalues and Select Top k

Sort eigenvalues in descending order and pick the top k components that
capture the most variance.

The explained variance ratio is calculated as:

λk
Explained Variance Ratiok =
∑λ
This helps in choosing how many components to keep.

Step 5: Form the Projection Matrix

Let’s say we choose kkk eigenvectors (columns of matrix W ):

W =[v 1 , v 2 , … , v k ]

Where each v i is an eigenvector (principal component).

Step 6: Project the Data

Transform the original data into the new space:

Z=XW

 X = standardized data matrix

 W = matrix of top kkk eigenvectors

 Z = transformed data in lower dimensions

Each row of Z is a data point represented in the principal component

space.

Geometric Intuition

 PCA finds the axes of greatest variance in the data.

 These new axes (principal components) are orthogonal

(perpendicular).

 Data is rotated and projected onto the new axes.

 The first few principal components often capture most of the variability.

Short Notes and Formulae Class XII Maths
100% (3)
Short Notes and Formulae Class XII Maths
30 pages
MATLAB2Python-Julia Cheatsheet
No ratings yet
MATLAB2Python-Julia Cheatsheet
13 pages
A Step by Step Explanation of Principal Component Analysis
No ratings yet
A Step by Step Explanation of Principal Component Analysis
7 pages
Machine Learning 2: Exercise Sheet 2
0% (1)
Machine Learning 2: Exercise Sheet 2
1 page
Linear Algebra Cheat Sheet
No ratings yet
Linear Algebra Cheat Sheet
2 pages
Pca
No ratings yet
Pca
18 pages
Module 2 Lab 2
No ratings yet
Module 2 Lab 2
5 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
PCA_dev
No ratings yet
PCA_dev
16 pages
Pca
No ratings yet
Pca
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
1 page
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
2. PCA
No ratings yet
2. PCA
22 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
No ratings yet
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
16 pages
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
23 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Pca
No ratings yet
Pca
17 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Unit-3
No ratings yet
Unit-3
28 pages
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
No ratings yet
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
19 pages
MLSP Exp02
No ratings yet
MLSP Exp02
10 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
PCA With An Example
No ratings yet
PCA With An Example
7 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
3 pages
program-3
No ratings yet
program-3
7 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Dimensionality Reduction (Principal Component Analysis)
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
12 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
PCA
100% (1)
PCA
33 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
PCA - Principal Component Analysis: Step by Step Computation of PCA
No ratings yet
PCA - Principal Component Analysis: Step by Step Computation of PCA
2 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
1501589578da-mod15-Q1-e-text
No ratings yet
1501589578da-mod15-Q1-e-text
9 pages
Mlfa Autumn 2023 Pca
No ratings yet
Mlfa Autumn 2023 Pca
32 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
No ratings yet
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
8 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
Ai ( PCA)
No ratings yet
Ai ( PCA)
3 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
Love Report
No ratings yet
Love Report
7 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
MLPDF 2
No ratings yet
MLPDF 2
9 pages
Pca
No ratings yet
Pca
34 pages
Principal+Component+Analysis
No ratings yet
Principal+Component+Analysis
6 pages
Principal Component Analysis: #Datascience
No ratings yet
Principal Component Analysis: #Datascience
13 pages
Mloa Exp2 C121
No ratings yet
Mloa Exp2 C121
20 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
DR Pca
No ratings yet
DR Pca
22 pages
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Four Special Matrices: ME710 Mathematical Methods For Engineers Mechanical Engineering NITK Surathkal
No ratings yet
Four Special Matrices: ME710 Mathematical Methods For Engineers Mechanical Engineering NITK Surathkal
16 pages
GRADE 12 DETERMINANTS BRIDGE CLASS MARCH 2024 (2)
No ratings yet
GRADE 12 DETERMINANTS BRIDGE CLASS MARCH 2024 (2)
30 pages
Section-A: JR - Intermediate MATHS-1A Model Paper-1 Max. Marks:75
No ratings yet
Section-A: JR - Intermediate MATHS-1A Model Paper-1 Max. Marks:75
4 pages
(eBook PDF) Edexcel A Level Further Mathematics Year 1 (AS) pdf download
100% (2)
(eBook PDF) Edexcel A Level Further Mathematics Year 1 (AS) pdf download
48 pages
JEE Mains Formula Revision - Class 12
100% (1)
JEE Mains Formula Revision - Class 12
208 pages
Mathematics For A+ Students
No ratings yet
Mathematics For A+ Students
5 pages
Assignment-3_RAjveer
No ratings yet
Assignment-3_RAjveer
9 pages
Inverse of A Matrix Using Elementary Row Operations (Gauss-Jordan)
No ratings yet
Inverse of A Matrix Using Elementary Row Operations (Gauss-Jordan)
5 pages
Determinants (MTG)
No ratings yet
Determinants (MTG)
5 pages
23S2 IE2107 LA Tutorial Soln
No ratings yet
23S2 IE2107 LA Tutorial Soln
25 pages
Lecture 1P4-Further Matrices (Class 3) - B&W
No ratings yet
Lecture 1P4-Further Matrices (Class 3) - B&W
55 pages
AKUEB MCQs All Chap
No ratings yet
AKUEB MCQs All Chap
25 pages
1
No ratings yet
1
19 pages
Ncert Exemplar Math Class 12 Chapter 03 Matrices
No ratings yet
Ncert Exemplar Math Class 12 Chapter 03 Matrices
52 pages
Determinant For Non Square Matrices
No ratings yet
Determinant For Non Square Matrices
14 pages
Quiz 6 Bayliss and Miksis: November 11, 2010
No ratings yet
Quiz 6 Bayliss and Miksis: November 11, 2010
3 pages
Practice Problems Matrix Operations Math 201-105-RE
No ratings yet
Practice Problems Matrix Operations Math 201-105-RE
4 pages
12th 24-25 Test Matrices
No ratings yet
12th 24-25 Test Matrices
2 pages
Vibration of N-Dof-System 4s
No ratings yet
Vibration of N-Dof-System 4s
15 pages
Lecture 12
No ratings yet
Lecture 12
8 pages
Linear Algebra
No ratings yet
Linear Algebra
25 pages
LP SB123 Week 3 Session 1 (Inv&Det)
No ratings yet
LP SB123 Week 3 Session 1 (Inv&Det)
3 pages
Instant Download (Ebook) Linear Algebra and Matrix Computations With MATLAB by Dingyü Xue ISBN 9783110663631, 3110663635 PDF All Chapters
100% (10)
Instant Download (Ebook) Linear Algebra and Matrix Computations With MATLAB by Dingyü Xue ISBN 9783110663631, 3110663635 PDF All Chapters
55 pages
Chapter 3 Geometric Objects and Transformations
No ratings yet
Chapter 3 Geometric Objects and Transformations
43 pages
A1201132722 63869 8 2023 K21MDCA2QSetAllocation
No ratings yet
A1201132722 63869 8 2023 K21MDCA2QSetAllocation
12 pages
(Ebook) A Course in Linear Algebra With Applications, 2nd Edition by Derek J. S. Robinson ISBN 9789812700230, 9789812700247, 9812700234, 9812700242 download
100% (2)
(Ebook) A Course in Linear Algebra With Applications, 2nd Edition by Derek J. S. Robinson ISBN 9789812700230, 9789812700247, 9812700234, 9812700242 download
47 pages

The Math Behind PCA

Uploaded by

The Math Behind PCA

Uploaded by

The Math Behind PCA (Principal Component Analysis)

Principal Component Analysis (PCA) is a linear transformation technique

It does this by transforming the original variables into a new set of

The Mathematical Steps of PCA

Step 1: Standardize the Data

PCA is sensitive to scale, so we start by centering and standardizing the

 x i = original value of the feature

 μi = mean of the feature

 σ i = standard deviation of the feature

This step ensures all features contribute equally.

Step 2: Compute the Covariance Matrix

Next, we measure the relationships (covariances) between all pairs of

 X = standardized data matrix (rows: samples, columns: features)

 C = covariance matrix (symmetric)

Each element c i j of C tells us the covariance between features i and j .

We solve the eigen decomposition of the covariance matrix:

 λ = eigenvalue (amount of variance captured)

 v = eigenvector (direction of the new axis)

 A set of eigenvectors (principal directions)

 A set of eigenvalues (explained variance per direction)

The eigenvector with the highest eigenvalue is the first principal

Step 4: Sort Eigenvalues and Select Top k

The explained variance ratio is calculated as:

Step 5: Form the Projection Matrix

Let’s say we choose kkk eigenvectors (columns of matrix W ):

Where each v i is an eigenvector (principal component).

Step 6: Project the Data

Transform the original data into the new space:

 X = standardized data matrix

 W = matrix of top kkk eigenvectors

 Z = transformed data in lower dimensions

Each row of Z is a data point represented in the principal component

 PCA finds the axes of greatest variance in the data.

 These new axes (principal components) are orthogonal

 Data is rotated and projected onto the new axes.

You might also like