0% found this document useful (0 votes)

15 views

2 - 4 Principal Component Analysis (PCA)

1) Principal component analysis (PCA) is used to reduce the dimensionality of large datasets by transforming correlated variables into a smaller number of uncorrelated variables called principal components. 2) PCA works by computing the eigenvalues and eigenvectors of the covariance matrix of the dataset and using them to change the basis of the data to a new set of orthogonal variables ordered by variability. 3) This transformation projects the data onto a new set of axes such that the first axis captures the largest variability in the data, with each successive axis capturing the next highest variability.

Uploaded by

Michael Odiembo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

2 - 4 Principal Component Analysis (PCA)

Uploaded by

Michael Odiembo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

2.

1 Data analytics for dimensionality

reduction:
Principal Component Analysis (PCA)
Prof. Massimiliano Grosso
University of Cagliari, Italy
[email protected]
GRICU PhD School 2021
Digitalization Tools for the Chemical and Process Industries
March 12, 2021

Outline
• Motivations
• Basic concepts
• Preprocessing
• Mathematical background
• Dimension reduction
• Geometrical interpretation

2.1 Principal Component Analysis (M. Grosso) 2

1
Motivations
• Concerns when dealing with “huge” amount of data:
• The size of the data:
• The useful information is often «hidden» amongst hundred/thousands of
variables
• The measurements are often highly correlated with one another
(multicollinearity)
• The number of independent variables (degrees of freedom) is much less than
the number of measurements on hand
• Noise in the measurements
• Difficulties in distinguishing the noise from the deterministic variations
induced by external sources

2.1 Principal Component Analysis (M. Grosso)

Motivations
• Multivariate data analysis method for
• Explorative data analysis
• Outlier detection
• Rank reduction
• Graphical clustering
• Classification

• PCA allows interpretation based on all variables simultaneously,

leading to understanding deeper than what is possible looking
at the individual variables alone
• It is the first multivariate analysis to be carried out
2.1 Principal Component Analysis (M. Grosso) 4

2
PCA: Basic concepts
• Aim of the PCA:

Projection of the variables onto the

Original variables
Principal components (PCs)

artificial variables,
high dimension,
dimension much
strongly correlated
lower, independent

PCA: Basic concepts

• Data must be collected on matrix X J
• Column vectors represent the variables
(j=1,…,J)
• attributes, wavelenghts, physical/chemical
parameters etc.
• Row vectors represent the samples X
(i=1,…,I) collected during the
experiments

2.1 Principal Component Analysis (M. Grosso) 6

3
Preprocessing of the data
• Matrix X can be visualized in a coordinate system made up by
• J orthogonal axes, each representing one of the original J variables
• Each i-th sample is a J-dimensional row vector
• Two-dimensional example with two variables highly correlated
x2 • First step; x2
• Translate the data to the
center («mean centering»)
x1 x1

∗ 1
= − ̅•∗ ℎ ∗
• = ∗

2.1 Principal Component Analysis (M. Grosso) 7

Preprocessing of the data

• Mean centering allows to consider the covariance matrix =
∗ ∗
⋯ − ̅ ∗· ⋯ − ̅ ∗·
= ⋮ ⋱ ⋮ = ⋮ ⋱ ⋮
∗ ∗
⋯ − ∗· ⋯ − ̅ ∗·

• Indeed, for the element kl

∗ ∗
= = − ̅ ∗· − ̅ ∗· =

• The diagonal elements of C are the dispersion related to the j-th variable
∗
= = − ̅ ∗· =

2.1 Principal Component Analysis (M. Grosso) 8

4
PCA – Basic concepts
• Principal Component Analysis is based on the decomposition
of the dataset matrix X
=
(I×J) (I×J)(J×J)

Scores matrix Loadings matrix

Artificial variables Rotation matrix relating
generated by artificial variables with the
PCA original ones

2.1 Principal Component Analysis (M. Grosso) 9

PCA – Basic concepts

• Important properties:
1. Even the scores are mean centered

̅• = 0 ∀ = 1, … , ⇒ •̅ = 0 ∀ = 1, … ,

2. Column vectors of the score matrix T are orthogonal:

=0∀ ≠
1. The square of the score matrix = is diagonal
3. Loadings matrix P is orthogonal: = ⇒ =

2.1 Principal Component Analysis (M. Grosso) 10

5
Mathematical background
• PCA scores and loadings can be related to the computation of
the eigenvalues and eigenvectors of the J×J covariance
matrix

• Remark
• C is a square, symmetric matrix, this leads to the following
properties:
• All the eigenvalues are real and positive
• All the eigenvectors are orthogonal to each other

2.1 Principal Component Analysis (M. Grosso) 11

Mathematical background
• Starting from the definition = , one can obtain the
following relationships
= = = =
• The latter equation corresponds to the eigendecomposition of
the square matrix =
• is a diagonal matrix whose diagonal elements are the eigenvalues of
C
• The m-th element = is the variance explained by the m-th
score
• P is the n×n square matrix whose m-th column is the eigenvector pm of
C
• it is a rotation matrix2.1 Principal Component Analysis (M. Grosso) 12

6
Mathematical background
• Once the eigenvectors pm are computed the corresponding
scores can be derived
= ⇒ = ⇒ =

• In practice, the original variables are projected onto the

orthogonal eigenspace defined by the eigenvectors/loadings

2.1 Principal Component Analysis (M. Grosso) 13

Mathematical background
• The eigenvalues of the covariance matrix are related to the variance of the
scores
= = ,

• Thus the j-th eigenvalue is the dispersion captured by the j-th score
• The total variance in the original data set is preserved in the T matrix

Sum of the variances of Sum of the variances

the original variables of the scores

2.1 Principal Component Analysis (M. Grosso) 14

7
Mathematical background
• In summary, one ends up with two matrices
, , , ,
= … = …
× ×1 ×1 ×1 × ×1 ×1 ×1

Scores matrix Loading

The j-th column represents an independent variable Each column is an

obtained by projecting the data onto the j-th eigenvector of the
eigenvector covariance matrix
Remind:
Sort the eigenvectors according to their eigenvalue
size (that is, their variance)
2.1 Principal Component Analysis (M. Grosso) 15

PCA – Dimension reduction

• The scores and loading matrices can be approximated by
considering only the first A principal components

= ⋮ ≈ = ⋮ ≈
× × × − × × × × − ×

Information
Information
considered
considered
negligible
negligible

2.1 Principal Component Analysis (M. Grosso) 16

8
PCA – Dimension reduction
• Qualitative interpretation of the PCA
(I×A) (A×J)
≈
PA × × ×
PT
X = TA T In general:
≪
(J×J)

(I×J) (I×J)
• Only part of the information collected in the X matrix is relevant
• Only the first A columns of T (the first scores) take into account
most of the data variance
2.1 Principal Component Analysis (M. Grosso) 17

PCA – A geometrical interpretation

• 2D example - Reduction to 1D
• Samples are strongly correlated
PC1
PC2
• First principal component PC1 is
the eigenvector direction
x22
corresponding to maximum
variance (largest eigenvalue) in the
coordinate space
• Second principal component is the
x11
orthogonal one leading to the
second variance directions

9
PCA – A geometrical interpretation PC1
Second
component x2
• Orthogonal projection onto a of loading 1
specific PC results in a score
for each sample
unit vector
• The loading is the unit vector along PC1
which defines this direction loading 1

x1
First
component
of loading 1

2.1 Principal Component Analysis (M.Grosso) 19

PCA – A geometrical interpretation

• The score is the projection of the point onto the first principal
component
x2 PC1
≈ =
t1

2.1 Principal Component Analysis (M. Grosso) 20

10
J
PCA – Working principle
– Reduction to 1D
PCA
• PCA projects matrix X into: projection
• a score vector t1 X t1
• a loading vector p1

≈
I
× ×1 1×
PCA
projection ×1
• t1 and p1 are the first
components p1T
1×

2.1 Principal Component Analysis (M. Grosso) 21

PCA – A geometrical interpretation

• 3D example (A little bit more
PC3
complicated)
PC1
PC2
• Points are mostly aligned
along the 2D plane defined
by the PC1 and the PC2
directions

2.1 Principal Component Analysis (M. Grosso) 22

11
PCA – Working principle – Reduction to 2D
• If two principal components are required, matrix is formed by
the outer products of t1 and p1, t2 and p2
p1 p2

X = t1 + t2 + E

• Matrix X is decomposed into two sets of rank 1 outer products

(2 terms) and the residual matrix E
2.1 Principal Component Analysis (M. Grosso) 23

PCA – Working principle

• Successive components are formed by the outer products of ta
and pa
p1 p2 pA

X = t1 + t2 + … + tA + E

• Matrix X is decomposed into a set of rank 1 outer products (A

terms) and the residual matrix E
2.1 Principal Component Analysis (M. Grosso) 24

12
PCA – Working principle
• The master equation for PCA is eventually

= + + … +

• or
= +
× × × ×

original data score loading residual

matrix matrix matrix matrix

2.1 Principal Component Analysis (M. Grosso) 25

Estimation of the residuals

• When considering a PCA model with A principal components,
one can evaluate the residual E

= − · = −

2.1 Principal Component Analysis (M. Grosso) 26

13
Estimation of the components
• How many principal components are needed?
• Possible criterion: cumulative variance explained by the first
A principal components
• The number of principal components to be considered explains most of
the variance in the data (e.g., 95%)
• Alternative possibilities will be discusses in the case studies

2.1 Principal Component Analysis (M. Grosso) 27

PCA to predict new data – Projection of the

data onto the principal component space
• Single observations, (eventually new data xnew) can be
eventually projected onto the space defined by the PCA model:

t new  x new  PA xˆ new  x new  PA  PAT

1 A 1 J   J  A 1 J  1 J  J  A A J 

2.1 Principal Component Analysis (M. Grosso) 28

14
PCA – Summary
• PCA projects the original data onto an orthogonal eigenspace of
smaller dimensions
• The space is described by the first A eigenvectors of the
covariance matrix
• The scores (i.e. the data projections onto the first eigenvectors)
represent a set of independent variables
• New data can be projected in the PCA model

2.1 Principal Component Analysis (M. Grosso) 29

References
1. Brereton, R.G. Chemometrics: Data Analysis for the Laboratory and Chemical Plant. Wiley, 2003
2. Brereton, R.G. Chemometrics for Pattern Recognition. Wiley, 2009
3. Jackson, J.E., A User’s Guide to Principal Components. Wiley, New York, 1991
4. Jolliffe, I.T. Principal Component Analysis. Second Edition. Springer, 2002.
5. Jolliffe IT, Cadima J. (2016). Principal component analysis: a review and recent developments.
Phil.Trans.R.Soc. A374:20150202.
6. Wold S., Esbensen K.,Geladi P (1987). Principal Component Analysis – A tutorial. Chemom. Intell. Lab. 2,
37-52

2.1 Principal Component Analysis (M. Grosso) 30

Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
County Emergency Management Form Renal Network 11 Dialysis Facility Disaster Plan Checklist
No ratings yet
County Emergency Management Form Renal Network 11 Dialysis Facility Disaster Plan Checklist
5 pages
Hadamard Matrix Analysis and Synthesis With Applications To Communications and Signal Image Processing
No ratings yet
Hadamard Matrix Analysis and Synthesis With Applications To Communications and Signal Image Processing
119 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Presentation a i Std 2
No ratings yet
Presentation a i Std 2
63 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
MDA PrincipalComponentAnalysis
No ratings yet
MDA PrincipalComponentAnalysis
20 pages
Mlfa Autumn 2023 Pca
No ratings yet
Mlfa Autumn 2023 Pca
32 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
09 PCA
No ratings yet
09 PCA
22 pages
PCA
100% (1)
PCA
45 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
45 pages
program-3
No ratings yet
program-3
7 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Pac
No ratings yet
Pac
70 pages
Unit-3
No ratings yet
Unit-3
28 pages
Varimax Rotation
No ratings yet
Varimax Rotation
47 pages
L08 PrincipalComponentAnalysis
No ratings yet
L08 PrincipalComponentAnalysis
36 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Principal Component Analysis (PCA) - : San José State University Math 253: Mathematical Methods For Data Visualization
No ratings yet
Principal Component Analysis (PCA) - : San José State University Math 253: Mathematical Methods For Data Visualization
49 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
PCA_dev
No ratings yet
PCA_dev
16 pages
PCA1
No ratings yet
PCA1
45 pages
Principle Component Analysis
No ratings yet
Principle Component Analysis
4 pages
Module 3
No ratings yet
Module 3
41 pages
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
No ratings yet
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
12 pages
PC A Tutorial
No ratings yet
PC A Tutorial
12 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
STAT502
No ratings yet
STAT502
13 pages
Principal Components Analysis: Hal Whitehead BIOL4062/5062
No ratings yet
Principal Components Analysis: Hal Whitehead BIOL4062/5062
29 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
02 Principal Components
No ratings yet
02 Principal Components
9 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
(ABDI H.) Principal Component Analysis
No ratings yet
(ABDI H.) Principal Component Analysis
27 pages
Principal Component Analysis: Herv e Abdi and Lynne J. Williams
No ratings yet
Principal Component Analysis: Herv e Abdi and Lynne J. Williams
27 pages
WIREs Computational Stats - 2010 - Abdi - Principal component analysis
No ratings yet
WIREs Computational Stats - 2010 - Abdi - Principal component analysis
27 pages
5 Pca
No ratings yet
5 Pca
33 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
3
No ratings yet
3
12 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
PCA Biology
No ratings yet
PCA Biology
45 pages
Principal Components Analysis
No ratings yet
Principal Components Analysis
23 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
PCA revis-BoW PDF
No ratings yet
PCA revis-BoW PDF
47 pages
Pca Ica
No ratings yet
Pca Ica
34 pages
2d
No ratings yet
2d
17 pages
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
Kumar 2017
No ratings yet
Kumar 2017
13 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Finite Element Methods
From Everand
Finite Element Methods
Rahul Basu
No ratings yet
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
Business Failures, Macroeconomic Risk and The E Ect of
No ratings yet
Business Failures, Macroeconomic Risk and The E Ect of
24 pages
Digital Finance and Its Impact On Financial Inclus
No ratings yet
Digital Finance and Its Impact On Financial Inclus
8 pages
Effect of Macroeconomic Variables On
No ratings yet
Effect of Macroeconomic Variables On
16 pages
WSP Cash Conversion Cycle - VF
No ratings yet
WSP Cash Conversion Cycle - VF
8 pages
Managing Mergers & Acquisitions Project
No ratings yet
Managing Mergers & Acquisitions Project
64 pages
ROIC ExplanationAndExamples
No ratings yet
ROIC ExplanationAndExamples
4 pages
Biofuel - Rice Husk Briquette Machine - United Nations Industrial Development Organization
No ratings yet
Biofuel - Rice Husk Briquette Machine - United Nations Industrial Development Organization
7 pages
Break-Even Ratio Example by Zilculator
No ratings yet
Break-Even Ratio Example by Zilculator
2 pages
One Spa Excel Worksheet
No ratings yet
One Spa Excel Worksheet
2 pages
1 Simple Budget Template With Dashboard Tipsographic
No ratings yet
1 Simple Budget Template With Dashboard Tipsographic
7 pages
Advanced Mortgage Amortization v2.0
No ratings yet
Advanced Mortgage Amortization v2.0
20 pages
Ras Survey 2018
No ratings yet
Ras Survey 2018
4 pages
Lease Agreement Capital City001
No ratings yet
Lease Agreement Capital City001
4 pages
Pond 13
No ratings yet
Pond 13
14 pages
Pond 12
No ratings yet
Pond 12
14 pages
Pond 15
No ratings yet
Pond 15
14 pages
Pond 14
No ratings yet
Pond 14
14 pages
Luo
No ratings yet
Luo
1 page
Pond 10
No ratings yet
Pond 10
14 pages
Pond 11
No ratings yet
Pond 11
14 pages
Pond 8
No ratings yet
Pond 8
14 pages
Pond 6
No ratings yet
Pond 6
8 pages
Pond 7
No ratings yet
Pond 7
8 pages
161 - Course Details B.E.computer Technology
No ratings yet
161 - Course Details B.E.computer Technology
33 pages
Lecture Notes Continuum Mechanics 2019
No ratings yet
Lecture Notes Continuum Mechanics 2019
53 pages
Eigenvectors, Spectral Theorems
No ratings yet
Eigenvectors, Spectral Theorems
22 pages
Solution Eigen
100% (2)
Solution Eigen
4 pages
Load Dependent Ritz Vector Algorithm and Error Ananlysis
No ratings yet
Load Dependent Ritz Vector Algorithm and Error Ananlysis
15 pages
03 01 17
No ratings yet
03 01 17
3 pages
Stability Analysis of Truss Type Highway Sign Support Structures
No ratings yet
Stability Analysis of Truss Type Highway Sign Support Structures
34 pages
Cchelp
No ratings yet
Cchelp
119 pages
Autonomous Programmes Bachelor of Engineering Department of Computer Science and Engineering
No ratings yet
Autonomous Programmes Bachelor of Engineering Department of Computer Science and Engineering
28 pages
VTU SEM 1 Scheme of Evaluation and Syllabus
No ratings yet
VTU SEM 1 Scheme of Evaluation and Syllabus
13 pages
An Evaluation of Key Factors For Bancassurance Success PDF
No ratings yet
An Evaluation of Key Factors For Bancassurance Success PDF
9 pages
Advanced Computational Fluid Dynamics: Jacek Rokicki
No ratings yet
Advanced Computational Fluid Dynamics: Jacek Rokicki
68 pages
Phy 302
No ratings yet
Phy 302
114 pages
Griffiths CH 3 Selected Solutions in Quantum Mechanics Prob 2,5,7,10,11,12,21,22,27
No ratings yet
Griffiths CH 3 Selected Solutions in Quantum Mechanics Prob 2,5,7,10,11,12,21,22,27
11 pages
Stability of Structures FE-based Stability Analysis
No ratings yet
Stability of Structures FE-based Stability Analysis
40 pages
LinearAlgebra Author Benjamin
No ratings yet
LinearAlgebra Author Benjamin
491 pages
Dynare Tutorial PDF
No ratings yet
Dynare Tutorial PDF
7 pages
MATLAB Guide Third Edition Desmond J. Higham - Download the ebook now to start reading without waiting
100% (1)
MATLAB Guide Third Edition Desmond J. Higham - Download the ebook now to start reading without waiting
68 pages
Lax - Hyperbolic Systems of Conservation Laws and The Mathematical Theory of Shock Waves
No ratings yet
Lax - Hyperbolic Systems of Conservation Laws and The Mathematical Theory of Shock Waves
59 pages
Syllabuslineercebir
No ratings yet
Syllabuslineercebir
2 pages
Eigenvalue Inequalities For Matrix Products
No ratings yet
Eigenvalue Inequalities For Matrix Products
4 pages
Dyna 1
100% (1)
Dyna 1
69 pages
BSC (H) Mathematics
No ratings yet
BSC (H) Mathematics
5 pages
Center of Circle After Perspective Transformation
No ratings yet
Center of Circle After Perspective Transformation
5 pages
A Fusion Model For Enhancement of Range Images
No ratings yet
A Fusion Model For Enhancement of Range Images
39 pages
Iit Jam Cy 2008
No ratings yet
Iit Jam Cy 2008
10 pages
Mat3 C03 - Mathematics - Iii
No ratings yet
Mat3 C03 - Mathematics - Iii
2 pages
Gate Ode Last 5 Years (2020 - 2024) - Pma
No ratings yet
Gate Ode Last 5 Years (2020 - 2024) - Pma
18 pages

2 - 4 Principal Component Analysis (PCA)

Uploaded by

2 - 4 Principal Component Analysis (PCA)

Uploaded by

2.

1 Data analytics for dimensionality

2.1 Principal Component Analysis (M. Grosso) 2

2.1 Principal Component Analysis (M. Grosso)

• PCA allows interpretation based on all variables simultaneously,

Projection of the variables onto the

PCA: Basic concepts

2.1 Principal Component Analysis (M. Grosso) 6

2.1 Principal Component Analysis (M. Grosso) 7

Preprocessing of the data

• Indeed, for the element kl

2.1 Principal Component Analysis (M. Grosso) 8

Scores matrix Loadings matrix

2.1 Principal Component Analysis (M. Grosso) 9

PCA – Basic concepts

2. Column vectors of the score matrix T are orthogonal:

2.1 Principal Component Analysis (M. Grosso) 10

2.1 Principal Component Analysis (M. Grosso) 11

• In practice, the original variables are projected onto the

2.1 Principal Component Analysis (M. Grosso) 13

Sum of the variances of Sum of the variances

2.1 Principal Component Analysis (M. Grosso) 14

Scores matrix Loading

The j-th column represents an independent variable Each column is an

PCA – Dimension reduction

2.1 Principal Component Analysis (M. Grosso) 16

PCA – A geometrical interpretation

2.1 Principal Component Analysis (M.Grosso) 19

PCA – A geometrical interpretation

2.1 Principal Component Analysis (M. Grosso) 20

2.1 Principal Component Analysis (M. Grosso) 21

PCA – A geometrical interpretation

2.1 Principal Component Analysis (M. Grosso) 22

• Matrix X is decomposed into two sets of rank 1 outer products

PCA – Working principle

• Matrix X is decomposed into a set of rank 1 outer products (A

original data score loading residual

2.1 Principal Component Analysis (M. Grosso) 25

Estimation of the residuals

2.1 Principal Component Analysis (M. Grosso) 26

2.1 Principal Component Analysis (M. Grosso) 27

PCA to predict new data – Projection of the

t new  x new  PA xˆ new  x new  PA  PAT

2.1 Principal Component Analysis (M. Grosso) 28

2.1 Principal Component Analysis (M. Grosso) 29

2.1 Principal Component Analysis (M. Grosso) 30

You might also like