0% found this document useful (0 votes)

7 views

02 Principal Components

Uploaded by

Ronit Bhatia

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

02 Principal Components

Uploaded by

Ronit Bhatia

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Principal Components Analysis (PCA)

Data Science Modelling (SOST30062)

Week 9

Dr András Vörös, Department of Social Statistics

Motivation
We have a number of correlated variables in our data
• they partly measure similar things
Can the information they carry be expressed in fewer variables?
 identify principal components

Uses of the approach:

• Explore data
• Visualise data
• Define predictors for supervised learning methods
Finding the first principal component
Aim: find linear combination of p variables that has the largest
variance (~ captures the most information)

 first principal component

Z1 = ϕ11 X1 + ϕ21 X2 + ⋯ + ϕp1 Xp
p

where ෍ ϕ2j1 = 1
j=1

ϕj1 are called the “loadings” of the variables on the first PC

(their relative weights on the component)
Maximising the variance
We need to maximise the following across n observations:
σni=1(zi1 − zത1 )2
Var(Z1 ) =
n
where zi1 = ϕ11 xi1 + ϕ21 xi2 + ⋯ + ϕp1 xip

If the variables are centered (have means of 0), this simplifies:

σni=1 𝑧𝑖1
2
Var(Z1 ) =
n
zi1 is called the “score” of the ith observationson the first PC
(its value on the component as a variable)
The other components
The second principal component has the largest variance of
linear combinations uncorrelated with the first PC

The third is uncorrelated with the first two

And so on…

In a coordinate system: uncorrelated = orthogonal

Interpretation of principal components
Two PCs based on arrests by three
crime type and urban population of
US states

Arrows: loading vectors on 2 PCs

(axes top and right)
• PC1: crime, PC2: urbanisation
State labels: scores on 2 PCs
(axes bottom and left)
• different state profiles by
crime and urbanisation
Scaling the variables matters
Units of measurement
matter

Variables with larger

variance will be more
important in PCs

Solution: scale them,

so all variables have
variance = 1
Choosing the number of components
How many PCs?
Proportion of variance
explained helps to
choose

Scree plots 
look for “elbow”; drop
in variance explained

Subjective choice
Please continue with the next topic.

Game Day Management
No ratings yet
Game Day Management
13 pages
Òturupon Meji - It Is Now A Pitiful Place
No ratings yet
Òturupon Meji - It Is Now A Pitiful Place
2 pages
A Step by Step Explanation of Principal Component Analysis
No ratings yet
A Step by Step Explanation of Principal Component Analysis
7 pages
My First Sewing Book
17% (6)
My First Sewing Book
12 pages
Strawberry Dna Lab and Analysis Questions
No ratings yet
Strawberry Dna Lab and Analysis Questions
5 pages
Module12 - Unsupervised Learning
No ratings yet
Module12 - Unsupervised Learning
52 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
PC A Tutorial
No ratings yet
PC A Tutorial
12 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Minitab Statguide Multivariate
No ratings yet
Minitab Statguide Multivariate
25 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
2 - 4 Principal Component Analysis (PCA)
No ratings yet
2 - 4 Principal Component Analysis (PCA)
15 pages
Practical Guide To Principal Component N R
No ratings yet
Practical Guide To Principal Component N R
43 pages
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
No ratings yet
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
12 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Lecture 12 - Unsupervised- PCA
No ratings yet
Lecture 12 - Unsupervised- PCA
17 pages
DR Pca
No ratings yet
DR Pca
22 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
4 pages
Principal Component Analysis (PCA) Explained - Built in
No ratings yet
Principal Component Analysis (PCA) Explained - Built in
11 pages
Week 9 Lecture - Revision Test-dual-translated
No ratings yet
Week 9 Lecture - Revision Test-dual-translated
92 pages
Practical Guide To Principal Component Analysis (PCA) in R & Python
No ratings yet
Practical Guide To Principal Component Analysis (PCA) in R & Python
33 pages
Principal Components Analysis: Hal Whitehead BIOL4062/5062
No ratings yet
Principal Components Analysis: Hal Whitehead BIOL4062/5062
29 pages
Ch12 Unsupervised Learning
No ratings yet
Ch12 Unsupervised Learning
58 pages
Unsupervised Handout
No ratings yet
Unsupervised Handout
50 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
PCA_review_reset
No ratings yet
PCA_review_reset
24 pages
5 Pca
No ratings yet
5 Pca
33 pages
Module 4-2 Principal Components Analysis
No ratings yet
Module 4-2 Principal Components Analysis
18 pages
PCA using R
No ratings yet
PCA using R
12 pages
Lecture Five-Multivariate Factor Models
No ratings yet
Lecture Five-Multivariate Factor Models
20 pages
Principal Components Analysis (PCA)
No ratings yet
Principal Components Analysis (PCA)
53 pages
1856
No ratings yet
1856
25 pages
MDA PrincipalComponentAnalysis
No ratings yet
MDA PrincipalComponentAnalysis
20 pages
Dimensional Reduction in R
No ratings yet
Dimensional Reduction in R
24 pages
116_Principal_components_analysis
No ratings yet
116_Principal_components_analysis
6 pages
Steps for PCA
No ratings yet
Steps for PCA
5 pages
AA11_Unsupervised Learning_2024 (2)
No ratings yet
AA11_Unsupervised Learning_2024 (2)
39 pages
Multivariate Statistics Principal Component Analysis (PCA)
No ratings yet
Multivariate Statistics Principal Component Analysis (PCA)
41 pages
Chapter 2 Principal Components Analysis: Math 3210
No ratings yet
Chapter 2 Principal Components Analysis: Math 3210
30 pages
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
No ratings yet
A Step-By-Step Explanation of Principal Component Analysis (PCA) - Built in
8 pages
MiM Predictive Analytics Sessions 1 2 (PCA)
No ratings yet
MiM Predictive Analytics Sessions 1 2 (PCA)
26 pages
Chapter 4: Normalized Principal Components Analysis: Dr. Lassad El Moubarki Tunis Business School
No ratings yet
Chapter 4: Normalized Principal Components Analysis: Dr. Lassad El Moubarki Tunis Business School
23 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
STAT502
No ratings yet
STAT502
13 pages
Presentation a i Std 2
No ratings yet
Presentation a i Std 2
63 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Unit5 1
No ratings yet
Unit5 1
98 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
Jolliffe 2014
No ratings yet
Jolliffe 2014
5 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Dimensionality Reduction (Pca)
No ratings yet
Dimensionality Reduction (Pca)
32 pages
Pac
No ratings yet
Pac
70 pages
ACPusingR
No ratings yet
ACPusingR
25 pages
Data Mining - Module 2 - HU
No ratings yet
Data Mining - Module 2 - HU
88 pages
Ch. 10 Principal Components Analysis (PCA)
No ratings yet
Ch. 10 Principal Components Analysis (PCA)
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Factor Analysis and Principal Components: by A. Subrahmanyam
No ratings yet
Factor Analysis and Principal Components: by A. Subrahmanyam
14 pages
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
Complex Variables
From Everand
Complex Variables
Francis J. Flanigan
No ratings yet
Chapter 7
No ratings yet
Chapter 7
50 pages
Gaurav Java Internship Report
No ratings yet
Gaurav Java Internship Report
30 pages
UNIT3 SBE Students
No ratings yet
UNIT3 SBE Students
92 pages
Variable Primary Chilled Water PDF
No ratings yet
Variable Primary Chilled Water PDF
8 pages
India Buyers & Directory - Great Export Import Australian Wheat
No ratings yet
India Buyers & Directory - Great Export Import Australian Wheat
3 pages
Revised SBM Assessment Tool With WinS New SBM Tool
No ratings yet
Revised SBM Assessment Tool With WinS New SBM Tool
15 pages
The Techniques of SPNF (Part 4) New
No ratings yet
The Techniques of SPNF (Part 4) New
87 pages
Q3 - Module 2 (DATA COLLECTION AND PRESENTATION)
No ratings yet
Q3 - Module 2 (DATA COLLECTION AND PRESENTATION)
14 pages
Introduction To Business Policy
100% (1)
Introduction To Business Policy
7 pages
Travelogue
100% (1)
Travelogue
4 pages
Mastercontrol Integrations Faq
No ratings yet
Mastercontrol Integrations Faq
8 pages
Student Handbook_07!03!24 (1)
No ratings yet
Student Handbook_07!03!24 (1)
272 pages
Distance Learning and Effect On Practical Skills Courses in BTVTED of ZCSPC
No ratings yet
Distance Learning and Effect On Practical Skills Courses in BTVTED of ZCSPC
12 pages
Nurse Refuses To Give CPR, Senior Dies: Ethical Problem or Legal Issues
No ratings yet
Nurse Refuses To Give CPR, Senior Dies: Ethical Problem or Legal Issues
4 pages
Peking Opera
No ratings yet
Peking Opera
22 pages
The Reinforced Concrete Buildings Design: For Wind and Earthquake Loads Resilience
No ratings yet
The Reinforced Concrete Buildings Design: For Wind and Earthquake Loads Resilience
62 pages
Elements Compounds Mixtures Worksheet Chemistry
No ratings yet
Elements Compounds Mixtures Worksheet Chemistry
3 pages
Military Awards: Unclassified
No ratings yet
Military Awards: Unclassified
206 pages
Thesis On Small Scale Industries in India
100% (3)
Thesis On Small Scale Industries in India
5 pages
A Paper On CSR of Kia Motors Philippines
No ratings yet
A Paper On CSR of Kia Motors Philippines
17 pages
Mitraljez 127mm M1938-46 I 1938 DSK 1966
No ratings yet
Mitraljez 127mm M1938-46 I 1938 DSK 1966
100 pages
FOCAL DYSTONIA-A NEUROLOGICAL CONDITION-TREATED WITH CAUSTICUM - Karl Robinson MD
No ratings yet
FOCAL DYSTONIA-A NEUROLOGICAL CONDITION-TREATED WITH CAUSTICUM - Karl Robinson MD
2 pages
Cambridge Chinese For Beginners: Textbook 2, by Marcus Reoch and William
No ratings yet
Cambridge Chinese For Beginners: Textbook 2, by Marcus Reoch and William
3 pages
Saturn V Instructions PDF
No ratings yet
Saturn V Instructions PDF
95 pages
Romy-Jr.-delos-Santos-SPA-Sec.-Eng-FINAL
No ratings yet
Romy-Jr.-delos-Santos-SPA-Sec.-Eng-FINAL
6 pages
Exercise 1: Choose The Best Answer To Complete The Following Sentences
No ratings yet
Exercise 1: Choose The Best Answer To Complete The Following Sentences
6 pages

02 Principal Components

Uploaded by

02 Principal Components

Uploaded by

Principal Components Analysis (PCA)

Data Science Modelling (SOST30062)

Dr András Vörös, Department of Social Statistics

Uses of the approach:

 first principal component

ϕj1 are called the “loadings” of the variables on the first PC

If the variables are centered (have means of 0), this simplifies:

The third is uncorrelated with the first two

In a coordinate system: uncorrelated = orthogonal

Arrows: loading vectors on 2 PCs

Variables with larger

Solution: scale them,

You might also like