0% found this document useful (0 votes)

2 views

Lecture 12 - Unsupervised- PCA

The document provides an overview of unsupervised learning, focusing on Principal Components Analysis (PCA) as a method for dimensionality reduction and data visualization. It discusses the challenges of unsupervised learning compared to supervised learning, and outlines the applications of PCA in various fields. Additionally, it explains the process of finding principal components, including the use of singular value decomposition and the importance of scaling variables.

Uploaded by

Ahmed Amr

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture 12 - Unsupervised- PCA

Uploaded by

Ahmed Amr

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

2020-12-21

Unsupervised Learning
PCA
Mohamed Elshenawy
Zewail University of Science and Technology

Overview

• Unsupervised Learning
• Principal Components Analysis

1
2020-12-21

Unsupervised Learning
Sections 10.1 from the book : An Introduction to Statistical Learning. James,
Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani, 2013, ISBN: 978-1-
461-47137-0

Supervised Learning

• So far, we discussed supervised learning methods.

• In supervised learning, we have access to a set of p features, 𝑋1 , 𝑋2 ,… 𝑋𝑝 ,
measured on n observations, and a target (response) T, measured on those n
observations.
• The goal is to predict the target variable T, using the input features 𝑋1 , 𝑋2 ,… 𝑋𝑝

2
2020-12-21

Unsupervised Learning
• We have only a set of features 𝑋1 , 𝑋2 ,… 𝑋𝑝 measured on n observations.
• We do not have an associated target (response) variable T. Therefore, we are not
interested in prediction.
• The goal is to discover interesting things about 𝑋1 , 𝑋2 ,… 𝑋𝑝 ,
• How can we merge the given features (𝑋1 , 𝑋2 ,… 𝑋𝑝 ) to produce a smaller set of attributes
that encode most of the information included in these given features (produce 𝑍1 , 𝑍2 ,…
𝑍𝑞 where 𝑞 < 𝑝). Dimensionality Reduction.
• Is there an informative way to visualize the data? Visualization
• Can we define subgroups, using given observations, that have similar characteristics
(similar values of 𝑋1 , 𝑋2 ,… 𝑋𝑝 , for instance)? Clustering
• Learn the probability distribution that generates the data. Density Estimation

The Challenge of Unsupervised Learning

• In supervised learning, the task is clear:
• 1) a clear goal: predict the target variables using input features.
• 2) a clear understanding of how to assess the performance of your model (using training
and test error, cross-validation, etc.)
• In contrast, unsupervised learning is often much more challenging and the task is
less clear. There is no universally accepted mechanism to assess the results of the
unsupervised learning (we don’t know the true answer).
• It is typically performed as part of an exploratory data analysis

3
2020-12-21

Example Applications
• A cancer researcher might assay gene expression levels in 100 patients with breast
cancer.
• A possible approach is to look for subgroups among the genes in order to obtain a
better understanding of the disease.
• Recommendation systems (recommend items based on the purchase histories of similar
shoppers):
• to identify groups of shoppers with similar browsing and purchase histories
• to identify items that are of particular interest to the shoppers within each group.
• Search engines:
• choose what search results to display to a particular individual based on the click
histories of other individuals with similar search patterns.

Principle Components Analysis

Sections 10.2 from the book : An Introduction to Statistical Learning. James,
Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani, 2013, ISBN: 978-1-
461-47137-0

4
2020-12-21

Which dimension has more information 𝑋1 or 𝑋2

𝑋2

𝑋1

Principle Components Analysis

• Principal component analysis (PCA) refers to the process by which principal components
are computed.
• Principal components allow us to summarize a dataset with a large set of correlated
variables using a smaller number of representative variables that collectively explain
most of the variability in the original set.
• Each of the dimensions found by PCA is a linear combination of the input features.

5
2020-12-21

Example – 2-D
You need:
𝑋2 • The direction of
𝑋 ′ (direction along
𝑍1 which the
observations are
highly variable
• Representation of
the data along the
new dimension

𝑋1

Visualize the data using 1-D

𝑍1

6
2020-12-21

2-D Example- 2

PCA - Applications

• Useful for preprocessing (reduce the dimensions of the

dataset)
• Can be used for data visualization (if we can obtain a two-
dimensional representation of the data that captures most of
the information, then can plot the observations in this low-
dimensional space.)

7
2020-12-21

Assumptions

1. Linear relationship between the data and learned

representation
2. Data is assumed to be continuous
3. Variation contains information

Example – 3-D

How to find this plane?

8
2020-12-21

How to find the first principle component

• The first principal component (𝑍1 ) of a set of features 𝑋1 , 𝑋2 , . . . , 𝑋𝑝 is the
normalized linear combination of the features that has the largest variance.
𝑍1 = 𝜙11 𝑋1 + 𝜙21 𝑋2 + . . . +𝜙𝑝1 𝑋𝑝
2𝑝
• By normalized, we meanσ𝑗=1 𝜙𝑗1 =1
• We constrain the loadings so that their sum of squares is equal to one.
• 𝜙11 , 𝜙21 , … . 𝜙𝑝1 : are referred to as the loadings of 𝑍1 (first component)
• The loadings make up the principal component loading vector
𝜙1 = (𝜙11 , 𝜙21 , … . 𝜙𝑝1 )𝑇
• To find the loading vector, we choose the values that produce the largest variance
(optimization problem)

How to find the first principle components (cont.)

• The first principal component loading vector solves the optimization problem.
𝑛 𝑝
1 2
max ෍(𝑧𝑖1 −𝑧ഥ1 )2 subject to ෍ 𝜙𝑗1 =1
𝜙11 ,𝜙21 ,….𝜙𝑝1 : 𝑛
𝑖=1 𝑗=1

• We refer to 𝑧11 , … . , 𝑧𝑛1 as the scores of the first principle component.

9
2020-12-21

How to find the first principle components (cont.)

𝑛 𝑛 𝑝
1 1
𝑧ഥ1 = ෍ 𝑧𝑖1 = ෍ ෍ 𝜙1𝑗 𝑥𝑖𝑗
𝑛 𝑛
𝑖=0 𝑖=0 𝑗=1
• If the data is normalized 𝑛
1
෍ 𝑥𝑖𝑗 = 0, 𝑡ℎ𝑎𝑡 𝑖𝑠 𝑧ഥ1 = 0
𝑛
𝑖=0 𝑛 𝑝
1 2
max ෍(𝑧𝑖1 )2 subject to ෍ 𝜙𝑗1 =1
𝜙11 ,𝜙21 ,….𝜙𝑝1 : 𝑛
𝑖=1 𝑗=1
2
𝑛 𝑝 𝑝
1 2
max ෍ ෍ 𝜙1𝑗 𝑥𝑖𝑗 subject to ෍ 𝜙𝑗1 =1
𝜙11 ,𝜙21 ,….𝜙𝑝1 : 𝑛
𝑖=1 𝑗=1 𝑗=1

• We constrain the loadings so that their sum of squares is equal to one, since otherwise setting
these elements to be arbitrarily large in absolute value could result in an arbitrarily large
variance.

How to find the first principle components (cont.)

2
𝑛 𝑝 𝑝
1 2
max ෍ ෍ 𝜙1𝑗 𝑥𝑖𝑗 subject to ෍ 𝜙𝑗1 =1
𝜙11 ,𝜙21 ,….𝜙𝑝1 : 𝑛
𝑖=1 𝑗=1 𝑗=1

• The problem can be solved using an eigen decomposition, a standard technique

in linear algebra.
• The loading vector 𝜙1 defines a direction in feature space along which the data
vary the most. If we project the n data points 𝑥1 , . . . , 𝑥𝑛 onto this direction, the
projected values are the principal component scores 𝑧11 , . . . , 𝑧𝑛1 themselves.

10
2020-12-21

Finding the second principle component

• The second principal component is the linear
combination of 𝑋1 , 𝑋2 , . . . , 𝑋𝑝 that has maximal
variance out of all linear combinations that are
uncorrelated with 𝑍1 .
• The second principal component scores
𝑧12 , 𝑧22 . . . , 𝑧𝑛2 take the form
𝑍2 = 𝜙12 𝑋1 + 𝜙22 𝑋2 + . . . +𝜙𝑝2 𝑋𝑝
• where 𝜙2 = (𝜙12 , 𝜙22 , … . 𝜙𝑝2 )𝑇 is the second
principle component loading vector.
• It turns out that constraining 𝑍2 to be
uncorrelated with 𝑍1 is equivalent to
constraining the direction 𝜙2 to be orthogonal
to the direction 𝜙1

Finding additional principle components

• We can define additional principal components in an incremental fashion by

choosing a new direction that:
• Is orthogonal to the principle components already considered .
• Maximizes the projected variance amongst all possible directions.

11
2020-12-21

Singular Value Decomposition

• You can perform PCA by using singular value decomposition of the data matrix

𝑋 = 𝑈𝑆𝑉 𝑇
• U: 𝑛 × 𝑛 orthogonal matrix
• S: diagonal matrix 𝑛 × 𝑝 matrix
• V: 𝑝 × 𝑝 orthogonal matrix
• Principle components (PC) are the columns of V
• PC scores are the columns of U

USArrests dataset

• For each of the 50 states in the United States (𝑛 = 50), the data set contains the
number of arrests per 100,000 residents for each of three crimes: Assault,
Murder, and Rape. In addition, the dataset has the UrbanPop attribute, which
indicates the percent of the population in each state living in urban areas.

12
2020-12-21

Principle Components

Biplot
• Overlays a score plot (projecting
the observations onto the span of
the first two PCs, shown in blue)
and a loadings plot (shown in
orange) in a single graph.

13
2020-12-21

Biplot (cont.)
• We can see the first loading vector
places approximately equal weight
on Assault, Murder, and Rape, with
much less weight on UrbanPop.
• The second loading vector places
most of its weight on UrbanPop and
much less weight on the other three
features.
• This indicates that the crime-related
variables are correlated with each
other.

Biplot (cont.)
• States with large positive scores on the
first component, such as California,
Nevada and Florida, have high crime rates
• States like North Dakota, with negative
scores on the first component, have low
crime rates.
• California also has a high score on the
second component, indicating a high level
of urbanizations.
• States close to zero on both components,
such as Indiana, have approximately
average levels of both crime and
urbanization.

14
2020-12-21

Scaling the variables

• The results obtained when we perform PCA depend on whether the variables
have been individually scaled (each multiplied by a different constant).
• This is in contrast to some other supervised and unsupervised learning
techniques, such as linear regression.

Scaling the variables

• Murder, Rape, and Assault are reported as the number of occurrences per
100, 000 people, and UrbanPop is the percentage of the state’s population that
lives in an urban area (different units).
• These four variables have variance 18.97, 87.73, 6945.16, and 209.5, respectively.
• If we perform PCA on the unscaled variables, then the first principal component
loading vector will have a very large loading for Assault, since that variable has by
far the highest variance.

15
2020-12-21

Scaling the variables (Cont.)

Scaled to have unit standard deviations.

Scaling the variables (Cont.)

• Because it is undesirable for the principal components obtained to depend on an

arbitrary choice of scaling we typically scale each variable to have standard
deviation one before we perform PCA.
• In certain settings, the variables may be measured in the same units. In this case,
we might not wish to scale the variables to have standard deviation one before
performing PCA.

16
2020-12-21

Scree Plot

Helps us to decide on the number of principal components required to visualize the data
by examining. We choose the smallest number of principal components that are required
in order to explain a sizable amount of the variation in the data.

Proportion of variance explained (PVE)

2
σ𝑛𝑖=1 σ𝑝𝑗=1 𝜙𝑚𝑗 𝑥𝑖𝑗
𝑃𝑉𝐸𝑜𝑓 𝑡ℎ𝑒 𝑚𝑡ℎ 𝑃𝐶 =
σ𝑝𝑗=1 σ𝑛𝑖=1 𝑥𝑖𝑗 2

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
57% (82)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
91% (35)
The 36 Questions That Lead To Love - The New York Times
3 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
70% (73)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Good Enough Quality
No ratings yet
Good Enough Quality
3 pages
Ch12 Unsupervised Learning
No ratings yet
Ch12 Unsupervised Learning
58 pages
Unsupervised Handout
No ratings yet
Unsupervised Handout
50 pages
AA11_Unsupervised Learning_2024 (2)
No ratings yet
AA11_Unsupervised Learning_2024 (2)
39 pages
Module12 - Unsupervised Learning
No ratings yet
Module12 - Unsupervised Learning
52 pages
DR Pca
No ratings yet
DR Pca
22 pages
Module12.01 UnsupervisedLearning
No ratings yet
Module12.01 UnsupervisedLearning
21 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Presentation a i Std 2
No ratings yet
Presentation a i Std 2
63 pages
Probabilistic & Unsupervised Learning: Maneesh@gatsby - Ucl.ac - Uk
No ratings yet
Probabilistic & Unsupervised Learning: Maneesh@gatsby - Ucl.ac - Uk
10 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
02 Principal Components
No ratings yet
02 Principal Components
9 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
Chapter Five Principal Comonent Analysis (PCA)
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
33 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
22 pages
EDAB Module 5 Singular Value Decomposition (SVD)
No ratings yet
EDAB Module 5 Singular Value Decomposition (SVD)
58 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
PCA
100% (1)
PCA
33 pages
PCA ChrisDing4
No ratings yet
PCA ChrisDing4
74 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Face Recognition PAC
No ratings yet
Face Recognition PAC
24 pages
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
No ratings yet
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
15 pages
Edab Module - 5
No ratings yet
Edab Module - 5
19 pages
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
23 pages
PCA_dev
No ratings yet
PCA_dev
16 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
DS Ca2 PPT 3010 3017
No ratings yet
DS Ca2 PPT 3010 3017
10 pages
Unit-3
No ratings yet
Unit-3
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
Principle Component Analysis
No ratings yet
Principle Component Analysis
4 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
3
No ratings yet
3
12 pages
10 Pca
No ratings yet
10 Pca
26 pages
Lecture 9_PCA
No ratings yet
Lecture 9_PCA
44 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
The Intuition Behind PCA: Machine Learning Assignment
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
11 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
STAT502
No ratings yet
STAT502
13 pages
PCA - Ensemble Classifiers
No ratings yet
PCA - Ensemble Classifiers
9 pages
Principal Component Analysis (PCA) Explained - Built in
No ratings yet
Principal Component Analysis (PCA) Explained - Built in
11 pages
Sanjay Singh Principal Component Analysis
No ratings yet
Sanjay Singh Principal Component Analysis
9 pages
ML Chapter 4 Part3
No ratings yet
ML Chapter 4 Part3
82 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
Pac
No ratings yet
Pac
70 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
ASCM Toc Supplier Relationship Management Certificate
No ratings yet
ASCM Toc Supplier Relationship Management Certificate
13 pages
Process design and layout
No ratings yet
Process design and layout
86 pages
An Outsourcing Case Study
No ratings yet
An Outsourcing Case Study
6 pages
F61
No ratings yet
F61
2 pages
Managing Performance in The Downstream
No ratings yet
Managing Performance in The Downstream
8 pages
Calpeda Water Pump
No ratings yet
Calpeda Water Pump
11 pages
4Cs of Contracts
No ratings yet
4Cs of Contracts
1 page
Systems Change Key Components1
No ratings yet
Systems Change Key Components1
29 pages
Checklist: Picking Options
No ratings yet
Checklist: Picking Options
1 page
Cover Page From FIDIC (Red) Contract For Construction 1999
No ratings yet
Cover Page From FIDIC (Red) Contract For Construction 1999
1 page
The Marketing Mix (The 4 P's of Marketing)
No ratings yet
The Marketing Mix (The 4 P's of Marketing)
3 pages
Orascom - Code of Business Conduct and Ethics
100% (1)
Orascom - Code of Business Conduct and Ethics
3 pages
Demand Sensing in Supply Chain PDF
100% (1)
Demand Sensing in Supply Chain PDF
8 pages
Supply Chain Evolution - Theory, Concepts and Science
No ratings yet
Supply Chain Evolution - Theory, Concepts and Science
25 pages
Game Theory Models of The Medical Consultation (Paper) PDF
No ratings yet
Game Theory Models of The Medical Consultation (Paper) PDF
6 pages
Microgrid Modeling For Stability Analysis
No ratings yet
Microgrid Modeling For Stability Analysis
21 pages
Philosophical_Definition_and_Description
No ratings yet
Philosophical_Definition_and_Description
7 pages
Paper 6 Introduction To Archaeology English Version
No ratings yet
Paper 6 Introduction To Archaeology English Version
171 pages
Tata Motors - Marketing St.
No ratings yet
Tata Motors - Marketing St.
98 pages
Child Mental Health Study
No ratings yet
Child Mental Health Study
34 pages
Surveying For Highways: Eng. Suneth Thushara Highwy Design Division RDA
No ratings yet
Surveying For Highways: Eng. Suneth Thushara Highwy Design Division RDA
31 pages
Image Classification and Analysis: Dr. P. K. Mani
No ratings yet
Image Classification and Analysis: Dr. P. K. Mani
55 pages
Block 1
No ratings yet
Block 1
59 pages
LESSON EXEMPLAR IN CAPSTONE (WEEK 2)
No ratings yet
LESSON EXEMPLAR IN CAPSTONE (WEEK 2)
3 pages
Week 2 FOR3 Module
No ratings yet
Week 2 FOR3 Module
7 pages
How To Interpret SEM Model-Fit Results in AMOS
No ratings yet
How To Interpret SEM Model-Fit Results in AMOS
21 pages
Zeneb Sarwar 23L-4014 BF - Assignment 4
No ratings yet
Zeneb Sarwar 23L-4014 BF - Assignment 4
3 pages
Chap07 Establishing Objectives and Budgeting For The Promotional Program
100% (1)
Chap07 Establishing Objectives and Budgeting For The Promotional Program
27 pages
Daraga National High School Senior High Department
No ratings yet
Daraga National High School Senior High Department
17 pages

Lecture 12 - Unsupervised- PCA

Uploaded by

Lecture 12 - Unsupervised- PCA

Uploaded by

2020-12-21

• So far, we discussed supervised learning methods.

The Challenge of Unsupervised Learning

Principle Components Analysis

Which dimension has more information 𝑋1 or 𝑋2

Principle Components Analysis

Visualize the data using 1-D

• Useful for preprocessing (reduce the dimensions of the

1. Linear relationship between the data and learned

How to find this plane?

How to find the first principle component

How to find the first principle components (cont.)

• We refer to 𝑧11 , … . , 𝑧𝑛1 as the scores of the first principle component.

How to find the first principle components (cont.)

How to find the first principle components (cont.)

• The problem can be solved using an eigen decomposition, a standard technique

Finding the second principle component

Finding additional principle components

• We can define additional principal components in an incremental fashion by

Singular Value Decomposition

Scaling the variables

Scaling the variables

Scaling the variables (Cont.)

Scaled to have unit standard deviations.

Scaling the variables (Cont.)

• Because it is undesirable for the principal components obtained to depend on an

Proportion of variance explained (PVE)

You might also like