Module12.01 UnsupervisedLearning
Module12.01 UnsupervisedLearning
Learning
Reference Books
35
30
25
Ad Spending
20
15
10
5
0
10 20 30 40 50 60 70
Population
The population size (pop) and ad spending (ad) for 100 different cities are shown as
purple circles. The green solid line indicates the first principal component direction,
and the blue dashed line indicates the second principal component direction.
Computation of Principal Components
∅ …∅
+ + ..… +
UrbanPop
3
2
0.5
Hawaii California
RhodM
e aIslU
saatnacdh useNttesw Jersey
Connecticut
Second Principal Component
Washington Colorado
1
0.0
0
Arkansas Alaska
Alabama
Georgia
VermontWest Virginia Murder
−0.5
South Carolina
−2
North Carolina
Mississippi
−3
−3 −2 −1 0 1 2 3
The PC is given by
+ +… .. + where
• •
•
1.0
• • ••
•
• • • •• • •• •••
• •
•
• •
0.5
Second principal component
•• •
•
• • • •
• • •• • •
0.0
• •
• • • ••• • • •
• • • •• •
•• • • •
• • • •
•• •
−0.5
• •
• •••
• • ••
• • • •
• •
−1.0
1.0
UrbanPop UrbanPop
3
150
2
0.5
100
** **
Second Principal Component
0.5
* **
1
*
* *
50
* * * * * Other
* Other
* * ** * * *
** * ** *
0.0
* *
0
* **
* * * * * * *** ** * * ** * **
* *
0.0
* 0
* *M*urd*er * *
* * A*ssault *
* * ** * * * Assa
* * * * ** *
* *
−1
* * *
* *
M*urder
−50
* *
−0.5
−0.5
*
−2
−100
**
−3