Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
Sudeshna Sarkar
IIT Kharagpur
Feature extraction - definition
•• Given a set of features
the Feature Extraction(“Construction”) problem is
is to map to some feature set that maximizes the
learner’s ability to classify patterns
Feature Extraction
• Find
a projection matrix w from N-dimensional to M-
dimensional vectors that keeps error low
• Assume that N features are linear combination of vectors
x , x ,, x
1 2 p
N
Original
p=16 p=32 p=64 p=100 Image
Is PCA a good criterion for classification?
• Data variation
determines the
projection direction
• What’s missing?
– Class information
What is a good projection?
Two classes
• Similarly, what is a
overlap
good criterion?
– Separating different
classes
• Between-class distance
– Distance between the centroids
of different classes
Between-class distance
What class information may be useful?
• Between-class distance
– Distance between the centroids of
different classes
• Within-class distance
• Accumulated distance of an instance
to the centroid of its class
m1 m2
2
J w
s s
2
1
2
2
Thank You