0% found this document useful (0 votes)

44 views

Feature Construction

Dimensionality reduction techniques like principal component analysis (PCA) can improve the performance of classification algorithms by removing irrelevant features. This helps address the "curse of dimensionality" where adding more dimensions exponentially increases data volume. Dimensionality reduction allows classification algorithms to better scale to large feature sets, helps understand the domain being studied, and reduces data collection and storage costs. PCA is a commonly used linear dimensionality reduction technique that transforms data into a new coordinate system where the greatest variance is on the first component and second greatest on the second and so on. However, PCA has limitations such as assumptions of linearity and Gaussian distributions.

Uploaded by

MUHAMMAD NUR FITRI BIN NORAFFENDI

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Feature Construction

Uploaded by

MUHAMMAD NUR FITRI BIN NORAFFENDI

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Supervised_learning_new

Feature construction
Dimensionality reduction -- motivation .2
– May Improve performance of classification algorithm by removing irrelevant
features
Defying the curse of dimensionality - simpler models result in improved - –
:generalization

The curse of dimensionality: the exponential increase in volume [ –

associated with adding extra dimensions to a space. For example, 100
evenly-spaced sample points suffice to sample a unit interval with no more
than 0.01 distance between points; an equivalent sampling of a 10-dimensional
unit hypercube with a spacing of 0.01 between adjacent points would require
10^20 sample points: thus, the 10-dimensional hypercube can be said to be a
factor of 10^18 "larger" than the unit interval. Hence, in the context of
]..classification/function approximation

– Classification algorithm may not scale up to the size of the full feature set
either in space or time
– Allows us to better understand the domain
– Cheaper to collect and store data based on reduced feature set.
Two approaches for
dimensionality reduction
– Feature construction
– Feature selection
Feature construction )a(
• Linear methods
– Principal component analysis (PCA)
– Independent component analysis (ICA)
– ….
• Non-linear methods
– Non linear component analysis (NLCA)
– Kernel PCA
– Local linear embedding (LLE)
– ….
Principal component analysis
(PCA) (1)
• PCA is mostly used as a tool in exploratory data analysis and for
making predictive models.
• PCA involves the calculation of the eigenvalue decomposition of
a data covariance matrix, usually after mean centering the data
for each attribute.
• PCA is mathematically defined as an orthogonal linear
transformation that transforms the data to a new coordinate
system such that the greatest variance by any projection of the
data comes to lie on the first coordinate (called the first principal
component), the second greatest variance on the second
coordinate, and so on.
• PCA is theoretically the optimal linear scheme, in terms of least
mean square error, for compressing a set of high dimensional
vectors into a set of lower dimensional vectors and then
reconstructing the original set.
Principal component analysis
(PCA) (2)
• The applicability of PCA is limited by the assumptions
made in its derivation:
• 1. Linearity: We assume the observed data set are
linear combinations of certain basis vectors.
• 2. PCA only finds the independent axes of the data
under the Gaussian assumption.
• 3. It is only when we believe that the observed data
has a high signal-to-noise ratio that the principal
components with larger variance correspond to
interesting dynamics and lower ones correspond to
noise.
The End

Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
Chapter Five Principal Comonent Analysis (PCA)
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
33 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
28 pages
ML Module 6
No ratings yet
ML Module 6
6 pages
Implementation of Dimensionality Reduction Techniques in Hospital Management
No ratings yet
Implementation of Dimensionality Reduction Techniques in Hospital Management
4 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
7.3 Pca
No ratings yet
7.3 Pca
17 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
2 pages
Dimensionality Reduction: Principal Component Analysis (PCA)
No ratings yet
Dimensionality Reduction: Principal Component Analysis (PCA)
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Principal+Component+Analysis
No ratings yet
Principal+Component+Analysis
6 pages
Lab #3
No ratings yet
Lab #3
12 pages
Module 3
No ratings yet
Module 3
41 pages
Unit 3
No ratings yet
Unit 3
31 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
Ai & ML Week-9
No ratings yet
Ai & ML Week-9
30 pages
1694601214-Unit 3.4 Principal Component Analysis CU 2.0
No ratings yet
1694601214-Unit 3.4 Principal Component Analysis CU 2.0
36 pages
cheat sheet
No ratings yet
cheat sheet
2 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
Feature Selection Extraction
No ratings yet
Feature Selection Extraction
24 pages
Dimension Reduction: P Adraig Cunningham University College Dublin
No ratings yet
Dimension Reduction: P Adraig Cunningham University College Dublin
24 pages
Principal Component Analysis: Jianxin Wu
No ratings yet
Principal Component Analysis: Jianxin Wu
24 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
33 pages
Pattern Recognition Techniques
No ratings yet
Pattern Recognition Techniques
13 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
30 pages
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
No ratings yet
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
15 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
33 pages
Principal component analysis
No ratings yet
Principal component analysis
15 pages
Principal Component Analysis PCA in Machine Learning
No ratings yet
Principal Component Analysis PCA in Machine Learning
20 pages
Principal Component Analysis (PCA) in Machine Learning
No ratings yet
Principal Component Analysis (PCA) in Machine Learning
20 pages
Linear Algebra
No ratings yet
Linear Algebra
5 pages
Forward Selection Component Analysis Algorithms and Applications
No ratings yet
Forward Selection Component Analysis Algorithms and Applications
16 pages
Linear Regression: Dimensionality Reduction
No ratings yet
Linear Regression: Dimensionality Reduction
7 pages
Principle Component Analysis
No ratings yet
Principle Component Analysis
4 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
Linear Algebra
No ratings yet
Linear Algebra
5 pages
Dimension Reduction
No ratings yet
Dimension Reduction
23 pages
DS Ca2 PPT 3010 3017
No ratings yet
DS Ca2 PPT 3010 3017
10 pages
PCA- PRINCIPAL COMPONENT ANALYSIS 1233
No ratings yet
PCA- PRINCIPAL COMPONENT ANALYSIS 1233
30 pages
Principal Component Analysis - Wikipedia
No ratings yet
Principal Component Analysis - Wikipedia
28 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
3
No ratings yet
3
12 pages
2101 Week11PCA
No ratings yet
2101 Week11PCA
32 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
23 pages
Feature Extraction Techniques
No ratings yet
Feature Extraction Techniques
32 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
PCA Slides
No ratings yet
PCA Slides
11 pages
Unit 17
No ratings yet
Unit 17
12 pages
10 ASAP Advanced Statistics Dimension Reduction
No ratings yet
10 ASAP Advanced Statistics Dimension Reduction
8 pages
ML Mod32019
No ratings yet
ML Mod32019
6 pages
A New Method For Dimensionality Reduction Using K-Means Clustering Algorithm For High Dimensional Data Set
No ratings yet
A New Method For Dimensionality Reduction Using K-Means Clustering Algorithm For High Dimensional Data Set
6 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
27 pages
D3S2 _ Unsupervised - Dimensionality Reduction
No ratings yet
D3S2 _ Unsupervised - Dimensionality Reduction
81 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
DMDW 5
No ratings yet
DMDW 5
25 pages
The Curse of Dimensionality in Data Mining and Tim
No ratings yet
The Curse of Dimensionality in Data Mining and Tim
14 pages
Curse of Dimensionality and Its Reduction
No ratings yet
Curse of Dimensionality and Its Reduction
5 pages
Linear Algebra and Feature Selection - Course Notes
No ratings yet
Linear Algebra and Feature Selection - Course Notes
49 pages
The Curse of Dimensionality - Inside Out 2
No ratings yet
The Curse of Dimensionality - Inside Out 2
8 pages
Introduction To Machine Learning: Mohsen Afsharchi
No ratings yet
Introduction To Machine Learning: Mohsen Afsharchi
72 pages
Slides Merged
No ratings yet
Slides Merged
374 pages
SDSC4008 08 Performance
No ratings yet
SDSC4008 08 Performance
39 pages
Example 1: Riding Mowers
No ratings yet
Example 1: Riding Mowers
6 pages
EIE4105 Multimodal Human Computer Interaction Technology: Fundamental of Statistical Learning
No ratings yet
EIE4105 Multimodal Human Computer Interaction Technology: Fundamental of Statistical Learning
31 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Feature Construction
No ratings yet
Feature Construction
8 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
9 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
8.Challenges motivating deep learning (c)
No ratings yet
8.Challenges motivating deep learning (c)
5 pages
Unit 1 BD PDF
No ratings yet
Unit 1 BD PDF
26 pages
What Is The Curse of Dimensionality?
No ratings yet
What Is The Curse of Dimensionality?
3 pages
CS434a/541a: Pattern Recognition Prof. Olga Veksler
No ratings yet
CS434a/541a: Pattern Recognition Prof. Olga Veksler
42 pages
Hubi Dubi
No ratings yet
Hubi Dubi
13 pages
Lab_6
No ratings yet
Lab_6
6 pages
MODELING GENERALIZATION IN MACHINE LEARNING: A METHODOLOGICAL AND COMPUTATIONAL STUDY
No ratings yet
MODELING GENERALIZATION IN MACHINE LEARNING: A METHODOLOGICAL AND COMPUTATIONAL STUDY
21 pages
Domingos
No ratings yet
Domingos
9 pages
03 Preprocessing
No ratings yet
03 Preprocessing
18 pages
Solving High-Dimensional Partial Differential Equations Using Deep Learning
No ratings yet
Solving High-Dimensional Partial Differential Equations Using Deep Learning
14 pages
PR Assignment 01 - Seemal Ajaz (206979)
No ratings yet
PR Assignment 01 - Seemal Ajaz (206979)
7 pages
AIML Feb, March Scheme 2023
No ratings yet
AIML Feb, March Scheme 2023
25 pages
Data Mining Tasks Notes Given
No ratings yet
Data Mining Tasks Notes Given
26 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
A_comprehensive_survey_of_anomaly_detect
No ratings yet
A_comprehensive_survey_of_anomaly_detect
30 pages
Unpaired Image-To-Image Translation Via Schrondinger Bridge
No ratings yet
Unpaired Image-To-Image Translation Via Schrondinger Bridge
15 pages

Feature Construction

Uploaded by

Feature Construction

Uploaded by

Supervised_learning_new

The curse of dimensionality: the exponential increase in volume [ –

You might also like