Feature Selection

Uploaded by

Aboode HD - LoL Montages

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views36 pages

Feature Selection

Uploaded by

Aboode HD - LoL Montages

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Feature Selection

Feature Selection
1. Overview
2. Perspectives
3. Aspects
4. Most Representative Methods
5. Related and Advanced Topics
6. Experimental Comparative Analyses
Feature Selection
1. Overview
2. Perspectives
3. Aspects
4. Most Representative Methods
5. Related and Advanced Topics
6. Experimental Comparative Analyses
Overview
• Why we need FS:
1. to improve performance (in terms of speed, predictive power,
simplicity of the model).
2. to visualize the data for model selection.
3. To reduce dimensionality and remove noise.

• Feature Selection is a process that chooses an optimal

subset of features according to a certain criterion.
Overview
• Reasons for performing FS may include:
• removing irrelevant data.
• increasing predictive accuracy of learned models.
• reducing the cost of the data.
• improving learning efficiency, such as reducing storage requirements and
computational cost.
• reducing the complexity of the resulting model description, improving the
understanding of the data and the model.
Feature Selection
1. Overview
2. Perspectives
3. Aspects
4. Most Representative Methods
5. Related and Advanced Topics
6. Experimental Comparative Analyses
Perspectives
1. searching for the best subset of features.

2. criteria for evaluating different subsets.

3. principle for selecting, adding, removing or changing new

features during the search.
Perspectives:
Search of a Subset of Features
• FS can be considered as a search problem, where each state of the
search space corresponds to a concrete subset of features selected.
• The selection can be represented as a binary array, with each element
corresponding to the value 1, if the feature is currently selected by
the algorithm and 0, if it does not occur.
• There should be a total of 2M subsets where M is the number of
features of a data set.
Perspectives:
Search of a Subset of Features
• Search Directions:
• Sequential Forward Generation (SFG): It starts with an empty set of features S. As the search
starts, features are added into S according to some criterion that distinguish the best feature
from the others. S grows until it reaches a full set of original features. The stopping criteria
can be a threshold for the number of relevant features m or simply the generation of all
possible subsets in brute force mode.

• Sequential Backward Generation (SBG): It starts with a full set of features and,iteratively,
they are removed one at a time. Here, the criterion must point out the worst or least
important feature. By the end, the subset is only composed of a unique feature, which is
considered to be the most informative of the whole set. As in the previous case, different
stopping criteria can be used.
Perspectives:
Search of a Subset of Features
• Search Directions:
Perspectives:
Search of a Subset of Features
• Search Directions:
Perspectives:
Search of a Subset of Features
• Search Directions:
• Bidirectional Generation (BG): Begins the search in both directions, performing SFG and SBG
concurrently. They stop in two cases: (1) when one search finds the best subset comprised of
m features before it reaches the exact middle, or (2) both searches achieve the middle of the
search space. It takes advantage of both SFG and SBG.

• Random Generation (RG): It starts the search in a random direction. The choice of adding or
removing a features is a random decision. RGtries to avoid the stagnation into a local optima
by not following a fixed way for subset generation. Unlike SFG or SBG, the size of the subset
of features cannot be stipulated.
Perspectives:
Search of a Subset of Features
• Search Directions:
Perspectives:
Search of a Subset of Features
• Search Directions:
Perspectives:
Search of a Subset of Features
• Search Strategies:
• Exhaustive Search: It corresponds to explore all possible subsets to find the optimal ones.
the space complexity is O(2M). If we establish a threshold m of minimum features to be
selected and the direction of search, the search space is, independent of the forward or
backward generation. Only exhaustive search can guarantee the optimality. Nevertheless,
they are also impractical in real data sets with a high M.
• Heuristic Search: It employs heuristics to carry out the search. Thus, it prevents brute force
search, but it will surely find a non-optimal subset of features. It draws a path connecting the
beginning and the end of the previous Figure, such in a way of a depth-first search. The
maximum length of this path is M and the number of subsets generated is O(M). The choice
of the heuristic is crucial to find a closer optimal subset of features in a faster operation.
Brute force
brute force algorithm is a simple and straightforward approach to solve a problem by trying every
possible solution until finding the best one.It does not use any clever tricks or shortcuts to reduce
the search space or improve the efficiency.
Example
Brute force algorithms are a type of algorithm that can be used to solve certain types of problems,
such as searching for an element in a list or array, sorting a list or array, calculating the factorial of a
number, and calculating the nth term of the Fibonacci sequence
advantages of brute force algorithms is that they are easy to understand and implement. You do
not need to have a deep knowledge of the problem domain or use complex data structures or
techniques. You can simply follow a logical and systematic process to check every possible solution.
Disadvantages of brute force algorithms
One of the main disadvantages of brute force algorithms is that they are very inefficient and time-
consuming. They can consume a lot of computational resources, such as memory, CPU, or network
bandwidth, depending on the size and complexity of the problem.
Perspectives:
Search of a Subset of Features
• Search Strategies:
• Nondeterministic Search: Complementary combination of the previous two.
It is also known as random search strategy and can generate best subsets
constantly and keep improving the quality of selected features as time goes
by. In each step, the next subset is obtained at random.
• it is unnecessary to wait until the search ends.
• we do not know when the optimal set is obtained, although we know which one is better
than the previous one and which one is the best at the moment.
Perspectives:
Selection Criteria
• Information Measures.
• Information serves to measure the uncertainty of the receiver when she/he receives a
message.
• Shannon’s Entropy:

• Information gain:
Perspectives:
Selection Criteria
• Distance Measures.
• Measures of separability, discrimination or divergence measures . The most typical is
derived from distance between the class conditional density functions.
Perspectives:
Selection Criteria
• Dependence Measures.
• known as measures of association or correlation.
• Its main goal is to quantify how strongly two variables are correlated or present some
association with each other, in such way that knowing the value of one of them, we can
derive the value for the other.
• Pearson correlation coefficient:
Perspectives:
Selection Criteria
• Consistency Measures.
• They attempt to find a minimum number of features that separate classes as the
full set of features can.

• They aim to achieve P(C|FullSet) = P(C|SubSet).

• An inconsistency is defined as the case of two examples with the same inputs
(same feature values) but with different output feature values (classes in
classification).
Perspectives:
Selection Criteria
• Accuracy Measures.
• This form of evaluation relies on the classifier or learner. Among various possible subsets
of features, the subset which yields the best predictive accuracy is chosen
Perspectives
• Filters:
Perspectives
• Filters:
• measuring uncertainty, distances, dependence or consistency is usually
cheaper than measuring the accuracy of a learning process. Thus, filter
methods are usually faster.
• it does not rely on a particular learning bias, in such a way that the selected
features can be used to learn different models from different DM techniques.
• it can handle larger sized data, due to the simplicity and low time complexity
of the evaluation measures.
Perspectives
• Wrappers:
Perspectives
• Wrappers:
• can achieve the purpose of improving the particular learner’s predictive
performance.
• usage of internal statistical validation to control the overfitting, ensembles of
learners and hybridizations with heuristic learning like Bayesian classifiers or
Decision Tree induction.
• filter models cannot allow a learning algorithm to fully exploit its bias,
whereas wrapper methods do.
Perspectives
• Embedded FS:
• similar to the wrapper approach in the sense that the features are specifically
selected for a certain learning algorithm, but in this approach, the features
are selected during the learning process.
• they could take advantage of the available data by not requiring to split the
training data into a training and validation set; they could achieve a faster
solution by avoiding the re-training of a predictor for each feature subset
explored.
Feature Selection
1. Overview
2. Perspectives
3. Aspects
4. Most Representative Methods
5. Related and Advanced Topics
6. Experimental Comparative Analyses
Aspects:
Output of Feature Selection
• Feature Ranking Techniques:
• we expect as the output a ranked list of features which are ordered according
to evaluation measures.
• they return the relevance of the features.
• For performing actual FS, the simplest way is to choose the first m features for
the task at hand, whenever we know the most appropriate m value.
Aspects:
Output of Feature Selection
• Feature Ranking Techniques:
Aspects:
Output of Feature Selection
• Minimum Subset Techniques:
• The number of relevant features is a parameter that is often not known by
the practitioner.
• There must be a second category of techniques focused on obtaining the
minimum possible subset without ordering the features.
• whatever is relevant within the subset, is otherwise irrelevant.
Aspects:
Output of Feature Selection
• Minimum Subset Techniques:
Aspects:
Evaluation
• Goals:
• Inferability: For predictive tasks, considered as an
improvement of the prediction of unseen examples with
respect to the direct usage of the raw training data.
• Interpretability: Given the incomprehension of raw data by
humans, DM is also used for generating more
understandable structure representation that can explain
the behavior of the data.
• Data Reduction: It is better and simpler to handle data
with lower dimensions in terms of efficiency and
interpretability.
Aspects:
Evaluation
• We can derive three assessment measures from these
three goals:
• Accuracy

• Complexity

• Number of Features Selected

• Speed of the FS method

• Generality of the features selected

Aspects:
Drawbacks
• The resulted subsets of many models of FS are strongly
dependent on the training set size.
• It is not true that a large dimensionality input can always be
reduced to a small subset of features because the objective
feature is actually related with many input features and the
removal of any of them will seriously effect the learning
performance.
• A backward removal strategy is very slow when working with
large-scale data sets. This is because in the firsts stages of the
algorithm, it has to make decisions funded on huge quantities of
data.
• In some cases, the FS outcome will still be left with a relatively
large number of relevant features which even inhibit the use of
complex learning methods.
Most Representative Methods
• Three major components to categorize combinations:
• Search Direction
• Search Strategy
• Evaluation Measure

Module5.2 Feature selection methods
No ratings yet
Module5.2 Feature selection methods
64 pages
DSV Module-3
No ratings yet
DSV Module-3
24 pages
dimensionalityReduction.pptx
No ratings yet
dimensionalityReduction.pptx
117 pages
Chapter 2 Data Preprocessing
No ratings yet
Chapter 2 Data Preprocessing
23 pages
Feature Selection: Slide 1
No ratings yet
Feature Selection: Slide 1
29 pages
Feature Selection 1692278667
No ratings yet
Feature Selection 1692278667
100 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
52 pages
Dimenn Red PDF
No ratings yet
Dimenn Red PDF
135 pages
u1 p2 2
No ratings yet
u1 p2 2
66 pages
Feature Selection
No ratings yet
Feature Selection
56 pages
Feature Selection
No ratings yet
Feature Selection
61 pages
3.1_Feature_Selection
No ratings yet
3.1_Feature_Selection
35 pages
CS464_Ch5_FeatureSelection
No ratings yet
CS464_Ch5_FeatureSelection
31 pages
CSC 522 Lecture3 4bd3ba83ce402d2da5bafd60f41095b6
No ratings yet
CSC 522 Lecture3 4bd3ba83ce402d2da5bafd60f41095b6
32 pages
Presentation 1 (2)
No ratings yet
Presentation 1 (2)
22 pages
Eel891 Selecao Atributos George Bebis
No ratings yet
Eel891 Selecao Atributos George Bebis
58 pages
Features Election
No ratings yet
Features Election
62 pages
3ML.03.Feature Reduction
No ratings yet
3ML.03.Feature Reduction
44 pages
A Review of Feature Selection and Its Methods: Cybernetics and Information Technologies March 2019
No ratings yet
A Review of Feature Selection and Its Methods: Cybernetics and Information Technologies March 2019
25 pages
Lua Chon Dac Trung
No ratings yet
Lua Chon Dac Trung
18 pages
7 Selectia trasaturilor
No ratings yet
7 Selectia trasaturilor
54 pages
Wrapper Method
No ratings yet
Wrapper Method
58 pages
Improving Floating Search Feature Selection Using Genetic Algorithm
No ratings yet
Improving Floating Search Feature Selection Using Genetic Algorithm
19 pages
ML Lecture 02
No ratings yet
ML Lecture 02
40 pages
کتاب پنجم بارگزاری شده
No ratings yet
کتاب پنجم بارگزاری شده
35 pages
A Review of Feature Selection and Its Methods
No ratings yet
A Review of Feature Selection and Its Methods
15 pages
icml2005
No ratings yet
icml2005
8 pages
n2020
No ratings yet
n2020
6 pages
Dimensionality Reduction of High Dimensional Data: Summer Internship Project Summary
No ratings yet
Dimensionality Reduction of High Dimensional Data: Summer Internship Project Summary
20 pages
Fast Clustering Based Feature Selection: Ubed S. Attar, Ajinkya N. Bapat, Nilesh S. Bhagure, Popat A. Bhesar
No ratings yet
Fast Clustering Based Feature Selection: Ubed S. Attar, Ajinkya N. Bapat, Nilesh S. Bhagure, Popat A. Bhesar
7 pages
Module-3 - DS (Autosaved)
No ratings yet
Module-3 - DS (Autosaved)
18 pages
A Review of Feature Selection Methods On Synthetic Data
No ratings yet
A Review of Feature Selection Methods On Synthetic Data
37 pages
Feature engineering
No ratings yet
Feature engineering
5 pages
Module3 DSV Notes
No ratings yet
Module3 DSV Notes
29 pages
Feature Subset Selection With Fast Algorithm Implementation
No ratings yet
Feature Subset Selection With Fast Algorithm Implementation
5 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Module 3
No ratings yet
Module 3
29 pages
Module-3 DSV
No ratings yet
Module-3 DSV
20 pages
21CS644 Mod 3
No ratings yet
21CS644 Mod 3
29 pages
AI5003-AML-Week07
No ratings yet
AI5003-AML-Week07
14 pages
Flairs99 042
No ratings yet
Flairs99 042
5 pages
Lecture#10
No ratings yet
Lecture#10
24 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
4 pages
FSD GAA30782GAB Feature and Set Up Description - 2015-05-28
100% (1)
FSD GAA30782GAB Feature and Set Up Description - 2015-05-28
196 pages
6 الى13 داتا ماينق
No ratings yet
6 الى13 داتا ماينق
19 pages
1-2 The Problem 3-4 Proposed Solution 5-7 The Experiment 8-9 Experimental Results 10-11 Conclusion 12 References 13
No ratings yet
1-2 The Problem 3-4 Proposed Solution 5-7 The Experiment 8-9 Experimental Results 10-11 Conclusion 12 References 13
14 pages
A Comparative Study Between Feature Selection Algorithms - Ok
No ratings yet
A Comparative Study Between Feature Selection Algorithms - Ok
10 pages
Feature Selection
No ratings yet
Feature Selection
18 pages
International Journal of Engineering Research and Development (IJERD)
No ratings yet
International Journal of Engineering Research and Development (IJERD)
5 pages
DA Assignmnet 3 Based On Format Solu
No ratings yet
DA Assignmnet 3 Based On Format Solu
9 pages
Feature Selection Methods
No ratings yet
Feature Selection Methods
24 pages
Toward Integrating Feature Selection Algorithms For Classification and Clustering-M7s PDF
No ratings yet
Toward Integrating Feature Selection Algorithms For Classification and Clustering-M7s PDF
12 pages
Survey 2006
No ratings yet
Survey 2006
15 pages
E-Note 14653 Content Document 20231228101402AM
No ratings yet
E-Note 14653 Content Document 20231228101402AM
10 pages
Unit 3,4 and 5
No ratings yet
Unit 3,4 and 5
5 pages
Savemom
No ratings yet
Savemom
14 pages
Feature Selection in PR
No ratings yet
Feature Selection in PR
6 pages
Kernels, Model Selection and Feature Selection
No ratings yet
Kernels, Model Selection and Feature Selection
5 pages
Feature Subset Selection: A Correlation Based Filter Approach
No ratings yet
Feature Subset Selection: A Correlation Based Filter Approach
4 pages
Adobe Master Collection 2024
No ratings yet
Adobe Master Collection 2024
5 pages
1.a Faster Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
No ratings yet
1.a Faster Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
3 pages
Multimedia Information and Media Quiz
No ratings yet
Multimedia Information and Media Quiz
9 pages
Resume-Ilya-Shigabeev
No ratings yet
Resume-Ilya-Shigabeev
4 pages
DAY 1 PCI Compliance
No ratings yet
DAY 1 PCI Compliance
16 pages
Sifive Vcu118 Fpga Getting Started Guide 20G1.05.00
No ratings yet
Sifive Vcu118 Fpga Getting Started Guide 20G1.05.00
34 pages
Yuvraj CS
No ratings yet
Yuvraj CS
23 pages
Figma Brochure Guide
No ratings yet
Figma Brochure Guide
2 pages
L3. Types of Data Warehouse PDF
No ratings yet
L3. Types of Data Warehouse PDF
14 pages
Accessing Mysql Using Pdo: This Work Is Licensed Under A
No ratings yet
Accessing Mysql Using Pdo: This Work Is Licensed Under A
28 pages
Getting Started With Laserfiche Guide
No ratings yet
Getting Started With Laserfiche Guide
78 pages
Unique Factorization Theorem
No ratings yet
Unique Factorization Theorem
7 pages
Ai Practical Code
No ratings yet
Ai Practical Code
5 pages
ARTS10 q2 Mod2 Technologybasedartworks-Ver3.2
100% (1)
ARTS10 q2 Mod2 Technologybasedartworks-Ver3.2
27 pages
Wireless Overview Slides
No ratings yet
Wireless Overview Slides
26 pages
4 Program Memory PDF
No ratings yet
4 Program Memory PDF
14 pages
Microsoft Office 365 Keys Here
33% (21)
Microsoft Office 365 Keys Here
16 pages
Cable Termination Plan
No ratings yet
Cable Termination Plan
8 pages
02 NFS2-640 Hardware
No ratings yet
02 NFS2-640 Hardware
11 pages
Broken Screen Wallpaper - Wallpaper Sun
No ratings yet
Broken Screen Wallpaper - Wallpaper Sun
1 page
Experience Live Music in Car: Control Wirelessly Via Your Smartphone
No ratings yet
Experience Live Music in Car: Control Wirelessly Via Your Smartphone
3 pages
Electrical and Computer Engineering Thesis Topics
100% (2)
Electrical and Computer Engineering Thesis Topics
4 pages
IntroductionToMoquiFramework-1 0 1
No ratings yet
IntroductionToMoquiFramework-1 0 1
17 pages
New Staff Induction: Report Prepared By: Ahmed Kasmani
100% (1)
New Staff Induction: Report Prepared By: Ahmed Kasmani
2 pages
AMO Phaco Catalog
No ratings yet
AMO Phaco Catalog
7 pages
Lab Manual: Principle of Communication System
No ratings yet
Lab Manual: Principle of Communication System
11 pages
2020's Best Web Scraping Tools For Data Extraction
No ratings yet
2020's Best Web Scraping Tools For Data Extraction
10 pages
1500 Flexibility Method - Trusses
No ratings yet
1500 Flexibility Method - Trusses
6 pages
IC ISO 27002 Information Security Guidelines Checklist 10838
No ratings yet
IC ISO 27002 Information Security Guidelines Checklist 10838
4 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet

Feature Selection

Uploaded by

Feature Selection

Uploaded by

Feature Selection

• Feature Selection is a process that chooses an optimal

2. criteria for evaluating different subsets.

3. principle for selecting, adding, removing or changing new

• They aim to achieve P(C|FullSet) = P(C|SubSet).

• Number of Features Selected

• Speed of the FS method

• Generality of the features selected

You might also like