0% found this document useful (0 votes)

35 views6 pages

Deep Learning Vocabulary

Feature engineering involves creating new features from raw data to improve machine learning model performance. It includes techniques like feature selection, extraction, transformation, encoding, and augmentation. Feature selection identifies the most relevant existing features, while extraction creates new features from existing ones. The goal is to design informative features that have strong relationships with prediction targets.

Uploaded by

jaffar bikat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views6 pages

Deep Learning Vocabulary

Uploaded by

jaffar bikat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Modal Robustness:

It refers to the ability of a model to work well on unseen data, an essential requirement for real-world
applications.

Best Websites for Vocabulary

https://pythongeeks.org/deep-learning-key-terms/

https://iq.opengenus.org/key-terms-in-deep-learning/

https://www.simplilearn.com/tutorials/deep-learning-tutorial/deep-learning-algorithm

https://www.springboard.com/blog/data-science/machine-learning-terminology/

https://www.inforly.io/deep-learning-glossary/

Feature Engineering Explained

Feature engineering is s a crucial step in the machine learning pipeline as it converts raw data
into features that help to make predictions or classifications. It has a significant impact on the
performance of the resulting model. The goal of feature engineering is to create informative,
uncorrelated features and have a strong relationship with the target variable.

Steps

There are various steps involved in feature engineering that include:

1. Feature Selection: This step involves selecting the most relevant features from the raw data.
The goal is to choose features that are informative, uncorrelated, and have a strong relationship
with the target variable.
2. Feature Extraction: This step involves creating new features from the raw data. The goal is to
transform the data into a format that is more suitable for the machine learning algorithm.
3. Feature Transformation: This step involves transforming the features into a format that is
suitable for the machine learning algorithm. Common techniques for feature transformation
include normalization, scaling, or log transformations.
4. Feature Augmentation: This step involves adding new features to the dataset that can provide
additional information to the machine learning algorithm. Feature augmentation can involve
adding new features derived from external sources, such as weather data or demographic
information.

Feature Extraction: Feature Extraction aims to reduce the number of features in a dataset by
creating new features from the existing ones (and then discarding the original features). These new
reduced set of features should then be able to summarize most of the information contained in the
original set of features. In this way, a summarised version of the original features can be created
from a combination of the original set.

Techniques

The following techniques used in feature engineering are as follows –

 Feature Encoding: This step involves encoding categorical data into a format that can be used by
the machine learning algorithm. Common techniques for feature encoding include one-hot
encoding, label encoding, and binary encoding.
 Feature Scaling: This step involves scaling the features so that they are on the same scale. This
can be important if the features have different units or scales, as it can make it easier for the
machine learning algorithm to compare the features.
 One-Hot Encoding: This is a technique used to convert categorical variables into numerical
values by creating a binary column for each category. For example, if there is a categorical
feature like color with categories red, blue, and green, then one-hot encoding will create three
binary columns representing each category.
 Discretization: Discretization is a technique used to convert continuous variables into discrete
values to simplify the model. For example, age can be discretized into age groups like 0-10, 11-
20, 21-30, etc.
 Binning: Binning is a technique used to group continuous variables into bins based on specific
intervals. For example, income can be binned into income ranges like low-income, middle-
income, and high-income.
 Imputation: Imputation is a technique used to fill in missing values in a dataset. Various
imputation techniques are available like mean imputation, median imputation, and mode
imputation

Basis Feature Engineering Feature Selection Feature Extraction

Feature selection, on the Feature extraction involves
Feature engineering
other hand, involves transforming the original data
involves creating new
selecting a subset of the into a new feature space using
features from the existing
Meaning available features that are mathematical techniques such
data to improve the
most relevant for a given as Principal Component
performance of machine
predictive modeling Analysis (PCA) or Linear
learning models.
problem. Discriminant Analysis (LDA).
The goal of feature selection
This may involve tasks The goal of feature extraction is
is to reduce the
such as transforming to identify and extract the most
dimensionality of the data,
variables, creating important and relevant
Purpose which can help to reduce
interaction terms, or information from the original
overfitting, improve model
encoding variables in a data while reducing the
performance, and speed up
way that captures dimensionality of the data.
training
relevant information.

https://www.wallstreetmojo.com/feature-engineering/

What is an optimizer?
Optimizers are algorithms or methods used to minimize an error function (loss function) or to
maximize the efficiency of production. Optimizers are mathematical functions which are
dependent on model’s learnable parameters i.e Weights & Biases. Optimizers help to know how
to change weights and learning rate of neural network to reduce the losses.

Learning Rate
How big/small the steps are gradient descent takes into the direction of the local minimum are
determined by the learning rate, which figures out how fast or slow we will move towards the
optimal weights.
Learning Rate

https://medium.com/mlearning-ai/optimizers-in-deep-learning-7bf81fed78a0

Feature Selection Concepts & Techniques

Feature selection is a process in machine learning that involves identifying and selecting the
most relevant subset of features out of the original features in a dataset to be used as inputs for
a model. The goal of feature selection is to improve model performance by reducing the number
of irrelevant or redundant features that may introduce noise or bias into the model.

The importance of feature selection lies in its ability to improve model accuracy and
efficiency by reducing the dimensionality of the dataset.

Feature importance technique for features selection

Feature importance techniques such as using estimator such as Random Forest algorithm to
fit a model and select features based on the value of attribute such as feature_importances_ .
The feature_importances_ attribute of the Random Forest estimator can be used to obtain the
relative importance of each feature in the dataset. The feature_importances_ attribute of the
Random Forest estimator provides a score for each feature in the dataset, indicating how
important that feature is for making predictions. These scores are calculated based on the
reduction in impurity (e.g., Gini impurity or entropy) achieved by splitting the data on that
feature. The feature with the highest score is considered the most important, while features with
low scores can be considered less important or even irrelevant. The code below

Feature Extraction Concepts & Techniques

Feature extraction is about extracting/deriving information from the original features set to
create a new features subspace. The primary idea behind feature extraction is to compress the
data with the goal of maintaining most of the relevant information. As with feature selection
techniques, these techniques are also used for reducing the number of features from the original
features set to reduce model complexity, model overfitting, enhance model computation
efficiency and reduce generalization error. The following are different types of feature extraction
techniques:

 Principal component analysis (PCA) for unsupervised data compression.

When to use Feature Selection & Feature Extraction

The key difference between feature selection and feature extraction techniques used for
dimensionality reduction is that while the original features are maintained in the case of
feature selection algorithms, the feature extraction algorithms transform the data onto a new
feature space.

Feature selection techniques can be used if the requirement is to maintain the original features,
unlike the feature extraction techniques which derive useful information from data to construct a
new feature subspace. Feature selection techniques are used when model explainability is a key
requirement.

Feature extraction techniques can be used to improve the predictive performance of the models,
especially, in the case of algorithms that don’t support regularization.

Unlike feature selection, feature extraction usually needs to transform the original data to
features with strong pattern recognition ability, where the original data can be regarded as
features with weak recognition ability.

https://vitalflux.com/machine-learning-feature-selection-feature-extraction/

What is feature extraction/selection?

Straight to the point:

 Extraction: Getting useful features from existing data.

 Selection: Choosing a subset of the original pool of features.


Important Deep Learning Terms

Before proceeding, there are a few terms that you should be familiar with.

 Epoch – The number of times the algorithm runs on the whole training dataset.
 Sample – A single row of a dataset.
 Batch – It denotes the number of samples to be taken to for updating the model parameters.
 Learning rate – It is a parameter that provides the model a scale of how much model weights
should be updated.
 Cost Function/Loss Function – A cost function is used to calculate the cost, which is the
difference between the predicted value and the actual value.
 Weights/ Bias – The learnable parameters in a model that controls the signal between two
neurons.

https://www.analyticsvidhya.com/blog/2021/10/a-comprehensive-guide-on-deep-learning-optimizers/

Unit - 3 Feature Engineering
No ratings yet
Unit - 3 Feature Engineering
29 pages
Feature Engineering PDF
No ratings yet
Feature Engineering PDF
19 pages
ICSS Function Test Presentation
No ratings yet
ICSS Function Test Presentation
9 pages
Smart Agriculture Emerging Pedagogies of Deep Learning Machine Learning and Internet of Things - Govind Singh Patel Amrita Rai Nripendra Narayan Das R.P. Singh
100% (2)
Smart Agriculture Emerging Pedagogies of Deep Learning Machine Learning and Internet of Things - Govind Singh Patel Amrita Rai Nripendra Narayan Das R.P. Singh
222 pages
UNIT 2 PART 2
No ratings yet
UNIT 2 PART 2
6 pages
Feature Engineering
No ratings yet
Feature Engineering
11 pages
UNIT 4
No ratings yet
UNIT 4
25 pages
Unit 6aics
No ratings yet
Unit 6aics
25 pages
What is Feature Engineering
No ratings yet
What is Feature Engineering
9 pages
Class PPT - Unit2
No ratings yet
Class PPT - Unit2
139 pages
Feature selection
No ratings yet
Feature selection
13 pages
Machine Learning
No ratings yet
Machine Learning
35 pages
Feature and Feature Extractionlect2
No ratings yet
Feature and Feature Extractionlect2
28 pages
Feature Engineering in Machine Learning
No ratings yet
Feature Engineering in Machine Learning
7 pages
Machine_Learning-Note-Modul2[1]
No ratings yet
Machine_Learning-Note-Modul2[1]
20 pages
ML - Unit-2 FULL - Feature Engineering Theory-13!09!24-1
No ratings yet
ML - Unit-2 FULL - Feature Engineering Theory-13!09!24-1
29 pages
Feature Engineering
No ratings yet
Feature Engineering
6 pages
AI-Module 4 - Updated
No ratings yet
AI-Module 4 - Updated
53 pages
Unit 2 Feature Engineering
No ratings yet
Unit 2 Feature Engineering
64 pages
Rajat Agarwal-21bcon630
No ratings yet
Rajat Agarwal-21bcon630
13 pages
Feature Engineering For Machine Learning
No ratings yet
Feature Engineering For Machine Learning
41 pages
Feature Selection: Slide 1
No ratings yet
Feature Selection: Slide 1
29 pages
NOTES
No ratings yet
NOTES
9 pages
Unit 3
No ratings yet
Unit 3
50 pages
Feature Engineering and Normalization
No ratings yet
Feature Engineering and Normalization
7 pages
DM - MOD - 1 Part III
No ratings yet
DM - MOD - 1 Part III
12 pages
ML1
No ratings yet
ML1
69 pages
life lesson
No ratings yet
life lesson
13 pages
What Is Feature Engineering
No ratings yet
What Is Feature Engineering
2 pages
Tripti Ahmed 20 42960 1 Copy
No ratings yet
Tripti Ahmed 20 42960 1 Copy
11 pages
Unit-4 Part 3 Feature Engineering
No ratings yet
Unit-4 Part 3 Feature Engineering
29 pages
Presentation1
No ratings yet
Presentation1
15 pages
Feature Engineering
No ratings yet
Feature Engineering
2 pages
Feature Engineering
No ratings yet
Feature Engineering
13 pages
Conference 101719
No ratings yet
Conference 101719
7 pages
Feature Selection
No ratings yet
Feature Selection
2 pages
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
No ratings yet
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
6 pages
AI Feature Engineering in Detail (wecompress.com)
No ratings yet
AI Feature Engineering in Detail (wecompress.com)
12 pages
3.1 Dimensionality Reduction
No ratings yet
3.1 Dimensionality Reduction
24 pages
Feature Selection Techniques in Machine Learning
No ratings yet
Feature Selection Techniques in Machine Learning
9 pages
AI5003-AML-Week07
No ratings yet
AI5003-AML-Week07
14 pages
Summery of Feature Eng
No ratings yet
Summery of Feature Eng
4 pages
NN-7
No ratings yet
NN-7
26 pages
Feature Selection in PR
No ratings yet
Feature Selection in PR
6 pages
Eature Engineering: Presenter: Prof. Amit Kumar Das
No ratings yet
Eature Engineering: Presenter: Prof. Amit Kumar Das
17 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
23 pages
ML-Unit 3
No ratings yet
ML-Unit 3
58 pages
AI6322 - Module 4 - Feature Engineering - MODULE
No ratings yet
AI6322 - Module 4 - Feature Engineering - MODULE
25 pages
Feature Engineering PDF
No ratings yet
Feature Engineering PDF
19 pages
CH1
No ratings yet
CH1
64 pages
VIVA
No ratings yet
VIVA
5 pages
Comparartive
No ratings yet
Comparartive
7 pages
Unit- 3
No ratings yet
Unit- 3
12 pages
Data Reduction
No ratings yet
Data Reduction
23 pages
Feature Selection Techniques in Machine Learning - Javatpoint
No ratings yet
Feature Selection Techniques in Machine Learning - Javatpoint
9 pages
11.feature Selection, Extraction
No ratings yet
11.feature Selection, Extraction
38 pages
L5 Dimensionality Reduction
No ratings yet
L5 Dimensionality Reduction
47 pages
KNIME - Seven Techs For Dimensionality Reduction
No ratings yet
KNIME - Seven Techs For Dimensionality Reduction
17 pages
DA Assignmnet 3 Based On Format Solu
No ratings yet
DA Assignmnet 3 Based On Format Solu
9 pages
ML Unit 2 Part 2
No ratings yet
ML Unit 2 Part 2
23 pages
Feature Engineering
No ratings yet
Feature Engineering
21 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Lecture 15 Counting.
No ratings yet
Lecture 15 Counting.
52 pages
Lecture 14 Graph-3
No ratings yet
Lecture 14 Graph-3
37 pages
Weekly
No ratings yet
Weekly
3 pages
9 Functions
No ratings yet
9 Functions
54 pages
CTRL
No ratings yet
CTRL
5 pages
Res Net
No ratings yet
Res Net
29 pages
Dense Net
No ratings yet
Dense Net
15 pages
Temporal Convolutional Network (TCN)
100% (1)
Temporal Convolutional Network (TCN)
21 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
Faculty Notification-All Institutes
No ratings yet
Faculty Notification-All Institutes
3 pages
5th AI Evaluation Schedule
No ratings yet
5th AI Evaluation Schedule
1 page
Introduction To Regression: George Boorman
No ratings yet
Introduction To Regression: George Boorman
50 pages
Fast Cross Validation Via Sequential Analysis - Talk
No ratings yet
Fast Cross Validation Via Sequential Analysis - Talk
16 pages
Behaviourist Theory of Psycholinguistics
75% (4)
Behaviourist Theory of Psycholinguistics
4 pages
Control Systems Lab 4
No ratings yet
Control Systems Lab 4
14 pages
Design by Root Locus PDF
No ratings yet
Design by Root Locus PDF
26 pages
Intro To AI - Assignment 1 - Nahom Habtamu
No ratings yet
Intro To AI - Assignment 1 - Nahom Habtamu
4 pages
Introduction To Information Systems People Technology and Processes 3rd Edition Wallace Solutions Manual 1
100% (82)
Introduction To Information Systems People Technology and Processes 3rd Edition Wallace Solutions Manual 1
26 pages
Lecture 3 Fuzzy Logic
100% (1)
Lecture 3 Fuzzy Logic
19 pages
Proposal Defense v6
No ratings yet
Proposal Defense v6
55 pages
Test 2 Week 6
No ratings yet
Test 2 Week 6
5 pages
Audiología Básica
No ratings yet
Audiología Básica
18 pages
Lag Compensator Date:: EX - NO
No ratings yet
Lag Compensator Date:: EX - NO
3 pages
SEM Based Defect Classifier For VSB Mask Writer
No ratings yet
SEM Based Defect Classifier For VSB Mask Writer
24 pages
Theory Jungle
No ratings yet
Theory Jungle
17 pages
Chapter 1. Lesson 1 - Communication, Importance and Process
No ratings yet
Chapter 1. Lesson 1 - Communication, Importance and Process
5 pages
Servo-Motor Driver Tuning
No ratings yet
Servo-Motor Driver Tuning
2 pages
Talend Use Cases
100% (1)
Talend Use Cases
38 pages
Communication and Communication Process
No ratings yet
Communication and Communication Process
23 pages
Multi-Agent Systems: Tom Holvoet, Hoang Tung Dinh
No ratings yet
Multi-Agent Systems: Tom Holvoet, Hoang Tung Dinh
4 pages
Classification
No ratings yet
Classification
58 pages
Microsoft PowerPoint - Lecture-1 - Intro
No ratings yet
Microsoft PowerPoint - Lecture-1 - Intro
33 pages
A Literature Survey On Domain Adaptation of Statistical Classifiers
No ratings yet
A Literature Survey On Domain Adaptation of Statistical Classifiers
12 pages
Digital Assignment Assignment and Marks Awarded Would Be Zero
No ratings yet
Digital Assignment Assignment and Marks Awarded Would Be Zero
3 pages
Naive Bayes
No ratings yet
Naive Bayes
38 pages
Traffic Sign Detection
No ratings yet
Traffic Sign Detection
5 pages
Different Branches of AI
No ratings yet
Different Branches of AI
25 pages
DBMS Day Wise Lession Plan 2021
No ratings yet
DBMS Day Wise Lession Plan 2021
2 pages

Deep Learning Vocabulary

Uploaded by

Deep Learning Vocabulary

Uploaded by

Modal Robustness:

Best Websites for Vocabulary

Feature Engineering Explained

There are various steps involved in feature engineering that include:

The following techniques used in feature engineering are as follows –

Basis Feature Engineering Feature Selection Feature Extraction

Feature Selection Concepts & Techniques

Feature importance technique for features selection

Feature Extraction Concepts & Techniques

 Principal component analysis (PCA) for unsupervised data compression.

When to use Feature Selection & Feature Extraction

What is feature extraction/selection?

Straight to the point:

 Extraction: Getting useful features from existing data.

Important Deep Learning Terms

You might also like