ML Lab 3

Uploaded by

zulqarnain

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

ML Lab 3

Uploaded by

zulqarnain

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

AI-3002- Machine Learning

Lab 2
Data Pre-Processing and Feature Engineering
Using Pandas and Numpy

Instructor: Shaina Laraib

1. Introduction
DATA PRE-PROCESSING

Data preprocessing is an integral step in Machine Learning as the quality of data

and the useful information that can be derived from it directly affects the ability of
our model to learn; therefore, it is extremely important that we preprocess our data
before feeding it into our model.
In this lab, we will be covering the following steps of data pre-processing:
 Data Cleaning
- Handling missing values
- Handling Outliers
- Dealing with duplicate values
 Data transformation
- Scaling
- Normalization

FEATURE ENGINEERING

Feature engineering is the process of selecting, manipulating, and transforming raw

data into features that can be used in supervised learning. Feature engineering is a
machine learning technique that leverages data to create new variables that aren’t in
the training set. It can produce new features for both supervised and unsupervised
learning, with the goal of simplifying and speeding up data transformations while
also enhancing model accuracy. Feature Engineering is important because
regardless of the data or architecture, a terrible feature will have a direct impact on
your model.
Data Cleaning
Data cleaning is the key step in machine learning. Data is usually gathered
from multiple sources, resulting in duplicates and redundant values. Such
values need to be dealt with before giving it to the model.
Looking for Missing Values
- Loading and importing the dataset.
- Looking for Null Values across Rows and Columns
- Handling Missing Values
- Outliers Detection and Removal
- Duplicates Removal
LAB TASK:
- Apply data cleaning step by step on House Price
Prediction Dataset
- Apply transformations on the data if required.
- You’re required to remove null values, remove outliers,
handle duplicates, and apply scaling and normalization on
features (if required).
- Bonus Question: Can you apply some feature engineering
to this data?

DM - MOD - 1 Part III
No ratings yet
DM - MOD - 1 Part III
12 pages
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
No ratings yet
Feature Engineering: Short Study: Indian Institute of Space Science and Technology, Department of Mathematics
6 pages
Feature Engineering and Normalization
No ratings yet
Feature Engineering and Normalization
7 pages
week3A
No ratings yet
week3A
18 pages
life lesson
No ratings yet
life lesson
13 pages
Feature Engineering
No ratings yet
Feature Engineering
2 pages
Experiment-3 31
No ratings yet
Experiment-3 31
9 pages
Data Preprocessing in Machine Learning
No ratings yet
Data Preprocessing in Machine Learning
5 pages
UNIT 2 ML
No ratings yet
UNIT 2 ML
14 pages
1635838720082
No ratings yet
1635838720082
35 pages
DS Unit 2
No ratings yet
DS Unit 2
42 pages
20 Questions On Feature Engineering and Eda
No ratings yet
20 Questions On Feature Engineering and Eda
9 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
26 pages
Deep Learning Vocabulary
No ratings yet
Deep Learning Vocabulary
6 pages
NN-7
No ratings yet
NN-7
26 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
50 AI Engineer Interview Questions & Answers [2025] - DigitalDefynd
No ratings yet
50 AI Engineer Interview Questions & Answers [2025] - DigitalDefynd
27 pages
ML_DA
No ratings yet
ML_DA
55 pages
training
No ratings yet
training
2 pages
Unit 2
No ratings yet
Unit 2
22 pages
99+ AI Engineer Interview Questions + Answers
No ratings yet
99+ AI Engineer Interview Questions + Answers
24 pages
AI6322 - Module 4 - Feature Engineering - MODULE
No ratings yet
AI6322 - Module 4 - Feature Engineering - MODULE
25 pages
Research Paper (1)
No ratings yet
Research Paper (1)
5 pages
Machine Learning (Autosaved)
No ratings yet
Machine Learning (Autosaved)
13 pages
Class PPT - Unit2
No ratings yet
Class PPT - Unit2
139 pages
Steps Assignment
No ratings yet
Steps Assignment
6 pages
UNIT 1
No ratings yet
UNIT 1
38 pages
Architecting To Support Machine Learning
No ratings yet
Architecting To Support Machine Learning
47 pages
Data Processing in AI
No ratings yet
Data Processing in AI
7 pages
3.1 Dimensionality Reduction
No ratings yet
3.1 Dimensionality Reduction
24 pages
Data Science Process Stages Lecture 2
No ratings yet
Data Science Process Stages Lecture 2
4 pages
Predicting Credit Card Approvals
100% (1)
Predicting Credit Card Approvals
14 pages
Ads Exp2 C35
No ratings yet
Ads Exp2 C35
9 pages
Data Preprocessing
No ratings yet
Data Preprocessing
9 pages
Building Good Training Sets UNIT 1 PART2
No ratings yet
Building Good Training Sets UNIT 1 PART2
46 pages
UNIT 1
No ratings yet
UNIT 1
28 pages
Data Normalization in Data Mining
No ratings yet
Data Normalization in Data Mining
8 pages
Lecture01 &02 (1)
No ratings yet
Lecture01 &02 (1)
77 pages
Summary Chap 1 & 2
No ratings yet
Summary Chap 1 & 2
5 pages
python_TUM
No ratings yet
python_TUM
3 pages
Cse3001 Ai Ml m2
No ratings yet
Cse3001 Ai Ml m2
118 pages
Take Home Assignment - CCS3342-Business Intelligence (1)
No ratings yet
Take Home Assignment - CCS3342-Business Intelligence (1)
2 pages
CSC407_Chapter 2-3
No ratings yet
CSC407_Chapter 2-3
46 pages
Case Study 8
No ratings yet
Case Study 8
9 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
Data Prep
No ratings yet
Data Prep
5 pages
Unit-II
No ratings yet
Unit-II
119 pages
230208 MLOps Getting From Good to Great
No ratings yet
230208 MLOps Getting From Good to Great
41 pages
Artificial Intelligancy Architecture
No ratings yet
Artificial Intelligancy Architecture
13 pages
Ch5 5 Data Preprocessing
No ratings yet
Ch5 5 Data Preprocessing
39 pages
3-Data Considerations
No ratings yet
3-Data Considerations
46 pages
06 Feature Engineering
No ratings yet
06 Feature Engineering
24 pages
Machine Learning Chapter 2
No ratings yet
Machine Learning Chapter 2
37 pages
Faculty Development Program ON Artificial Intelligence & Machine Learning For Engineering Applications
No ratings yet
Faculty Development Program ON Artificial Intelligence & Machine Learning For Engineering Applications
70 pages
ML - Unit-2 FULL - Feature Engineering Theory-13!09!24-1
No ratings yet
ML - Unit-2 FULL - Feature Engineering Theory-13!09!24-1
29 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
1 page
Segmentation Dataset
No ratings yet
Segmentation Dataset
41 pages
Mdcm Sagar Assignment
No ratings yet
Mdcm Sagar Assignment
15 pages
Data Preprocessing
No ratings yet
Data Preprocessing
4 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet