U19ADS2035-Python For Data Science Laboratory Page No:17

Uploaded by

sailesh lal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

U19ADS2035-Python For Data Science Laboratory Page No:17

Uploaded by

sailesh lal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Register Number:61781922110041

Ex. No:4
PERFORM ALL BASIC DATA PREPROCESSING STEPS ON THE GIVEN DATASET
Date:

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:17

PROGRAM:
#4.to perform all basic data pre.processing steps on the given data set.
import pandas as pd
train=pd.read_csv("C://Users//admin//Downloads//train.csv")
df=train.copy()
print("The first 5 rows:\n")
print(df.head())
print("last 5 rows:\n")
print(df.tail())
print("n_samples x n_features\n")
print(df.shape)
print("List of all the columns\n")
print(df.columns)
print("Rows index\n")
print(df.index)
print("General description of dataset.\n")
print(df.describe())
print("Counting null values in whole dataset:\n")
print(df.isnull().sum())
print("Counting null value on a particular column:\n")
df['Age'].isnull().sum()
"""Handling Missing Values"""
df['Age'].fillna(df['Age'].mean(),inplace=True)
print("After Handling Null values:\n")
print(df['Age'].isnull().sum())
print(df.head())

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:18

OUTPUT:
The first 5 rows:
PassengerId Survived Pclass ... Fare Cabin Embarked
0 1 0 3 ... 7.2500 NaN S
1 2 1 1 ... 71.2833 C85 C
2 3 1 3 ... 7.9250 NaN S
3 4 1 1 ... 53.1000 C123 S
4 5 0 3 ... 8.0500 NaN S

[5 rows x 12 columns]
last 5 rows:
PassengerId Survived Pclass ... Fare Cabin Embarked
886 887 0 2 ... 13.00 NaN S
887 888 1 1 ... 30.00 B42 S
888 889 0 3 ... 23.45 NaN S
889 890 1 1 ... 30.00 C148 C
890 891 0 3 ... 7.75 NaN Q

[5 rows x 12 columns]
n_samples x n_features
(891, 12)
List of all the columns
Index(['PassengerId', 'Survived', 'Pclass', 'Name', 'Sex', 'Age', 'SibSp',
'Parch', 'Ticket', 'Fare', 'Cabin', 'Embarked'],
dtype='object')
Rows index
RangeIndex(start=0, stop=891, step=1)
General description of dataset.

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:19

PassengerId Survived Pclass ... SibSp Parch Fare

count 891.000000 891.000000 891.000000 ... 891.000000 891.000000 891.000000
mean 446.000000 0.383838 2.308642 ... 0.523008 0.381594 32.204208
std 257.353842 0.486592 0.836071 ... 1.102743 0.806057 49.693429
min 1.000000 0.000000 1.000000 ... 0.000000 0.000000 0.000000
25% 223.500000 0.000000 2.000000 ... 0.000000 0.000000 7.910400
50% 446.000000 0.000000 3.000000 ... 0.000000 0.000000 14.454200
75% 668.500000 1.000000 3.000000 ... 1.000000 0.000000 31.000000
max 891.000000 1.000000 3.000000 ... 8.000000 6.000000 512.329200

[8 rows x 7 columns]
Counting null values in whole dataset:

PassengerId 0
Survived 0
Pclass 0
Name 0
Sex 0
Age 177
SibSp 0
Parch 0
Ticket 0
Fare 0
Cabin 687
Embarked 2
dtype: int64
Counting null value on a particular column:
After Handling Null values:
0
PassengerId Survived Pclass ... Fare Cabin Embarked

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:20

0 1 0 3 ... 7.2500 NaN S

1 2 1 1 ... 71.2833 C85 C
2 3 1 3 ... 7.9250 NaN S
3 4 1 1 ... 53.1000 C123 S
4 5 0 3 ... 8.0500 NaN S

[5 rows x 12 columns]

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:21

Only Pandas
No ratings yet
Only Pandas
8 pages
The International Association of Lions Clubs (Lions Clubs International) District 325 A 1, Multiple District 325, Nepal LY 2018-2019
100% (1)
The International Association of Lions Clubs (Lions Clubs International) District 325 A 1, Multiple District 325, Nepal LY 2018-2019
69 pages
Kami Export - Benjamin Ratin - 1.6 Photoelectron Spectroscopy Student
No ratings yet
Kami Export - Benjamin Ratin - 1.6 Photoelectron Spectroscopy Student
3 pages
PRAC3_23BME053
No ratings yet
PRAC3_23BME053
5 pages
Titanic Data
No ratings yet
Titanic Data
5 pages
7 Questions To Ask Before EDA
100% (1)
7 Questions To Ask Before EDA
2 pages
Ai Tools and Applications-Lab
No ratings yet
Ai Tools and Applications-Lab
33 pages
Assignment Data Science
No ratings yet
Assignment Data Science
2 pages
Pyt Manual 1
No ratings yet
Pyt Manual 1
85 pages
Assignment 5
No ratings yet
Assignment 5
14 pages
Titanic Survival Prediction 1692609491
No ratings yet
Titanic Survival Prediction 1692609491
15 pages
Titanic Classification
100% (1)
Titanic Classification
7 pages
Loading The Dataset: ## The Matplotlib and Seaborn Library For Result Visualization and Analysis
No ratings yet
Loading The Dataset: ## The Matplotlib and Seaborn Library For Result Visualization and Analysis
13 pages
assignment1
No ratings yet
assignment1
2 pages
Rajat DM
No ratings yet
Rajat DM
54 pages
TITANIC CLASSIFICATION - Task1
No ratings yet
TITANIC CLASSIFICATION - Task1
2 pages
Data Cleaning by Manish Batra 1697684636
No ratings yet
Data Cleaning by Manish Batra 1697684636
30 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
23L-2589 Lab 10
No ratings yet
23L-2589 Lab 10
17 pages
Data Cleaning
No ratings yet
Data Cleaning
13 pages
dspracticalexternak23aug
No ratings yet
dspracticalexternak23aug
8 pages
Python for Machine Learning
No ratings yet
Python for Machine Learning
33 pages
ML File 211173
No ratings yet
ML File 211173
19 pages
Assign9.Ipynb - Colab
No ratings yet
Assign9.Ipynb - Colab
4 pages
Dataset Visualization Basic Ml-1
No ratings yet
Dataset Visualization Basic Ml-1
12 pages
AI Final PDF
No ratings yet
AI Final PDF
38 pages
vertopal.com_homework1
No ratings yet
vertopal.com_homework1
17 pages
7 8 - Missing Value Handling
No ratings yet
7 8 - Missing Value Handling
4 pages
Data Cleaning and Manipulation in Python
No ratings yet
Data Cleaning and Manipulation in Python
33 pages
FDS Practical 2
No ratings yet
FDS Practical 2
8 pages
Titanic
100% (2)
Titanic
13 pages
ml dataset performance
No ratings yet
ml dataset performance
11 pages
LOGISTIC_REGRESSION - Jupyter Notebook
No ratings yet
LOGISTIC_REGRESSION - Jupyter Notebook
18 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
28 pages
TitanicFeatureEngineering Handout
No ratings yet
TitanicFeatureEngineering Handout
26 pages
Titanic Survival Prediction Ml
No ratings yet
Titanic Survival Prediction Ml
36 pages
Passengerid Survived Pclass Name Sex Age Sibsp Parch Ticket
No ratings yet
Passengerid Survived Pclass Name Sex Age Sibsp Parch Ticket
16 pages
Titanic Data Analysis
No ratings yet
Titanic Data Analysis
14 pages
FDA EXP3 E0323040
No ratings yet
FDA EXP3 E0323040
2 pages
Machine Learning Notebook
No ratings yet
Machine Learning Notebook
19 pages
BD WPS2
No ratings yet
BD WPS2
11 pages
seaborn ploting in titanic
No ratings yet
seaborn ploting in titanic
18 pages
Exp 3 Data Wrangling Sdk Ok
No ratings yet
Exp 3 Data Wrangling Sdk Ok
8 pages
Overview of Data Cleaning
No ratings yet
Overview of Data Cleaning
17 pages
Data Preprocessing - Ipynb - Colaboratory
No ratings yet
Data Preprocessing - Ipynb - Colaboratory
7 pages
PreguntaB
No ratings yet
PreguntaB
50 pages
2524c225-2e58-4d21-8bba-8fda084be465_Programs_Week_10
No ratings yet
2524c225-2e58-4d21-8bba-8fda084be465_Programs_Week_10
11 pages
MTA Project
No ratings yet
MTA Project
1 page
The Titanic dataset
No ratings yet
The Titanic dataset
6 pages
Data cleaning and exploratory analysis on a public dataset
No ratings yet
Data cleaning and exploratory analysis on a public dataset
11 pages
12212221 (1) copy
No ratings yet
12212221 (1) copy
9 pages
Day 20
No ratings yet
Day 20
5 pages
SN Travel Jupyter Notebook PDF
No ratings yet
SN Travel Jupyter Notebook PDF
28 pages
Homework2
No ratings yet
Homework2
12 pages
✌️???? ????????????✌️???? ??????
No ratings yet
✌️???? ????????????✌️???? ??????
63 pages
Python pandas library
No ratings yet
Python pandas library
10 pages
Atividade Fabricio Rezende Luz - Colab
No ratings yet
Atividade Fabricio Rezende Luz - Colab
2 pages
Aiml Lab04&5 - Output
No ratings yet
Aiml Lab04&5 - Output
18 pages
Logistic Regression On Titanic Dataset
No ratings yet
Logistic Regression On Titanic Dataset
6 pages
Import As: Pandas PD Titanic - Data PD - Read - CSV Titanic - Data - Head
No ratings yet
Import As: Pandas PD Titanic - Data PD - Read - CSV Titanic - Data - Head
12 pages
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Grammar lesson 11
No ratings yet
Grammar lesson 11
6 pages
PEHV Question Bank 2024
No ratings yet
PEHV Question Bank 2024
3 pages
MGT Circular 2022-23
No ratings yet
MGT Circular 2022-23
1 page
Syllabus
No ratings yet
Syllabus
2 pages
MATH 10 Midterms Review MHE2 1
No ratings yet
MATH 10 Midterms Review MHE2 1
36 pages
Environmental sanitation-1
No ratings yet
Environmental sanitation-1
15 pages
Ifm Bop
No ratings yet
Ifm Bop
18 pages
PLLT: Chapter 1 Language, Learninh and Teaching
No ratings yet
PLLT: Chapter 1 Language, Learninh and Teaching
3 pages
Laboratory 13
No ratings yet
Laboratory 13
13 pages
Israel Research Proposal-1
No ratings yet
Israel Research Proposal-1
27 pages
Discovering World Prehistory Interpreting the Past through Archaeology 1st Edition Mark Q. Sutton All Chapters Instant Download
No ratings yet
Discovering World Prehistory Interpreting the Past through Archaeology 1st Edition Mark Q. Sutton All Chapters Instant Download
55 pages
AIA2013 Hands-On InventorCAM 01
No ratings yet
AIA2013 Hands-On InventorCAM 01
34 pages
Holiday Treats Recipes
No ratings yet
Holiday Treats Recipes
10 pages
Long Phrase Tab
No ratings yet
Long Phrase Tab
6 pages
00 Cost+Estimation Basic+Course Introduction
No ratings yet
00 Cost+Estimation Basic+Course Introduction
6 pages
CY1001-2015 Inorganic Lecture Notes
No ratings yet
CY1001-2015 Inorganic Lecture Notes
16 pages
PB-2022-02-01-01872MPK-HERO Passion X Pro Splendor 100 BS6 MPK - Piston and Ring Kit
No ratings yet
PB-2022-02-01-01872MPK-HERO Passion X Pro Splendor 100 BS6 MPK - Piston and Ring Kit
2 pages
Iraqi Drilling Company Daily Report Integrated Mangement System (IDCP08F4)
No ratings yet
Iraqi Drilling Company Daily Report Integrated Mangement System (IDCP08F4)
1 page
A Walk Through Darkness Pathworking Guide to the Goetic Demons
No ratings yet
A Walk Through Darkness Pathworking Guide to the Goetic Demons
178 pages
Company Profile - HAJ Corporation
No ratings yet
Company Profile - HAJ Corporation
26 pages
Example Criteria
No ratings yet
Example Criteria
1 page
Sekaran Lyrica Phase 2
No ratings yet
Sekaran Lyrica Phase 2
12 pages
Stay Hungry Stay Foolish
No ratings yet
Stay Hungry Stay Foolish
60 pages
The Shadow Line
No ratings yet
The Shadow Line
4 pages
Games of The General
No ratings yet
Games of The General
2 pages
222 Ways To Avoid Very
100% (1)
222 Ways To Avoid Very
21 pages
Experiment No 2 Object:: To Determine The API Gravity of Given Sample
No ratings yet
Experiment No 2 Object:: To Determine The API Gravity of Given Sample
4 pages
Sec 03
No ratings yet
Sec 03
22 pages
Dimensional Analysis
No ratings yet
Dimensional Analysis
27 pages
Banana Cake Recipe (Cream Cheese Frosting)
No ratings yet
Banana Cake Recipe (Cream Cheese Frosting)
3 pages
Diss DLL 2.1
No ratings yet
Diss DLL 2.1
4 pages

U19ADS2035-Python For Data Science Laboratory Page No:17

Uploaded by

U19ADS2035-Python For Data Science Laboratory Page No:17

Uploaded by

Register Number:61781922110041

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:17

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:18

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:19

PassengerId Survived Pclass ... SibSp Parch Fare

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:20

0 1 0 3 ... 7.2500 NaN S

U19ADS2035- PYTHON FOR DATA SCIENCE LABORATORY Page No:21

You might also like