PPT Pandas(Assignment 3)

Pandas is a Python library designed for data manipulation and analysis, providing tools for cleaning, exploring, and analyzing datasets. It features DataFrames, which are 2-dimensional structures similar to tables, and allows users to perform operations like locating rows, reading CSV files, and deleting rows or columns. The library is essential for data science, enabling users to derive insights from large and complex datasets.

Uploaded by

agarwalkrishna0634

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views24 pages

PPT Pandas(Assignment 3)

Uploaded by

agarwalkrishna0634

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Cognitive Computing UCS420

Pandas
Pandas
• Pandas is a Python library used for working
with data sets.
• It has functions for analyzing, cleaning,
exploring, and manipulating data.
Why Use Pandas?
• Pandas allows us to analyze big data and make
conclusions based on statistical theories.
• Pandas can clean messy data sets, and make
them readable and relevant.
• Relevant data is very important in data
science.
What Can Pandas Do?
• Pandas gives you answers about the data. Like:
– Is there a correlation between two or more columns?
– What is average value?
– Max value?
– Min value?
• Pandas are also able to delete rows that are not
relevant, or contains wrong values, like empty or
NULL values. This is called cleaning the data.
Pandas DataFrames
• A Pandas DataFrame is a 2-dimensional data
structure, like a 2-dimensional array, or a table
with rows and columns.
• Pandas DataFrame is a two-dimensional, size-
mutable, and heterogeneous data structure
(similar to a table in a relational database or
an Excel spreadsheet).
Pandas DataFrames
• Example:
• Create a simple Pandas DataFrame:
• import pandas as pd
data = {
’calories’: [420, 380, 390],
’duration’: [50, 40, 45]
}
#load data into a DataFrame object:
df = pd.DataFrame(data)
print(df)
Pandas DataFrames
• Locate Row:
• As you can see from the result above, the
DataFrame is like a table with rows and
columns.
• Pandas use the loc attribute to return one or
more specified row(s)
Pandas DataFrames
• Example
• Return row 0:
• #refer to the row index:
print(df.loc[0])
Example
• import pandas as pd

mydataset = {
'cars': ["BMW", "Volvo", "Ford"],
'passings': [3, 7, 2]
}

myvar = pd.DataFrame(mydataset)

print(myvar)
Pandas Series
• A Pandas Series is like a column in a table.
• It is a one-dimensional array holding data of any type.
• Example:
• Create a simple Pandas Series from a list:
• import pandas as pd

a = [1, 7, 2]

data1 = pd.Series(a)

print(data1)
Labels
• If nothing else is specified, the values are
labeled with their index number. First value
has index 0, second value has index 1 etc.
• This label can be used to access a specified
value.
• With the index argument, you can name your
own labels.
Labels
• Example
• Create your own labels:
• import pandas as pd

a = [1, 7, 2]

myvar = pd.Series(a, index = ["x", "y", "z"])

print(myvar)
Navigating Data Frame
• iloc exclusively uses integer positions for accessing data.
• As a result, it makes it particularly useful when dealing with
data where labels might be unknown or irrelevant.

• df.iloc[row number/slice]
• df.iloc[4], df.iloc[1:4], df.iloc[:], df.iloc[1:4, 5:8]
Navigating Data Frame
• df.iloc[4]-This command selects the 5th row
(index 4) from the DataFrame df. It returns a
single row as a Series.
• df.iloc[1:4]:This command selects a slice of
rows from index 1 to 3 (excluding index 4)
from df. It returns multiple rows as a
DataFrame.
Navigating Data Frame
• df.iloc[:]
• -This command selects all rows and columns
from df. It’s essentially the same as df,
returning the entire DataFrame.
• df.iloc[1:4, 5:8]:This command selects rows
from index 1 to 3 (excluding 4) and columns
from index 5 to 7 (excluding 8). It returns the
specified subset as a DataFrame.
Navigating Data Frame
• df.iloc[:,2]-This will select all rows (:) for the
specified column index (3rd column),
effectively giving you the entire column
without specifically extracting any single row.
• This is the closest way to extract a column
with .iloc without targeting individual rows.
Pandas Read CSV
• A simple way to store big data sets is to use
CSV files (comma separated files).
• CSV files contains plain text and is a well know
format that can be read by everyone including
Pandas.
• In our examples we will be using a CSV file
called 'data.csv'.
Pandas Read CSV
• Example:
• Load the CSV into a DataFrame:
• import pandas as pd
df = pd.read_csv('data.csv’)
• #show only first 5 rows
• df.head()
• #show all the rows
• print(df.to_string())
• #show last 5 rows
• print("\nLast 5 rows:")
• print(df.tail(5))
Pandas Read CSV
• The pd.read_csv() function is used to read the
data from the data.csv file.
• df.to_string() converts the entire DataFrame df
into a string representation, showing all rows
and columns.
• If you have a large DataFrame with many
rows, Pandas will only return the first 5 rows,
and the last 5 rows:
Delete a column from Dataset
• You can delete a column or feature from a
dataset-
– df.drop(df.columns[1], axis=1, inplace=True)

•Column Selection: df.columns[1] is used to select the second

column.
•Axis Parameter: axis=1 specifies you are dropping a column. For
rows, use axis=0.
•Inplace=True - If you want to modify the DataFrame in place.
•Inplace=False- If you do not want to modify the DataFrame in place
Delete a row from Dataset
• You can delete a row or feature from a
dataset-
– df.drop(1, axis=0, inplace=True)

•Row Selection: The first parameter ‘1’ is used to select the first row.
•Axis Parameter: axis=0 specifies you are dropping a row.
•Inplace=True - If you want to modify the DataFrame in place.
•Inplace=False- If you do not want to modify the DataFrame in place
Pandas-Some other useful commands
Pandas-Some other useful commands
Pandas-Some other useful commands

1745516832930-Pandas-Handbook
No ratings yet
1745516832930-Pandas-Handbook
33 pages
Pandas Basics
No ratings yet
Pandas Basics
84 pages
Oracle Multitenant Cheat Sheet: by Via
No ratings yet
Oracle Multitenant Cheat Sheet: by Via
4 pages
Elizabeth: ETL Informatica Developer
No ratings yet
Elizabeth: ETL Informatica Developer
5 pages
Lab 1 Introduction To MS Access: Fig. 1 Database Window
No ratings yet
Lab 1 Introduction To MS Access: Fig. 1 Database Window
6 pages
PPT for Assignment-3 (Final_Pandas_Lab)
No ratings yet
PPT for Assignment-3 (Final_Pandas_Lab)
40 pages
CHP 8 Pandas
No ratings yet
CHP 8 Pandas
49 pages
Data Science - Sec3
No ratings yet
Data Science - Sec3
27 pages
Pandas
No ratings yet
Pandas
41 pages
Pandas Notes(1)
No ratings yet
Pandas Notes(1)
44 pages
Pandas Dataframe Export The CSV File
No ratings yet
Pandas Dataframe Export The CSV File
9 pages
Lecture 7 Understanding dataFrames in Python and R
No ratings yet
Lecture 7 Understanding dataFrames in Python and R
17 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
Data Science Notes Unit-1 Part -2
No ratings yet
Data Science Notes Unit-1 Part -2
22 pages
Exp1 - Manipulating Datasets Using Pandas
No ratings yet
Exp1 - Manipulating Datasets Using Pandas
15 pages
2_Pandas
No ratings yet
2_Pandas
22 pages
Python Data Frame New
No ratings yet
Python Data Frame New
32 pages
introduction to pandas
No ratings yet
introduction to pandas
14 pages
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
Python 3rd unit question and answer
No ratings yet
Python 3rd unit question and answer
25 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Pandas
No ratings yet
Pandas
21 pages
Pandas AI
No ratings yet
Pandas AI
14 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Line By Line 12 IP
No ratings yet
Line By Line 12 IP
21 pages
Mdad - Numpy ML
No ratings yet
Mdad - Numpy ML
85 pages
Unit6 - Working With Data
No ratings yet
Unit6 - Working With Data
29 pages
Pandas in Python
No ratings yet
Pandas in Python
59 pages
Data Handlinng Using Pandas
No ratings yet
Data Handlinng Using Pandas
46 pages
Unit-4Introduction To Pandas
No ratings yet
Unit-4Introduction To Pandas
44 pages
Pandas Class 12 Ncertttt
No ratings yet
Pandas Class 12 Ncertttt
48 pages
Data Handling Using Pandas-I-ORG
No ratings yet
Data Handling Using Pandas-I-ORG
44 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Pandas Notes (1)
No ratings yet
Pandas Notes (1)
10 pages
unit 3
No ratings yet
unit 3
10 pages
Pandas
No ratings yet
Pandas
13 pages
Unit 4
No ratings yet
Unit 4
36 pages
Panda
No ratings yet
Panda
46 pages
Data Analysis With Pandas
No ratings yet
Data Analysis With Pandas
28 pages
Notes on Pandas.
No ratings yet
Notes on Pandas.
7 pages
DevOps Session 3 Pandas.pptx
No ratings yet
DevOps Session 3 Pandas.pptx
33 pages
Pandas
No ratings yet
Pandas
41 pages
Lecture 2 - data wrangling_update (2)
No ratings yet
Lecture 2 - data wrangling_update (2)
114 pages
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
No ratings yet
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
47 pages
The Pandas Library
No ratings yet
The Pandas Library
39 pages
Loki Temp PPT Pandas 2
No ratings yet
Loki Temp PPT Pandas 2
31 pages
05Getting Started With Pandas
No ratings yet
05Getting Started With Pandas
44 pages
Dataframe Notes
No ratings yet
Dataframe Notes
47 pages
Phan1_Pandas_Numpy_Matplotlib
No ratings yet
Phan1_Pandas_Numpy_Matplotlib
158 pages
Practical Guide To Pandas For Data Science
100% (1)
Practical Guide To Pandas For Data Science
26 pages
Starting Out With Pandas - Ext
No ratings yet
Starting Out With Pandas - Ext
18 pages
Pandas
No ratings yet
Pandas
26 pages
Pandas
No ratings yet
Pandas
5 pages
ip study
No ratings yet
ip study
18 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Class XII Data Handlinng Using PandasI
No ratings yet
Class XII Data Handlinng Using PandasI
46 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
48 pages
Pandas
No ratings yet
Pandas
21 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
12 new
No ratings yet
12 new
7 pages
HIVE
No ratings yet
HIVE
24 pages
Rdbms Model Question Paper
No ratings yet
Rdbms Model Question Paper
1 page
Class Xii (Informatics Practices) Half Yearly QP Chennai Region
No ratings yet
Class Xii (Informatics Practices) Half Yearly QP Chennai Region
4 pages
m234comschp2engtz0xxma
No ratings yet
m234comschp2engtz0xxma
11 pages
Notes
No ratings yet
Notes
5 pages
SQL Assignment 01
No ratings yet
SQL Assignment 01
12 pages
10 SQL Nested Queries
No ratings yet
10 SQL Nested Queries
11 pages
Dbms PDF
No ratings yet
Dbms PDF
22 pages
Creating The Tables
No ratings yet
Creating The Tables
11 pages
Complete Reference To Informatica PDF
100% (3)
Complete Reference To Informatica PDF
52 pages
Lecture#10 Database Systems
No ratings yet
Lecture#10 Database Systems
14 pages
Namma Kalvi 12th Computer Science Model Question Papers em 2020 217201
No ratings yet
Namma Kalvi 12th Computer Science Model Question Papers em 2020 217201
54 pages
DBMS 2018 1
No ratings yet
DBMS 2018 1
14 pages
slides(lec-6)
No ratings yet
slides(lec-6)
9 pages
Operations Manager 2007 Report Authoring Guide: Authors
No ratings yet
Operations Manager 2007 Report Authoring Guide: Authors
73 pages
PL 300 Updated Part 2
No ratings yet
PL 300 Updated Part 2
28 pages
Vikram Kumar - BISQL2
No ratings yet
Vikram Kumar - BISQL2
1 page
DBMS End Term
No ratings yet
DBMS End Term
27 pages
Hms Doc Final
No ratings yet
Hms Doc Final
46 pages
Class x Question Bank Information Technology Chapter - 8
No ratings yet
Class x Question Bank Information Technology Chapter - 8
4 pages
Module 2
No ratings yet
Module 2
9 pages
Source Code Open & Save TXT Files
No ratings yet
Source Code Open & Save TXT Files
1 page
current_log1
No ratings yet
current_log1
48 pages
It6503 Web Programming Syllabus
No ratings yet
It6503 Web Programming Syllabus
2 pages
SQL With Python Guide
No ratings yet
SQL With Python Guide
17 pages
Jian
No ratings yet
Jian
6 pages

PPT Pandas(Assignment 3)

Uploaded by

PPT Pandas(Assignment 3)

Uploaded by

Cognitive Computing UCS420

myvar = pd.Series(a, index = ["x", "y", "z"])

•Column Selection: df.columns[1] is used to select the second

You might also like