100% found this document useful (2 votes)

257 views

Python Pandas Demo PDF

This document provides an overview of pandas and pandas DataFrames. It defines what a DataFrame is, lists some of its key characteristics like having rows and columns, and describes common DataFrame methods like selecting columns, filtering rows, and handling missing data. It also gives examples of creating a DataFrame from a dictionary of data and using methods like isnull() and dropna().

Uploaded by

Rakshit Kukreja

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (2 votes)

257 views

Python Pandas Demo PDF

Uploaded by

Rakshit Kukreja

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

DoyPyEdu PLAY WITH PYTHON

PYTHON PANDAS

 Python pandas

 Pandas data frame

 Characteristics of data frame

 Selecting or accessing a column

 Data frame using rows / column names

 Program all mix

 Practical programs of pandas

Pandas Data Frame is two-dimensional size-mutable, potentially heterogeneous tabular data

structure with labeled axes (rows and columns). A Data frame is a two-dimensional data structure, i.e., data is
aligned in a tabular fashion in rows and columns. Pandas DataFrame consists of three principal components,
the data, rows, and columns.

Page 1 of 23 EDUCATION FOR EVERYONE

DoyPyEdu PLAY WITH PYTHON

CHARACTERISTICS OF DATA FRAME:

1. has two axes – row index or column index
2. like a spreadsheet where each value is identifiable with the combination of row index and colimn index
3. indices can be of number or letter or strings
4. can easily change its value
For working in pandas generally we import both numpy and pandas. We import numpy because sometimes
numpy function are also needed by giving import statements.
Import numpy as np
Import pandas as pd
DataFrame Methods:
FUNCTION DESCRIPTION
index() Method returns index (row labels) of the DataFrame
insert() Method inserts a column into a DataFrame
add() Method returns addition of dataframe and other, element-wise (binary operator add)
sub() Method returns subtraction of dataframe and other, element-wise (binary operator sub)
mul() Method returns multiplication of dataframe and other, element-wise (binary operator mul)
div() Method returns floating division of dataframe and other, element-wise (binary operator
truediv)
unique() Method extracts the unique values in the dataframe
nunique() Method returns count of the unique values in the dataframe
value_counts() Method counts the number of times each unique value occurs within the Series
columns() Method returns the column labels of the DataFrame
axes() Method returns a list representing the axes of the DataFrame
isnull() Method creates a Boolean Series for extracting rows with null values
notnull() Method creates a Boolean Series for extracting rows with non-null values
between() Method extracts rows where a column value falls in between a predefined range
isin() Method extracts rows from a DataFrame where a column value exists in a predefined
collection
dtypes() Method returns a Series with the data type of each column. The result’s index is the original
DataFrame’s columns
astype() Method converts the data types in a Series
values() Method returns a Numpy representation of the DataFrame i.e. only the values in the
DataFrame will be returned, the axes labels will be removed
sort_values()- Set1 Method sorts a data frame in Ascending or Descending order of passed Column
, Set2
sort_index() Method sorts the values in a DataFrame based on their index positions or labels instead of
their values but sometimes a data frame is made out of two or more data frames and hence
later index can be changed using this method
loc[] Method retrieves rows based on index label
iloc[] Method retrieves rows based on index position
ix[] Method retrieves DataFrame rows based on either index label or index position. This method
combines the best features of the .loc[] and .iloc[] methods
rename() Method is called on a DataFrame to change the names of the index labels or column names
columns() Method is an alternative attribute to change the coloumn name
drop() Method is used to delete rows or columns from a DataFrame
pop() Method is used to delete rows or columns from a DataFrame
sample() Method pulls out a random sample of rows or columns from a DataFrame
nsmallest() Method pulls out the rows with the smallest values in a column
nlargest() Method pulls out the rows with the largest values in a column
shape() Method returns a tuple representing the dimensionality of the DataFrame
ndim() Method returns an ‘int’ representing the number of axes / array dimensions.
Returns 1 if Series, otherwise returns 2 if DataFrame
dropna() Method allows the user to analyze and drop Rows/Columns with Null values in different ways
fillna() Method manages and let the user replace NaN values with some value of their own
rank() Values in a Series can be ranked in order with this method
query() Method is an alternate string-based syntax for extracting a subset from a DataFrame
copy() Method creates an independent copy of a pandas object

Page 2 of 23 EDUCATION FOR EVERYONE

DoyPyEdu PLAY WITH PYTHON

duplicated() Method creates a Boolean Series and uses it to extract rows that have duplicate values
drop_duplicates() Method is an alternative option to identifying duplicate rows and removing them through
filtering
set_index() Method sets the DataFrame index (row labels) using one or more existing columns
reset_index() Method resets index of a Data Frame. This method sets a list of integer ranging from 0 to
length of data as index
where() Method is used to check a Data Frame for one or more condition and return the result
accordingly. By default, the rows not satisfying the condition are filled with NaN value

Attribute Description
Columns The column label of the data frame
Index The index (row labels) of the data frame
Axes Returns a list representing both the axes of the data frame
dtypes Return the data type of the dataFrame
size Returns the int representing the number of element in this object
shape Returns a tuple representing the dimensionally of the dat frame
values Returns a numpy representation of the data frame
empty Indicator whether dataframe is empty
ndim Returns a int representing the number of axes
T Transpose index and columns
SELECTING OR ACCESSING A COLUMN:
<data frame object [<column name>] {USE SQUARE BRACKETS}
Or
<Data frame object>.<column name> {USE DOT NOTATION}
Selecting a subset from a data frame Using Rows / Column Names
1. To Access A Row <DF OBJECT>.loc[<row label>,:]

2. To Access Multiple Rows <DF OBJECT>.loc[<start row> : <end row>,:]

3. To Access Selective Column <DF OBJECT>.loc[<start row >:<end row>,:]

4. To Access Range Of Column <df object>.loc[<start row>:<endrow>,<start column>:<end column>]

import pandas as p Data:

print("Data: \n") Empty DataFrame
data1=p.DataFrame() Columns: []
print(data1) Index: []
Working With Pandas DataFrama
Case 1
Example 1:
import pandas as pd
import numpy as np
dict = {'Info 1':[20,30,np.nan,60,34,33],
'Info 2': [50,20,70,np.nan,14,23],
'Info 3':[70,np.nan,40,50,40,13],
'Info 3':[40,70,10,20,24,np.nan],
'Info 4':[np.nan,np.nan,np.nan,np.nan,np.nan,np.nan]}
df = pd.DataFrame(dict)
print(df)

Page 3 of 23 EDUCATION FOR EVERYONE

DoyPyEdu PLAY WITH PYTHON

print("-"*30)
print(df.isnull())
print("-"*30)
print(df.dropna(1))
print("-"*30)
for i, j in df.iterrows():
print(i, j)
print()
Output:
Info 1 Info 2 Info 3 Info 4 ------------------------------ 3 Info 1 60.0
0 20.0 50.0 40.0 NaN 0 Info 1 20.0 Info 2 NaN
1 30.0 20.0 70.0 NaN Info 2 50.0 Info 3 20.0
2 NaN 70.0 10.0 NaN Info 3 40.0 Info 4 NaN
3 60.0 NaN 20.0 NaN Info 4 NaN Name: 3, dtype: float64
4 34.0 14.0 24.0 NaN Name: 0, dtype: float64
5 33.0 23.0 NaN NaN 4 Info 1 34.0
------------------------------ 1 Info 1 30.0 Info 2 14.0
Info 1 Info 2 Info 3 Info 4 Info 2 20.0 Info 3 24.0
0 False False False True Info 3 70.0 Info 4 NaN
1 False False False True Info 4 NaN Name: 4, dtype: float64
2 True False False True Name: 1, dtype: float64
3 False True False True 5 Info 1 33.0
4 False False False True 2 Info 1 NaN Info 2 23.0
5 False False True True Info 2 70.0 Info 3 NaN
------------------------------ Info 3 10.0 Info 4 NaN
Empty DataFrame Info 4 NaN Name: 5, dtype: float64
Columns: [] Name: 2, dtype: float64
Index: [0, 1, 2, 3, 4, 5]
Example 2:
import pandas as pd
import numpy as np
DATA = {'Stu_Name':['Mohan', 'Rahul', 'Jeevin', 'Pawan'],
'Total_Marks':[250, 210, 319, 218]}
df = pd.DataFrame(DATA)
print(df)
print("-"*30)
print(df[['Stu_Name', 'Total_Marks']])

Output:
Stu_Name Total_Marks Stu_Name Total_Marks
0 Mohan 250 0 Mohan 250
1 Rahul 210 1 Rahul 210
2 Jeevin 319 2 Jeevin 319
3 Pawan 218 3 Pawan 218
------------------------------
Example 3:
import pandas as pd
import numpy as np
dict = {'Info 1':[20,30,np.nan,60,34,33],
'Info 2': [50,20,70,np.nan,14,23],
'Info 3':[70,np.nan,40,50,40,13],
'Info 3':[40,70,10,20,24,np.nan],
'Info 4':[np.nan,np.nan,np.nan,np.nan,np.nan,np.nan]}
df = pd.DataFrame(dict)
Page 4 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

print(df)
print("-"*30)
columns = list(df)
print(columns)
print("-"*30)
for i in columns:
print (df[i][2])
print("-"*30)
Output:
Info 1 Info 2 Info 3 Info 4 ------------------------------
0 20.0 50.0 40.0 NaN nan
1 30.0 20.0 70.0 NaN ------------------------------
2 NaN 70.0 10.0 NaN 70.0
3 60.0 NaN 20.0 NaN ------------------------------
4 34.0 14.0 24.0 NaN 10.0
5 33.0 23.0 NaN NaN ------------------------------
------------------------------ nan
['Info 1', 'Info 2', 'Info 3', 'Info 4'] ------------------------------
Example 4:
import pandas as pd for i in columns:
import numpy as np print (df[i],end=", ")
dict = {'Info 1':[20,30,np.nan,60,34,33], print("-"*30)
'Info 2': [50,20,70,np.nan,14,23], for i in columns:
'Info 3':[70,np.nan,40,50,40,13], print (df[i][2],end=", ")
'Info 3':[40,70,10,20,24,np.nan]} print()
df = pd.DataFrame(dict) print("-"*30)
print(df) for i in columns:
print("-"*30) for j in range(0,len(df[i])):
columns = list(df) print (df[i][j],end=", ")
print(columns) print()
print("-"*30) print("-"*30)
Output:
Info 1 Info 2 Info 3 4 34.0 4 24.0
0 20.0 50.0 40.0 5 33.0 5 NaN
1 30.0 20.0 70.0 Name: Info 1, dtype: float64, 0 Name: Info 3, dtype: float64,
2 NaN 70.0 10.0 50.0 ------------------------------
3 60.0 NaN 20.0 1 20.0 nan,
4 34.0 14.0 24.0 2 70.0 70.0,
5 33.0 23.0 NaN 3 NaN 10.0,
------------------------------ 4 14.0 ------------------------------
['Info 1', 'Info 2', 'Info 3'] 5 23.0 20.0, 30.0, nan, 60.0, 34.0, 33.0,
------------------------------ Name: Info 2, dtype: float64, 0 50.0, 20.0, 70.0, nan, 14.0, 23.0,
0 20.0 40.0 40.0, 70.0, 10.0, 20.0, 24.0, nan,
1 30.0 1 70.0 ------------------------------
2 NaN 2 10.0
3 60.0 3 20.0
Case 2
Example 1:
import pandas as pd df = pd.DataFrame(dict)
import numpy as np print(df)
dict = {'Student 1':[20,30,60,34,33], print("-"*30)
'Student 2': [50,20,70,14,23], print(df.fillna(0))
'Student 3':[70,40,50,40,13]} print("-"*30)
Page 5 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

Output:
Student 1 Student 2 Student 3 Student 1 Student 2 Student 3
0 20 50 70 0 20 50 70
1 30 20 40 1 30 20 40
2 60 70 50 2 60 70 50
3 34 14 40 3 34 14 40
4 33 23 13 4 33 23 13
------------------------------ ------------------------------
Example 2:
import pandas as pd
import numpy as np
dict = { 'Info 1':[20,30,np.nan,60,34,33],
'Info 2': [50,20,70,np.nan,14,23],
'Info 3':[70,np.nan,40,50,40,13],
'Info 3':[40,70,10,20,24,np.nan],
'Info 4':[np.nan,np.nan,np.nan,np.nan,np.nan,np.nan]}
df = pd.DataFrame(dict)
print(df)
print("-"*30)
print(df.fillna(0))
print("-"*30)
print(df.fillna(1))
print("-"*30)

Output:
Info 1 Info 2 Info 3 Info 4 Info 1 Info 2 Info 3 Info 4 Info 1 Info 2 Info 3 Info 4
0 20.0 50.0 40.0 NaN 0 20.0 50.0 40.0 0.0 0 20.0 50.0 40.0 1.0
1 30.0 20.0 70.0 NaN 1 30.0 20.0 70.0 0.0 1 30.0 20.0 70.0 1.0
2 NaN 70.0 10.0 NaN 2 0.0 70.0 10.0 0.0 2 1.0 70.0 10.0 1.0
3 60.0 NaN 20.0 NaN 3 60.0 0.0 20.0 0.0 3 60.0 1.0 20.0 1.0
4 34.0 14.0 24.0 NaN 4 34.0 14.0 24.0 0.0 4 34.0 14.0 24.0 1.0
5 33.0 23.0 NaN NaN 5 33.0 23.0 0.0 0.0 5 33.0 23.0 1.0 1.0
------------------------------ ------------------------------ ------------------------------
Example 3:
import pandas as pd
import numpy as np
dict = {'Info 1':[20,30,np.nan,60,34,33],
'Info 2': [50,20,70,np.nan,14,23],
'Info 3':[70,np.nan,40,50,40,13],
'Info 3':[40,70,10,20,24,np.nan],
'Info 4':[np.nan,np.nan,np.nan,np.nan,np.nan,np.nan]}
df = pd.DataFrame(dict)
print(df)
print("-"*30)
print(df.isnull())
print("-"*30)
Output:
Info 1 Info 2 Info 3 Info 4 Info 1 Info 2 Info 3 Info 4
0 20.0 50.0 40.0 NaN 0 False False False True
1 30.0 20.0 70.0 NaN 1 False False False True
2 NaN 70.0 10.0 NaN 2 True False False True
3 60.0 NaN 20.0 NaN 3 False True False True
4 34.0 14.0 24.0 NaN 4 False False False True
5 33.0 23.0 NaN NaN 5 False False True True
------------------------------ ------------------------------
Page 6 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

Working With loc()/iloc()/at()/del methods of Pandas in DataFrama

Example 1:
import pandas as pd print("df.loc['R2', 'Stu2'] : \n",R)
import numpy as np print("-"*30)
dict = {'Stu1':[20,30,60,34,33], R = df.loc[:, ['Stu1', 'Stu3']]
'Stu2': [50,20,70,14,23], print("df.loc[ : , ['Stu1', 'Stu3']] : \n",R)
'Stu3':[70,40,50,40,13]} print("-"*30)
df = pd.DataFrame(dict) R = df.iloc[0:2, :]
print(df) print("df.iloc[0:2, :] : \n",R)
print("-"*30) print("-"*30)
ind = ['R1', 'R2', 'R3','R4','R5'] R = df.iloc[2:2, :]
print(ind) print("df.iloc[2:2, :] : \n",R)
df.index = ind print("-"*30)
print(df) R = df.iloc[2:3, :]
print("-"*30) print("df.iloc[2:3, :] : \n",R)
R = df.loc['R2', 'Stu2'] print("-"*30)
Output:
Stu1 Stu2 Stu3 R5 33 23 13 Stu1 Stu2 Stu3
0 20 50 70 ------------------------------ R1 20 50 70
1 30 20 40 df.loc['R2', 'Stu2'] : R2 30 20 40
2 60 70 50 20 ------------------------------
3 34 14 40 ------------------------------ df.iloc[2:2, :] :
4 33 23 13 df.loc[ : , ['Stu1', 'Stu3']] : Empty DataFrame
------------------------------ Stu1 Stu3 Columns: [Stu1, Stu2, Stu3]
['R1', 'R2', 'R3', 'R4', 'R5'] R1 20 70 Index: []
Stu1 Stu2 Stu3 R2 30 40 ------------------------------
R1 20 50 70 R3 60 50 df.iloc[2:3, :] :
R2 30 20 40 R4 34 40 Stu1 Stu2 Stu3
R3 60 70 50 R5 33 13 R3 60 70 50
R4 34 14 40 ------------------------------ ------------------------------
df.iloc[0:2, :] :
Example 2:
import pandas as pd r=df.at[2, 'C1']
import numpy as np print(r)
#pd.DataFrame(DATA in list/Dictionary,Rows in print("-"*30)
list[string/integer],columns in list[string/integer] print("Display Before update :")
df = pd.DataFrame([[1, 2, 3], [5, 4, 6], [11, 120, print(df)
310]], index=[1, 2, 3], columns=['C1', 'C2', 'C3']) print("P/Insert at df.at[2, 'C1']=41:")
print("All Data :") df.at[2, 'C1']=41
print(df) print("Display After update :")
print("-"*30) print(df)
print("Display at :") print("-"*30)
Output:
All Data : C1 C2 C3
C1 C2 C3 1 1 2 3
1 1 2 3 2 5 4 6
2 5 4 6 3 11 120 310
3 11 120 310 P/Insert at df.at[2, 'C1']=41:
------------------------------ Display After update :
Display at : C1 C2 C3
5 1 1 2 3
------------------------------ 2 41 4 6
Display Before update : 3 11 120 310
Page 7 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

------------------------------
Example 3:
import pandas as pd print(r)
import numpy as np print("-"*30)
DATA =[[1, 2, 3], [5, 4, 6], [11, 120, 310]] print("Display Before update :")
Rows=[1, 2, 3] print(df)
Columns =['C1', 'C2', 'C3'] print("P/Insert at df.at[2, 'C1']=41:")
df = pd.DataFrame(DATA, Rows, Columns) df.at[2, 'C2']=41
print("All Data :") df.at[1, 'C1']=39
print(df) df.at[0, 'C3']=51
print("-"*30) print("Display After update :")
print("Display at :") print(df)
r=df.at[1, 'C2'] print("-"*30)
Output:
All Data : ------------------------------ C1 C2 C3
C1 C2 C3 Display Before update : 1 39.0 2.0 3.0
1 1 2 3 C1 C2 C3 2 5.0 41.0 6.0
2 5 4 6 1 1 2 3 3 11.0 120.0 310.0
3 11 120 310 2 5 4 6 0 NaN NaN 51.0
------------------------------ 3 11 120 310 ------------------------------
Display at : P/Insert at df.at[2, 'C1']=41:
2 Display After update :
Example 4:
import pandas as pd print(df)
import numpy as np print("-"*30)
DATA =[[1, 2, 3], [5, 4, 6], [11, 120, 310]] print("Del Column C2:")
Rows=[1, 2, 3] del df['C2']
Columns =['C1', 'C2', 'C3'] print(df)
df = pd.DataFrame(DATA, Rows, Columns) print("-"*30)
print("All Data :")
Output: ------------------------------
All Data : Del Column C2:
C1 C2 C3 C1 C3
1 1 2 3 1 1 3
2 5 4 6 2 5 6
3 11 120 310 3 11 310
------------------------------
Working With min()/max()/mode()/mean()/median()
Methods in DataFrama
Example 1:
import pandas as pd print("-"*30)
import numpy as np print("df.min():")
DATA =[[11, 2, 3], [5, 0, 6], [101, 20, 310]] print(df.min())
Rows=[1, 2, 3] print("-"*30)
Columns =['C1', 'C2', 'C3'] print("df.max():")
df = pd.DataFrame(DATA, Rows, Columns) print(df.max())
print("All Data :") print("-"*30)
print(df)
Output:
All Data : ------------------------------ dtype: int64
C1 C2 C3 df.min(): ------------------------------
1 11 2 3 C1 5 df.max():
2 5 0 6 C2 0 C1 101
3 101 20 310 C3 3 C2 20

Page 8 of 23 EDUCATION FOR EVERYONE

DoyPyEdu PLAY WITH PYTHON

C3 310 dtype: int64 ------------------------------

Example 2:
import pandas as pd print("df.max(axis = 0) : ")
data=[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], print(df.max(axis = 0) )
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] print("*"*30)
rows=["Round 1","Round 2","Round 3","Round 4"] print("df.min(axis = 0) : ")
columns=["Game 1","Game 2","Game 3","Game 4","Game 5"] print(df.min(axis = 0) )
print(data) print("*"*30)
print(rows) print("df.max(axis = 1) : ")
print(columns) print(df.max(axis = 1) )
df = pd.DataFrame(data,rows,columns) print("*"*30)
print("*"*30) print("df.min(axis = 1) : ")
print("All Data:") print(df.min(axis = 1) )
print(df) print("*"*30)
print("*"*30) print("df.max(axis = 1,skipna=True) : ")
print("df.min() : ") print(df.max(axis = 1,skipna=True) )
print(df.min()) print("*"*30)
print("*"*30) print("df.min(axis = 1,skipna=True) : ")
print("df.max() :") print(df.min(axis = 1,skipna=True) )
print(df.max())
print("*"*30)
Output:
[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], Game 4 17
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] Game 5 15
['Round 1', 'Round 2', 'Round 3', 'Round 4'] dtype: int64
['Game 1', 'Game 2', 'Game 3', 'Game 4', 'Game 5'] ******************************
****************************** df.min(axis = 0) :
All Data: Game 1 2
Game 1 Game 2 Game 3 Game 4 Game 5 Game 2 6
Round 1 10 41 51 17 12 Game 3 4
Round 2 15 12 4 10 2 Game 4 10
Round 3 2 6 17 13 15 Game 5 2
Round 4 24 13 17 17 6 dtype: int64
****************************** ******************************
df.min() : df.max(axis = 1) :
Game 1 2 Round 1 51
Game 2 6 Round 2 15
Game 3 4 Round 3 17
Game 4 10 Round 4 24
Game 5 2 dtype: int64
dtype: int64 ******************************
****************************** df.min(axis = 1) :
df.max() : Round 1 10
Game 1 24 Round 2 2
Game 2 41 Round 3 2
Game 3 51 Round 4 6
Game 4 17 dtype: int64
Game 5 15 ******************************
dtype: int64 df.max(axis = 1,skipna=True) :
****************************** Round 1 51
df.max(axis = 0) : Round 2 15
Game 1 24 Round 3 17
Game 2 41 Round 4 24
Game 3 51 dtype: int64
Page 9 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

****************************** Round 3 2
df.min(axis = 1,skipna=True) : Round 4 6
Round 1 10 dtype: int64
Round 2 2
Example 3:
import pandas as pd print("df.mode(axis = 0) : ")
data=[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], print(df.mode(axis = 0))
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] print("df.median(axis = 0) : ")
rows=["Round 1","Round 2","Round 3","Round 4"] print(df.median(axis = 0))
columns=["Game 1","Game 2","Game 3","Game 4","Game 5"] print("*"*30)
print(data) print("df.mean(axis = 1) : ")
print(rows) print(df.mean(axis = 1))
print(columns) print("df.mode(axis = 1) : ")
df = pd.DataFrame(data,rows,columns) print(df.mode(axis = 1))
print("*"*30) print("df.median(axis = 1) : ")
print("All Data:") print(df.median(axis = 1))
print(df) print("*"*30)
print("*"*30) print("df.mean(axis = 0,skipna=True) : ")
print("df.mean() : ") print(df.mean(axis = 0,skipna=True) )
print(df.mean()) print("df.median(axis = 0,skipna=True) : ")
print("df.mode() : ") print(df.median(axis = 0,skipna=True) )
print(df.mode()) print("*"*30)
print("df.median() : ") print("df.mean(axis = 1,skipna=True) : ")
print(df.median()) print(df.mean(axis = 1,skipna=True) )
print("*"*30) print("df.median(axis = 1,skipna=True) : ")
print("df.mean(axis = 0) : ") print(df.median(axis = 1,skipna=True) )
print(df.mean(axis = 0))
Output:
[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], Game 2 12.5
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] Game 3 17.0
['Round 1', 'Round 2', 'Round 3', 'Round 4'] Game 4 15.0
['Game 1', 'Game 2', 'Game 3', 'Game 4', 'Game 5'] Game 5 9.0
****************************** dtype: float64
All Data: ******************************
Game 1 Game 2 Game 3 Game 4 Game 5 df.mean(axis = 0) :
Round 1 10 41 51 17 12 Game 1 12.75
Round 2 15 12 4 10 2 Game 2 18.00
Round 3 2 6 17 13 15 Game 3 22.25
Round 4 24 13 17 17 6 Game 4 14.25
****************************** Game 5 8.75
df.mean() : dtype: float64
Game 1 12.75 df.mode(axis = 0) :
Game 2 18.00 Game 1 Game 2 Game 3 Game 4 Game 5
Game 3 22.25 0 2 6 17.0 17.0 2
Game 4 14.25 1 10 12 NaN NaN 6
Game 5 8.75 2 15 13 NaN NaN 12
dtype: float64 3 24 41 NaN NaN 15
df.mode() : df.median(axis = 0) :
Game 1 Game 2 Game 3 Game 4 Game 5 Game 1 12.5
0 2 6 17.0 17.0 2 Game 2 12.5
1 10 12 NaN NaN 6 Game 3 17.0
2 15 13 NaN NaN 12 Game 4 15.0
3 24 41 NaN NaN 15 Game 5 9.0
df.median() : dtype: float64
Game 1 12.5 ******************************

Page 10 of 23 EDUCATION FOR EVERYONE

DoyPyEdu PLAY WITH PYTHON

df.mean(axis = 1) : Game 4 14.25

Round 1 26.2 Game 5 8.75
Round 2 8.6 dtype: float64
Round 3 10.6 df.median(axis = 0,skipna=True) :
Round 4 15.4 Game 1 12.5
dtype: float64 Game 2 12.5
df.mode(axis = 1) : Game 3 17.0
0 1 2 3 4 Game 4 15.0
Round 1 10.0 12.0 17.0 41.0 51.0 Game 5 9.0
Round 2 2.0 4.0 10.0 12.0 15.0 dtype: float64
Round 3 2.0 6.0 13.0 15.0 17.0 ******************************
Round 4 17.0 NaN NaN NaN NaN df.mean(axis = 1,skipna=True) :
df.median(axis = 1) : Round 1 26.2
Round 1 17.0 Round 2 8.6
Round 2 10.0 Round 3 10.6
Round 3 13.0 Round 4 15.4
Round 4 17.0 dtype: float64
dtype: float64 df.median(axis = 1,skipna=True) :
****************************** Round 1 17.0
df.mean(axis = 0,skipna=True) : Round 2 10.0
Game 1 12.75 Round 3 13.0
Game 2 18.00 Round 4 17.0
Game 3 22.25 dtype: float64
Working With sum()/count() Methods in DataFrama
Example 1:
import pandas as pd print(df.sum())
data=[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], print("df.count() : ")
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] print(df.count())
rows=["Round 1","Round 2","Round 3","Round 4"] print("*"*30)
columns=["Game 1","Game 2","Game 3","Game 4","Game 5"]
print("df.sum(axis = 0) : ")
print(data)
print(df.sum(axis = 0))
print(rows)
print("df.count(axis = 0) : ")
print(columns)
print(df.count(axis = 0))
df = pd.DataFrame(data,rows,columns)
print("*"*30)
print("*"*30)
print("df.sum(axis = 1) : ")
print("All Data:")
print(df.sum(axis = 1))
print(df)
print("df.count(axis = 1) : ")
print("*"*30)
print(df.count(axis = 1))
print("df.sum() : ")
print("*"*30)
Output:
[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], Game 2 72
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] Game 3 89
['Round 1', 'Round 2', 'Round 3', 'Round 4'] Game 4 57
['Game 1', 'Game 2', 'Game 3', 'Game 4', 'Game 5'] Game 5 35
****************************** dtype: int64
All Data: df.count() :
Game 1 Game 2 Game 3 Game 4 Game 5 Game 1 4
Round 1 10 41 51 17 12 Game 2 4
Round 2 15 12 4 10 2 Game 3 4
Round 3 2 6 17 13 15 Game 4 4
Round 4 24 13 17 17 6 Game 5 4
****************************** dtype: int64
df.sum() : ******************************
Game 1 51 df.sum(axis = 0) :
Page 11 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

Game 1 51 df.sum(axis = 1) :
Game 2 72 Round 1 131
Game 3 89 Round 2 43
Game 4 57 Round 3 53
Game 5 35 Round 4 77
dtype: int64 dtype: int64
df.count(axis = 0) : df.count(axis = 1) :
Game 1 4 Round 1 5
Game 2 4 Round 2 5
Game 3 4 Round 3 5
Game 4 4 Round 4 5
Game 5 4 dtype: int64
dtype: int64 ******************************
******************************
Working With var()/quantile() Methods in DataFrama
Example 1:
import pandas as pd print(df)
data=[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], print("*"*30)
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] print("df.quantile(.2, axis = 0): ")
rows=["Round 1","Round 2","Round 3","Round 4"] print(df.quantile(.2, axis = 0))
columns=["Game 1","Game 2","Game 3","Game 4","Game 5"]
print("*"*30)
print(data)
print("df.quantile(.2, axis = 1): ")
print(rows)
print(df.quantile(.2, axis = 1))
print(columns)
print("*"*30)
df = pd.DataFrame(data,rows,columns)
print("df.quantile(.4, axis = 0): ")
print("*"*30)
print(df.quantile(.4, axis = 0))
print("All Data:")
print("*"*30)
Output:
[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], Name: 0.2, dtype: float64
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] ******************************
['Round 1', 'Round 2', 'Round 3', 'Round 4'] df.quantile(.2, axis = 1):
['Game 1', 'Game 2', 'Game 3', 'Game 4', 'Game 5'] Round 1 11.6
****************************** Round 2 3.6
All Data: Round 3 5.2
Game 1 Game 2 Game 3 Game 4 Game 5 Round 4 11.6
Round 1 10 41 51 17 12 Name: 0.2, dtype: float64
Round 2 15 12 4 10 2 ******************************
Round 3 2 6 17 13 15 df.quantile(.4, axis = 0):
Round 4 24 13 17 17 6 Game 1 11.0
****************************** Game 2 12.2
df.quantile(.2, axis = 0): Game 3 17.0
Game 1 6.8 Game 4 13.8
Game 2 9.6 Game 5 7.2
Game 3 11.8 Name: 0.4, dtype: float64
Game 4 11.8 ******************************
Game 5 4.4
Example 2:
import pandas as pd df = pd.DataFrame(data,rows,columns)
data=[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], print("*"*30)
[2, 6, 17, 13, 15], [24, 13, 17, 17, 6]] print("All Data:")
rows=["Round 1","Round 2","Round 3","Round 4"] print(df)
columns=["Game 1","Game 2","Game 3","Game 4","Game 5"] print("*"*30)
print(data) print("df.quantile(.2, axis = 0): ")
print(rows) print(df.quantile(.2, axis = 0))
print(columns) print("*"*30)
Page 12 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

print("df.quantile(.2, axis = 1): ") print(""30)

print(df.quantile(.2, axis = 1)) print("df.quantile([.1, .25, .5, .75], axis = 1): ")
print("*"*30) print(df.quantile([.1, .25, .5, .75], axis = 1) )
print("df.quantile([.1, .25, .5, .75], axis = 0): ") print("*"*30)
print(df.quantile([.1, .25, .5, .75], axis = 0) )
Output:
[[10, 41, 51, 17, 12], [15, 12, 4, 10, 2], [2, 6, 17, 13, Round 1 11.6
15], [24, 13, 17, 17, 6]] Round 2 3.6
['Round 1', 'Round 2', 'Round 3', 'Round 4'] Round 3 5.2
['Game 1', 'Game 2', 'Game 3', 'Game 4', 'Game 5'] Round 4 11.6
****************************** Name: 0.2, dtype: float64
All Data: ******************************
Game 1 Game 2 Game 3 Game 4 Game 5 df.quantile([.1, .25, .5, .75], axis = 0):
Round 1 10 41 51 17 12 Game 1 Game 2 Game 3 Game 4 Game 5
Round 2 15 12 4 10 2 0.10 4.40 7.8 7.90 10.90 3.20
Round 3 2 6 17 13 15 0.25 8.00 10.5 13.75 12.25 5.00
Round 4 24 13 17 17 6 0.50 12.50 12.5 17.00 15.00 9.00
****************************** 0.75 17.25 20.0 25.50 17.00 12.75
df.quantile(.2, axis = 0): ******************************
Game 1 6.8 df.quantile([.1, .25, .5, .75], axis = 1):
Game 2 9.6 Round 1 Round 2 Round 3 Round 4
Game 3 11.8 0.10 10.8 2.8 3.6 8.8
Game 4 11.8 0.25 12.0 4.0 6.0 13.0
Game 5 4.4 0.50 17.0 10.0 13.0 17.0
Name: 0.2, dtype: float64 0.75 41.0 12.0 15.0 17.0
****************************** ******************************
df.quantile(.2, axis = 1):
Working With pivot()/pivot_table() Methods in DataFrama
Example 1:
import pandas as pd
df = pd.DataFrame({'R1': ['Rohit', 'Ravi', 'Rohan','Ramesh'],
'R2': ['10th', '10th', '11th','10th'],
'R3': [270, 230, 201,223]})
print("*"*30)
print("All Data:")
print(df)
print("*"*30)
print("Values can be an object or a list:")
print(df.pivot('R1', 'R2', 'R3') )
print("*"*30)
print("Value is a list:")
print(df.pivot(index ='R1', columns ='R2', values =['R3', 'R1']) )
Output:
******************************
All Data: Ravi 230.0 NaN
R1 R2 R3 Rohan NaN 201.0
0 Rohit 10th 270 Rohit 270.0 NaN
1 Ravi 10th 230 ******************************
2 Rohan 11th 201 Value is a list:
3 Ramesh 10th 223 R3 R1
****************************** R2 10th 11th 10th 11th
Values can be an object or a list: R1
R2 10th 11th Ramesh 223 NaN Ramesh NaN
R1 Ravi 230 NaN Ravi NaN
Ramesh 223.0 NaN Rohan NaN 201 NaN Rohan
Page 13 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

Rohit 270 NaN Rohit NaN

Example 2:
import pandas as pd
import numpy as np
df = pd.DataFrame({ 'R1': ['Rohit', 'Ravi', 'Rohan','Ramesh'],
'R2': ['10th', '10th', '11th','10th'],
'R3': [270, 230, 201,223]})
print("*"*30)
print("All Data:")
print(df)
print("*"*30)
print("Simplest pivot table must have a dataframe ")
print("and an index/list of index. :")
table = pd.pivot_table(df, index =['R1', 'R2'])
print(table)
print("*"*30)
print("Creates a pivot table dataframe :")
table = pd.pivot_table(df, values ='R2', index =['R1', 'R2'],
columns =['R3'], aggfunc = np.sum)
print(table)
print("*"*30)
Output:
****************************** Ramesh 10th 223
All Data: Ravi 10th 230
R1 R2 R3 Rohan 11th 201
0 Rohit 10th 270 Rohit 10th 270
1 Ravi 10th 230 ******************************
2 Rohan 11th 201 Creates a pivot table dataframe :
3 Ramesh 10th 223 Empty DataFrame
****************************** Columns: []
Simplest pivot table must have a dataframe Index: [(Ramesh, 10th), (Ravi, 10th),
and an index/list of index. : (Rohan, 11th), (Rohit, 10th)]
R3 ******************************
R1 R2
More Examples of pivot() method:
>>> df = pd.DataFrame(
{ 'foo': ['one', 'one', 'one', 'two', 'two','two'], 'bar': ['A', 'B', 'C', 'A', 'B', 'C'],
'baz': [1, 2, 3, 4, 5, 6], 'zoo': ['x', 'y', 'z', 'q', 'w', 't']})
>>> df
foo bar baz zoo
0 one A 1 x
1 one B 2 y
2 one C 3 z
3 two A 4 q
4 two B 5 w
5 two C 6 t
>>> df.pivot(index='foo', columns='bar', values='baz')
bar A B C
foo
one 1 2 3
two 4 5 6
>>> df.pivot(index='foo', columns='bar')['baz']
bar A B C

Page 14 of 23 EDUCATION FOR EVERYONE

DoyPyEdu PLAY WITH PYTHON

foo
one 1 2 3
two 4 5 6
>>> df.pivot(index='foo', columns='bar', values=['baz', 'zoo'])
baz zoo
bar A B C A B C
foo
one 1 2 3 x y z
two 4 5 6 q w t
A ValueError is raised if there are any duplicates.
>>> df = pd.DataFrame({"foo": ['one', 'one', 'two', 'two'],"bar": ['A', 'A', 'B', 'C'],"baz": [1, 2, 3, 4]})
>>> df
foo bar baz
0 one A 1
1 one A 2
2 two B 3
3 two C 4
More Examples of pivot_table() method:
>>> df = pd.DataFrame({"A": ["foo", "foo", "foo", "foo", "foo",
... "bar", "bar", "bar", "bar"],
... "B": ["one", "one", "one", "two", "two",
... "one", "one", "two", "two"],
... "C": ["small", "large", "large", "small",
... "small", "large", "small", "small",
... "large"],
... "D": [1, 2, 2, 3, 3, 4, 5, 6, 7],
... "E": [2, 4, 5, 5, 6, 6, 8, 9, 9]})
>>> df
A B C D E
0 foo one small 1 2
1 foo one large 2 4
2 foo one large 2 5
3 foo two small 3 5
4 foo two small 3 6
5 bar one large 4 6
6 bar one small 5 8
7 bar two small 6 9
8 bar two large 7 9
This first example aggregates values by taking the sum.

>>> table = pd.pivot_table(df, values='D', index=['A', 'B'],

... columns=['C'], aggfunc=np.sum)
>>> table
C large small
A B
bar one 4.0 5.0
two 7.0 6.0
foo one 4.0 1.0
two NaN 6.0
We can also fill missing values using the fill_value parameter.
>>> table = pd.pivot_table(df, values='D', index=['A', 'B'],
... columns=['C'], aggfunc=np.sum, fill_value=0)
>>> table
C large small
A B
Page 15 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

bar one 4 5
two 7 6
foo one 4 1
two 0 6
The next example aggregates by taking the mean across multiple columns.

>>> table = pd.pivot_table(df, values=['D', 'E'], index=['A', 'C'],

... aggfunc={'D': np.mean,
... 'E': np.mean})
>>> table
D E
A C
bar large 5.500000 7.500000
small 5.500000 8.500000
foo large 2.000000 4.500000
small 2.333333 4.333333
We can also calculate multiple types of aggregations for any given value column.

>>> table = pd.pivot_table(df, values=['D', 'E'], index=['A', 'C'],

... aggfunc={'D': np.mean,
... 'E': [min, max, np.mean]})
>>> table
D E
mean max mean min
A C
bar large 5.500000 9.0 7.500000 6.0
small 5.500000 9.0 8.500000 8.0
foo large 2.000000 5.0 4.500000 4.0
small 2.333333 6.0 4.333333 2.0
List:
Example:
import pandas as pd print("-------------------------")
import numpy as np print("Multi Type Data: \n")
print("Single list Type Data: \n") data = [['Ravi',13],['Jeevin',17],['Kunal',10]]
l1=[1,2,3] df = pd.DataFrame(data,columns=['Name','Age'])
data2=pd.DataFrame(l1) print (df)
print(data2) print("-------------------------")
print("-------------------------") print("Float type Data: \n")
print("Nested Type Data: \n") data = [['Ravi',13],['Jeevin',17],['Kunal',10]]
l2=[[1,2,3],[4,5,6]] df = pd.DataFrame(data,columns=['Name','Age'],dtype=float)
data3=pd.DataFrame(l2) print (df)
print(data3) print("-------------------------")
Output:
Single list Type Data: 0 1 2 2 Kunal 10
0 0 1 2 3 -------------------------
0 1 1 4 5 6 Float type Data:
1 2 ------------------------- Name Age
2 3 Multi Type Data: 0 Ravi 13.0
------------------------- Name Age 1 Jeevin 17.0
Nested Type Data: 0 Ravi 13 2 Kunal 10.0
1 Jeevin 17 -------------------------

Dictionary:
import pandas as p
print("Data Dictionary 1: \n")
Page 16 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH
PYTHON

dict = {'Name':['Sumit', 'Ravi', 'Kali', 'Tarun'],'Age':[17,17,18,13]}

df = p.DataFrame(dict)
print (df)
print("----------------------------")
print("Data Dictionary 2: \n")
data = {'Name':['Raju', 'Jatin', 'Rahul', 'Sachin'],'Age':[15,14,17,13] }
df = p.DataFrame(data, index=['R1','R2','R3','R4'])
print (df)
print("----------------------------")
print("DataFrame from List of Dicts, Dictionary 3: \n")
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]
df = p.DataFrame(data)
print (df)
print("----------------------------")
print("DataFrame from List of Dicts, Dictionary 4: \n")
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]
df = p.DataFrame(data, index=['first', 'second'])
print (df)
print("----------------------------")

Output:
Data Dictionary 1: R4 Sachin 13
Name Age ----------------------------
0 Sumit 17 DataFrame from List of Dicts, Dictionary 3:
1 Ravi 17 a b c
2 Kali 18 0 1 2 NaN
3 Tarun 13 1 5 10 20.0
---------------------------- ----------------------------
Data Dictionary 2: DataFrame from List of Dicts, Dictionary 4:
Name Age a b c
R1 Raju 15 first 1 2 NaN
R2 Jatin 14 second 5 10 20.0
R3 Rahul 17 ----------------------------

Program All All Different Tipe Of Methods Of Dataframe:

import pandas as p
print("Data Dictionary 1: \n")
dict = {'Name':['Sumit', 'Ravi', 'Kali', 'Tarun'],'Age':[17,17,18,13]}
df = p.DataFrame(dict)
print (df)
print("="*30)
print("Data Dictionary 2: \n")
data = {'Name':['Raju', 'Jatin', 'Rahul', 'Sachin'],'Age':[15,14,17,13]}
df = p.DataFrame(data, index=['R1','R2','R3','R4'])
print (df)
print("="*30)
print("DataFrame from List of Dicts, Dictionary 3: \n")
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]
df = p.DataFrame(data)
print (df)
print("----------------------------")
print("DataFrame from List of Dicts, Dictionary 4: \n")
Page 17 of 23 EDUCATION FOR
EVERYONE
DoyPyEdu PLAY WITH
PYTHON

data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]

df = p.DataFrame(data, index=['first', 'second'])
print (df)
print("="*30)
print("DataFrame from List of Dicts, Dictionary 5: \n")
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]
df1 = p.DataFrame(data, index=['first', 'second'], columns=['a', 'b'])
df2 = p.DataFrame(data, index=['first', 'second'], columns=['a', 'b1'])
print (df1)
print (df2)
print("----------------------------")
print("DataFrame from List of Dicts, Dictionary 6: \n")
d = {'one' : p.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : p.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}
df = p.DataFrame(d)
print (df)
print("="*30)
print("DataFrame from List of Dicts, Dictionary 7: \n")
d = {'one' : p.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : p.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}
df = p.DataFrame(d)
print (df ['one'])
print("----------------------------")
print("DataFrame from List of Dicts, Dictionary 8: \n")
d = {'one' : p.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : p.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}
df = p.DataFrame(d)
print("="*30)
# Adding a new column to an existing DataFrame object
#with column label by passing new series
print ("Adding a new column by passing as Series:")
df['three']=p.Series([10,20,30],index=['a','b','c'])
print (df)
print("----------------------------")
print ("Adding a new column using the existing columns in DataFrame:")
df['four']=df['one']+df['three']
print (df)
print("----------------------------")
print("DataFrame from List of Dicts, Dictionary 9: \n")

print("="*30)
import pandas as pd
d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd']),
'three' : pd.Series([10,20,30], index=['a','b','c'])}
df = pd.DataFrame(d)
print ("Our dataframe is:")
print (df)
print("="*30)
# using del function
print ("Deleting the first column using DEL function:")
del df['one']
print (df)

Page 18 of 23 EDUCATION FOR

EVERYONE
DoyPyEdu PLAY WITH
PYTHON

print("="*30)
# using pop function
print ("Deleting another column using POP function:")
df.pop('two')
print (df)
print("="*30)
print("Row Selection, Addition, and Deletion")
print("DataFrame from List ofSelection by Label 10: \n")
d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}
df = pd.DataFrame(d)
print (df.loc['b'])
print("="*30)
print("DataFrame from List of Selection by integer location 11: \n")
d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}
df = pd.DataFrame(d)
print (df.iloc[2])
print("="*30)
print("DataFrame from List of Slice Rows 12: \n")
d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}
df = pd.DataFrame(d)
print (df[2:4])
print("="*30)
print("DataFrame from List of Addition of Rows 13: \n")
df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])
df2 = pd.DataFrame([[5, 6], [7, 8]], columns = ['a','b'])
df = df.append(df2)
print (df)
print("="*30)
print("DataFrame from List of Deletion of Rows 14: \n")
df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])
df2 = pd.DataFrame([[5, 6], [7, 8]], columns = ['a','b'])
df = df.append(df2)
# Drop rows with label 0
df = df.drop(0)
print (df)
print("----------------------------")
Output:
Data Dictionary 1: ==============================
Name Age DataFrame from List of Dicts, Dictionary 3:
0 Sumit 17 a b c
1 Ravi 17 0 1 2 NaN
2 Kali 18 1 5 10 20.0
3 Tarun 13 ----------------------------
============================== DataFrame from List of Dicts, Dictionary 4:
Data Dictionary 2: a b c
Name Age first 1 2 NaN
R1 Raju 15 second 5 10 20.0
R2 Jatin 14 ==============================
R3 Rahul 17 DataFrame from List of Dicts, Dictionary 5:
R4 Sachin 13
Page 19 of 23 EDUCATION FOR
EVERYONE
DoyPyEdu PLAY WITH
PYTHON

a b ==============================
first 1 2 Deleting the first column using DEL function:
second 5 10 two three
a b1 a 1 10.0
first 1 NaN b 2 20.0
second 5 NaN c 3 30.0
---------------------------- d 4 NaN
DataFrame from List of Dicts, Dictionary 6: ==============================
one two Deleting another column using POP function:
a 1.0 1 three
b 2.0 2 a 10.0
c 3.0 3 b 20.0
d NaN 4 c 30.0
============================== d NaN
DataFrame from List of Dicts, Dictionary 7: ==============================
a 1.0 Row Selection, Addition, and Deletion
b 2.0 DataFrame from List ofSelection by Label 10:
c 3.0 one 2.0
d NaN two 2.0
Name: one, dtype: float64 Name: b, dtype: float64
---------------------------- ==============================
DataFrame from List of Dicts, Dictionary 8: DataFrame from List of Selection by integer
============================== location 11:
Adding a new column by passing as Series:
one two three one 3.0
a 1.0 1 10.0 two 3.0
b 2.0 2 20.0 Name: c, dtype: float64
c 3.0 3 30.0 ==============================
d NaN 4 NaN DataFrame from List of Slice Rows 12:
---------------------------- one two
Adding a new column using the existing c 3.0 3
columns in DataFrame: d NaN 4
one two three four ==============================
a 1.0 1 10.0 11.0 DataFrame from List of Addition of Rows 13:
b 2.0 2 20.0 22.0 a b
c 3.0 3 30.0 33.0 0 1 2
d NaN 4 NaN NaN 1 3 4
---------------------------- 0 5 6
DataFrame from List of Dicts, Dictionary 9: 1 7 8
============================== ==============================
Our dataframe is: DataFrame from List of Deletion of Rows 14:
one two three a b
a 1.0 1 10.0 1 3 4
b 2.0 2 20.0 1 7 8
c 3.0 3 30.0 ----------------------------
d NaN 4 NaN

NOTES

Page 20 of 23 EDUCATION FOR

EVERYONE
DoyPyEdu PLAY WITH PYTHON

PRACTICAL PROGRAMS OF PANDAS

#*************pandaDict1.py****************
import pandas as p
d={'NAME':['Tom','Jack','Steve','Ricky'],'AGE':[28,34,29,42]}
data=p.DataFrame(d)
print(data)
#*************pandaDict2.py****************
import pandas as p
d={'NAME':['Tom','Jack','Steve','Ricky'],'AGE':[28,34,29,42]}
data=p.DataFrame(d,index=['rank1','rank2','rank3','rank4'])
print(data)
#*************pandaDict3.py****************
import pandas as p
d=[{'a':1,'b':2,'c':3},{'a':5,'b':10,'c':20}]
data=p.DataFrame(d)
print(data)
#*************pandaDict4.py****************
import pandas as p
d=[{'a':1,'b':2,'c':3},{'a':5,'b':10,'c':20}]
data=p.DataFrame(d,index=['first','second'])
print(data)
#*************pandaDict5.py****************
import pandas as p
d=[{'a':1,'b':2,'c':3},{'a':5,'b':10,'c':20}]
data=p.DataFrame(d,index=['first','second'],columns=['a','b1'])
print(data)
data=p.DataFrame(d,index=['first','second'],columns=['a','b'])
print(data)
#*************pandaDict6.py*************
import pandas as p
d={'one':p.Series([1,2,3],index=['a','b','c']),'two':p.Series([1,2,3,4],index=['a','b','c','d'])}
data=p.DataFrame(d)
print(data)
#*************pandaDict7.py****************
import pandas as p
d={'one':p.Series([1,2,3],index=['a','b','c']),'two':p.Series([1,2,3,4],index=['a','b','c','d'])}
data=p.DataFrame(d)
print(data['one'])
#*************pandaDict8.py****************
import pandas as p
d={'one':p.Series([1,2,3],index=['a','b','c']),'two':p.Series([1,2,3,4],index=['a','b','c','d'])}
data=p.DataFrame(d)
print(data)
data['three']=p.Series([10,20,30],index=['a','b','c'])
print(data)
#*************pandaDict9.py****************
import pandas as p
Page 21 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

d={'one':p.Series([1,2,3],index=['a','b','c']),'two':p.Series([1,2,3,4],index=['a','b','c','d'])}
data=p.DataFrame(d)
print(data)
data['three']=p.Series([10,20,30],index=['a','b','c'])
print(data)
data['four']=data['one']+data['three']
print(data)
#****pandaIMPORTANTfunctions10.py********
import pandas as p
d={'one':p.Series([1,2,5],index=['a','b','c']),'two':p.Series([1,9,3],index=['a','b','c'])}
data=p.DataFrame(d)
print(data)
print("SUM : \n",data.sum())
print("MIN :\n",data.min())
print("MAX :\n",data.max())
print("MIN INDEX :\n",data.idxmin())
print("MAX INDEX :\n",data.idxmax())
print(data.describe())
#*************pandalist11.py****************
import pandas as p
l=[['Alex',10],['bob',12],['Clark',13]]
data1=p.DataFrame(l,columns=['NAME','AGE'],dtype=float)
print(data1)
#*************pandalist12.py****************
import pandas as p
l=[1,2,3]
data1=p.DataFrame(l)
print(data1)
#*************pandalist13.py****************
import pandas as p
l=[[1,2,3],[4,5,6]]
data1=p.DataFrame(l)
print(data1)
#*************pandalist14.py****************
import pandas as p
l=[['Alex',10],['bob',12],['Clark',13]]
data1=p.DataFrame(l,columns=['NAME','AGE'])
print(data1)
#*************pandaDict15.py****************
import pandas as p
d={'one':p.Series([1,2,3],index=['a','b','c']),'two':p.Series([1,2,3,4],index=['a','b','c','d'])}
data=p.DataFrame(d)
print(data)
data['three']=p.Series([10,20,30],index=['a','b','c'])
print(data)
data['four']=data['one']+data['three']
print(data)
data.pop('three')
Page 22 of 23 EDUCATION FOR EVERYONE
DoyPyEdu PLAY WITH PYTHON

print(data)
#*************pandaDict16.py**************
import pandas as p
d={'one':p.Series([1,2,3],index=['a','b','c']),'two':p.Series([1,9,3,4],index=['a','b','c','d'])}
data=p.DataFrame(d)
print(data.loc['b'])
#*************pandaDict17.py****************
import pandas as p
d={'one':p.Series([1,2,5],index=['a','b','c']),'two':p.Series([1,9,3,4],index=['a','b','c','d'])}
data=p.DataFrame(d)
print(data.iloc[2])
#*************pandaDict18.py****************
import pandas as p
d={'one':p.Series([1,2,5,6],index=['a','b','c','e']),
'two':p.Series([1,9,3,4],index=['a','b','c','d'])
}
data=p.DataFrame(d)
print(data[2:4])
#***********pandaDict19.py************
import pandas as p
d1=p.DataFrame([[1,2],[3,4]],columns=['a','b'])
d2=p.DataFrame([[5,6],[7,8]],columns=['a','b'])
d1=d1.append(d2)
print(d1)
#*************pandaDict20.py****************
import pandas as p
d1=p.DataFrame([[1,2],[3,4]],columns=['a','b'])
d2=p.DataFrame([[5,6],[7,8]],columns=['a','b'])
d1=d1.append(d2)
print(d1)
d1=d1.drop(0)
print(d1)
NOTES

Page 23 of 23 EDUCATION FOR EVERYONE

12 Comp Sci 1 Revision Notes Pythan Advanced Prog
No ratings yet
12 Comp Sci 1 Revision Notes Pythan Advanced Prog
5 pages
Bank Management System V.B
55% (31)
Bank Management System V.B
73 pages
Image Processing With MATLAB Graphical User Interface (GUI)
No ratings yet
Image Processing With MATLAB Graphical User Interface (GUI)
17 pages
Hvac Programming Guide - Doc 0
No ratings yet
Hvac Programming Guide - Doc 0
181 pages
Basics of CPP Objective Questions MCQs
No ratings yet
Basics of CPP Objective Questions MCQs
23 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Pandas Visualisation
No ratings yet
Pandas Visualisation
27 pages
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
No ratings yet
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
3 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
List Comprehension in Python
No ratings yet
List Comprehension in Python
8 pages
Tutorial Pytorch Best Commands
No ratings yet
Tutorial Pytorch Best Commands
8 pages
Pandas
100% (1)
Pandas
1,131 pages
Pandas Python
100% (2)
Pandas Python
115 pages
Natural Language Toolkit NLTK PDF
No ratings yet
Natural Language Toolkit NLTK PDF
23 pages
Python3 Data Structures Cheat Sheet: by Via
No ratings yet
Python3 Data Structures Cheat Sheet: by Via
1 page
Chapter 10 Python Pandas
No ratings yet
Chapter 10 Python Pandas
40 pages
Introduction To Data Visualization in Python
No ratings yet
Introduction To Data Visualization in Python
16 pages
API Reference - Scikit-Learn 0.19.2 Documentation
No ratings yet
API Reference - Scikit-Learn 0.19.2 Documentation
21 pages
Python For Finance - The Complete Beginner's Guide - by Behic Guven - Jul, 2020 - Towards Data Science PDF
100% (1)
Python For Finance - The Complete Beginner's Guide - by Behic Guven - Jul, 2020 - Towards Data Science PDF
12 pages
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
100% (1)
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
17 pages
#Barebones App #Access Request Data #Useful Plugins: Rin Ted .Co M
No ratings yet
#Barebones App #Access Request Data #Useful Plugins: Rin Ted .Co M
1 page
12 Useful Pandas Techniques in Python For Data Manipulation
100% (2)
12 Useful Pandas Techniques in Python For Data Manipulation
19 pages
Numpy Complete Material
No ratings yet
Numpy Complete Material
19 pages
Python Pandas
100% (1)
Python Pandas
35 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Panda Python
100% (1)
Panda Python
398 pages
Advanced Python Tips
No ratings yet
Advanced Python Tips
50 pages
Python Programming Notes
No ratings yet
Python Programming Notes
144 pages
6 XG Boost - Jupyter Notebook
100% (1)
6 XG Boost - Jupyter Notebook
3 pages
Acceleo User Guide
No ratings yet
Acceleo User Guide
56 pages
Python Seaborn Tutorial - Jupyter Notebook
No ratings yet
Python Seaborn Tutorial - Jupyter Notebook
19 pages
Python Pandas2 PDF
No ratings yet
Python Pandas2 PDF
38 pages
13.file Handling
No ratings yet
13.file Handling
66 pages
Solution of Practical
No ratings yet
Solution of Practical
47 pages
Class XII (As Per CBSE Board) : Informatics Practices
No ratings yet
Class XII (As Per CBSE Board) : Informatics Practices
43 pages
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
No ratings yet
Figure Style and Scale: Darkgrid Whitegrid Dark White Ticks Darkgrid
15 pages
Python Data Analysis Visualization
No ratings yet
Python Data Analysis Visualization
34 pages
Python Full
100% (1)
Python Full
59 pages
Pandas Guide
No ratings yet
Pandas Guide
64 pages
Python Pandas Cheatsheety
No ratings yet
Python Pandas Cheatsheety
7 pages
Matplotlib and Seaborn PDF
100% (1)
Matplotlib and Seaborn PDF
29 pages
Data Visualization - Getting Started With Plotly
No ratings yet
Data Visualization - Getting Started With Plotly
37 pages
Pandas Cheat Sheet CN
No ratings yet
Pandas Cheat Sheet CN
4 pages
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
100% (1)
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
1 page
Python programms
No ratings yet
Python programms
8 pages
Python Pandas
No ratings yet
Python Pandas
96 pages
Python Functions
No ratings yet
Python Functions
29 pages
Python Quick Guide - Tutorialspoint
No ratings yet
Python Quick Guide - Tutorialspoint
199 pages
StatisticsMachineLearningPythonDraft PDF
100% (1)
StatisticsMachineLearningPythonDraft PDF
219 pages
Duckdb Docs
No ratings yet
Duckdb Docs
721 pages
Columbia Seaborn Tutorial
No ratings yet
Columbia Seaborn Tutorial
12 pages
Python Guide Documentation: Release 0.0.1
No ratings yet
Python Guide Documentation: Release 0.0.1
167 pages
OOP Using Python Hands-On Assessment
No ratings yet
OOP Using Python Hands-On Assessment
10 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
45 pages
Python Logging Module PDF
No ratings yet
Python Logging Module PDF
17 pages
Essentials of Machine Learning Algorithms (With Python and R Codes) PDF
100% (1)
Essentials of Machine Learning Algorithms (With Python and R Codes) PDF
20 pages
Java How to Program Early Objects 10th Edition Deitel Test Bank - 2025 Scribd Download Full Chapters
No ratings yet
Java How to Program Early Objects 10th Edition Deitel Test Bank - 2025 Scribd Download Full Chapters
33 pages
Python - Programming
No ratings yet
Python - Programming
9 pages
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
Mastering IPython 4.0
From Everand
Mastering IPython 4.0
Thomas Bitterman
No ratings yet
New Learning of Python by Practical Innovation and Technology
From Everand
New Learning of Python by Practical Innovation and Technology
Sudhir Pathania
No ratings yet
Learn R By Coding
From Everand
Learn R By Coding
Thomas Kurnicki
No ratings yet
Python Unleashed: Mastering the Art of Efficient Coding
From Everand
Python Unleashed: Mastering the Art of Efficient Coding
James Livingston
No ratings yet
Pandas
No ratings yet
Pandas
4 pages
6 Manual Completo de Dibujo y Pintura
No ratings yet
6 Manual Completo de Dibujo y Pintura
17 pages
Operator Overloading & Data Conversion
50% (2)
Operator Overloading & Data Conversion
39 pages
NCERT Solutions Class 12 Computer Science Chapter - Arrays
No ratings yet
NCERT Solutions Class 12 Computer Science Chapter - Arrays
26 pages
2023-24 CS 12 Links Swati Chawla
100% (1)
2023-24 CS 12 Links Swati Chawla
7 pages
Kohonen Neural Network
No ratings yet
Kohonen Neural Network
3 pages
Monitoring IDocs With The SAP Application Interface Framework
No ratings yet
Monitoring IDocs With The SAP Application Interface Framework
11 pages
Linux Command Line For You and Me Documentation: Release 0.1
No ratings yet
Linux Command Line For You and Me Documentation: Release 0.1
108 pages
Filter Rules For Central Finance 20221012
No ratings yet
Filter Rules For Central Finance 20221012
16 pages
bca-6-sem-asp-dot-net-paper-3-summer-2018
No ratings yet
bca-6-sem-asp-dot-net-paper-3-summer-2018
1 page
Week-01 Assignment
No ratings yet
Week-01 Assignment
7 pages
Assignments
No ratings yet
Assignments
3 pages
Matrix.org - Clients
No ratings yet
Matrix.org - Clients
8 pages
Software Testing Notes
No ratings yet
Software Testing Notes
30 pages
TIBCO Installation Guidelines
No ratings yet
TIBCO Installation Guidelines
14 pages
202 Network Response
No ratings yet
202 Network Response
2 pages
Positioning Module Type A1SD71-S2
No ratings yet
Positioning Module Type A1SD71-S2
192 pages
s23 Solution
No ratings yet
s23 Solution
26 pages
Quiz 2 (Seri 1)
No ratings yet
Quiz 2 (Seri 1)
3 pages
Atc Checks
No ratings yet
Atc Checks
15 pages
Yash Resume
No ratings yet
Yash Resume
1 page
Question: What Is MDS & Why We Use MDS in Oracle SOA?
No ratings yet
Question: What Is MDS & Why We Use MDS in Oracle SOA?
4 pages
Operating Systems Class Notes
No ratings yet
Operating Systems Class Notes
2 pages
Chandra Kirana J.I - P6 - Transformation
No ratings yet
Chandra Kirana J.I - P6 - Transformation
27 pages
Joseph Malafronte Resume Fall 2017
No ratings yet
Joseph Malafronte Resume Fall 2017
1 page
Sweetalert2 All Min
No ratings yet
Sweetalert2 All Min
16 pages
Dbms
No ratings yet
Dbms
31 pages