0% found this document useful (0 votes)

472 views1 page

Solutions To Pandas Basic Questions

The document describes operations performed on a pandas DataFrame called 'birds' created from dictionary data and list labels. The DataFrame contains bird observation data with columns for bird name, age, number of visits, and observation priority. A number of data selection, sorting, and aggregation operations are demonstrated including filtering rows by criteria, calculating group means, counting unique bird types, and sorting.

Uploaded by

Jason Shax

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

472 views1 page

Solutions To Pandas Basic Questions

Uploaded by

Jason Shax

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Consider the following Python dictionary data and Python list labels:

data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills', 'Cranes', 'plovers', 'Cranes', 'spoonbills',
'spoonbills'], 'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, np.nan, 8, 4], 'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2], 'priority': ['yes',
'yes', 'no', 'yes', 'no', 'no', 'no', 'yes', 'no', 'no']}
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']

1. Create a DataFrame birds from this dictionary data which has the index labels.

In [1]: import pandas as pd

import numpy as np
data = {'birds': ['Cranes', 'Cranes', 'plovers', 'spoonbills', 'spoonbills', 'Cranes', 'p
lovers', 'Cranes', 'spoonbills', 'spoonbills'], 'age': [3.5, 4, 1.5, np.nan, 6, 3, 5.5, n
p.nan, 8, 4], 'visits': [2, 4, 3, 4, 3, 4, 2, 2, 3, 2], 'priority': ['yes', 'yes', 'no',
'yes', 'no', 'no', 'no', 'yes', 'no', 'no']}
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']

df = pd.DataFrame(data,index=labels)
df
Out[1]:
birds age visits priority
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

2. Display a summary of the basic information about birds DataFrame and its data.

In [2]: df.describe()
Out[2]:
age visits
count 8.000000 10.000000
mean 4.437500 2.900000
std 2.007797 0.875595
min 1.500000 2.000000
25% 3.375000 2.000000
50% 4.000000 3.000000
75% 5.625000 3.750000
max 8.000000 4.000000

3. Print the first 2 rows of the birds dataframe

In [3]: df.head(2)
Out[3]:
birds age visits priority
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes

4. Print all the rows with only 'birds' and 'age' columns from the dataframe

In [4]: df[['birds','age']]
Out[4]:
birds age
a Cranes 3.5
b Cranes 4.0
c plovers 1.5
d spoonbills NaN
e spoonbills 6.0
f Cranes 3.0
g plovers 5.5
h Cranes NaN
i spoonbills 8.0
j spoonbills 4.0

5. select [2, 3, 7] rows and in columns ['birds', 'age', 'visits']

In [5]: df.iloc[[2,3,7], :3]

Out[5]:
birds age visits
c plovers 1.5 3
d spoonbills NaN 4
h Cranes NaN 2

6. select the rows where the number of visits is less than 4

In [6]: df[df['visits'] < 4]

Out[6]:
birds age visits priority
a Cranes 3.5 2 yes
c plovers 1.5 3 no
e spoonbills 6.0 3 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

7. select the rows with columns ['birds', 'visits'] where the age is missing i.e NaN
In [7]: df[df['age'].isnull()]
Out[7]:
birds age visits priority
d spoonbills NaN 4 yes
h Cranes NaN 2 yes

8. Select the rows where the birds is a Cranes and the age is less than 4

In [8]: df[(df['birds']=='Cranes') & (df['age'] < 4 )]

Out[8]:
birds age visits priority
a Cranes 3.5 2 yes
f Cranes 3.0 4 no

9. Select the rows the age is between 2 and 4(inclusive)

In [9]: df[(df['age'] >= 2) & (df['age'] <= 4 )]

Out[9]:
birds age visits priority
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
f Cranes 3.0 4 no
j spoonbills 4.0 2 no

10. Find the total number of visits of the bird Cranes

In [10]: df[df['birds']=='Cranes']['visits'].sum()
Out[10]:
12

11. Calculate the mean age for each different birds in dataframe.

In [11]: x = df[df['birds']=='Cranes']['age'].mean()
y = df[df['birds']=='plovers']['age'].mean()
z = df[df['birds']=='spoonbills']['age'].mean()

print(x,'\n', y,'\n',z)

3.5
3.5
6.0

12. Append a new row 'k' to dataframe with your choice of values for each column. Then delete that row
to return the original DataFrame.

In [12]: data = {'birds':'egret','age':4,'visits':2,'priority':'yes'}

x = pd.DataFrame(data, index=['k'], columns=['birds','age','visits','priority'])
df.append(x)
Out[12]:
birds age visits priority
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no
k egret 4.0 2 yes

In [13]: drop_k = df.drop(df.tail(0).index) #deleting k

df = drop_k
df
Out[13]:
birds age visits priority
a Cranes 3.5 2 yes
b Cranes 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f Cranes 3.0 4 no
g plovers 5.5 2 no
h Cranes NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

13. Find the number of each type of birds in dataframe (Counts)

In [14]: df['birds'].value_counts()
Out[14]:
spoonbills 4
Cranes 4
plovers 2
Name: birds, dtype: int64

14. Sort dataframe (birds) first by the values in the 'age' in decending order, then by the value in the
'visits' column in ascending order.

In [15]: age_sorting = df.sort_values('age', ascending=False)

print(age_sorting,'\n\n')
visit_ascend = df.sort_values('visits',ascending=True )
print(visit_ascend)

birds age visits priority

i spoonbills 8.0 3 no
e spoonbills 6.0 3 no
g plovers 5.5 2 no
b Cranes 4.0 4 yes
j spoonbills 4.0 2 no
a Cranes 3.5 2 yes
f Cranes 3.0 4 no
c plovers 1.5 3 no
d spoonbills NaN 4 yes
h Cranes NaN 2 yes

birds age visits priority

a Cranes 3.5 2 yes
g plovers 5.5 2 no
h Cranes NaN 2 yes
j spoonbills 4.0 2 no
c plovers 1.5 3 no
e spoonbills 6.0 3 no
i spoonbills 8.0 3 no
b Cranes 4.0 4 yes
d spoonbills NaN 4 yes
f Cranes 3.0 4 no

15. Replace the priority column values with'yes' should be 1 and 'no' should be 0
In [16]: x = df.replace(to_replace=['yes','no'], value=[1,0])
x
Out[16]:
birds age visits priority
a Cranes 3.5 2 1
b Cranes 4.0 4 1
c plovers 1.5 3 0
d spoonbills NaN 4 1
e spoonbills 6.0 3 0
f Cranes 3.0 4 0
g plovers 5.5 2 0
h Cranes NaN 2 1
i spoonbills 8.0 3 0
j spoonbills 4.0 2 0

16. In the 'birds' column, change the 'Cranes' entries to 'trumpeters'.

In [17]: z = df.replace(to_replace=['Cranes'], value=['trumpeters'])

z
Out[17]:
birds age visits priority
a trumpeters 3.5 2 yes
b trumpeters 4.0 4 yes
c plovers 1.5 3 no
d spoonbills NaN 4 yes
e spoonbills 6.0 3 no
f trumpeters 3.0 4 no
g plovers 5.5 2 no
h trumpeters NaN 2 yes
i spoonbills 8.0 3 no
j spoonbills 4.0 2 no

In [ ]:

2 ASSIGNMENT 2 (Beginning Superstore)
0% (1)
2 ASSIGNMENT 2 (Beginning Superstore)
1 page
Vogue Stitctionary Vol 1 Index
0% (1)
Vogue Stitctionary Vol 1 Index
8 pages
Pandas
No ratings yet
Pandas
43 pages
Basic Principles of SIM Construction
No ratings yet
Basic Principles of SIM Construction
22 pages
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
No ratings yet
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
6 pages
Pandas PDF
No ratings yet
Pandas PDF
3,021 pages
Data Visualization - Getting Started With Plotly
No ratings yet
Data Visualization - Getting Started With Plotly
37 pages
Appendix B DAX Reference
100% (1)
Appendix B DAX Reference
174 pages
Mastering SQL Window Functions - 01
No ratings yet
Mastering SQL Window Functions - 01
39 pages
Salary Prediction LinearRegression
100% (1)
Salary Prediction LinearRegression
7 pages
Creating Data Visualizations Using Tableau Desktop (Beginner) _ Map and Data Library
No ratings yet
Creating Data Visualizations Using Tableau Desktop (Beginner) _ Map and Data Library
48 pages
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
No ratings yet
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
56 pages
1 - Interactive Data Visualization With Bokeh
No ratings yet
1 - Interactive Data Visualization With Bokeh
31 pages
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
No ratings yet
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
8 pages
Working With Dates in Pandas: Prepared by Asif Bhat
No ratings yet
Working With Dates in Pandas: Prepared by Asif Bhat
13 pages
Customer Segmentation Clustering
No ratings yet
Customer Segmentation Clustering
35 pages
Python Date Time
No ratings yet
Python Date Time
6 pages
Yeungnam University School of Mechanical Engineering Syllabus For 0993 Tribology
No ratings yet
Yeungnam University School of Mechanical Engineering Syllabus For 0993 Tribology
42 pages
Midsem Regular MFDS 22-12-2019 Answer Key PDF
No ratings yet
Midsem Regular MFDS 22-12-2019 Answer Key PDF
5 pages
MongoDB CheatSheet
No ratings yet
MongoDB CheatSheet
9 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
6 pages
Python Variables Cheatsheet
No ratings yet
Python Variables Cheatsheet
2 pages
Cheat Sheet: Tableau-Desktop
No ratings yet
Cheat Sheet: Tableau-Desktop
1 page
Getting Started With Tableau Prep
No ratings yet
Getting Started With Tableau Prep
3 pages
Get Data With Power BI Desktop: Angeles University Foundation College of Computer Studies
No ratings yet
Get Data With Power BI Desktop: Angeles University Foundation College of Computer Studies
35 pages
Data Visualisation With Tableau
No ratings yet
Data Visualisation With Tableau
26 pages
Proc SQL
100% (1)
Proc SQL
7 pages
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
No ratings yet
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
47 pages
CSE-Machine Learning & Big Data - WSS Source Book
No ratings yet
CSE-Machine Learning & Big Data - WSS Source Book
181 pages
Data Visualization R Programming Power Bi Lab Record
No ratings yet
Data Visualization R Programming Power Bi Lab Record
29 pages
Excel - Module 2 (Formulas, Functions, and Formatting)
No ratings yet
Excel - Module 2 (Formulas, Functions, and Formatting)
3 pages
1 Lect - 1.2 - 12 - August 2022 PDF
No ratings yet
1 Lect - 1.2 - 12 - August 2022 PDF
59 pages
Data Preparation With Tableau
No ratings yet
Data Preparation With Tableau
11 pages
Introduction To Spreadsheet Modeling - Winston Albright
No ratings yet
Introduction To Spreadsheet Modeling - Winston Albright
46 pages
Numpy
No ratings yet
Numpy
15 pages
Python Pandas
No ratings yet
Python Pandas
96 pages
Infosys Placement Paper at Vaddeswara1
No ratings yet
Infosys Placement Paper at Vaddeswara1
5 pages
DAX Cheat Sheet
No ratings yet
DAX Cheat Sheet
10 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
XL Wings
No ratings yet
XL Wings
214 pages
Chapter 8 B - Trendlines and Regression Analysis
No ratings yet
Chapter 8 B - Trendlines and Regression Analysis
73 pages
Data Science Theory: Analysis and Analytics
No ratings yet
Data Science Theory: Analysis and Analytics
14 pages
List Comprehension in Python
No ratings yet
List Comprehension in Python
8 pages
Python Libraries Cheat Sheets
No ratings yet
Python Libraries Cheat Sheets
6 pages
Choice of Charts Power BI
No ratings yet
Choice of Charts Power BI
14 pages
DVC - All Questions and Answers - CT 1, CT 2 and Model - Final
No ratings yet
DVC - All Questions and Answers - CT 1, CT 2 and Model - Final
114 pages
Pandas Plotting Capabilities
No ratings yet
Pandas Plotting Capabilities
27 pages
100 SQL Formulas Each Student Should Know
No ratings yet
100 SQL Formulas Each Student Should Know
10 pages
100 Pandas Exercises
No ratings yet
100 Pandas Exercises
6 pages
SAS Presentation
No ratings yet
SAS Presentation
49 pages
Pandas Guide
No ratings yet
Pandas Guide
64 pages
Data Visualization Ebook
No ratings yet
Data Visualization Ebook
15 pages
Introduction To Data Visualization in Python
No ratings yet
Introduction To Data Visualization in Python
16 pages
Machine Learning Basics: 1. General Introduction
No ratings yet
Machine Learning Basics: 1. General Introduction
46 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
Data Wrangling
No ratings yet
Data Wrangling
13 pages
DAX Functions For Data Analysis
No ratings yet
DAX Functions For Data Analysis
9 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Python Interview Questions
No ratings yet
Python Interview Questions
61 pages
Microsoft Excel Statistical and Advanced Functions for Decision Making
From Everand
Microsoft Excel Statistical and Advanced Functions for Decision Making
Palani Murugappan
No ratings yet
SQLite Complete Self-Assessment Guide
From Everand
SQLite Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
From Everand
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
CertSquad Professional Trainers
No ratings yet
Quantifying Determiners
No ratings yet
Quantifying Determiners
7 pages
User Guide
No ratings yet
User Guide
1,657 pages
Communication Options
No ratings yet
Communication Options
1 page
JHS Calendar of Activities 2022-23
No ratings yet
JHS Calendar of Activities 2022-23
4 pages
Radiant Apostolate
No ratings yet
Radiant Apostolate
2 pages
Treasures Grammar and Writing Handbook Gr 1 Teachers Edition Mcgraw-Hill [Mcgraw-Hill] - Download the ebook today to explore every detail
100% (1)
Treasures Grammar and Writing Handbook Gr 1 Teachers Edition Mcgraw-Hill [Mcgraw-Hill] - Download the ebook today to explore every detail
49 pages
STULZ C7000-CC2 57D 0813 en
No ratings yet
STULZ C7000-CC2 57D 0813 en
67 pages
Lab Experiment ME 9
No ratings yet
Lab Experiment ME 9
3 pages
Instant download Mozart Studies 2 Simon P. Keefe (Ed.) pdf all chapter
100% (1)
Instant download Mozart Studies 2 Simon P. Keefe (Ed.) pdf all chapter
65 pages
Bugreport Olive QKQ1.191014.001 2021 06 30 17 14 42 Dumpstate - Log 10154
No ratings yet
Bugreport Olive QKQ1.191014.001 2021 06 30 17 14 42 Dumpstate - Log 10154
23 pages
PST Eucl
No ratings yet
PST Eucl
45 pages
ISC Class 11 Mathematics Syllabus 2023 24
No ratings yet
ISC Class 11 Mathematics Syllabus 2023 24
10 pages
Office Automation Tools
No ratings yet
Office Automation Tools
38 pages
Going On A Business Trip British English Intermediate Group
No ratings yet
Going On A Business Trip British English Intermediate Group
3 pages
Book 2
No ratings yet
Book 2
1 page
Volume XLVIII February 2012: Interim Edition of Roman Pontifical To Be Published
No ratings yet
Volume XLVIII February 2012: Interim Edition of Roman Pontifical To Be Published
4 pages
DLL-Q1-W1-DAY1.ENGLISH- MATATAG
No ratings yet
DLL-Q1-W1-DAY1.ENGLISH- MATATAG
13 pages
TTCT InterpMOD.2018
No ratings yet
TTCT InterpMOD.2018
16 pages
Cirilo Bautista
No ratings yet
Cirilo Bautista
30 pages
Grade Xi
No ratings yet
Grade Xi
69 pages
Science 4: Quarter 3 Module 1 Week 1 Learning Competencies (Essential Competencies)
100% (1)
Science 4: Quarter 3 Module 1 Week 1 Learning Competencies (Essential Competencies)
4 pages
Nature and Definition of Language
100% (1)
Nature and Definition of Language
4 pages
PME-500-TR User's Manual
100% (1)
PME-500-TR User's Manual
18 pages
ICT J1 AND J2 and J3
No ratings yet
ICT J1 AND J2 and J3
14 pages
Umrah Guide
No ratings yet
Umrah Guide
7 pages
Phrasal Verbs Exercise 1
0% (1)
Phrasal Verbs Exercise 1
2 pages
Coding Barang Keluar
No ratings yet
Coding Barang Keluar
11 pages
Kinetic AppStudioUserGuide
No ratings yet
Kinetic AppStudioUserGuide
272 pages