0% found this document useful (0 votes)

22 views

PDF Notebook

Case stdy solution

Uploaded by

Krunal Kalariya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

PDF Notebook

Case stdy solution

Uploaded by

Krunal Kalariya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

09/12/2023, 21:33 Aerofit 9th dec - Jupyter Notebook

In [1]: import pandas as pd

In [2]: df=pd.read_csv('aerofit.csv')

In [3]: df.head(10)

Out[3]:
Product Age Gender Education MaritalStatus Usage Fitness Income Miles

0 KP281 18 Male 14 Single 3 4 29562 112

1 KP281 19 Male 15 Single 2 3 31836 75

2 KP281 19 Female 14 Partnered 4 3 30699 66

3 KP281 19 Male 12 Single 3 3 32973 85

4 KP281 20 Male 13 Partnered 4 2 35247 47

5 KP281 20 Female 14 Partnered 3 3 32973 66

6 KP281 21 Female 14 Partnered 3 3 35247 75

7 KP281 21 Male 13 Single 3 3 32973 85

8 KP281 21 Male 15 Single 5 4 35247 141

9 KP281 21 Female 15 Partnered 2 3 37521 85

In [4]: df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 180 entries, 0 to 179
Data columns (total 9 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Product 180 non-null object
1 Age 180 non-null int64
2 Gender 180 non-null object
3 Education 180 non-null int64
4 MaritalStatus 180 non-null object
5 Usage 180 non-null int64
6 Fitness 180 non-null int64
7 Income 180 non-null int64
8 Miles 180 non-null int64
dtypes: int64(6), object(3)
memory usage: 12.8+ KB

In [5]: df.isnull().sum()

Out[5]: Product 0
Age 0
Gender 0
Education 0
MaritalStatus 0
Usage 0
Fitness 0
Income 0
Miles 0
dtype: int64

In [6]: df['Product'].value_counts()

Out[6]: KP281 80
KP481 60
KP781 40
Name: Product, dtype: int64

In [7]: import seaborn as sbn

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 1/5

09/12/2023, 21:33 Aerofit 9th dec - Jupyter Notebook

In [8]: sbn.boxplot(x='Product', y='Income', data=df)

Out[8]: <AxesSubplot:xlabel='Product', ylabel='Income'>

In [9]: sbn.boxplot(x='Product', y='Miles', data=df)

Out[9]: <AxesSubplot:xlabel='Product', ylabel='Miles'>

In [10]: sbn.boxplot(x='Product', y='Education', data=df)

Out[10]: <AxesSubplot:xlabel='Product', ylabel='Education'>

In [12]: df.groupby('Product')['Education'].median()

Out[12]: Product
KP281 16.0
KP481 16.0
KP781 18.0
Name: Education, dtype: float64

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 2/5

09/12/2023, 21:33 Aerofit 9th dec - Jupyter Notebook

In [13]: df.groupby('Product')['Education'].describe()

Out[13]:
count mean std min 25% 50% 75% max

Product

KP281 80.0 15.037500 1.216383 12.0 14.0 16.0 16.0 18.0

KP481 60.0 15.116667 1.222552 12.0 14.0 16.0 16.0 18.0

KP781 40.0 17.325000 1.639066 14.0 16.0 18.0 18.0 21.0

In [15]: sbn.boxplot(x='Product', y='Miles', data=df)

Out[15]: <AxesSubplot:xlabel='Product', ylabel='Miles'>

In [17]: sbn.scatterplot(x='Miles', y='Income',hue='Product', data=df)

Out[17]: <AxesSubplot:xlabel='Miles', ylabel='Income'>

In [18]: sbn.boxplot(x='Gender', y='Miles', hue='Product', data=df)

Out[18]: <AxesSubplot:xlabel='Gender', ylabel='Miles'>

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 3/5

09/12/2023, 21:33 Aerofit 9th dec - Jupyter Notebook

In [19]: sbn.heatmap(df.corr(), annot=True)

Out[19]: <AxesSubplot:>

In [ ]: #insights- visual analysis

In [ ]: #200 products of KP781. Who's gonna buy more? What's the number?

In [ ]: #100 Women, what's of percentage of women buying 281

In [ ]: #2300 PRODUCTS OF TYPE 281, WHO'S GONNA BUY MORE Married/ Unmarried and by what percentage

In [22]: pd.crosstab(index=df['Gender'], columns=df['Product'])

Out[22]:
Product KP281 KP481 KP781

Gender

Female 40 29 7

Male 40 31 33

In [25]: pd.crosstab(index=df['Gender'], columns=df['Product'], normalize='columns')*100

Out[25]:
Product KP281 KP481 KP781

Gender

Female 50.0 48.333333 17.5

Male 50.0 51.666667 82.5

In [ ]: #1200 pieces of KP281 are sold, how many of them are bought by females = 600x

In [26]: pd.crosstab(index=df['Gender'], columns=df['Product'], margins=True)

Out[26]:
Product KP281 KP481 KP781 All

Gender

Female 40 29 7 76

Male 40 31 33 104

All 80 60 40 180

In [ ]: # % of females - 76/180 - Marginal Prob

#if there are 200 females, how many of them will buy kp281- 40/76*200

In [ ]: #In my whole data, what's the contribution of males buying kp481 -- Joint probability - 17.2%

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 4/5

09/12/2023, 21:33 Aerofit 9th dec - Jupyter Notebook

In [28]: pd.crosstab(index=df['Gender'], columns=df['Product'], margins=True, normalize=True)*100

Out[28]:
Product KP281 KP481 KP781 All

Gender

Female 22.222222 16.111111 3.888889 42.222222

Male 22.222222 17.222222 18.333333 57.777778

All 44.444444 33.333333 22.222222 100.000000

In [ ]: #insights-----

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 5/5

Solution - Data Analysis With Python-Project-2 - v1.0
No ratings yet
Solution - Data Analysis With Python-Project-2 - v1.0
14 pages
Mavenir UPF Solution Brief
No ratings yet
Mavenir UPF Solution Brief
9 pages
Aerofit CaseStudy
No ratings yet
Aerofit CaseStudy
30 pages
Aerofit_business_Case - JupyterLab
No ratings yet
Aerofit_business_Case - JupyterLab
36 pages
Aerofit_Case_Study
No ratings yet
Aerofit_Case_Study
16 pages
Business Case Aerofit Descriptive Statistics & Probability
No ratings yet
Business Case Aerofit Descriptive Statistics & Probability
12 pages
Aerofit Case Study analysis.ipynb - Colaboratory
No ratings yet
Aerofit Case Study analysis.ipynb - Colaboratory
6 pages
Aerofit
No ratings yet
Aerofit
7 pages
aerofit_eda
No ratings yet
aerofit_eda
25 pages
CardioGoodFitness - Descriptive Statistics (2) (1) - Jupyter Notebook
No ratings yet
CardioGoodFitness - Descriptive Statistics (2) (1) - Jupyter Notebook
14 pages
CardioGoodFitness - Jupyter Notebook
No ratings yet
CardioGoodFitness - Jupyter Notebook
12 pages
aerofit_case_study1
No ratings yet
aerofit_case_study1
56 pages
Cardio Good Fitness Dataset
No ratings yet
Cardio Good Fitness Dataset
27 pages
indexdw (1)
No ratings yet
indexdw (1)
34 pages
Practical7 Python Programming
No ratings yet
Practical7 Python Programming
6 pages
M pdf
No ratings yet
M pdf
13 pages
EDAusingpython_SAlaruri
No ratings yet
EDAusingpython_SAlaruri
50 pages
Clothes Size Prediction with KNN (1)
No ratings yet
Clothes Size Prediction with KNN (1)
11 pages
asfasdas
No ratings yet
asfasdas
36 pages
String (Pandas) - Removing $ After Int Sales ( Revenue') Sales ( Revenue') .STR - Strip ( $') #Convert String To Int
No ratings yet
String (Pandas) - Removing $ After Int Sales ( Revenue') Sales ( Revenue') .STR - Strip ( $') #Convert String To Int
12 pages
Bio-Signal Analysis For Smoking
No ratings yet
Bio-Signal Analysis For Smoking
1 page
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Assignment 1 - LP1
No ratings yet
Assignment 1 - LP1
14 pages
Data Science
No ratings yet
Data Science
8 pages
K-Nearest Neighbors For Diabetes Prediction: Malik Yousaf (F2020019038) Ahsan Rauf (F2020019057)
No ratings yet
K-Nearest Neighbors For Diabetes Prediction: Malik Yousaf (F2020019038) Ahsan Rauf (F2020019057)
15 pages
PRJ Sales Forecasting
No ratings yet
PRJ Sales Forecasting
22 pages
Major project - Colab
No ratings yet
Major project - Colab
15 pages
PythonForMachineLearning
No ratings yet
PythonForMachineLearning
66 pages
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
No ratings yet
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
7 pages
Analysing NBA DATA
No ratings yet
Analysing NBA DATA
13 pages
Pandas cheat sheet
No ratings yet
Pandas cheat sheet
19 pages
Aerofit Case Study
No ratings yet
Aerofit Case Study
1 page
Machine Learning Lab Manual (1) (1)
No ratings yet
Machine Learning Lab Manual (1) (1)
26 pages
Fds SLOT 2
No ratings yet
Fds SLOT 2
12 pages
Nikita Prasad - Exploratory Data Analysis (EDA)
No ratings yet
Nikita Prasad - Exploratory Data Analysis (EDA)
18 pages
Python Class 6 Assignment Solution
No ratings yet
Python Class 6 Assignment Solution
9 pages
Exemplar - Perform Feature Engineering
No ratings yet
Exemplar - Perform Feature Engineering
14 pages
Pandas
No ratings yet
Pandas
21 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Assignment 3
No ratings yet
Assignment 3
7 pages
Pandas Notes
No ratings yet
Pandas Notes
5 pages
E-Commerce Product Delivery Prediction
No ratings yet
E-Commerce Product Delivery Prediction
13 pages
BPP Business School - Applied Modelling and Visualisation
No ratings yet
BPP Business School - Applied Modelling and Visualisation
19 pages
pp DWDM 4 5
No ratings yet
pp DWDM 4 5
26 pages
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
No ratings yet
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
5 pages
Exercise 3
No ratings yet
Exercise 3
12 pages
05 Pandas (1)
No ratings yet
05 Pandas (1)
12 pages
Astros
No ratings yet
Astros
20 pages
BigMart Sales Data Analysis
No ratings yet
BigMart Sales Data Analysis
16 pages
ml_labmanual (3)
No ratings yet
ml_labmanual (3)
33 pages
Mini Project Rathin
No ratings yet
Mini Project Rathin
6 pages
Python For Data Sceince l1 Hands On
No ratings yet
Python For Data Sceince l1 Hands On
5 pages
Experiment 3 ML
No ratings yet
Experiment 3 ML
4 pages
Big Sales Mart Final Script PDF
No ratings yet
Big Sales Mart Final Script PDF
36 pages
Lecture 2 Annotated
No ratings yet
Lecture 2 Annotated
70 pages
Date Preparation and Exploration:: Titanic Data - CSV
No ratings yet
Date Preparation and Exploration:: Titanic Data - CSV
5 pages
ML Proj Diabetes.pptx
No ratings yet
ML Proj Diabetes.pptx
51 pages
batch1 ds
No ratings yet
batch1 ds
15 pages
Final
No ratings yet
Final
15 pages
data science programs
No ratings yet
data science programs
11 pages
150+ C Pattern Programs
From Everand
150+ C Pattern Programs
Hernando Abella
No ratings yet
Q6
No ratings yet
Q6
1 page
Q5
No ratings yet
Q5
1 page
Download
No ratings yet
Download
17 pages
Note Chisquare
No ratings yet
Note Chisquare
19 pages
5 Steps To Designing An Embedded Software Architecture, Step 2
No ratings yet
5 Steps To Designing An Embedded Software Architecture, Step 2
4 pages
Sharjah Regulations
86% (121)
Sharjah Regulations
129 pages
1974008731
No ratings yet
1974008731
95 pages
Wo Wie
No ratings yet
Wo Wie
2 pages
9.an Integrated System For Regional
No ratings yet
9.an Integrated System For Regional
3 pages
Overexpression of Snail Is Associated With Lymph Node Metastasis and Poor Prognosis in Patients With Gastric Cancer
No ratings yet
Overexpression of Snail Is Associated With Lymph Node Metastasis and Poor Prognosis in Patients With Gastric Cancer
24 pages
Chapter 1 - Physics-2
No ratings yet
Chapter 1 - Physics-2
6 pages
h._geo_-_midterm_review_packet_-_1718
No ratings yet
h._geo_-_midterm_review_packet_-_1718
18 pages
Innovations in Logistics Management As A Direction For Improving The Logistics Activities of Enterprises
No ratings yet
Innovations in Logistics Management As A Direction For Improving The Logistics Activities of Enterprises
9 pages
Group 3 Tests and Adjustments: 1. Clutch Cut-Off Pressure Switch Test
No ratings yet
Group 3 Tests and Adjustments: 1. Clutch Cut-Off Pressure Switch Test
3 pages
Java - Understanding The Workings of Equals and Hashcode in A HashMap - Stack Overflow
No ratings yet
Java - Understanding The Workings of Equals and Hashcode in A HashMap - Stack Overflow
10 pages
GATE 2014 2015 Exam Syllabus For Electrical Engineering - EEE PDF Download
No ratings yet
GATE 2014 2015 Exam Syllabus For Electrical Engineering - EEE PDF Download
2 pages
Innoseis 04 Uaceg v02 01 - Case Study 4sb r10 Final
No ratings yet
Innoseis 04 Uaceg v02 01 - Case Study 4sb r10 Final
24 pages
Tosca Notes
No ratings yet
Tosca Notes
14 pages
BMEn 3301 Spring 2013 Syllabus
No ratings yet
BMEn 3301 Spring 2013 Syllabus
13 pages
The Common Ion Effect
No ratings yet
The Common Ion Effect
24 pages
HELLA DR 820 Instruction Manual
No ratings yet
HELLA DR 820 Instruction Manual
1 page
FA-Joining-oxide-ceramics
No ratings yet
FA-Joining-oxide-ceramics
9 pages
Percentage (NerdsJobPortal - Com)
No ratings yet
Percentage (NerdsJobPortal - Com)
13 pages
Write 15 applications of statistics and probability on engineering
No ratings yet
Write 15 applications of statistics and probability on engineering
8 pages
Lab 10 DB
No ratings yet
Lab 10 DB
3 pages
AN004 - Data Analysis Techniques
No ratings yet
AN004 - Data Analysis Techniques
4 pages
Design and Fabrication of Solar Tracker
No ratings yet
Design and Fabrication of Solar Tracker
7 pages
Excel Macro Examples
No ratings yet
Excel Macro Examples
23 pages
CEPEsample
No ratings yet
CEPEsample
5 pages
A2 Chem Repeated Ques
No ratings yet
A2 Chem Repeated Ques
34 pages
Student Performance Prediction and Analysis: Ijarcce
No ratings yet
Student Performance Prediction and Analysis: Ijarcce
4 pages
Xxgpu90-15 16-TV V
No ratings yet
Xxgpu90-15 16-TV V
1 page
01.electric Charge and Fields
No ratings yet
01.electric Charge and Fields
54 pages

PDF Notebook

Uploaded by

PDF Notebook

Uploaded by

09/12/2023, 21:33 Aerofit 9th dec - Jupyter Notebook

In [1]: import pandas as pd

0 KP281 18 Male 14 Single 3 4 29562 112

1 KP281 19 Male 15 Single 2 3 31836 75

2 KP281 19 Female 14 Partnered 4 3 30699 66

3 KP281 19 Male 12 Single 3 3 32973 85

4 KP281 20 Male 13 Partnered 4 2 35247 47

5 KP281 20 Female 14 Partnered 3 3 32973 66

6 KP281 21 Female 14 Partnered 3 3 35247 75

7 KP281 21 Male 13 Single 3 3 32973 85

8 KP281 21 Male 15 Single 5 4 35247 141

9 KP281 21 Female 15 Partnered 2 3 37521 85

In [7]: import seaborn as sbn

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 1/5

In [8]: sbn.boxplot(x='Product', y='Income', data=df)

Out[8]: <AxesSubplot:xlabel='Product', ylabel='Income'>

In [9]: sbn.boxplot(x='Product', y='Miles', data=df)

Out[9]: <AxesSubplot:xlabel='Product', ylabel='Miles'>

In [10]: sbn.boxplot(x='Product', y='Education', data=df)

Out[10]: <AxesSubplot:xlabel='Product', ylabel='Education'>

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 2/5

KP281 80.0 15.037500 1.216383 12.0 14.0 16.0 16.0 18.0

KP481 60.0 15.116667 1.222552 12.0 14.0 16.0 16.0 18.0

KP781 40.0 17.325000 1.639066 14.0 16.0 18.0 18.0 21.0

In [15]: sbn.boxplot(x='Product', y='Miles', data=df)

Out[15]: <AxesSubplot:xlabel='Product', ylabel='Miles'>

In [17]: sbn.scatterplot(x='Miles', y='Income',hue='Product', data=df)

Out[17]: <AxesSubplot:xlabel='Miles', ylabel='Income'>

In [18]: sbn.boxplot(x='Gender', y='Miles', hue='Product', data=df)

Out[18]: <AxesSubplot:xlabel='Gender', ylabel='Miles'>

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 3/5

In [19]: sbn.heatmap(df.corr(), annot=True)

In [ ]: #insights- visual analysis

In [ ]: #100 Women, what's of percentage of women buying 281

In [22]: pd.crosstab(index=df['Gender'], columns=df['Product'])

In [25]: pd.crosstab(index=df['Gender'], columns=df['Product'], normalize='columns')*100

Female 50.0 48.333333 17.5

Male 50.0 51.666667 82.5

In [26]: pd.crosstab(index=df['Gender'], columns=df['Product'], margins=True)

In [ ]: # % of females - 76/180 - Marginal Prob

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 4/5

In [28]: pd.crosstab(index=df['Gender'], columns=df['Product'], margins=True, normalize=True)*100

Female 22.222222 16.111111 3.888889 42.222222

Male 22.222222 17.222222 18.333333 57.777778

All 44.444444 33.333333 22.222222 100.000000

localhost:8888/notebooks/Desktop/DSML/dsml-case-studies/Aerofit/Aerofit 9th dec.ipynb 5/5

You might also like