0% found this document useful (0 votes)
61 views

Import As Import As: "Iris - CSV"

This document analyzes iris data from a CSV file with 150 rows and 5 columns containing sepal length, sepal width, petal length, petal width, and species information for each iris sample. It loads and inspects the data, calculates summary statistics, and generates histograms and boxplots to visualize the distributions of each variable.

Uploaded by

avinash
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
61 views

Import As Import As: "Iris - CSV"

This document analyzes iris data from a CSV file with 150 rows and 5 columns containing sepal length, sepal width, petal length, petal width, and species information for each iris sample. It loads and inspects the data, calculates summary statistics, and generates histograms and boxplots to visualize the distributions of each variable.

Uploaded by

avinash
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Untitled http://localhost:8888/nbconvert/html/Assignment%2010/Untitled.ipynb...

In [1]:
import numpy as np
import pandas as pd

In [10]:
df = pd.read_csv("iris.csv")

In [11]:
df

Out[11]: sepal_length sepal_width petal_length petal_width species

0 5.1 3.5 1.4 0.2 setosa

1 4.9 3.0 1.4 0.2 setosa

2 4.7 3.2 1.3 0.2 setosa

3 4.6 3.1 1.5 0.2 setosa

4 5.0 3.6 1.4 0.2 setosa

... ... ... ... ... ...

145 6.7 3.0 5.2 2.3 virginica

146 6.3 2.5 5.0 1.9 virginica

147 6.5 3.0 5.2 2.0 virginica

148 6.2 3.4 5.4 2.3 virginica

149 5.9 3.0 5.1 1.8 virginica

150 rows × 5 columns

In [12]:
column = len(list(df))
column

Out[12]: 5

In [13]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 150 entries, 0 to 149
Data columns (total 5 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 sepal_length 150 non-null float64
1 sepal_width 150 non-null float64
2 petal_length 150 non-null float64
3 petal_width 150 non-null float64
4 species 150 non-null object
dtypes: float64(4), object(1)
memory usage: 6.0+ KB

1 of 4 19-04-2022, 01:06 pm
Untitled http://localhost:8888/nbconvert/html/Assignment%2010/Untitled.ipynb...

In [14]:
df.describe()

Out[14]: sepal_length sepal_width petal_length petal_width

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

In [15]:
import seaborn as sns
import matplotlib
import matplotlib.pyplot as plt
%matplotlib inline

In [17]:
fig, axes = plt.subplots(2, 2, figsize=(16, 8))

axes[0,0].set_title("Distribution of First Column")


axes[0,0].hist(df["sepal_length"]);

axes[0,1].set_title("Distribution of Second Column")


axes[0,1].hist(df["sepal_width"]);

axes[1,0].set_title("Distribution of Third Column")


axes[1,0].hist(df["petal_length"]);

axes[1,1].set_title("Distribution of Fourth Column")


axes[1,1].hist(df["petal_width"]);

2 of 4 19-04-2022, 01:06 pm
Untitled http://localhost:8888/nbconvert/html/Assignment%2010/Untitled.ipynb...

3 of 4 19-04-2022, 01:06 pm
Untitled http://localhost:8888/nbconvert/html/Assignment%2010/Untitled.ipynb...

In [19]:
data_to_plot = [df["sepal_length"],df["sepal_width"],df["petal_length"],df["petal_width"

sns.set_style("whitegrid")
# Creating a figure instance
fig = plt.figure(1, figsize=(12,8))

# Creating an axes instance


ax = fig.add_subplot(111)

# Creating the boxplot


bp = ax.boxplot(data_to_plot);

In [ ]:

4 of 4 19-04-2022, 01:06 pm

You might also like