0% found this document useful (0 votes)
9 views

Data Mining-3

The document analyzes a dataset to check for missing values, duplicate data, and performs exploratory data analysis including descriptive statistics. Graphs and tables are used to summarize the distribution, skewness and kurtosis of variables in the dataset.

Uploaded by

Deepanjali b.n
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

Data Mining-3

The document analyzes a dataset to check for missing values, duplicate data, and performs exploratory data analysis including descriptive statistics. Graphs and tables are used to summarize the distribution, skewness and kurtosis of variables in the dataset.

Uploaded by

Deepanjali b.n
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

CHECK FOR MISSING VALUE

Table 3. Missing value checking

Table 3 confirms that there is no missing value present in dataset given.

CHECK FOR DUPLICATE DATA

Table 4. Duplicate data checking

We can interfere from above table that there is no duplicate data in the dataset.

EXPLORATORY DATA ANALYSIS

From Table 2,
We can conclude that there are total 210 rows and 7 columns in the dataset.
And all the variables are in float64 Data type, where data are in Decimal numbers format.
and there is no missing value present in the dataset.

Table.5 – Data information

4
DESCRIPTIVE STATISTICS TO SUMMARIZE DATA

From the descriptive statistics, we can see that there are 120 counts in all the variables.
There is no Nan values present in data.

Table.6 – Descriptive Summary of the data

• Distribution is symmetric as the mean is almost equal to the median and the distribution have
zero skewness.
• Maximum of 91% of customer has done payment in full to the bank.

UNIVARIATE ANALYSIS

1. SPENDING

Refer Fig1,

Data distribution is Right or Positively skewed. Looks like a bimodal distribution, as there are two
peaks. In a bimodal distribution, the data should be separated and analyzed as separate normal
distributions.

Fig 1: Histogram of Spending

You might also like