Data Mining-3
Data Mining-3
We can interfere from above table that there is no duplicate data in the dataset.
From Table 2,
We can conclude that there are total 210 rows and 7 columns in the dataset.
And all the variables are in float64 Data type, where data are in Decimal numbers format.
and there is no missing value present in the dataset.
4
DESCRIPTIVE STATISTICS TO SUMMARIZE DATA
From the descriptive statistics, we can see that there are 120 counts in all the variables.
There is no Nan values present in data.
• Distribution is symmetric as the mean is almost equal to the median and the distribution have
zero skewness.
• Maximum of 91% of customer has done payment in full to the bank.
UNIVARIATE ANALYSIS
1. SPENDING
Refer Fig1,
Data distribution is Right or Positively skewed. Looks like a bimodal distribution, as there are two
peaks. In a bimodal distribution, the data should be separated and analyzed as separate normal
distributions.