Bank Marketing Data
Bank Marketing Data
BY
Kotha Bala Venkata Naga
PavanKumar
Problem Statement
• logistic regression does work better when you remove attributes that are
unrelated to the output variable as well as attributes that are very similar
(correlated) to each other
• logistic regression model is that the interpretation is more difficult because
the interpretation of the weights is multiplicative and not additive.
• The problem of complete separation can be solved by introducing
penalization of the weights or defining a prior probability distribution of
weights.
Decision tree classification
• As we mentioned earlier a single decision tree tends to overfit the data. The process of
averaging or combining the results of different decision trees helps to overcome the
problem of overfitting.
• Random forests also have less variance than a single decision tree. It means that it
works correctly for a large range of data items than single decision trees.
• Random forests are extremely flexible and have very high accuracy.
• They also do not require preparation of the input data. You do not have to scale the
data.
• It also maintains accuracy even when a large proportion of the data are missing.
Disadvantage