0% found this document useful (0 votes)
33 views

Chapter 12 - Logistic Regression For Classification and Prediction

Logistic regression is used to predict the likelihood of categorical dependent variables, such as purchase/non-purchase. It is preferred over discriminant analysis when there are two categories of the dependent variable. Logistic regression uses maximum likelihood estimation to estimate the odds of predicting dependent variable values from independent variables. It iterates to improve coefficient estimates based on log likelihood until improvements are insignificant. While both techniques can predict groups, logistic regression has fewer assumptions and can handle different variable scales. An example is provided to illustrate logistic regression analysis in SPSS.

Uploaded by

alfin luanmasa
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views

Chapter 12 - Logistic Regression For Classification and Prediction

Logistic regression is used to predict the likelihood of categorical dependent variables, such as purchase/non-purchase. It is preferred over discriminant analysis when there are two categories of the dependent variable. Logistic regression uses maximum likelihood estimation to estimate the odds of predicting dependent variable values from independent variables. It iterates to improve coefficient estimates based on log likelihood until improvements are insignificant. While both techniques can predict groups, logistic regression has fewer assumptions and can handle different variable scales. An example is provided to illustrate logistic regression analysis in SPSS.

Uploaded by

alfin luanmasa
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

Mandagi, Pinkan Ni Komang Yuli Oroh, Yana

Learning Objectives

In this chapter, we will

 a Introduce the concept of logistic regression


 a Compare it with discriminant analysis
 a Explain its working through an example
 a Discuss pros and cons of its usage in the real world
APPLICATION AREAS

Like Discriminant Analysis, Logistic Regression is used to distinguish between two or more groups. While
either technique can be used for such applications, Logistic Regression is generally preferred when there
are only two categories of the dependent variable, while Discriminant Analysis is the method of choice
when there are more than two. Typical application areas are cases where one wishes to predict the
likelihood of an entity belonging to one group or another, such as in response to a marketing effort
(likelihood of purchase/non-purchase), creditworthiness (high/low risk of default), insurance (high/low
risk of accident claim), medicine (high/low risk of some ailment like heart attack), sports (likelihood of
victory/loss).
METHODS

(12.1 (12.4)
)
(12.2) (12.5)

(12.3) (12.6)
The Algorithm

While Linear models use the Ordinary Least Squares (OLS) estimation of coefficients, Logistic
regression uses the Maximum Likelihood Estimation (MLE) technique. In other words, it tries to
estimate the odds that the dependent variable values can be predicted using the independent variable
values. This is done by starting out with a random set of coefficients and then iteratively improving them
based on improvements to the log likelihood measure. After a few iterations, the process stops when
further improvement is insignificant, based on some predetermined criteria.
Logistic Regression Versus Linear Discriminant Analysis

In terms of predicting power, there is a debate over which technique performs better and there is no
clear winner. As stated before, the general view is that Logistic Regression is preferred for binomial
dependent variables, while discriminant is better when there are more than 2 values of the dependent. Mathematically too,
Logistic Regression is less encumbered by the assumptions of Discriminant Analysis. The independent variables in Logistic
Regression may be anything from Nominal to Ratio scaled and there are no distribution assumptions.
NUMERICAL EXAMPLE WITH SPSS
SPSS
Interpretation of Output
Statistical Significance
Predictors
Classification of New Customer

The values of the independent variables are used to compute a score, which is then transformed
to get a probability of disloyalty. If this probability is greater than 0.5, the customer will be
classified as disloyal (1). If less than 0.5, then he/she will be classified as loyal (0).
THANK YOU!

You might also like