Business ANalytics - Logistic Regression
Business ANalytics - Logistic Regression
Analytics
OUR ANALYSIS- USING LOGISTIC
REGRESSION
1650 movies
Dichotomous scale
Data collection
& streamlining
Running logistic
regression
Analyzing
results
Prediction
Accuracy check
ANALYSIS
Using the
regression
coefficients to
predict the
values on test
data and then
Divided the
checking
data into
accuracy of the
training and
model
test data
On the
training data-
logistic
regression
analysis
using R
STEP 1: Collecting the data set
STEP 2 Logistic regression on Training Data
Step 3- Prediction on test data
ACCURACY OF THE MODEL
USING ONLY SIGNIFICANT VARIABLES
ACCURACY OF THE MODEL
ERROR 0.043378995
ACCURACY 0.956621005
PREDICTED PROBABILITY OF WINNING USING LOGISTIC
REGRESSION
Classification
1.00
0.90
0.80
0.70
0.60
Probability
0.50
0.40
0.30
0.20
0.10
0.00
0 1 2 3 4 5 6 7 8 9 10
IMDB Rating
PREDICTED PROBABILITY OF WINNING USING LOGISTIC
REGRESSION
Classification
1
0.9
0.8
0.7
0.6
Probability
0.5
0.4
0.3
0.2
0.1
0
0 20 40 60 80 100 120 140
Number of Nominations
INTERPRETATION- SIGNIFICANT
VARIABLES
Significant Variables:
IMDb rating
Number of nominations
Insignificant
variables:
Rating count
Number of photos
Number of news
articles
p(x)= -9.008+(1.091*imdbRating)+(-
0.000003268*ratingCount)+(1.810*nrOfNomination)+(- Number of user
0.001*nrOfPhotos)+(- reviews
0.000049*nrOfNewsArticles)+(0.002*nrOfUserReviews Number of genres
)+(-0.066*nrOfGenre)
p(x)= --8.9192+(1.076*imdbRatin)+(1.8920*nrOfNomination)
APPLICATIONS