05b Logistic Regression
05b Logistic Regression
Logistic Regression
p
z log logit function
1 p
p
e z
1 p
ez 1
p logistic function
1 e z
1 e z
Scenario:
– A multidimensional feature space (features
can be categorical or continuous).
– Outcome is discrete, not continuous.
We’ll focus on case of two classes.
– It seems plausible that a linear decision
boundary (hyperplane) will give good
predictive accuracy.
matlab_demo_05.m
Advantages:
– Makes no assumptions about distributions of classes in feature
space
– Easily extended to multiple classes (multinomial regression)
– Natural probabilistic view of class predictions
– Quick to train
– Very fast at classifying unknown records
– Good accuracy for many simple data sets
– Resistant to overfitting
– Can interpret model coefficients as indicators of feature
importance
Disadvantages:
– Linear decision boundary