MIS410 Lecture8toLecture10
MIS410 Lecture8toLecture10
(Course Code: MIS 410, Prerequisite: BUS 173, MIS 210/MIS 310)
• Cross-Validation
• Bootstrapping
• Multiple regression
• Logistic regression
Cross-Validation
• Avoid overfitting
Cross-Validation Methods
• K-fold cross-validation
• Bootstrapping
Hold Out Cross-Validation
• Which of the previous two models perform better with 25% hold out
cross-validation? – The model with the least average MSE computed
from 500 replication.
K-fold Cross-Validation
Usually, k=10 is used and we call it 10-fold cross-
validation. Let us explain this with k=5 as follows.
K-fold Cross-Validation
Replicate the whole computation 100 times and calculate
average MSE
MSE.rep = NULL
For(i in 1:100)
{
MSE.rep[ i ] = MSE
}
mean(MSE.rep)
Bootstrapping
• (1) Fit regression model
• (2) Calculate fitted values and residuals, and save these values
• (5) Fit regression model with dependent variable in (4) and save coefficients
• (6) Repeat (3) – (5) steps 100 times, and calculate average of coefficients.
These estimates are known as bootstrap estimates of coefficients.
Bootstrapping