Assignment 2 Due Date: Sep 29, 2020
Assignment 2 Due Date: Sep 29, 2020
1. Build a logistic regression model based on training data set to identify good customers and
bad customers. A good customer is one who has never delayed the payment, whereas a bad
customer is one who has delayed the payment even once. Use the variables “AGE”,
“NOOFDEPE”, “MTHINCTH”, “SALDATFR”, “TENORYR”, “DWNPMFR”, “PROFBUS”,
“QUALHSC”, “QUAL_PG”, “SEXCODE”, “FULLPDC”, “FRICODE” and “WASHCODE” as predictors
in your logistic model. Clearly interpret the output of the model.
2. Judge the performance of the model based on validation data set. Is the performance of the
model satisfactory? Consider at least two criteria.
3. Include the variable “Region” as an additional predictor in your logistic model. Note that you
have to create appropriate dummy variables for “Region”. Does inclusion of “Region”
improves the performance of the model?
4. Suppose Auto Finance Ltd. provides loan for a 2-year period. The management of the Auto
Finance Ltd. has estimated that the profit associated with a “True Positive” case is Rs. 6360.
Furthermore, they also estimated that the losses associated with a “False Negative” case and
a “False Positive” case are Rs. 12500 and Rs. 6360, respectively. Based on confusion matrix
obtained for the validation data set, calculate the total profit for the company.
5. How will the fitted model be helpful in taking managerial decisions?