Data Science and ML - End Term
Data Science and ML - End Term
Part B
Use of Internet: Yes
Use of Laptop : Allowed
Open book : Yes
Calculator : Yes
Duration : 75 Minutes
Part B Marks : 20 Marks
Instructions
Submit
Note
1. If your .ipnyb does not run, you may not get any marks.
2. It is your responsibility to ensure that the files you submit
are complete in all aspects.
3. Do not forget to mention your name and rollnumber in the file
name
4. You should use your computer to run the code. Sharing of code
files is strictly prohibited.
Datasets Supplied
1. Use the Use the german rec.t dataset German Credit Data csv'
L. Divide the data into train and test using last 3 digits of your
roll number as random state [2 Marks]
Build three classification models using the train dataset by
applying (a) 1ogistic regression and (b) SVM- RBF Kernel and (c)
Decision Trees [2 Marks]
Get the predicted probabilities for the test data set for each
of the above three models. Store the three predicted
probabilities in a dataframe named "result". [2 Marks]
PART A
Q1 State True or False [Total Marks 09]
PART B
Q1. Find the Gini Index of the following nodes. Show the steps in deriving the answe.
(3 Marks]
Q2.Find the Entropy of the following nodes. Show the steps in deriving the answer. (3
Marks]
Q6. Answer the following questions from the following figure (3 Marks]
1.What is the probability of being good credit if Checkin_acc_Al4 =0and duration is 20?
2. What is the probability of being good credit if Checkin_acc_Al4 = 0 and duration is 35 ?
3. What is the probability of being good credit if Checkin_acc_Al4 = 1 and inst_plans_A143
= 0?