0% found this document useful (0 votes)
54 views

Data Mining Quiz 3 - Random Forest: Course Content

This document describes the results of a random forest machine learning quiz on data mining. It shows that the student scored 5 out of 5 points on the quiz. The quiz consisted of 6 multiple choice questions about building and evaluating a random forest model on a cardiac dataset to predict patient mortality. Based on the model's performance on the training and test sets, it achieved an accuracy of 99% and an AUC of 79% on the test set. The model precision for predicting died patients was 71% and recall was 45%. [END SUMMARY]

Uploaded by

AbhijitSinha
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views

Data Mining Quiz 3 - Random Forest: Course Content

This document describes the results of a random forest machine learning quiz on data mining. It shows that the student scored 5 out of 5 points on the quiz. The quiz consisted of 6 multiple choice questions about building and evaluating a random forest model on a cardiac dataset to predict patient mortality. Based on the model's performance on the training and test sets, it achieved an accuracy of 99% and an AUC of 79% on the test set. The model precision for predicting died patients was 71% and recall was 45%. [END SUMMARY]

Uploaded by

AbhijitSinha
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Go Back to Data Mining

Course Content

Data Mining Quiz 3 - Random Forest

Type : Graded Quiz

Attempts : 1/1

Questions : 6

Time : 45m

Due Date : Dec 05, 2021, 11:59 PM

Your Score : 5/5

Instructions

Attempt History

Attempt #1
Dec 05, 2021, 7:38
PM
Marks: 5

Q No: 1 Correct Answer


Marks:
0.50/0.50

Please use the dataset azcabgptca.csv for answering the

following question. Which of the following plot is the 'countplot'

of the target variable?

[Hint: use seaborn]

This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/ 1/7
None of the mentioned

You Selected

Correct Answer
Q No: 2
Marks:
1/1
This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/
Make a RF model using random state=0 and n estimators=500( and using the
remaining

This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/
default parameters)

Splitting: X_train, X_test, train_labels, test_labels = train_test_split(X, y,


test_size=.30, random_state=0)
Fit the model on train data
Predict on the train and test both.

What is the model score for the training &

testing set? [Round of up to 2 decimal

places]

Training Set Score 0.85 Testing Set Score 0.88

Training Set Score 0.91 Testing Set Score 0.92

You Selected
Training Set Score 0.99
Testing Set Score 0.99

Training Set Score 1.0 Testing Set Score 1.0

Q No: 3 Correct Answer


Marks:
1/1

Which one of the following is the confusion Matrix (plotted) for the

testing set? [Ignore the aesthetics, focus on the values]

This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/
You Selected

All of the mentioned

None of the mentioned

Q No: 4 Correct Answer


Marks:
1/1
What is the ROC Area Under the Curve Score for the
testing set? [Upto 2 decimal places]

This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/
ROC AUC Score for Testing Set 0.89

ROC AUC Score for Testing Set 0.21

You Selected
ROC AUC Score for Testing Set 0.79

ROC AUC Score for Testing Set 0.99

Q No: 5 Correct Answer


Marks:
1/1
What is the Precision and Recall for the patients who
have died? [Use Testing Set]

Precision: 0.86 Recall: 0.73

You Selected
Precision: 0.71
Recall: 0.45

Precision: 1:00
Recall: 0.99

Precision: 0.99
Recall: 0.56

This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/
Patients who have died are marked as 1 in the variable 'died'. This means you need to see precision & recall f

precision recall f1-score support

0 0.99 1.00 0.99 577

1 0.71 0.45 0.56 11

accuracy 0.99 588


macro avg 0.85 0.73 0.77 588
weighted 0.98 0.99 0.98 588
avg

Q No: 6 Correct
Answer Marks:
0.50/0.50
As per the predictions on the testing set, how many patients actually died but
we predicted that they won't?

6 You Selected

The False Negatives, often termed as 'missed alarm'.

This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/
Comments:

+ Add comments

This study source was downloaded by 100000841338173 from CourseHero.com on 05-07-2022 03:34:25 GMT -05:00

https://www.coursehero.com/file/134041773/Data-Mining-Quiz-3-Random-Forest-Data-Mining-Great-Learningdocx/
Powered by TCPDF (www.tcpdf.org)

You might also like