This document contains a summary of an assignment for a Data Mining and Business Intelligence course. It includes 5 multiple choice questions and 1 short answer question related to key concepts in data mining and machine learning such as classification accuracy, true/false positives and negatives, evaluating classification algorithms, and decision tree pruning. It provides instructions on computing metrics like true positive rate from given classification probability data and plotting an ROC curve.
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
77 views
Dmbi Assignment 6
This document contains a summary of an assignment for a Data Mining and Business Intelligence course. It includes 5 multiple choice questions and 1 short answer question related to key concepts in data mining and machine learning such as classification accuracy, true/false positives and negatives, evaluating classification algorithms, and decision tree pruning. It provides instructions on computing metrics like true positive rate from given classification probability data and plotting an ROC curve.
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2
MGM’s College of engineering and Technology, Kamothe, Navi Mumbai
Department of Information Technology
Course Code:ITC602 Course Name: Data Mining & Business Intelligence Assignment VI Class: TE AY: 2018-2019 Date of Issue:22/2/19 Date of Submission:25/2/19 Date of Return:28/2/19
Ques. Question Module Level* CO
No. 1 Classification accuracy is 3 1 CO4
A) A subdivision of a set of examples into a number of
classes
B) Measure of the accuracy, of the classification of a concept
that is given by a certain theory
C) The task of assigning a classification to a set of examples
D) None of these
2 The positive tuples that are incorrectly classified as negative. 3 1 CO4
A)TP B)TN C)FP D)FN 3 The negative tuples that are incorrectly classified as positive. 3 1 CO4 A)TP B)TN C)FP D)FN 4 How can you evaluate and compare the accuracy of different 3 2 CO4 classification algorithms? 5 The data tuples in the following table are sorted by 3 2 CO4 decreasing probability value, as returned by a classifier. For each tuple, compute the values for the number of true positives(TP), false positives (FP), true negatives (TN), and false negatives(FN). Compute the true positive rate(TPR) and false positive rate(FPR).Plot the ROC curve for the data.
Tuple# Class Probability
1 P 0.95
2 N 0.85
3 P 0.78
4 P 0.66
5 N 0.60
6 P 0.55
7 N 0.53
8 N 0.52
9 N 0.51
10 P 0.40
6 Why is tree pruning useful in decision tree induction? What is 3 4 CO4
a drawback of using a separate set of tuples to evaluate pruning?
Looking at Other Materials or Other Students' Work During The Exam Period Will Result in Immediate Confiscation of Your Exam, and Further Disciplinary Action