0% found this document useful (0 votes)

49 views6 pages

191CSC503T - Data Mining-Cat 2-Question Bank

Question bank for the subjects data mining

Uploaded by

harisiva062005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views6 pages

191CSC503T - Data Mining-Cat 2-Question Bank

Question bank for the subjects data mining

Uploaded by

harisiva062005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CONTINUOUS ASSESSMENT TEST – 2

Regulations R 2019 - V21

Department of Computer Science and Engineering

Third Year / Fifth Semester

191CSC503T - DATA MINING

CO1:To understand data mining principles and techniques and Introduce DM as a cutting
edge business intelligence
CO2:To study the overview of developing areas – web mining, text mining and ethical
aspects of data mining
CO3:To study algorithms for finding hidden and interesting patterns in data
CO4:To understand and apply various classification and clustering techniques using tools.
CO5:To identify business applications and trends of data mining

Unit – III CLASSIFICATION (2nd half)

PART A
1. Define Support vector machine. CO3 K1
2. Define back propagation. CO3 K1
3. What are K-nearest neighbor classifiers? CO3 K1
4. Differentiate lazy learners and Eager learners. CO3 K2
5. Illustrate support vector machines with example. CO3 K2
6. How would you show your understanding about rule based classification? CO3 K2
7. Discuss why pruning is needed in decision tree. CO3 K2
8. Define Lazy learners with an example. CO3 K2
9. What are eager learners? CO3 K1

CO’s Bloom’s
Q.No Questions
Level
Part – B
Illustrate in detail about the Bayesian Classification methods with an K3
1. example. CO3

2. Discuss about constraint based association rule mining with example CO3 K3

Outline the working principle of the support vector machine with a neat
3. sketch.
CO3 K4

Illustrate in detail about the Backpropagation classification methods with

4. an example.
CO3 K3
Elucidate the different techniques used to improve the classification
5. accuracy
CO3 K4

CO’s Bloom’s
Q.No Questions
Level
Part c
Evaluate the following dataset using Naive Bayes classification algorithm.
Sl. No. Color Legs Height Smelly Species

1 White 3 Short Yes M

2 Green 2 Tall No M

3 Green 3 Short Yes M

1. 4 White 3 Short Yes M CO3 K5

5 Green 2 Short No H

6 White 2 Tall No H

7 White 2 Tall No H

8 White 2 Short Yes H

Justify your answer: For a university dataset assume the necessary features
2. required for model evaluation and selection.
CO3 K5

UNIT IV : CLUSTERING TECHNIQUES

CO’s Bloom’s
Q.No Questions
Level
Part A
1. What is cluster analysis? CO4 K1
2. Define Clustering? CO4 K1
3. How is the quality of a cluster represented? CO4 K2
4. Define K-means partitioning CO4 K1
5. List the major clustering methods. CO4 K2
6. Define outlier. How will you determine outliers in the data? CO4 K1
7. Discuss the challenges of outlier detection. CO4 K2
8. Explain the typical phases of outlier detection methods. CO4 K2
9. Distinguish between Classification and clustering. CO4 K2
10. Give the methods of clustering high dimensional data. CO4 K2
11. How is the goodness of clusters measured? CO4 K2
12. Classify hierarchical clustering methods CO4 K2
13. Define grid-based method in clustering. CO4 K1
14. What are the applications of cluster analysis? CO4 K1
15. What is the concept of partitioning methods? CO4 K1
16 Define hierarchical method in clustering. CO4 K1
17 Define density-based method in clustering. CO4 K1
18 What are types of outliers? CO4 K1
19 Mention the applications of outlier CO4 K2
20 What is outlier analysis? CO4 K1
Given two objects represented by the tuples (22,1,42,10) and (20,0,36,8). CO4 K2
a) Compute Euclidean distance
21 b) Compute Manhattan distance
c) Compute Minkowski distance, q = 3
Given 5-dimensional numeric samples A= (1,0,2,5,3) and B(2,1,0,3,-1). CO4 K2
22 Find Euclidean distance between points.

CO’s Bloom’s
Q.No Questions
Level
Part – B
Consider that the data mining task is to cluster the following eight points K3
A1,A2,A3,B1,B2,B3,C1AND C2(with (X,Y) representing location) into
three clusters A1(2,10) , A2(2,5) , A3(8,4) , B1(5,8) , B2(7,5) , B3(6,4) ,
C1(1,2) , C2(4,9).
1. The distance function is Euclidean distance. Suppose initially we assign A1,
CO4
B1 and C1 as the center of each cluster, respectively. Use the K-means
algorithm to show the three cluster centers after the first round of execution
and the final tree clusters.
K3
Use K-medoid algorithm to determine clusters for the following with k=2

Point X Y

P1 2 6

P2 3 4

P3 3 8

P4 4 7

2. P5 6 2 CO4

P6 6 4

P7 7 3

P8 7 4

P9 8 5

P10 7 6
K4
Outline the steps involved in the DBSCAN algorithm. Determine the core,
border, noise points from following data using DBSCAN. minpts=4 and
eps=1.9

Point X Y

P1 2 10

P2 2 5

P3 8 4
3. CO4
P4 5 8

P5 7 5

P6 6 4

P7 1 2

P8 4 9

4. Discuss about the requirements of Clustering in data mining. CO4 K3

Let us consider four points (X1,X2,X3,X4) with the following co-ordinate K3
as a two-dimensional samples for clustering

X1=(1,0) , X2=(0,1) , X3=(2,1) , X4=(3,3,)

5. CO4
a) Apply one iteration of the K-means partition clustering algorithm.
b) What is the change in the total square error?
c) Apply the second iteration of the K-means algorithm.
Clusters: C1=(X1,X3) C2=(X2,X4)
6. Analyze the different clustering techniques used in data mining. CO4 K4
7 Give an insight of various outlier detection methods used in data mining. CO4 K3
8 Analyze the various constraints while clustering high dimensional data. CO4 K4

CO’s Bloom’s
Q.No Questions
Level
Part C
Cluster the following eight points (with (x, y) representing locations)
into three clusters: (1, 2), (2, 5),(2, 10),(4, 9), (5, 8), (6, 4), (7, 5),(8,
4)
1. CO4 K5
Initial cluster centers are: (8, 4), (5, 8) (1, 2)
Use K-Means Algorithm to find the three cluster centers till the
second iteration.
Outline the steps involved in the DBSCAN algorithm. Determine the core,
border, noise points from following data using DBSCAN. minpts=4 and
eps=1.9

POINTS X Y

P1 3 7

P2 4 6

P3 5 5

P4 6 4

2. P5 7 3 CO4 K4
P6 6 2

P7 7 2

P8 8 4

P9 3 3

P10 2 6

P11 3 5

P12 2 4

UNIT V : WEKA TOOL

CO’s Bloom’s
Q.No Questions
Level
Part A
Why is data preprocessing needed? Name any four preprocessing filters CO5 K2
1. used in the WEKA tool.
2. What are the foundations of data mining? CO5 K1
3. Name some specific application oriented databases. CO5 K2
4. Explain how data mining is used in health care analysis. CO5 K1
5. Explain data mining applications for bio medical and DNA data analysis. CO5 K1
6. Differentiate between data mining and data warehousing. CO5 K2
7. What are the applications of data mining? CO5 K1
8. List out the various data mining tools. CO5 K2
9. What is a dataset? Give an example. CO5 K1
10. What is association-rule learner? CO5 K1
11. Draw the layout of the Weka tool. CO5 K1
12. List out the limitations of the Weka tool. CO5 K2
13. Write down the functionalities of the Weka tool. CO5 K1
14. What is auto import? Give an example. CO5 K1
15. List out various data warehouse tools. CO5 K2
CO’s Bloom’s
Q.No Questions
Level
Part B
1. Discuss in detail about the WEKA tool and its functionalities. CO5 K3
2. Outline the features involved in the Iris plant database in detail. CO5 K4
3. Outline the features involved in the breast cancer database in detail. CO5 K3
4. Give a detailed note on Association rule learner. CO5 K3
Evaluate the performance measures of the different classification CO5 K4
5. algorithm for Iris plant dataset using WEKA tool
Evaluate the performance measures of the different clustering algorithm CO5 K4
6. for breast cancer dataset using WEKA tool
Evaluate the performance measures of the different clustering algorithm CO5 K4
7 for Iris plant dataset using WEKA tool
Evaluate the performance measures of the different classification CO5 K4
8 algorithm for breast cancer dataset using WEKA tool

CO’s Bloom’s
Q.No Questions
Level
Part- C
Illustrate the steps involved in loading and classifying the Iris plant database in CO5 K5
1. the WEKA tool.
Elucidate the steps involved in loading and classifying Breast cancer databases CO5 K5
2. in the WEKA tool.

Data Structures Using C
0% (1)
Data Structures Using C
7 pages
PROJECT REPORT (Singly Linked List)
100% (1)
PROJECT REPORT (Singly Linked List)
40 pages
Classification
No ratings yet
Classification
14 pages
Q1S(1)
No ratings yet
Q1S(1)
2 pages
DM Questions
No ratings yet
DM Questions
7 pages
unit-5 -Question bank
No ratings yet
unit-5 -Question bank
5 pages
Q1R_ext(2)
No ratings yet
Q1R_ext(2)
4 pages
Script of E__Previous Question Papers_URR18 03.08.2023_VI Semester_U18CS605.pdf
No ratings yet
Script of E__Previous Question Papers_URR18 03.08.2023_VI Semester_U18CS605.pdf
10 pages
DM
No ratings yet
DM
7 pages
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
No ratings yet
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
5 pages
Model Question paper 2
No ratings yet
Model Question paper 2
7 pages
Sample Question DMW
No ratings yet
Sample Question DMW
4 pages
Data Mining Model Qns
No ratings yet
Data Mining Model Qns
14 pages
Data Mining Suggestions
No ratings yet
Data Mining Suggestions
5 pages
M3A
No ratings yet
M3A
4 pages
DWDM Previous
No ratings yet
DWDM Previous
10 pages
Dcs 7302
No ratings yet
Dcs 7302
17 pages
Data Warehousing and Mining April 2019
No ratings yet
Data Warehousing and Mining April 2019
4 pages
Data Warehousing and Mining
No ratings yet
Data Warehousing and Mining
4 pages
Oral Questions LP II
No ratings yet
Oral Questions LP II
21 pages
DM-Model Question Paper Solutions
No ratings yet
DM-Model Question Paper Solutions
27 pages
DWDM_QB[1]
No ratings yet
DWDM_QB[1]
6 pages
DM Question Bank
No ratings yet
DM Question Bank
5 pages
Aie - Concept of Data Mining
No ratings yet
Aie - Concept of Data Mining
5 pages
DWDM MID - 2 Question Paper and Online Bits
No ratings yet
DWDM MID - 2 Question Paper and Online Bits
3 pages
Data Mining (Gtu Sem-6)002
No ratings yet
Data Mining (Gtu Sem-6)002
5 pages
mcqs unit 3
No ratings yet
mcqs unit 3
6 pages
Data Mining Exam Answers - April 2024
No ratings yet
Data Mining Exam Answers - April 2024
6 pages
Data Mining Long Answers
No ratings yet
Data Mining Long Answers
4 pages
Data Mining Assignment
No ratings yet
Data Mining Assignment
2 pages
DM passing package
No ratings yet
DM passing package
38 pages
comp 414 revision
No ratings yet
comp 414 revision
9 pages
Data Mining
No ratings yet
Data Mining
7 pages
ans_DM
No ratings yet
ans_DM
16 pages
QB V Unts of DM Ece Iv Year
No ratings yet
QB V Unts of DM Ece Iv Year
6 pages
DM - MP (3)
No ratings yet
DM - MP (3)
6 pages
Ii Semester 2004-2005CS C415/is C415 - Data Mining
No ratings yet
Ii Semester 2004-2005CS C415/is C415 - Data Mining
6 pages
Tutorial Pres 1
No ratings yet
Tutorial Pres 1
28 pages
Exam DUT 070816 Ans
No ratings yet
Exam DUT 070816 Ans
5 pages
Ch5 - Questions
No ratings yet
Ch5 - Questions
12 pages
Data Mining Syllabus and Question
No ratings yet
Data Mining Syllabus and Question
6 pages
A17-DWDM-(CSE,IT)-06-07-2023-(Sup)
No ratings yet
A17-DWDM-(CSE,IT)-06-07-2023-(Sup)
2 pages
QUESTION BANK BCA_IDS
No ratings yet
QUESTION BANK BCA_IDS
3 pages
Question Bank 2
No ratings yet
Question Bank 2
4 pages
Data Mining Question Bank
No ratings yet
Data Mining Question Bank
8 pages
Data_Mining_Key_Concepts
No ratings yet
Data_Mining_Key_Concepts
3 pages
DMBI Questions
No ratings yet
DMBI Questions
8 pages
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
No ratings yet
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
3 pages
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
No ratings yet
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
10 pages
CST466
No ratings yet
CST466
5 pages
Analysis and Study of K Means Clustering Algorithm IJERTV2IS70648
No ratings yet
Analysis and Study of K Means Clustering Algorithm IJERTV2IS70648
6 pages
MCQ On Data Mining With Answers Set-1
No ratings yet
MCQ On Data Mining With Answers Set-1
11 pages
Answer Midterm Exam Data Mining1 2021 - 2022
100% (1)
Answer Midterm Exam Data Mining1 2021 - 2022
4 pages
A Dynamic K-Means Clustering For Data Mining-Dikonversi
No ratings yet
A Dynamic K-Means Clustering For Data Mining-Dikonversi
6 pages
DMW MCQ
No ratings yet
DMW MCQ
388 pages
Data Mining and Warehousing Quizzes Compilation - Answer Key
No ratings yet
Data Mining and Warehousing Quizzes Compilation - Answer Key
5 pages
SemSuggestions DM
No ratings yet
SemSuggestions DM
6 pages
Data Mining Mid 1_Students-1
No ratings yet
Data Mining Mid 1_Students-1
4 pages
Data Mining
No ratings yet
Data Mining
32 pages
Data Mining QB
No ratings yet
Data Mining QB
15 pages
Course Outcomes For Assessment in This Ia: Cos Co3 Co4 Co5 Co6
No ratings yet
Course Outcomes For Assessment in This Ia: Cos Co3 Co4 Co5 Co6
4 pages
Professional C++
From Everand
Professional C++
Marc Gregoire
3/5 (4)
Trees
No ratings yet
Trees
69 pages
23IT1332-DSA ASSIGNMENT
No ratings yet
23IT1332-DSA ASSIGNMENT
5 pages
DAA Module 3 Power Point-S.Mercy
No ratings yet
DAA Module 3 Power Point-S.Mercy
56 pages
Assignment 1 ME502
0% (1)
Assignment 1 ME502
4 pages
Linked Lists: Short Answer
No ratings yet
Linked Lists: Short Answer
14 pages
AIML lab manual
No ratings yet
AIML lab manual
44 pages
An Exhaustive Study On Different Sudoku Solving Techniques: Keywords
0% (1)
An Exhaustive Study On Different Sudoku Solving Techniques: Keywords
7 pages
1 Hw6 Solutions
No ratings yet
1 Hw6 Solutions
8 pages
Classification Algorithm
No ratings yet
Classification Algorithm
78 pages
Data Structure Previous Year Paper - B.C.A Study
No ratings yet
Data Structure Previous Year Paper - B.C.A Study
4 pages
DAA Assignment_Unit_3_tut _1 (1)
No ratings yet
DAA Assignment_Unit_3_tut _1 (1)
2 pages
TCB 1063 Algorithm and Data Structure
No ratings yet
TCB 1063 Algorithm and Data Structure
11 pages
13 Useful Deep Learning Interview Questions and Answer
No ratings yet
13 Useful Deep Learning Interview Questions and Answer
6 pages
ADA Module 1
No ratings yet
ADA Module 1
22 pages
9.1 2D Arrays - Watermark
No ratings yet
9.1 2D Arrays - Watermark
6 pages
Accolite QPP
No ratings yet
Accolite QPP
8 pages
COL106 Major 2020-21 Sem1
No ratings yet
COL106 Major 2020-21 Sem1
4 pages
(Ebook) Analysis of Algorithms: An Active Learning Approach by Jeffrey J. McConnell ISBN 9780763716349, 0763716340 download
No ratings yet
(Ebook) Analysis of Algorithms: An Active Learning Approach by Jeffrey J. McConnell ISBN 9780763716349, 0763716340 download
53 pages
Stack
No ratings yet
Stack
64 pages
Top Coder All Tutorials
No ratings yet
Top Coder All Tutorials
205 pages
Longest Path Matrix Algorithm
No ratings yet
Longest Path Matrix Algorithm
12 pages
The Blue Path - Uva 12532 - Interval Product
No ratings yet
The Blue Path - Uva 12532 - Interval Product
6 pages
302 Data Structure Using C-Min
No ratings yet
302 Data Structure Using C-Min
3 pages
Region Elimination Method
No ratings yet
Region Elimination Method
17 pages
Dsa
No ratings yet
Dsa
4 pages
Project 1
No ratings yet
Project 1
2 pages
report-dickerson
No ratings yet
report-dickerson
110 pages
Uninformed and Informed Search Algorithms
No ratings yet
Uninformed and Informed Search Algorithms
9 pages