unit4 mcqs

The document is a question bank for a Data Warehousing and Data Mining course, specifically focusing on Unit IV. It contains multiple-choice questions covering various topics such as the multidimensional model, learning processes, classification techniques, and algorithms related to frequent itemset mining. The questions assess knowledge on concepts like the Apriori algorithm, FP-growth, and association rule mining.

Uploaded by

Rohini Rajaram Pandian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

unit4 mcqs

Uploaded by

Rohini Rajaram Pandian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 7

Question Bank

Year & Semester: III & V

Subject Code & Subject Name: U14CS518 & Data Warehousing and Data Mining
Unit – IV
PART – A
1. What is true of the multidimensional model?
a. It typically requires less disk storage
b. It typically requires more disk storage
c. Typical business queries requiring aggregate functions take more time
d. Increasing the size of a dimension is difficult
2. Learning is,

a. The process of finding the right formal representation of a certain body of knowledge
in order to represent it in a knowledge-based system
b. It automatically maps an external signal space into a system's internal
representational space. They are useful in the performance of classification tasks.
c. A process where an individual learns how to carry out a certain task when making a
transition from a situation in which the task cannot be carried out to a situation in
which the same task under the same circumstances can be carried out.
d. None of these
3.The apriori property means
a. If a set cannot pass a test, all of its supersets will fail the same test as well
b. To improve the efficiency the level-wise generation of frequent item sets
c. If a set can pass a test, all of its supersets will fail the same test as well
d. To decrease the efficiency the level-wise generation of frequent item sets
4. Which is the technique used for classification in data mining?
a. Descriptive pattern c. Decision tree classifiers
b. Associations d. Regression
5. Which algorithm is used to build decision tree classifier in a given set of training instances?
a. Greedy alogorithm c. ETL algorithm
b. Bayes algorithm d. None of the above
6. ____________ deal with the prediction of value rather than a class.
a. Regression c. Recall
b. Precision d. Multiway splits
7. Which is a type of classifier that has been found to give very accurate classification across a
range of application?

a. Binary split c. Overfitting

b. Multiway split d. Support vector machine
8. Which of the following is/are frequent pattern analysis?

a. set of items b. subsequences

c. substructures d. All of the above

9. Which of the following technique is not improving efficiency of an Apriori algorithm?

a. Transaction reduction b. Partitioning

c. Hash-based itemset counting d. FP growth

10. Which of the following is an example for frequent itemset mining?

a. Market basket analysis c. Clustering

b. Cross-marketing d. All of the above
11. Mining frequent itemsets without candidate generation is called_______________.
a. Market basket analysis c. Frequent pattern growth
b. Apriori algorithm d. None of the above
1
12. In a a.dataDimension
cube the base
cuboids
cuboid called as, c. Apex cuboids
b. Dimension cuboids d. 3 -Dimension cuboids
13. Prediction is,
a. The result of the application of a theory or a rule in a specific case
b. One of several possible enters within a database table that is chosen by the designer as
the primary means of accessing the data in the table.
c. Discipline in statistics that studies ways to find the most interesting projections of multi-
dimensional spaces.
d. None of these
14. Frequent pattern growth adopts a divide-and-conquer strategy, it consists of,
a. Conditional databases & Frequent-pattern tree
b. Frequent-pattern tree & Conditional databases
c. FP-tree d. Set of conditional
databases
15. Which type of data format is adopted in Apriori and FP-growth frequent patterns,
a. Vertical data format c. Both (a) & (b)
b.Horizontal data format d.None of the above
16. Single dimensional association rule also called as_________________
a. Intradimensional association rule
b. Interdimensional association rules
c. Hybrid-dimensional association rules
d. None of the above
17. What is ARCS?
a. Association Regression Classification System
b. Association Rule Classification System
c. Association Rule Clustering System
d. Algorithm for Rule Classification System
18. When an intemset S satisfies the constraint called____________
a. Monotonic c. Succinctness
b. Anti-monotonic d. Optimization
19. The data classification process is consists of,
a. Learning & Classification c. Supervised & classification
b. Unsupervised & clustering d. Learning & Clustering
20. What are the two components of a belief network,
a. Probabilistic networks & probability tables
b. Bayesian networks & Directed acyclic graph
c. Directed acyclic graph & set of conditional probability tables
d. None of the above
21. ................is the process of finding a model that describes and distinguishes data classes or
concepts.
a. Data Characterization c. Data discrimination
b. Data Classification d. Data selection
22. What classifiers are normally considered to be easy to interpret?
a. SVM c. Decision trees
b. Linear Regression d. k-Nearest Neighbor
23. Disjoint training and test datasets are required to estimate the classification performance on . .
.
a. The training dataset c. The entire population
b. The test dataset d. None of The Above
24. The confidence of the estimate of classification performance increases with . . .
a. increasing training dataset size
b. decreasing training dataset size
c. increasing test dataset size
d. decreasing test dataset size
25. A common weakness of association rule mining is that . . .
a. it is too inefficient

2
b. it produces too many rules
c. it produces not enough interesting rules
d. it produces too many frequent itemsets
26. Which of the following are interestingness measures for association rules?
a. accuracy c. compactness
b. recall d. lift
27. The rule, age(x, ”youth”) AND income(X, low) -> class(X, B)?

a. Decision tree c. Neural network

b. If-then d. All of the above
28. If A => B = P(B/A) what is the confidence equation?
a. Support _count (AƲB) b. Support _count (A∩B)
Support count (A) Support count (B)

c. Support _count (AƲB) d. Support _count (A)

Support count (B) Support count (AƲB)

29. If the number of transaction is five and minimum support threshold is 60%, then what is the
minimum support count?

a. 2 c. 4
b. 3 d. 1

30. If the resulting value of this equation is less than 1, then what is

the occurrence of A & B?

a. Independent and there is no correlation
b. Negatively correlated c. Positively correlated
d. Dependent and there is no correlation

31. What is the hash bucket address if the values of two frequent itemset (x & y) are 2 and 3?
a. 4 b. 2 c. 0 d. 5

32. When an intemset S satisfies the constraint, so does any of its superset, sum (S.Price) ³ v is?

a. Anti Monotonicity c. Monotonicity

b. Succinctness d. Convertible
33. How do you find the midpoint between each pair of adjacent values if it considered as a
possible split point as ai.
a. (ai+ai+1)/2 c. (ai+ai+2)/4
b. (ai+ai-1)/2 d. (ai+ai+2)/4

34. IF age = youth AND student = yes THEN buys computer = yes, in this rule what is IF part
and THEN part?

a. Rule consequent & Rule condition

b. Rule precondition & Rule antecedent
c. Rule consequent & Rule antecedent
d. Rule antecedent & Rule consequent
35. How to compute the errors in propagated backward network’s prediction ?
a. Err j = Oj (1-Oj)(Tj -Oj)

b. ij = (l) ErrjOi

3
c. wi j = wi j + ij d. j = (l)Err j

36. The rule is, IF income = high THEN loan decision = accept, Each time we add an attribute
test to a rule and selecting credit rating = excellent, then what is the current rule?
a. IF income = high AND loan decision = accept THEN credit rating = excellent
b. IF loan decision = accept AND credit rating = excellent THEN income = high
c. IF income = high AND credit rating = excellent THEN loan decision = accept
d. None of the above
37. What does Apriori algorithm do?
a. It mines all frequent patterns through pruning rules with lesser support
b. It mines all frequent patterns through pruning rules with higher support
c. Both a and b
d. None of the above
38. What does FP growth algorithm do?
a. It mines all frequent patterns through pruning rules with lesser support
b. It mines all frequent patterns through pruning rules with higher support
c. It mines all frequent patterns by constructing a FP tree
d. All of the above
39. What techniques can be used to improve the efficiency of apriori algorithm?
a. Hash-based techniques
b. Transaction Redu
c. Support(A B) / Support (A)
d. Support(A B) / Support (B)
40. Which of the following is direct application of frequent itemset mining?
a. Social Network Analysis
b. Market Basket Analysis
c. Outlier Detection
d. Intrusion Detection
41. What is not true about FP growth algorithms?
a. It mines frequent itemsets without candidate generation.
b. There are chances that FP trees may not fit in the memory
c. FP trees are very expensive to build
d. It expands the original database to build FP trees.
42. When do you consider an association rule interesting?
a. If it only satisfies min_support
b. If it only satisfies min_confidence
c. If it satisfies both min_support and min_confidence
d. There are other measures to check so
43. What is the difference between absolute and relative support?
a. Absolute - Minimum support count threshold and Relative - Minimum support threshold
b. Absolute - Minimum support threshold and Relative - Minimum support count threshold
4
c. Both mean same
44. What is the relation between candidate and frequent itemsets?
a. A candidate itemset is always a frequent itemset
b. A frequent itemset must be a candidate itemset
c. No relation between the two
d. Both are same
45. Which technique finds the frequent itemsets in just two database scans?
a. Partitioning
b. Sampling
c. Hashing
d. Dynamic itemset counting
46. Which of the following is true?
a. Both apriori and FP-Growth uses horizontal data format
b. Both apriori and FP-Growth uses vertical data format
c. Apriori uses horizontal and FP-Growth uses vertical data format
d. Apriori uses vertical and FP-Growth uses horizontal data format
47. What is the principle on which Apriori algorithm work?
a. If a rule is infrequent, its specialized rules are also infrequent
b. If a rule is infrequent, its generalized rules are also infrequent
c. Both a and b
d. None of the above
48. Which of these is not a frequent pattern mining algorithm?
a. Apriori
b. FP growth
c. Decision trees
d. Eclat
49. Which algorithm requires fewer scans of data?
a. Apriori
b. FP growth
c. Both a and b
d. None of the above
50. What are closed itemsets?
a. An itemset for which at least one proper super-itemset has same support
b. An itemsetwhose no proper super-itemset has same support
c. An itemset for which at least super-itemset has same confidence
d. An itemsetwhose no proper super-itemset has same confidence
51. What are closed frequent itemsets?
a. A closed itemset
b. A frequent itemset
c. An itemset which is both closed and frequent

5
d. None of the above
52. What are maximal frequent itemsets?
a. A frequent itemsetwhose no super-itemset is frequent
b. A frequent itemset whose super-itemset is also frequent
c. A non-frequent itemset whose super-itemset is frequent
d. None of the above
53. Why is correlation analysis important?
a. To make apriori memory efficient
b. To weed out uninteresting frequent itemsets
c. To find large number of interesting itemsets
d. To restrict the number of database iterations
54. For questions given below consider the data
Transactions :
1. I1, I2, I3, I4, I5, I6
2. I7, I2, I3, I4, I5, I6
3. I1, I8, I4, I5
4. I1, I9, I10, I4, I6
5. I10, I2, I4, I11, I5
With support as 0.6 find all frequent itemsets?
a. <I1>, <I2>, <I4>, <I5>, <I6>, <I1, I4>, <I2, I4>, <I2, I5>, <I4, I5>, <I4, I6>, <I2, I4,
I5>
b. <I2>, <I4>, <I5>, <I2, I4>, <I2, I5>, <I4, I5>, <I2, I4, I5>
c. <I11>, <I4>, <I5>, <I6>, <I1, I4>, <I5, I4>, <I11, I5>, <I4, I6>, <I2, I4, I5>
d. None of above
55. What will happen if support is reduced?
a. Number of frequent itemsets remains same
b. Some itemsets will add to the current set of frequent itemsets
c. Some itemsets will become infrequent while others will become frequent
d. Can not say
56. Find all strong association rules given the support is 0.6 and confidence is 0.8.
a. <I2, I4> → I5, <I2, I5> → <I4><I5, I4> → I2
b. <I2, I4> → I5, <I2, I5> → <I4>
c. Null rule set
d. Cannot be determined
57. What is the effect of reducing min confidence criteria on the same?
a. Number of association rules remains same
b. Some association rules will add to the current set of association rules
c. Some association rules will become invalid while others might become a rule.
d. Can not say
58. Can FP growth algorithm be used if FP tree cannot be fit in memory?

6
a. Yes
b. No
c. Both a and b
d. None of the above
59. What is association rule mining?
a. Same as frequent itemset mining
b. Finding of strong association rules using frequent itemsets
c. Using association to analyse correlation rules
d. None of the above
60. What is frequent pattern growth?
a. Same as frequent itemset mining
b. Use of hashing to make discovery of frequent itemsets more efficient
c. Mining of frequent itemsets without candidate generation
d. None of the above
61. When is sub-itemset pruning done?
a. A frequent itemset 'P' is a proper subset of another frequent itemset 'Q'
b. Support (P) = Support(Q)
c. When both a and b is true
d. When a is true and b is not
62. Which of the following is not null invariant measure(that does not considers null
transactions)?
a. all_confidence
b. max_confidence
c. cosine measure
d. lift
63. The apriori algorithm works in a ..and ..fashion?
a. top-down and depth-first
b. top-down and breath-first
c. bottom-up and depth-first
d. bottom-up and breath-first

MCQ Data Mining
78% (9)
MCQ Data Mining
6 pages
Week 6: Test Bank Questions Data Mining and Data Warehousing - IT 446
No ratings yet
Week 6: Test Bank Questions Data Mining and Data Warehousing - IT 446
39 pages
DataMining - Workbook MCQ
No ratings yet
DataMining - Workbook MCQ
16 pages
Data Mining MCQ Multiple Choice Questions With Answers: Eguardian
No ratings yet
Data Mining MCQ Multiple Choice Questions With Answers: Eguardian
15 pages
Data Mining Practice Final Exam Solutions: True/False Questions
100% (1)
Data Mining Practice Final Exam Solutions: True/False Questions
5 pages
MCQ
100% (7)
MCQ
37 pages
Basic Numerical Method Using Scilab-ID2069
No ratings yet
Basic Numerical Method Using Scilab-ID2069
8 pages
unit 4- Question Bank
No ratings yet
unit 4- Question Bank
11 pages
DMW MCQ
No ratings yet
DMW MCQ
388 pages
DM-Question Bank 2024-25 Objective Question Bank
No ratings yet
DM-Question Bank 2024-25 Objective Question Bank
14 pages
Datamining Bits
No ratings yet
Datamining Bits
16 pages
DM Obj
No ratings yet
DM Obj
16 pages
BE Information Technology 0
No ratings yet
BE Information Technology 0
655 pages
Data Mining IMP Objective Questions_Sep 2023
No ratings yet
Data Mining IMP Objective Questions_Sep 2023
4 pages
DM imp bits
No ratings yet
DM imp bits
4 pages
DWDM MID - 2 Question Paper and Online Bits
No ratings yet
DWDM MID - 2 Question Paper and Online Bits
3 pages
Data Mining
100% (1)
Data Mining
7 pages
DWDM MCQ Qns 2020
No ratings yet
DWDM MCQ Qns 2020
5 pages
Data Mining
No ratings yet
Data Mining
8 pages
Datamining Quiz
No ratings yet
Datamining Quiz
173 pages
Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
12 pages
Aproiri Qand A
No ratings yet
Aproiri Qand A
9 pages
Questions-For-Data-Mining-2020 Eng Marwan
No ratings yet
Questions-For-Data-Mining-2020 Eng Marwan
19 pages
unit 3 Question Bank
No ratings yet
unit 3 Question Bank
8 pages
MCQ On Data Mining
No ratings yet
MCQ On Data Mining
20 pages
Mcq on Data Mining
No ratings yet
Mcq on Data Mining
20 pages
mcqs unit 3
No ratings yet
mcqs unit 3
6 pages
Data Mining New
No ratings yet
Data Mining New
3 pages
Data Warehousing Mining MCQs
No ratings yet
Data Warehousing Mining MCQs
12 pages
Data Mining MCQs unit1&2
No ratings yet
Data Mining MCQs unit1&2
11 pages
Data Mining Exam Questions
No ratings yet
Data Mining Exam Questions
25 pages
DMDW Question Bank
No ratings yet
DMDW Question Bank
17 pages
DM Bits
No ratings yet
DM Bits
5 pages
DMDA Viva Questions-1
No ratings yet
DMDA Viva Questions-1
7 pages
DW Model Questions
No ratings yet
DW Model Questions
8 pages
Business Intelligence and Analytics: Systems For Decision Support, 10e (Sharda) Chapter 5 Data Mining
100% (1)
Business Intelligence and Analytics: Systems For Decision Support, 10e (Sharda) Chapter 5 Data Mining
13 pages
BD Chapter 5
No ratings yet
BD Chapter 5
14 pages
CA2-Question Bank MCQ (PEC-CSBS601D)
No ratings yet
CA2-Question Bank MCQ (PEC-CSBS601D)
9 pages
Assignment Data Mining
No ratings yet
Assignment Data Mining
27 pages
Artificial Intelligence MCQs
No ratings yet
Artificial Intelligence MCQs
30 pages
Data Mining
100% (1)
Data Mining
30 pages
Jntu Online Examinations (Mid 2 - DMDW)
No ratings yet
Jntu Online Examinations (Mid 2 - DMDW)
23 pages
Dmbi Mcqs Mcqs For Data Mining and Business Intelligence
No ratings yet
Dmbi Mcqs Mcqs For Data Mining and Business Intelligence
24 pages
Template For Submitting Question Paper in Hard Copy in Person in Examination Section
No ratings yet
Template For Submitting Question Paper in Hard Copy in Person in Examination Section
10 pages
Data Mining Question Bank
No ratings yet
Data Mining Question Bank
4 pages
CXCXX C C
No ratings yet
CXCXX C C
14 pages
Aie - Concept of Data Mining
No ratings yet
Aie - Concept of Data Mining
5 pages
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
No ratings yet
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
9 pages
Data Mining List of Important Question
No ratings yet
Data Mining List of Important Question
4 pages
mcqs
No ratings yet
mcqs
5 pages
Review Sheet 1 Question I: MCQ
No ratings yet
Review Sheet 1 Question I: MCQ
10 pages
DM_MCQS_UNIT-2
No ratings yet
DM_MCQS_UNIT-2
4 pages
association_mining_questions
No ratings yet
association_mining_questions
2 pages
Data Mining Exam
No ratings yet
Data Mining Exam
14 pages
Question Bank: Q1) What Is Data Warehouse?
No ratings yet
Question Bank: Q1) What Is Data Warehouse?
17 pages
DWM 700
No ratings yet
DWM 700
16 pages
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
CISSP Certification Success Guide
From Everand
CISSP Certification Success Guide
SUJAN
No ratings yet
AI-900: Microsoft Azure AI Fundamentals Preparation
From Everand
AI-900: Microsoft Azure AI Fundamentals Preparation
Georgio Daccache
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Neo4j 4.0 Certification - Exam Practice Tests
From Everand
Neo4j 4.0 Certification - Exam Practice Tests
Cristian Scutaru
No ratings yet
History of Databases
No ratings yet
History of Databases
1 page
Smart Antennas Adaptive Beamforming Through Statistical Signal Processing Techniques
No ratings yet
Smart Antennas Adaptive Beamforming Through Statistical Signal Processing Techniques
6 pages
Correlation and Regression (TP)
No ratings yet
Correlation and Regression (TP)
4 pages
Ch7 D FFT2016
No ratings yet
Ch7 D FFT2016
42 pages
Petroleum: Salaheldin Elkatatny, Mohamed Mahmoud
No ratings yet
Petroleum: Salaheldin Elkatatny, Mohamed Mahmoud
9 pages
Session4 Easier Worksheet Answers
No ratings yet
Session4 Easier Worksheet Answers
6 pages
Lecture 6- Deadlocks (1)
No ratings yet
Lecture 6- Deadlocks (1)
45 pages
Few-Shot Learning Tutorial - Medium
No ratings yet
Few-Shot Learning Tutorial - Medium
16 pages
Y7 Autumn Block 3 WO4 Solve One Step Linear Equations Involving Multiplication or Division Using Inverse Operations 2019
No ratings yet
Y7 Autumn Block 3 WO4 Solve One Step Linear Equations Involving Multiplication or Division Using Inverse Operations 2019
2 pages
Minimum Spanning Tree.pptx
No ratings yet
Minimum Spanning Tree.pptx
24 pages
Stats Chapter 6 Probability
No ratings yet
Stats Chapter 6 Probability
22 pages
Lesson Plan Day Products of Sum and Difference1
100% (1)
Lesson Plan Day Products of Sum and Difference1
6 pages
Ty-Timetable Latest
No ratings yet
Ty-Timetable Latest
2 pages
HW 3
No ratings yet
HW 3
12 pages
The Circular Restricted Four Body Problem Is Non-Integrable A Computer Assisted Proof
No ratings yet
The Circular Restricted Four Body Problem Is Non-Integrable A Computer Assisted Proof
185 pages
3_Curve Fitting
No ratings yet
3_Curve Fitting
4 pages
2019 Dec Ecs 211 Eng Maths 3
No ratings yet
2019 Dec Ecs 211 Eng Maths 3
1 page
Floating Point Algorithmic Math Package User's Guide
No ratings yet
Floating Point Algorithmic Math Package User's Guide
8 pages
Apply Simple Moving Average To Forecast The Sales For The Month of September
No ratings yet
Apply Simple Moving Average To Forecast The Sales For The Month of September
15 pages
4 Dfa
No ratings yet
4 Dfa
54 pages
Module 10 - Simple Linear Regression
No ratings yet
Module 10 - Simple Linear Regression
10 pages
Causal Loop Diagram
No ratings yet
Causal Loop Diagram
4 pages
ECO 104.7 - Assignment II
No ratings yet
ECO 104.7 - Assignment II
2 pages
Tracking With Debiased Consistent Converted Measurements Versus EKF (1993)
No ratings yet
Tracking With Debiased Consistent Converted Measurements Versus EKF (1993)
8 pages
Full download (Ebook) Handbook of Univariate and Multivariate Data Analysis and Interpretation with SPSS by Robert Ho ISBN 9780824704285, 0824704282 pdf docx
100% (4)
Full download (Ebook) Handbook of Univariate and Multivariate Data Analysis and Interpretation with SPSS by Robert Ho ISBN 9780824704285, 0824704282 pdf docx
67 pages
Seminar 7CS7-40 PPT FORMAT (1)
No ratings yet
Seminar 7CS7-40 PPT FORMAT (1)
16 pages
Animal Detection Using Deep Learning Algorithm
No ratings yet
Animal Detection Using Deep Learning Algorithm
6 pages
Fractal Protocol Litepaper AF Edit 15.11
No ratings yet
Fractal Protocol Litepaper AF Edit 15.11
1 page
mml-book[061-090]
No ratings yet
mml-book[061-090]
30 pages