Data Mining Algorithms MCQs
Data Mining Algorithms MCQs
Questions
1. 1. What is the main purpose of association rules in data mining?
✅ Answer: C
✅ Answer: D
✅ Answer: C
4. 4. What is the measure called that shows how frequently items appear together in the
dataset?
A. Confidence
B. Support
C. Lift
D. Strength
✅ Answer: B
5. 5. A rule appears frequently in the dataset but rarely holds true when applied. What
does this indicate?
A. High support, high confidence
B. Low support, low confidence
C. High support, low confidence
D. Low support, high confidence
✅ Answer: C
A. Negative correlation
B. Positive correlation
C. No correlation
D. High support
✅ Answer: C
7. 7. Which algorithm generates candidate itemsets by joining large itemsets from the
previous pass with themselves?
A. AIS
B. SETM
C. Apriori
D. K-means
✅ Answer: C
A. AIS
B. Apriori
C. Decision Tree
D. SETM
✅ Answer: D
9. 9. Why is the Apriori algorithm more efficient than AIS and SETM?
✅ Answer: B
10. 10. In which business application is market basket analysis commonly used?
A. Employee training
B. Product recommendation
C. Supply chain management
D. Retail sales
✅ Answer: D
11. 11. In the diaper and beer example, what percentage of transactions included both
items?
A. 2.75%
B. 2%
C. 1.75%
D. 0.5%
✅ Answer: C
12. 12. What does a high confidence and low support in an association rule imply?
✅ Answer: B
13. 13. Which of the following is the correct form of an association rule?
✅ Answer: A
✅ Answer: C
✅ Answer: C
A. A numerical estimate
B. A set of decision rules
C. A category label
D. A list of all itemsets
✅ Answer: C
17. 17. What is the major difference between classification and prediction?
✅ Answer: C
A. Linear Regression
B. DBSCAN
C. K-Means
D. Naive Bayes
✅ Answer: D
19. 19. In data preparation, which method helps scale values into a small specified range?
A. Generalization
B. Noise Reduction
C. Normalization
D. Aggregation
✅ Answer: C
20. 20. Which algorithm is best suited for predicting continuous values?
A. Logistic Regression
B. K-Nearest Neighbors
C. Linear Regression
D. Decision Tree Classification
✅ Answer: C
A. Storage
B. Forecasting
C. Sharing
D. Publication
✅ Answer: B
22. 22. Which of the following algorithms is used in both classification and prediction?
A. Logistic Regression
B. Apriori
C. DBSCAN
D. K-Means
✅ Answer: A
23. 23. What does a decision tree output when used for classification?
✅ Answer: D
24. 24. What technique is used to identify whether two attributes are related?
A. Clustering
B. Correlation analysis
C. Filtering
D. Regression
✅ Answer: B
✅ Answer: C
26. 26. What fundamental principle does the Naive Bayes algorithm rely on?
A. Decision boundaries
B. Baye’s theorem
C. Distance measures
D. Gaussian distribution
✅ Answer: B
A. Supervised
B. Reinforcement
C. Unsupervised
D. Semi-supervised
✅ Answer: C
28. 28. In a shopping mall, grouping similar items together like t-shirts or vegetables is an
example of:
A. Supervised learning
B. Data labeling
C. Clustering
D. Classification
✅ Answer: C
A. Data encryption
B. Market segmentation
C. Linear regression
D. Data normalization
✅ Answer: B
30. 30. Which clustering method does NOT require the number of clusters to be specified in
advance?
A. K-Means
B. Partitioning Clustering
C. Hierarchical Clustering
D. Fuzzy Clustering
✅ Answer: C
31. 31. Which clustering method allows data points to belong to more than one cluster?
A. Hard clustering
B. K-Means
C. Agglomerative Hierarchical
D. Fuzzy Clustering
✅ Answer: D
A. DBSCAN
B. K-Means
C. Naive Bayes
D. Decision Tree
✅ Answer: A
✅ Answer: C
34. 34. What type of model does the Expectation-Maximization (EM) algorithm use in
clustering?
A. Centroid-based
B. Rule-based
C. Distribution-based
D. Tree-based
✅ Answer: C
A. Density-based
B. Distribution-based
C. Partitioning
D. Hierarchical
✅ Answer: C
A. K-Means
B. DBSCAN
C. Agglomerative Hierarchical
D. Expectation-Maximization
✅ Answer: C
37. 37. Which algorithm avoids the need to define the number of clusters and works by
message passing?
A. K-Means
B. Affinity Propagation
C. DBSCAN
D. Mean-shift
✅ Answer: B
✅ Answer: B
✅ Answer: B