0% found this document useful (0 votes)
5 views

Data Mining Questions

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Data Mining Questions

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Data Mining Important Questions

All the best. do well.

The questions which are in bold are very important to clear the subject. to get a good score,
practice more problems as well.

Unit (1,3,4 are problematic) and unit(2,5 are completely theory).. clustering is also theory
except k means and PAM)

Unit-1

1 Define Data Mining


2 List out KDD process steps
3 What are the types of data?
4 Compare descriptive and predictive data mining
5 What is classification
6 What is prediction
7 Why we need to Pre-process the data
8 List out Data Pre-processing steps
9 What is Data cleaning
10 what is Data integration
11 Illustrate Data transformation functions
12 List out the major issues in data mining
13 What is Data selection
15 Define Outlier Analysis
16 Define Clustering analysis
17 Define evolution Analysis
18 What is data redundancy
19 Define Data discretization
20 What is categorical attribute

Long Questions

1 Identify the need for Data Mining


Show with diagrammatic illustration of the steps involved in the process of
2
the Knowledge Discovery from Data (10M)
(a)​Classify the different types of data on which Mining can be performed (5M)
3
(b)​Illustrate the architecture of a typical Data mining system (5M)
4 Explain Various Data Mining Functionalities with an example (10M)
(a)​Illustrate with a diagram about Data Mining Task Primitives.
5
(b)​Discuss about the Major issues in Data Mining.
6 What is Data Cleaning? Describe various methods of Data Cleaning.
Suppose that the data for analysis includes the attribute age. The age
values for the data tuples are (in increasing order) :
13, 15, 16, 16, 19, 20, 23, 29, 35, 41, 44, 53, 62, 69, 72
7 (i)​ Use min-max normalization to transform the value of 45 for age
onto the range [0,1]
(ii)​ Use Z-Score normalization to transform the value 45 for age
where the standard deviation of age is 20.64 years
(a)​List the Issues to be considered during Data Integration (4M)
8
(b)​Discuss about detecting data redundancy using correlation analysis (6M)
Explain about Data Transformation method with suitable example
9
10 Explain about the different Data Reduction techniques.
11 Explain data Preprocessing Techniques (10 M)
12 Similarity & dissimilarity Problems

Unit-2

1.​ Difference between database and data warehouse

2.​ Define Data Warehouse

3.​ Define ETL, Datamart

4.​ Brief multidimensional model

5.​ Define data extraction, data cleaning, data integration, data aggregation

6.​ Difference between ER Model and Dimensional Model

7.​ Define Surrogate key

8.​ Diff btw fact table and dimension table

9.​ Dimensional modeling- Additive measures

Long

1.​ Architecture of Data warehouse

2.​ Data warehouse and its characteristics

3.​ Data warehouse Components (1 & 3 are same, they can ask in any way)

4.​ Schema Design in Dimensional Modeling


5.​ Dimensional table characteristics

6.​ Explain OLAP

Unit-3

1 What is meant by association rule?


2 What is meant by Market basket analysis?
3 state and explain Apriori property.
4 What is meant by Mining Multilevel Association Rules?
5 Define Uniform Minimum Support.
6 What is meant by Reduced Minimum Support?
7 What is meant by multidimensional association rules?
8 What is meant by intradimensional association rule?
9 What is meant by inter dimensional association rules?
10 What is meant by Quantitative association rules?
11 What is meant by Partition Algorithms?
12 state and explain FP_growth Algorithm.
13 What is meant by Frequent itemset.
14 What is meant by Maximal Frequent Item Set?
15 What is meant by Closed Frequent Item Set?
16 Explain the join & prune step in apriori algorithm.
17 Draw and explain the conditional FP_Tree.
18 How will you measure support and confidence with an example?
19 How to improve the efficiency of apriori algorithm.
20 What is meant by conditional pattern base?

Long Questions

1 Explain about Various kinds of Association rule Mining.


Discuss​ about​ FP-growth​ algorithm​ for​ the​ following​
2 given example{M,O,N,K,E,Y}​ {D,O,N,K,E,Y} {M,A,K,E} {M,U,C,K,Y}
{C,O,O,K,I,E},
Support= 60 %, Confidence = 80 %.
State and explain Apriori Algorithm with an example Consider the
following data set to​ generate Association rules​
3 {M,O,N,K,E,Y}
{D,O,N,K,E,Y}​{M,A,K,E}​{M,U,C,K,Y}​{C,O,O,K,I,E},​
Support=​ 60​ %,
Confidence = 80 %.
4 Explain in detail about partitional algorithms with an example.
5 Explain the steps involved in Apriori Algorithm.
Write short notes on Maximal Frequent Item Set & Closed Frequent
6
Item Set.
Explain in detail about support and Confidence Measures with an
7
example.
8 Discuss about Quantitative association mining.
9 Explain in detail about Multidimensional association rule.

10 State and explain apriori algorithm for the following given example. Support=
60 %, Confidence = 80 %.

Unit-4

1.​ Where are decision trees mainly used?


2. What do you mean by concept hierarchies?
3. How will you solve a classification problem using decision trees?
4. Explain ID3.
5. What is a “decision tree”?
6. Define Data Classification.
7. Define Prediction.
What is the difference between “supervised” and unsupervised” learning
8.
scheme.
9. What is clustering?
10. What are the requirements of clustering?
11. State the categories of clustering methods?
12. Difference between K-Means and K-Medoids Algorithms.
13. What do you meant by Hierarchical Clustering
14. What do you mean by Agglomerative Clustering.
15. What do you meant by Outlier Detection.

Long Questions

1 Discuss about Decision tree induction algorithm with an example.

2 Explain about Attribute Subset Selection Measures with an example.

3 Explain backpropagation algorithm with example

4 Explain grid based clustering

5 Explain Density based clustering algorithms (DBSCAN and Optics)


DBSCAN is very imp
What do you mean by Clustering? Explain the requirements used in
6
Clustering?
7 Explain in detail about Hierarchical Clustering.
8 Explain in detail about the partitioned Clustering method.
9 Discuss about Outlier Detection.
10 Explain in detail about Clustering methods with an example.

Unit-5

What are the foundations of data mining


1
2 Name some specific application oriented databases
3 What are the goals of time series analysis
4 Name some conventional visualization techniques
How can data visualization help in decision making
5
6 What is the use of DB Miner
7 What is the scope of data mining
8 Differentiate between data mining and data warehousing
9 Which problems in general the data mining can solve
10 Short notes on data mining for Financial data analysis
11 List out the types of data mining
12 What is meant by information retrieval
13 What is meant by Content based retrieval system
14 What is meant by Similarity search in multimedia
15 List out the techniques in audio mining
16 List out the techniques in video mining
17 Mention the types of information retrieval systems
18 What is meant by text mining

Long Questions

1 Explain relational data


2 Explain transactional data
3 Explain multi-dimensional
4 Explain distributed data
5 What are the social impacts of Data Mining
6 Discuss about Spatial Data Mining
7 Discuss about Multimedia data mining
8 Discuss about Text and web Data

have a glance on all topics of unit 5…

You might also like