0% found this document useful (0 votes)
46 views

Data-Mining-Question-bank

The document is a question bank for a 6th semester course on Data Mining and Warehousing, covering various topics such as data mining tasks, the relationship between data warehousing and data mining, and differences between OLTP and OLAP systems. It includes questions on data processing, data cleaning, visualization techniques, and statistical analysis of datasets. Additionally, it features practical problems related to data analysis and normalization techniques.

Uploaded by

miteshjoshi5565
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views

Data-Mining-Question-bank

The document is a question bank for a 6th semester course on Data Mining and Warehousing, covering various topics such as data mining tasks, the relationship between data warehousing and data mining, and differences between OLTP and OLAP systems. It includes questions on data processing, data cleaning, visualization techniques, and statistical analysis of datasets. Additionally, it features practical problems related to data analysis and normalization techniques.

Uploaded by

miteshjoshi5565
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

DATA MINING & WAREHOUSING QUESTION BANK

Semester :6th
Q.1 Explain different data mining tasks.

Q.2 What is the relation between data warehousing and data mining?

Q.3 Explain the differences between “Explorative Data Mining” and “Predictive Data Mining” and give one
example of each.

Q.4 What are the application areas of data Mining?

Q.5 What is Data Mining? Explain KDD process with diagram. How is data warehouse different from a database?
How are they similar?

Q.13 Compare OLTP and OLAP Systems.

Q.14 What do you mean by Data Processing?

Q.15 Explain data cleaning.

Q.16 Describe different data cleaning approaches.

Q.17 Describe different types of attributes with example.

Q.22 Give Brief description of following:

(a) Binning

(b) regression

(c) Clustering

(d) Smoothing
Q.24 A data set for analysis includes only one attribute X:

X={ 7,12,5,8,5,9,13,12,19,7,12,12,13,3,4,5,13,8,7,6}

(a) What is the mean of the data set X?


(b) What is the median?
(c) Find the standard deviation for X.

Q.25 Explain the different visualization techniques with example.

Q.26 Suppose we have a dataset representing the ages of individuals collected for analysis.
The recorded age values (in increasing order) are:

Age Data:
14, 17, 18, 18, 20, 22, 22, 23, 24, 24, 26, 26, 26, 26, 31, 34, 34, 36, 36, 36, 36,
38, 42, 47, 48, 54, 72

Solve the following:


(a) What is the mean of the data? What is the median?
(b) What is the mode of the data? Comment on the data’s modality.
(c) What is the midrange of the data?
(d) Can you find (roughly) the first quartile (Q1) and the third quartile (Q3) of the
data?
(e) Give the five-number summary of the data.
(f) Show a boxplot of the data.

Q28 Explain Binning process with proper example

Q.29 A survey was conducted with 35 participants, and their responses were recorded and sorted as
follows:

Survey Data:
8, 12, 14, 14, 17, 18, 18, 21, 21, 21, 23, 24, 24, 24, 27, 27, 27, 27, 32, 32, 34, 34, 34, 37, 37, 37, 37, 38,
42, 47, 48, 53, 53, 72, 88

Draw histograms for this data taking bin size as 5 and 8. Examine the effect of bin
size on the histograms you obtain.

Q. 30 A sales firm has reported following sales figures for FY 23-24 (i.e. March 23 to Feb
24)

3100, 512, 780, 630, 490, 8650, 6100, 400, 2900, 780, 680, 590
Analyze this data series to normalize using Min-Max scaling

You might also like