0% found this document useful (0 votes)
75 views

BDA Quiz 2 Help

The document contains a quiz with 30 multiple choice questions testing knowledge of data mining and machine learning concepts. Some of the key concepts covered include support vectors, social network analysis, data visualization, logistic regression, kernel functions in support vector machines, decision trees, artificial neural networks, naive Bayes, k-means clustering, and regression analysis. The majority of the questions can be answered with a single term or short phrase.

Uploaded by

Spider
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
75 views

BDA Quiz 2 Help

The document contains a quiz with 30 multiple choice questions testing knowledge of data mining and machine learning concepts. Some of the key concepts covered include support vectors, social network analysis, data visualization, logistic regression, kernel functions in support vector machines, decision trees, artificial neural networks, naive Bayes, k-means clustering, and regression analysis. The majority of the questions can be answered with a single term or short phrase.

Uploaded by

Spider
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

QUIZ – 2 Answers

1) Support vectors are the data points that lie closest to the decision surface.

True

============================================================================

2) The graphical representation of an SNA is made up of links and _____

Nodes

============================================================================

3) Data visualization helps in making large data set coherent. True/false

True

============================================================================

4) Which of the following statements is true regarding binary logistic regression?

The dependent variable consists of two categories

============================================================================

5) ------------ is not a functionality of datamining

Selection and interpretation

============================================================================

6) ______________ is/are the way/s to represent uncertainty.

All (?)

============================================================================

7) Where can Bayes Rule be used?

Answering probabilistic query

============================================================================

8) Advantages of Kernel func in SVM

All of the above

============================================================================
9) __________predicts future trends & behaviors, allowing business managers to make
proactive,knowledge-driven decisions

Data Mining

============================================================================

10) Study the chart and answer the questions: Pie chart given here represents the domestic
expenditure of a family in percent. Study the chart and answer the following questions if the total
monthly income of the family is Rs. 33,650 House rent per month is _________

6057

============================================================================

11) Which among the following statements is false regarding decision trees

Decision trees will never suffer from overfitting

============================================================================

12) If an ANN receives n inputs x1, x2, x3… xn with weights w1, w2, w3… wn.The weighted sum ------ is
calculated and passed to an activation function

Σ wi * xi

============================================================================

13) If A, B are two sets of items, and A⊆ B. Which of the following statement is always true?

support(A) >= support(B)

============================================================================

14) Which among the following statements is false regarding Artificial Neural Networks

ANN can handle only categorical values

============================================================================

15) TDM in text mining:

Term Document Matrix

============================================================================

16) Considering the K-means algorithm, if points (-3, 2), (-4, 2), and (-2, -1) are the only points which
are assigned to the first cluster now, what is the new centroid for this cluster
(-3,1)

============================================================================

17) Which of the following is true about Naive Bayes

Assumes that all the features in a dataset are independent

============================================================================

18) SVM less effective when

The data is noisy and contains overlapping points

============================================================================

19) The splitting criteria used in C4.5 algorithm is

Information Gain

============================================================================

20) Match the following

a-3, b-1, c-2, d-4

============================================================================

21) Real world application of SVM

Text and hypertext categorization

============================================================================

22) ____ is also known as Market Basket Analysis

Association Rule Mining

============================================================================

23) Given the regression equation Y=22.4 - 53.9X, then 22.4 is the ------- while -53.9 is the ---- of the
regression line

Intercept, slope

============================================================================

24) Applications of Text mining

All
============================================================================

25) The main purpose for structure mining is to extract previously unknown relationships between

Web pages

============================================================================

26) _____ can be used to describe nodes that contain the most amount of information about a
network

Degree centrality

============================================================================

27) Best choice of number of clusters

============================================================================

28) In regression analysis, if all the values fall on the same straight line and the line has a positive
slope then , correlation coefficient ‘r’ is

r = +1

============================================================================

29) Confusion matrix predictive accuracy

0.806

============================================================================

30) Which of the following statements regarding k-means clustering is wrong

It is suitable for discovering cluster shapes that are not hyperellipsoids

============================================================================

You might also like