Data-Mining (Set 1)
Data-Mining (Set 1)
1 of 4 sets
2. Bayesian classifiers is
o m
. c
A. a class of learning algorithm that tries to find an optimum classification of a set of examples
using the probabilistic theory.
te
a
B. any mechanism employed by a learning system to constrain the search space of a hypothesis.
q M
C. an approach to the design of learning algorithms that is inspired by the fact that when people
c
encounter new situations, they often explain them by reference to familiar experiences, adapting
M
the explanations to fit the new situation.
D. none of these
Answer:A
3. Algorithm is
A. it uses machine-learning techniques. here program can learn from past experience and adapt
themselves to new situations.
B. computational procedure that takes some value as input and produces some value as output.
C. science of making machines performs tasks that would require intelligence when performed by
humans.
D. none of these
Answer:B
4. Bias is
A. a class of learning algorithm that tries to find an optimum classification of a set of examples
using the probabilistic theory.
B. any mechanism employed by a learning system to constrain the search space of a hypothesis.
C. an approach to the design of learning algorithms that is inspired by the fact that when people
encounter new situations, they often explain them by reference to familiar experiences, adapting
the explanations to fit the new situation.
D. none of these
Answer:B
6. Case-based learning is
A. a class of learning algorithm that tries to find an optimum classification of a set of examples
using the probabilistic theory.
B. any mechanism employed by a learning system to constrain the search space of a hypothesis.
C. an approach to the design of learning algorithms that is inspired by the fact that when people
encounter new situations, they often explain them by reference to familiar experiences, adapting
the explanations to fit the new situation.
D. none of these
Answer:C
7. Classification is
A. a subdivision of a set of examples into a number of classes.
B. a measure of the accuracy, of the classification of a concept that is given by a certain theory.
C. the task of assigning a classification to a set of examples
D. none of these
Answer:A
9. Classification accuracy is
A. a subdivision of a set of examples into a number of classes
B. measure of the accuracy, of the classification of a concept that is given by a certain theory.
C. the task of assigning a classification to a set of examples
D. none of these
Answer:B
11. Cluster is
A. group of similar objects that differ significantly from other objects
B. operations on a database to transform or simplify data in order to prepare it for a machine-
learning algorithm
C. symbolic representation of facts or ideas from which information can potentially be extracted
D. none of these
Answer:A
13. A definition of a concept is-----if it recognizes all the instances of that concept
A. complete
B. consistent
C. constant
D. none of these
Answer:A
18. Discovery is
A. it is hidden within a database and can only be recovered if one is given certain clues (an
example is encrypted information).
B. the process of executing implicit previously unknown and potentially useful information from
data.
C. an extremely complex molecule that occurs in human chromosomes and that carries genetic
information in the form of genes.
D. none of these
Answer:B
21. Enrichment is
24. Heuristic is
A. a set of databases from different vendors, possibly using different database paradigms
B. an approach to a problem that is not guaranteed to work but performs well in most cases
C. information that is hidden in a database and that cannot be recovered by a simple sql query.
D. none of these
Answer:B
33. Learning is
A. the process of finding the right formal representation of a certain body of knowledge in order to
represent it in a knowledge-based system
B. it automatically maps an external signal space into a system\s internal representational space.
they are useful in the performance of classification tasks.
37. Node is
A. a component of a network
B. in the context of kdd and data mining, this refers to random errors in a database table.
C. one of the defining aspects of a data warehouse
43. Prediction is
A. the result of the application of a theory or a rule in a specific case
B. one of several possible enters within a database table that is chosen by the designer as the
primary means of accessing the data in the table.
C. discipline in statistics that studies ways to find the most interesting projections of multi-
dimensional spaces.
D. none of these
Answer:A
49. ............................. is a comparison of the general features of the target class data
objects against the general features of objects from one or multiple contrasting
classes.
A. data characterization
B. data classification
C. data discrimination
D. data selection
Answer:A
59. The ............................ exposes the information being captured, stored, and
managed by operational systems.
A. top-down view
B. data warehouse view
C. data source view
D. business query view
Answer:C
61. The .................. allows the selection of the relevant information necessary for
the data warehouse.
A. top-down view
B. data warehouse view
C. data source view
D. business query view
Answer:D
65. .......................... supports basic OLAP operations, including slice and dice, drill-
down, roll-up and pivoting.
A. information processing
B. analytical processing
66. The core of the multidimensional model is the ....................... , which consists of
a large set of facts and a number of dimensions.
A. multidimensional cube
B. dimensions cube
C. data cube
D. data model
Answer:B
67. The data from the operational environment enter ........................ of data
warehouse.
A. current detail data
B. older detail data
C. lightly summarized data
D. highly summarized data
Answer:A
70. Data warehouse contains ................ data that is never found in the operational
environment.
71. ................... are responsible for running queries and reports against data
warehouse tables.
A. hardware
B. software
C. end users
D. middle ware
Answer:D
72. The biggest drawback of the level indicator in the classic star schema is that is
limits ............
A. flexibility
B. quantify
C. qualify
D. ability
Answer:B
77. Which one of the following is not a part of empirical cycle in scientific
research?
A. Observation
B. Theory.
C. Self learning.
D. Prediction.
Answer:C
79. Redundancy refers to the elements of a message that can be derived from other
parts of _________.
A. different message.
B. irrelevant message.
C. same message.
D. complete message.
83. ___________ multiprocessing machines share same hard disk and internal
memory.
A. Massively parallel.
B. Symmetric.
C. Parallel.
D. Asymmetric.
Answer:B
96. In _________ approach data ware house is build first and all information
needed is selected.
A. top-down.
B. client/server.
C. bottom-up.
D. DSS.
Answer:A