0% found this document useful (0 votes)
307 views

Data Mining

The document discusses key concepts in data science and machine learning including outlier detection methods, descriptive and inferential statistics, data exploration and analysis stages, classification and regression algorithms, association rule mining, clustering, and ensemble classifiers. It provides definitions and identifies examples of these terms.

Uploaded by

Leslie Diaz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
307 views

Data Mining

The document discusses key concepts in data science and machine learning including outlier detection methods, descriptive and inferential statistics, data exploration and analysis stages, classification and regression algorithms, association rule mining, clustering, and ensemble classifiers. It provides definitions and identifies examples of these terms.

Uploaded by

Leslie Diaz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 3

__________ outlier significantly deviates based on the context selected.

Contextual outlier

__________statistics provides inferences on population.


Inferential

__________ stage of data science process helps in converting raw data into a
machine-readable format.
Exploring Data analysis

Classification predicts the value of __________ variable.


Continous

Derived relationships in Association Rule Mining are represented in the form of


__________.
Rules

Inferential statistics is used in __________ datasets.


Population

__________association measure compares the confidence with the expected confidence.


Lift

Regression can be used in predicting/forecasting Applications.


True

Identify the algorithm that works based on the concept of clustering.


K means

Which of the following association measure helps in identifying how frequently the
item appears in a dataset?
Support

Which among the following is/are (an) outlier detection method(s)?


all of the options

__________ parameter of regression helps in identifying the direction of


relationship between variables.
Measure discrepancy

Which of the following helps in measuring the dispersion range of the data?
Standar deviation

__________statistics provides the summary statistics of the data.


Descritpive

Which among the following is/are (an) Ensemble Classifier?


all of the options

Identify the Unsupervised Learning method.


Clustering

In Association Rules, the Antecedent and Consequent form a disjoint set.


True

Jacard Index distance measure is used on __________.


non numeric

__________ aids in identifying associations, correlations, and frequent patterns in


data.
Association rule mining

Identify the algorithm that works based on the concept of classification.


All of the options

__________ step of KDD process helps in identifying valuable patterns.


Data mining

Descriptive statistics is used in __________ datasets.


Sample

__________ stage of data science process helps in exploring and determining the
patterns from the data.
Exploratory

Classification is a __________ task.


Data analysis

The science of collecting, interpreting, and analyzing data is known as __________.


Statistics

Which of the following helps in measuring the central tendency of the dataset?
All of the options

Which among the following is/are (an) outlier detection method(s)?


all of the options
Clustering process works on _________ measure.
Distance

You might also like