0% found this document useful (0 votes)
110 views

Data Mining Nostos

The document discusses key concepts in data science and machine learning including: - Dependent and explanatory variables which are also known as predictor variables - Classification which predicts categorical variables and is a data analysis task - Clustering which is an unsupervised learning method that works based on distance measures - Association rule mining which aids in identifying associations and correlations in data through rules - Descriptive and inferential statistics and their uses in sample and population datasets
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
110 views

Data Mining Nostos

The document discusses key concepts in data science and machine learning including: - Dependent and explanatory variables which are also known as predictor variables - Classification which predicts categorical variables and is a data analysis task - Clustering which is an unsupervised learning method that works based on distance measures - Association rule mining which aids in identifying associations and correlations in data through rules - Descriptive and inferential statistics and their uses in sample and population datasets
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Consider an example of an apartment: The number of bedrooms, bathrooms, and the floor of

an apartment determines its price. Which is the dependent variable in this example?
Price

Which process of KDD aids in unifying data from different sources?


Data Integration

Consider an example of an apartment: The number of bedrooms, bathrooms, and the floor of
an apartment determines its price.
All options
Which is/are the predictor variable(s) in this example?
All the options

__________ term portrays the process of discovering small pieces from a large volume of raw
material
Mining

__________ step of classification contributes to the construction of learning model.


Learning Step

Response variable is a __________.


Dependent Variable

__________ is the problem of identifying a category to which a new observation belongs to,
based on a training set of data containing observations whose categories are already known.
Classification

Explanatory variable is a __________.


Predictor Variable

Which of the following helps in measuring the central tendency of the dataset?
All the options

Classification predicts the value of __________ variable.


Categorical

Inferential statistics is used in __________ datasets.


Population
__________ stage of data science process helps in exploring and determining the patterns
from the data.
Exploratory Data Analysis

Identify the algorithm that works based on the concept of clustering.


K-Means

Classification is a __________ task.


Data Analysis

Descriptive statistics is used in __________ datasets.


Sample

__________statistics provides the summary statistics of the data.


Descriptive

In Association Rules, the Antecedent and Consequent form a disjoint set.


True

_______association measure compares the confidence with the expected confidence.


Lift

The science of collecting, interpreting, and analyzing data is known as __________.


Statistics

__________ outlier significantly deviates based on the context selected.


Contextual Outlier

Regression can be used in predicting/forecasting Applications.


True

Identify the application of the Outlier Detection Method.


Intrusion Detection

Identify the Unsupervised Learning method.


Clustering

Which of the following helps in measuring the dispersion range of the data?
All the options

__________ aids in identifying associations, correlations, and frequent patterns in data.


Association Rule Mining

Collective outlier significantly deviates from the entire dataset.


False

_______ stage of data science process helps in converting raw data into a machine-readable
format.
Data Gathering

Jacard Index distance measure is used on __________.


Non-numeric dataset

Derived relationships in Association Rule Mining are represented in the form of __________.
Rules

Identify the algorithm that works based on the concept of classification.


None of the options

Which of the following association measure helps in identifying how frequently the item
appears in a dataset?
Support

_________ parameter of regression helps in identifying the direction of relationship between


variables.
Regression Coefficient

Distance measure(s) used in clustering process of Numeric Dataset is/are __________.


All the options

Clustering process works on _________ measure.


Distance

_______ step of KDD process helps in identifying valuable patterns.


Pattern Evaluation

Which among the following is/are (an) outlier detection method(s)?


All the options

________statistics provides inferences on population.


Inferential

You might also like