0% found this document useful (0 votes)

2 views16 pages

Handout 2 Data Mining

Data mining is the process of extracting knowledge from large datasets using various statistical and analytical tools to identify patterns and relationships among variables. It is essential for organizations to manage and analyze vast amounts of data, enabling informed decision-making and uncovering hidden insights. Techniques include supervised and unsupervised learning, clustering, classification, and association rule mining, with applications across marketing, finance, and manufacturing.

Uploaded by

mishhra.shailja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views16 pages

Handout 2 Data Mining

Uploaded by

mishhra.shailja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

11/30/2024

Slide - 1

What is Data Mining

Slide - 2

What is Data Mining

It is the process of mining knowledge

from large amount of data

Data Mining
Techniques

Useful Data

Slide - 3

1
11/30/2024

Data Mining
• Data mining is focused on better understanding of
characteristics and patterns among variables in large
databases using a variety of statistical and analytical tools.
– It is used to identify relationships among variables in
large data sets and understand hidden patterns that
they may contain.

Slide - 4

Why do we use Data Mining

• Companies and organizations get huge amount of data
from different sources and platforms.
• As size of database increases it becomes difficult to
manually search for useful information in it.
• Data mining techniques are used which include AI and
mathematical complex algorithms for getting specific and
useful data.
• This specific data helps in decision making

Slide - 5

Why do we use Data Mining

• We also get trends, patterns, insights of collected data.
• Data Mining is also called as “Knowledge Discovery in
Database (KDD).”
• This data mining term was introduced in 1990

Slide - 6

2
11/30/2024

Data Mining
• Data mining can be considered part descriptive and part
prescriptive analytics.
• In descriptive analytics, data-mining tools help analysts to
identify patterns in data.
• Excel charts and PivotTables, for example, are useful tools
for describing patterns and analyzing data sets; however,
they require manual intervention.
• Regression analysis and forecasting models help us to
predict relationships or future values of variables of
interest.

Slide - 7

Data Mining Overview (1/12)

• The terms ‘artificial intelligence,’ ‘machine learning,’ and ‘data
mining’ are all used interchangeably.
• Their definitions overlap with no clear boundaries.
• They describe applications of computer software used to obtain
insightful solutions that traditional data analysis techniques may
not be able to achieve.
• In a very broad sense, artificial intelligence is used to describe
computer systems that demonstrate human-like intelligence and
cognitive abilities
– Deduction
– Pattern recognition
– Interpretation of complex data
• Examples: Deep Blue playing chess, Watson

11-8

Slide - 8

Data Mining Overview (2/12)

• Machine learning describes techniques that integrate self-
learning algorithms. (Coined by Arthur Samuel, IBM,1959)
• Its an application of artificial intelligence that allows the
computer to learn automatically without human intervention
or assistance.
• Designed to evaluate results and to improve performance
over time.
• Machine learning techniques can uncover hidden patterns
and relationships in data.
• Use self-learning algorithms to evaluate results and improve
performance over time.
• Examples: Predict rider demand to strategically dispatch 11-9

drivers for Uber Slide - 9

3
11/30/2024

Data Mining Overview (3/12)

• Data mining describes the process of applying a set of
analytical techniques necessary for the development of
machine learning and artificial intelligence.
• Data mining is often recognized as a building block of
machine learning and artificial intelligence.
– Uncover hidden patterns and relationships in data
– Gain insights and derive relevant information to help make
decisions

• Data mining techniques are used for data segmentation,

pattern recognition, classification, and prediction.
• Example: Group customers into segments for customized
promotions. 11-10

Slide - 10

Data Mining Overview (4/12) : Process

• Data mining is a complex process of examining data and
applying analytical techniques to gain valuable insights.
• Requires a systematic approach to managing and
conducting data mining projects.
• A popular approach is based on the Cross-Industry
Standard Process for Data Mining (CRISP-DM)
methodology.
• Although there are other data mining methodologies, many
practitioners prefer CRISP-DM.
• It emphasizes business goals and objectives prior to
preparing the data and choosing analysis techniques.
11-11

Slide - 11

Data Mining Overview (5/12)

• CRISP-DM was developed in the 1990s by a group of five
companies: SPSS, TeraData, Daimler AG, NCR, and OHRA.
• CRISP-DM consists of six major phases.
1. Business understanding: situational context, specific objectives, project
schedule, deliverables
2. Data understanding: collecting raw data, preliminary results, potential
hypotheses
3. Data preparation: record and variable selection, wrangling, cleaning
4. Modeling: selection and execution of data mining techniques, convert or
transform data to formats/types needed for certain analyses, document
assumptions, cross-validation
5. Evaluation: evaluate performance of competing models, select best
models, review and interpret results, develop recommendations
6. Deployment: develop a set of actionable insights and a strategy for
deployment/monitoring/feedback 11-12

Slide - 12

4
11/30/2024

Data Mining Overview (6/12)

11-13

Slide - 13

Data Mining Overview (7/12)

• It is important to note that not every step of the CRISP-DM framework is
needed for all data mining applications.
• The data preparation phase plays a significant role in the data mining
process.
• An analyst or analytics team tends to spend a sizable portion of the
project time (often 80%) on understanding, cleansing, transforming, and
preparing, data leading up to the modeling activities.
• The CRISP-DM methodology is popular among data mining
practitioners because it offers a holistic approach to data mining with
detailed phases, tasks, and activities.
• Other data mining methodologies include SEMMA (for Sample, Explore,
Modify, Model, and Assess) and KDD (Knowl- edge Discovery in
Databases).
11-14

Slide - 14

Data Mining Overview (8/12)

• Data mining algorithms are classified into two types of techniques
depending on the way they learn about data.
– Supervised data mining techniques are use for developing predictive models.
– Unsupervised data mining techniques are effective for data exploration,
dimension reduction, and pattern recognition.

• The key distinction between supervised and unsupervised techniques is

that, in supervised data mining, the target variable is identified.
– In regression models, the target variable is the response variable.
– The historical values of the target variable exist in the data set.
• Data mining algorithms can examine the impact of the predictor
variables on the target variable.
• On the contrary, in unsupervised data mining, no target variable is
identified.
11-15

Slide - 15

5
11/30/2024

Data Mining Overview (9/12)

• Some of the most commonly used supervised data mining
algorithms are based on classic statistical techniques.
• Examples include the linear regression model and the logistic
regression model.
• Use information on the predictor variables (𝑥 , 𝑥 , … , 𝑥 ) to
predict and/or describe changes in the target variable (𝑦) .
• A regression model is therefore “trained” or “supervised” because
the known values of the target variable are used to build the
model.
• The performance of the model can be evaluated based on how
the predicted values deviate from the actual values.
11-16

Slide - 16

Data Mining Overview (10/12)

• Common applications of supervised data mining include
classification and prediction models.
• In a classification model, the target variable is categorical.
– Predict the class memberships of new cases
– Example: example: classify stock buy, hold, or sale
• In a prediction model, the target variable is numerical.
– Predict the target for a new case
– Example: spending of a customer
• Other machine learning algorithms: k-Nearest Neighbors,
naïve Bayes, Decision Trees

11-17

Slide - 17

Data Mining Overview (11/12)

• Unsupervised data mining requires no knowledge of the
target variable.
• The algorithms allow the computer to identify patterns and
relationships in the data without any specific guidance from
the analyst.
• Unsupervised learning is considered to be an important part
of exploratory data analysis and descriptive analytics.
• Used prior to conducting supervised learning in order to
understand the data set, formulate questions, or summarize
data.
• Common applications of unsupervised learning include
dimension reduction and pattern recognition. 11-18

Slide - 18

6
11/30/2024

Data Mining Overview (12/12)

• Dimension reduction converts a set of high-dimensional
data (large number of variables) into data with lesser
dimensions without losing much of the information.
– Deploy before other data mining methods
– Reduce information redundancy, improve model stability
– Relevant for big data to bring out important patterns and build more
stable models

• Pattern recognition recognizing patterns using machine

learning.
– Recurring sequences
– Frequent combinations
– Recognizable features
– Common characteristics 11-19

Slide - 19

Data Mining Techniques include

Statistics AI ML
It include: Different AI algo’s It include:
1. Cluster Techniques 1. KNN algo
2. Regression 2. Apriori algo
3. Classification 3. K mean algo
4. Segmentation 4. Naïve bayes algo

Shopping on Amazon Slide - 20

Shopping on Amazon

Slide - 21

7
11/30/2024

The Scope of Data Mining

• Cluster Analysis
– identifying groups in which elements are in some way similar
• Classification
– analyzing data to predict how to classify a new data element
• Association
– analyzing databases to identify natural associations among
variables and create rules for target marketing or buying
recommendations
• Cause-and-effect Modeling
– developing analytic models to describe relationships between
metrics that drive business performance

Slide - 22

Cluster Analysis
• Cluster analysis, also called data segmentation, is a
collection of techniques that seek to group or segment a
collection of objects (observations or records) into subsets
or clusters, such that those within each cluster are more
closely related to one another than objects assigned to
different clusters.
– The objects within clusters should exhibit a high
amount of similarity, whereas those in different clusters
will be dissimilar.

Slide - 23

Clustering Methods
• Hierarchical clustering
– Agglomerative
clustering methods,
which proceed by series
of fusions of the n
objects into groups.
– Divisive clustering
methods, which
separate n objects
successively into finer
groupings.

Slide - 24

8
11/30/2024

Single Linkage Clustering

• An agglomerative method that keeps forming clusters from
the individual objects until only one cluster is left.
• In the single linkage method, the distance between two
clusters r and s, is defined as the minimum
distance between any object in cluster r and any object in
cluster s.

Slide - 28

Dendogram
• Visualization of the clustering process. The y-axis
measures the intercluster distance. A dendogram shows
the sequence in which clusters are formed as you move up
the diagram.

Slide - 33

Classification
• Classification methods seek to classify a categorical
outcome into one of two or more categories based on
various data attributes.
• For each record in a database, we have a categorical
variable of interest and a number of additional predictor
variables.
• For a given set of predictor variables, we would like to
assign the best value of the categorical variable.

Slide - 34

9
11/30/2024

Classification Techniques
• k-Nearest Neighbors (k-NN) Algorithm
– Finds records in a database that have similar numerical
values of a set of predictor variables.
• Discriminant Analysis
– Uses predefined classes based on a set of linear
discriminant functions of the predictor variables.

Slide - 42

k-Nearest Neighbors (k-NN)

• The k-nearest neighbors (k-NN) algorithm is a
classification scheme that attempts to find records in a
database that are similar to one we wish to classify.
Similarity is based on the “closeness” of a record to
numerical predictors in the other records, using normalized
Euclidean distances.

Slide - 43

k-Nearest Neighbor Rules

• The nearest neighbor to a record is the one that that has
the smallest distance from it.
– If k = 1, then the 1-NN rule classifies a record in the
same category as its nearest neighbor.
– k-NN rule finds the k-Nearest Neighbors to each record
we want to classify and then assigns the classification
as the classification of majority of the k nearest
neighbors.
• Typically, various values of k are used and then results
inspected to determine which is best.

Slide - 44

10
11/30/2024

Discriminant Analysis
• Discriminant analysis is a technique for classifying a set
of observations into predefined classes. The purpose is to
determine the class of an observation based on a set of
predictor variables.
• With only two classification groups, we can apply
regression analysis. Unfortunately, when there are more
than two, linear regression cannot be applied, and special
software must be used.

Slide - 47

Association Rule Mining

• Association rule mining, often called affinity analysis,
seeks to uncover associations and/or correlation
relationships in large data sets.
– Association rules identify attributes that occur together
frequently in a given data set.
– Market basket analysis, for example, is used to
determine groups of items consumers tend to purchase
together.
• Association rules provide information in the form of if-then
(antecedent-consequent) statements.

Slide - 51

Cause-and-Effect Modeling
• Correlation analysis can help us develop cause-and-effect
models that relate lagging and leading measures.
– Lagging measures tell us what has happened and are
often external business results such as profit, market
share, or customer satisfaction.
– Leading measures predict what will happen and are
usually internal metrics such as employee satisfaction,
productivity, and turnover.

Slide - 57

11
11/30/2024

Data Mining Advantages

• Marketing/Retailing:
• Direct marketers can benefit from data mining by providing
precise and helpful trends regarding their target audience's
purchase habits. These trends enable marketers to target
their target market more precisely with their marketing efforts.
For consumers with a long history of purchasing software, a
software company's marketing may promote its new product.
• Data mining can aid marketers in making predictions about
the goods their target customers may be interested in buying.
Marketers can surprise consumers and enhance the
shopping experience by making this forecast.

Slide - 60

Data Mining Advantages

• Banking/Crediting:
• Financial companies can benefit from data mining in areas
like credit documentation and loan records.
• A bank, for instance, can determine the degree of risk
associated with each specific loan by assessing prior
consumers who share comparable features.
• Data mining can also assist credit card issuers in alerting
customers to possibly fraudulent credit card transactions.
Credit card issuers can cut their losses even though data
mining technology only sometimes predicts fraudulent
charges with 100% accuracy
Slide - 61

Data Mining Advantages

• Manufacturing:
• Manufacturers can spot defective equipment and establish the
best control parameters by using data mining on operational
engineering data.
• For instance, semiconductor manufacturers face a dilemma
since even in diverse wafer production facilities' manufacturing
environments, the quality of the wafers is generally the same,
and some even have faults for unexplained reasons.
• Data mining has been used to identify the control parameter
ranges that result in the fabrication of the golden wafer. The
desired grade wafers are then produced using those ideal
control settings Slide - 62

12
11/30/2024

Data Mining Advantages

• Customer Identification:
• Every consumer in the market is unique in their ways. Their
fundamental behavior and traits differ.
• As a result, it is easier to comprehend their preferences with
the right methodology. Businesses may better identify their
clients with data mining, increasing the likelihood that they will
buy their products

Slide - 63

Data Mining Advantages

• Detecting Criminal Activities:
• Governments and other institutions can use market analysis
data to identify criminals.
• For instance, the data can be structured to make it easier to
analyze a customer's prior transactions. As a result, it might
quickly reveal any fraudulent activity.

Slide - 64

Data Mining Advantages

• Marketing Techniques:
• Businesses can build data models using data mining
approaches.
• They could quickly determine which people would be interested
in their products using these models. As a result, the firms may
be sure that the products they introduce will be profitable.
• Therefore, whatever new products are presented will help the
company's profits expand.

Slide - 65

13
11/30/2024

Data Mining Advantages

• Criminal Justice:
• By discovering patterns in location, crime type, habit, and other
behavior patterns, data mining can help law enforcement locate
and apprehend criminal offenders.

Slide - 66

Data Mining Disadvantages

• Privacy Issues:
• Businesses gather data about their customers in various ways
to understand the trends in their buying habits. Particularly now
that the internet is booming with social networks, e-commerce,
forums, and blogs, concerns about personal privacy have been
growing significantly.
• People worry that their personal information will be collected
and used unethically, which could get them into a lot of trouble
due to privacy concerns.
• However, businesses don't last forever; on occasion, they
might be bought out by another company or go out of business
entirely. At this time, they likely sell or leak the personal
information they possess Slide - 67

Data Mining Disadvantages

• Safety Concerns:
• A major concern is security. Social Security numbers, birthdays,
salary information, and other details about customers and
employees are owned by businesses. But it still needs to be
determined how well this information is protected.
• Many large corporations like Ford Motor Credit Company and
Sony Pictures have seen hackers access and steal large
amounts of consumer data.
• The credit card was stolen, and identity theft became a major
issue because so much financial and personal information was
available.

Slide - 68

14
11/30/2024

Data Mining Disadvantages

• Information that has been misused or is erroneous:
• Data mining techniques can be used improperly to gather
information for unethical objectives.
• Using this information to their advantage, unethical individuals
or organizations could discriminate against a certain group of
people or take advantage of the weak.
• A further drawback of data mining is its imperfect accuracy.
Inaccurate information will have major repercussions if used to
make decisions.

Slide - 69

Data Mining Disadvantages

• Expensive:
• A particularly expensive procedure is data mining. For instance,
businesses need to hire more staff and technical experts to
ensure that data mining is done properly. Advanced data mining
software is necessary for many firms but may be expensive.
Because they need to yield more useful insights, data mining
often costs more than it saves for most small enterprises.

Slide - 70

Data Mining Disadvantages

• Technical Knowledge:
• Depending on how they should be used, various mining tools
are available. They each have a distinctive algorithm and
design.
• Selecting the appropriate tool will only be possible with the
required technical knowledge. Therefore, it is necessary to send
out a competent specialist to handle the tool selection

Slide - 71

15
11/30/2024

Data Mining Disadvantages

• Accuracy:
• Even though data mining has created a framework for simple
data collection with its techniques, its accuracy is still
constrained. Making decisions can be complicated by
erroneous information that has been acquired.

Slide - 72

Data Mining Disadvantages

• Large databases are needed for data mining:
• Although data mining is one of the most effective tools in a
marketer's arsenal, it has its challenges.
• One such disadvantage is that huge datasets are necessary for
data mining to be effective.
• For instance, if an email list contains just 100 subscribers,
more than the data from those emails will be required for data
mining.
• On the other hand, more information will be available, and data
mining will be more successful if the list has 100,000 persons

Slide - 73

Data Mining Disadvantages

• Data mining methods are not perfect:
• Accurate information is only sometimes produced through data
mining. There are numerous methods for analyzing data, some
of which are more precise than others.
• Predictive models, for instance, rely on the expectation that
particular data patterns will be discovered. When only some
facts back a forecast, this can result in overestimating how
accurate it will turn out.
• Another problem arises when a database contains missing
data that must be considered to produce an accurate analysis.

Slide - 74

Internship Report at Red Cross
83% (6)
Internship Report at Red Cross
37 pages
Biological Science Term 2 Learning Sequence 2013
No ratings yet
Biological Science Term 2 Learning Sequence 2013
5 pages
The Black Sun, Refers in Its Sub-Title To "The Alchemy and Art of Darkness." It Is An Apt Description
No ratings yet
The Black Sun, Refers in Its Sub-Title To "The Alchemy and Art of Darkness." It Is An Apt Description
12 pages
Chapter 4 SR2023
No ratings yet
Chapter 4 SR2023
58 pages
Chapter 3-IB
No ratings yet
Chapter 3-IB
69 pages
Data Mining and IBM SPSS Modeler
No ratings yet
Data Mining and IBM SPSS Modeler
20 pages
Introduction To Data Mining & Business Intelligence
No ratings yet
Introduction To Data Mining & Business Intelligence
25 pages
Data Mining Concepts
100% (3)
Data Mining Concepts
122 pages
Chapter 6_Data Mining
No ratings yet
Chapter 6_Data Mining
62 pages
DataMining and Warehousing - chapter1
No ratings yet
DataMining and Warehousing - chapter1
23 pages
datamining&warehousing
No ratings yet
datamining&warehousing
65 pages
Lecture 2 Data Mining Functions
No ratings yet
Lecture 2 Data Mining Functions
40 pages
PPT4 W3 S4 R0 Predictive Analytics I Data Mining Process
No ratings yet
PPT4 W3 S4 R0 Predictive Analytics I Data Mining Process
50 pages
Data Analysis-2
No ratings yet
Data Analysis-2
41 pages
PredictiveAnalysis U1 U2
No ratings yet
PredictiveAnalysis U1 U2
7 pages
07 DataMining
No ratings yet
07 DataMining
37 pages
01Intro1
No ratings yet
01Intro1
33 pages
Concepts and Techniques: - Chapter 1
No ratings yet
Concepts and Techniques: - Chapter 1
37 pages
Lecture_01_11jan
No ratings yet
Lecture_01_11jan
29 pages
5 Data Mining Proccess and Techniques - Week 7
No ratings yet
5 Data Mining Proccess and Techniques - Week 7
61 pages
Week 02 PDF
No ratings yet
Week 02 PDF
39 pages
turban_dss9e_ch05
No ratings yet
turban_dss9e_ch05
54 pages
DWDM-LS1-Fall-24-25
No ratings yet
DWDM-LS1-Fall-24-25
42 pages
BIDW Lecture 2
No ratings yet
BIDW Lecture 2
33 pages
DM Chapter 1
No ratings yet
DM Chapter 1
37 pages
Data Mining Overview: by Dr. Sunil D. Lakdawala
No ratings yet
Data Mining Overview: by Dr. Sunil D. Lakdawala
52 pages
Unit-1
No ratings yet
Unit-1
148 pages
Chapter 4 - IS 466 - Fall Semester 24-25
No ratings yet
Chapter 4 - IS 466 - Fall Semester 24-25
57 pages
Data Mining - An Overview
No ratings yet
Data Mining - An Overview
40 pages
Chapter 4 - IS 466 - Spring Semester 23-24 Final
No ratings yet
Chapter 4 - IS 466 - Spring Semester 23-24 Final
57 pages
Introduction To Data Mining-Week1
No ratings yet
Introduction To Data Mining-Week1
43 pages
Concepts and Techniques: - Chapter 1
No ratings yet
Concepts and Techniques: - Chapter 1
39 pages
Concepts and Techniques: - Chapter 1
No ratings yet
Concepts and Techniques: - Chapter 1
39 pages
01 Introduction
No ratings yet
01 Introduction
36 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
Lecture 1.1.1 1.1.2
No ratings yet
Lecture 1.1.1 1.1.2
32 pages
Chapter Five Data Mining for Healthcare Analytics
No ratings yet
Chapter Five Data Mining for Healthcare Analytics
77 pages
Chapter 6 Data Mining
No ratings yet
Chapter 6 Data Mining
39 pages
LECTURE 1 data mining
No ratings yet
LECTURE 1 data mining
41 pages
Module 1
No ratings yet
Module 1
40 pages
01Intro
No ratings yet
01Intro
41 pages
Cse5243 Intro. To Data Mining: Chapter 1. Introduction
No ratings yet
Cse5243 Intro. To Data Mining: Chapter 1. Introduction
56 pages
intro data mining
No ratings yet
intro data mining
51 pages
Presentation 1
No ratings yet
Presentation 1
28 pages
Chapter - 1
No ratings yet
Chapter - 1
22 pages
Data Mining
No ratings yet
Data Mining
63 pages
Business Intelligence Data Mining: (John Naisbett)
No ratings yet
Business Intelligence Data Mining: (John Naisbett)
60 pages
Data Mining: Concepts and Techniques: - Chapter 1
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 1
37 pages
dm 1
No ratings yet
dm 1
47 pages
Presentation On Data Mining
100% (1)
Presentation On Data Mining
51 pages
1 - Introduction To DM
No ratings yet
1 - Introduction To DM
59 pages
01Intro (2)
No ratings yet
01Intro (2)
45 pages
introduction to Data Mining
No ratings yet
introduction to Data Mining
48 pages
1712060004 (1)
No ratings yet
1712060004 (1)
25 pages
Data Mining
No ratings yet
Data Mining
30 pages
4 Datamining
No ratings yet
4 Datamining
90 pages
Data Mining
No ratings yet
Data Mining
43 pages
DM Introduction
No ratings yet
DM Introduction
32 pages
Week 01 Chapt01
No ratings yet
Week 01 Chapt01
49 pages
Lecture 7 & 8 Data Mining
No ratings yet
Lecture 7 & 8 Data Mining
21 pages
Chapter 5- Data Mining
No ratings yet
Chapter 5- Data Mining
29 pages
BIS 541 Ch01 20-21 S
No ratings yet
BIS 541 Ch01 20-21 S
129 pages
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
ppt
No ratings yet
ppt
54 pages
UNIT 1&2 notes
No ratings yet
UNIT 1&2 notes
40 pages
SYLLABUS
No ratings yet
SYLLABUS
1 page
All Unit Notes
No ratings yet
All Unit Notes
21 pages
e Book-1 (IT 2.0 Operation Module)
No ratings yet
e Book-1 (IT 2.0 Operation Module)
216 pages
ATP QUESTIONS IX FEB 2025
No ratings yet
ATP QUESTIONS IX FEB 2025
7 pages
Road Map All Areas Booklet
No ratings yet
Road Map All Areas Booklet
4 pages
Chapter 1
No ratings yet
Chapter 1
31 pages
8051 Ub Programmer
No ratings yet
8051 Ub Programmer
22 pages
Fundamental of MATLAB: Part: I Introduction To MATLAB
100% (1)
Fundamental of MATLAB: Part: I Introduction To MATLAB
3 pages
Past Paper - Moment- Physical Quantities
No ratings yet
Past Paper - Moment- Physical Quantities
2 pages
01-Irrigation Engg
No ratings yet
01-Irrigation Engg
16 pages
Rdso - SPN - 200 - 2010 Rev 2.0 - Flashing Tail Lamp
100% (1)
Rdso - SPN - 200 - 2010 Rev 2.0 - Flashing Tail Lamp
21 pages
Reading June 20
No ratings yet
Reading June 20
6 pages
Price List Exam
No ratings yet
Price List Exam
1 page
1LA7063-4AB12 Datasheet en
No ratings yet
1LA7063-4AB12 Datasheet en
1 page
Untitled Document (27)
No ratings yet
Untitled Document (27)
5 pages
Wjec Gcse English Language Specification 2015 24-10-14 Branded
No ratings yet
Wjec Gcse English Language Specification 2015 24-10-14 Branded
23 pages
How-To Reverse Shipment & Delivery
No ratings yet
How-To Reverse Shipment & Delivery
10 pages
Managing Mental Health in The Workplace
100% (2)
Managing Mental Health in The Workplace
7 pages
TUCANO SUL COMPLETE English 30 Nov 18
No ratings yet
TUCANO SUL COMPLETE English 30 Nov 18
58 pages
Galene-sphalerite Solubilité Barrett1988
No ratings yet
Galene-sphalerite Solubilité Barrett1988
8 pages
Adj and Adv Academic
No ratings yet
Adj and Adv Academic
4 pages
SBA #13 - Paper Chromatography
100% (1)
SBA #13 - Paper Chromatography
4 pages
Apply An Architectural Framework To Stratifying Warehouse Management Systems
No ratings yet
Apply An Architectural Framework To Stratifying Warehouse Management Systems
17 pages
Two Sample Statistical Inference:: Formulae
No ratings yet
Two Sample Statistical Inference:: Formulae
2 pages
Performance Management and Its Challenges
No ratings yet
Performance Management and Its Challenges
5 pages
Light Vehicle Driving NC II: Automotive
No ratings yet
Light Vehicle Driving NC II: Automotive
6 pages
ANZSCO
No ratings yet
ANZSCO
17 pages
Engine Removal and Installation: General
No ratings yet
Engine Removal and Installation: General
4 pages
Social Capital and Participation Theories PDF
No ratings yet
Social Capital and Participation Theories PDF
56 pages

Handout 2 Data Mining

Uploaded by

Handout 2 Data Mining

Uploaded by

11/30/2024

What is Data Mining

What is Data Mining

It is the process of mining knowledge

Why do we use Data Mining

Why do we use Data Mining

Data Mining Overview (1/12)

Data Mining Overview (2/12)

drivers for Uber Slide - 9

Data Mining Overview (3/12)

• Data mining techniques are used for data segmentation,

Data Mining Overview (4/12) : Process

Data Mining Overview (5/12)

Data Mining Overview (6/12)

Data Mining Overview (7/12)

Data Mining Overview (8/12)

• The key distinction between supervised and unsupervised techniques is

Data Mining Overview (9/12)

Data Mining Overview (10/12)

Data Mining Overview (11/12)

Data Mining Overview (12/12)

• Pattern recognition recognizing patterns using machine

Data Mining Techniques include

Shopping on Amazon Slide - 20

The Scope of Data Mining

Single Linkage Clustering

k-Nearest Neighbors (k-NN)

k-Nearest Neighbor Rules

Association Rule Mining

Data Mining Advantages

Data Mining Advantages

Data Mining Advantages

Data Mining Advantages

Data Mining Advantages

Data Mining Advantages

Data Mining Advantages

Data Mining Disadvantages

Data Mining Disadvantages

Data Mining Disadvantages

Data Mining Disadvantages

Data Mining Disadvantages

Data Mining Disadvantages

Data Mining Disadvantages

Data Mining Disadvantages

You might also like