0% found this document useful (0 votes)

5 views

Data Classification

Data classification involves organizing raw data into distinct classes for better visualization, particularly in choropleth maps. Various methods of classification include equal intervals, quantiles, mean standard deviation, natural breaks, optimal classification, and head/tail breaks, each with its own approach to grouping data. The document discusses the principles and algorithms behind these methods, emphasizing the importance of spatial distribution in data classification.

Uploaded by

Mariam Kariam

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Data Classification

Uploaded by

Mariam Kariam

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Data Classification

Data Classification
• Data classification involves grouping raw data into
classes, with each resulting class depicted by a different
symbol. Data classification is particularly appropriate for
choropleth maps because of the difficulty of
differentiating areal symbols (e.g., lightnesses of a single
hue) on an unclassed map.
A. Equal Intervals
• In the equal intervals (or equal steps) method of
classification, each class occupies an equal interval along
the number line.
The steps for computation for our six-class map are
as follows:
• Class Interval or Width
Equal Interval
• Determine the lower limit of each
class.
• Determine the Upper limit of
each class.
• Specify the class limits actually
shown in the legend
• Determine which observations
fall in each class
B. Quantiles
• In the quantiles method of classification, data are rank
ordered and the same number of observations is placed
in each class
• Quartiles
• Quintiles
• Sextiles
©. Mean Standard Deviation
The mean–standard deviation method is one of several
classification techniques that do consider how data are
distributed along the number line.
In this method, classes
are formed by repeatedly adding or subtracting the
standard deviation from the mean of the data.
Distribution
• Data are normally distributed (or near normal), the mean serves
as a useful dividing point, enabling a contrast of values above
and below it.

• For our sixclass map, Calculated Limits are computed using the
mean and standard deviation values listed under Normal
Distribution Limits.

• For a fiveclass map, the two middle classes could be combined,

and the mean would fall in the middle of the resulting class.
(D). Natural Breaks
• In natural breaks classification, graphs (e.g., a dispersion
graph or histogram) are examined visually to determine
logical breaks (or, alternatively, clusters) in the data.
• Minimize differences between data values in the same
class and to maximize differences between classes.
E.Optimal Classification
• The optimal classification method is a solution to the
subjectivity of natural breaks. The optimal method
places similar data values in the same class by
minimizing an objective measure of classification error.
Optimal
Jenks–Caspall algorithm
• The Jenks–Caspall algorithm, developed by George Jenks
and Fred Caspall (1971), is an empirical solution to the
problem of determining optimal classes.
• we assume that we wish to minimize the total map error
(ADCM)
Fisher–Jenks algorithm
• The Fisher–Jenks algorithm has a mathematical
foundation that guarantees an optimal solution. Walter
Fisher (1958) was responsible for developing the
mathematical foundation, and George Jenks (1977)
introduced the idea to cartographers—hence the term
Fisher-Jenks algorithm.
(E).Optimal (Median)
• ADCM is the sum of absolute deviations about class medians
for a particular number of classes, and ADAM is the sum of
absolute deviations about the median for the entire data set.
• An analogous measure can be computed when the mean is
used as the measure of central tendency (and the error in a
class is the sum of squared deviations about the mean) and is
known as the goodness of variance fit (GVF).
• GADF ranges from 0 to 1, with 0 representing the lowest
accuracy (a one-class map) and 1 representing the highest
accuracy.
HEAD/TAIL BREAKS:
• Bin Jiang (2013) has developed a new data classification
method known as head/tail breaks, which Jiang argues is
appropriate for data that are heavy-tailed (they have a
strong positive skew).
Head/tail breaks is a novel classification method that uses
the mean of the data to recursively divide the data as long
as the data above the mean are heavy-tailed (strongly
positively skewed). In addition to automatically
determining an appropriate number of classes, head/tail
breaks are unique in the sense that each lower-valued class
should be viewed as a background for higher-valued
classes.
Ceriteria
CONSIDERING THE SPATIAL
DISTRIBUTION OF THE DATA

Project On Economic Load Dispatch Using Genetic Algorithm and Artificial Neural Network Optimization Techniques
No ratings yet
Project On Economic Load Dispatch Using Genetic Algorithm and Artificial Neural Network Optimization Techniques
45 pages
Information Theory
50% (2)
Information Theory
30 pages
Smed Forms
100% (1)
Smed Forms
11 pages
Data Classification: Classes or Groups, With Each Class Represented A Unique
No ratings yet
Data Classification: Classes or Groups, With Each Class Represented A Unique
12 pages
GI 224 Classification
No ratings yet
GI 224 Classification
19 pages
IntroGIS Presentation Jan2018 ShortenedChoropleth
No ratings yet
IntroGIS Presentation Jan2018 ShortenedChoropleth
8 pages
SUG243 - Cartography - Data Classification Method
100% (5)
SUG243 - Cartography - Data Classification Method
4 pages
Data Classification - Tutorial - 2014
No ratings yet
Data Classification - Tutorial - 2014
6 pages
Clustering Theory Applications and Algorithms
No ratings yet
Clustering Theory Applications and Algorithms
9 pages
Classification Methods
No ratings yet
Classification Methods
12 pages
Cheatsheet FDA a4 Full
No ratings yet
Cheatsheet FDA a4 Full
2 pages
Data Mining Notes
No ratings yet
Data Mining Notes
25 pages
The General Considerations and Implementation In: K-Means Clustering Technique: Mathematica
No ratings yet
The General Considerations and Implementation In: K-Means Clustering Technique: Mathematica
10 pages
Math Reviewer
No ratings yet
Math Reviewer
6 pages
Datawarehousing and Data Mining
No ratings yet
Datawarehousing and Data Mining
119 pages
4
No ratings yet
4
26 pages
Ambo University Inistitute of Technology Department of Computer Science
No ratings yet
Ambo University Inistitute of Technology Department of Computer Science
13 pages
DBB2102 – QUANTITATIVE TECHNIQUES FOR MANAGEMENT
No ratings yet
DBB2102 – QUANTITATIVE TECHNIQUES FOR MANAGEMENT
15 pages
University School of Business MBA: SUBJECT NAME: Decision Science-I Subject Code: 21bat604
No ratings yet
University School of Business MBA: SUBJECT NAME: Decision Science-I Subject Code: 21bat604
33 pages
Stat Quick Overview
No ratings yet
Stat Quick Overview
35 pages
R21 Unit 2
No ratings yet
R21 Unit 2
101 pages
Dsbda2 Dsbda Merged
No ratings yet
Dsbda2 Dsbda Merged
3 pages
ADS imp ans
No ratings yet
ADS imp ans
11 pages
margin_6794edf99eb1f_3c24107b2ce99dfbffd813406a34e332_6794ede66a47f
No ratings yet
margin_6794edf99eb1f_3c24107b2ce99dfbffd813406a34e332_6794ede66a47f
2 pages
Types of attributes-1
No ratings yet
Types of attributes-1
8 pages
FDT and MCT
No ratings yet
FDT and MCT
19 pages
cheatsheet data
No ratings yet
cheatsheet data
3 pages
Lec 2
No ratings yet
Lec 2
26 pages
Clustering and Applications and Trends in Data Mining
No ratings yet
Clustering and Applications and Trends in Data Mining
42 pages
02 Data
No ratings yet
02 Data
64 pages
Lecture 2 (1)
No ratings yet
Lecture 2 (1)
73 pages
Clustering and Association Rule
No ratings yet
Clustering and Association Rule
69 pages
Graph Partitioning Advance Clustering Technique
No ratings yet
Graph Partitioning Advance Clustering Technique
14 pages
Unit 4
No ratings yet
Unit 4
65 pages
Difference Between Classification and Tabulation
No ratings yet
Difference Between Classification and Tabulation
30 pages
ADS PRINT ans
No ratings yet
ADS PRINT ans
4 pages
4 ExploratoryAnalysis
No ratings yet
4 ExploratoryAnalysis
42 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
15 pages
DWM Exp6 C49
No ratings yet
DWM Exp6 C49
15 pages
Data Analysts-1
No ratings yet
Data Analysts-1
65 pages
Chapter 2
No ratings yet
Chapter 2
65 pages
Modern Math Reviewer
No ratings yet
Modern Math Reviewer
14 pages
CH 2
No ratings yet
CH 2
68 pages
Data Classification
No ratings yet
Data Classification
3 pages
Clustering
No ratings yet
Clustering
47 pages
Cluster Analysis
No ratings yet
Cluster Analysis
39 pages
Data Mining Mid 2
No ratings yet
Data Mining Mid 2
20 pages
DA Major Notes
No ratings yet
DA Major Notes
46 pages
Group 8 1
No ratings yet
Group 8 1
6 pages
Data Science Cheatsheet
No ratings yet
Data Science Cheatsheet
5 pages
Classification of Data
No ratings yet
Classification of Data
22 pages
Anova For Comparing Means Between More Than 2 Groups: Variance: Average of Squared Differences From Mean
No ratings yet
Anova For Comparing Means Between More Than 2 Groups: Variance: Average of Squared Differences From Mean
69 pages
Lec.02 Getting to Know Your Data
No ratings yet
Lec.02 Getting to Know Your Data
62 pages
02data (Compatibility Mode)
No ratings yet
02data (Compatibility Mode)
11 pages
LJ 9
No ratings yet
LJ 9
7 pages
Statistics L 1
No ratings yet
Statistics L 1
27 pages
Lecture - 2.3.1 Tabulation
No ratings yet
Lecture - 2.3.1 Tabulation
45 pages
RM Module 3
No ratings yet
RM Module 3
34 pages
Cluster Analysis Introduction
No ratings yet
Cluster Analysis Introduction
23 pages
Module 1
No ratings yet
Module 1
64 pages
Data Mining: Data Exploration: - Chapter 6
No ratings yet
Data Mining: Data Exploration: - Chapter 6
56 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Map as a representation of reality
No ratings yet
Map as a representation of reality
8 pages
map design1
No ratings yet
map design1
37 pages
Cartography vs Digital Cartography
No ratings yet
Cartography vs Digital Cartography
11 pages
Weather Chart
100% (2)
Weather Chart
35 pages
How COVID-19 Redefines The Concept of Sustainability
No ratings yet
How COVID-19 Redefines The Concept of Sustainability
4 pages
Surveying PDF
No ratings yet
Surveying PDF
1 page
Ответы на тесты AI Basics (Overview of AI)
No ratings yet
Ответы на тесты AI Basics (Overview of AI)
3 pages
English Sample Exam Daf 202306
No ratings yet
English Sample Exam Daf 202306
34 pages
03.04.conservation of Linear Momentum For An Infinitesimal C.M
No ratings yet
03.04.conservation of Linear Momentum For An Infinitesimal C.M
8 pages
Chapter 10, Exercises 10-1, 10-2, Problem # 15: X y Xy
No ratings yet
Chapter 10, Exercises 10-1, 10-2, Problem # 15: X y Xy
2 pages
Road Traffic Prediction Using Artificial Neural Networks: September 2018
No ratings yet
Road Traffic Prediction Using Artificial Neural Networks: September 2018
6 pages
Exercise1 ShannonTheoryC NFont
No ratings yet
Exercise1 ShannonTheoryC NFont
9 pages
Quarter 2 Assessment MATH 9 WEEK 3&4
No ratings yet
Quarter 2 Assessment MATH 9 WEEK 3&4
2 pages
Array_Leetcode.pdf
No ratings yet
Array_Leetcode.pdf
4 pages
Wills Lifestyle Group1 Section2
No ratings yet
Wills Lifestyle Group1 Section2
5 pages
Masters in Computational and Applied Mathematics
No ratings yet
Masters in Computational and Applied Mathematics
2 pages
SSL vs. TLS - What Are Differences
No ratings yet
SSL vs. TLS - What Are Differences
4 pages
Z Transform
100% (1)
Z Transform
59 pages
L26 Banker Algorithm
No ratings yet
L26 Banker Algorithm
8 pages
K Means R and Rapid Miner Patient and Mall Case Study
No ratings yet
K Means R and Rapid Miner Patient and Mall Case Study
80 pages
Appn. of Active Disturbance Rejection Control in Tank Gun Control System (623KB)
No ratings yet
Appn. of Active Disturbance Rejection Control in Tank Gun Control System (623KB)
19 pages
Multi-Label Hierarchical Text Classification Using The ACM Taxonomy
No ratings yet
Multi-Label Hierarchical Text Classification Using The ACM Taxonomy
12 pages
Cheatsheet Midterm
No ratings yet
Cheatsheet Midterm
2 pages
Viva Questions
No ratings yet
Viva Questions
2 pages
Data Structure Lec37 Handout
No ratings yet
Data Structure Lec37 Handout
8 pages
Slope-Deflection Method: Structural Theory
No ratings yet
Slope-Deflection Method: Structural Theory
19 pages
Lecture 2
No ratings yet
Lecture 2
73 pages
Linear Programming III: Duality and Sensitivity Analysis
No ratings yet
Linear Programming III: Duality and Sensitivity Analysis
3 pages
Lecture 01 Part C - Constraint Satisfaction Problem (CSP)
No ratings yet
Lecture 01 Part C - Constraint Satisfaction Problem (CSP)
132 pages
Human Pose Estimation Using Convolutional Neural Networks
No ratings yet
Human Pose Estimation Using Convolutional Neural Networks
7 pages
Lecture 1 (Final) - Chapter 1 - Introduction To Quantitative Analysis
No ratings yet
Lecture 1 (Final) - Chapter 1 - Introduction To Quantitative Analysis
24 pages
PREDICTING BANK CREDIT RISK USING DATA MINING Group SIX
No ratings yet
PREDICTING BANK CREDIT RISK USING DATA MINING Group SIX
5 pages
Intrusion Detection Systems With Deep Learning: A Systematic Mapping Study
No ratings yet
Intrusion Detection Systems With Deep Learning: A Systematic Mapping Study
5 pages

Data Classification

Uploaded by

Data Classification

Uploaded by

Data Classification

• For a fiveclass map, the two middle classes could be combined,

You might also like