Organisaton of Data
Organisaton of Data
Chapter - 3
ORGANIZATION OF DATA
Characteristics of classification:
1. Homogeneity
2.Suitability
3. Clarity
4. Flexibility
5. Diversification
Basis of classification:
Raw data can be classified as:
1. Chronological classification: In such a classification data are classified
either in ascending or in descending order with reference to time such as
years, quarters, months weeks etc.
2. Geographical/Spatial classification: The data are classified with
reference to geographical location/place such as countries, states , cities,
districts, block etc.
3. Qualitative classification: Data are classified with reference to
descriptive characteristics like sex, caste, religion literacy etc.
4. Quantitative classification: Data are classified on the basis of some
measurable characteristics such as height, age, weight, income, marks of
students.
5. conditional classification: When data are classified with respect to
condition, the type of classification is called conditional classification.
A mass of data in its original form is called raw data. It is an unorganized
mass of various items.
A characteristic which is capable of being measured and changes its value
overtime is called a variable. It is of two types.
(a) Discrete
(b) Continuous
The class mid-point or class mark is the middle value of a class. It lies
halfway between the lower class limit and the upper class limit of a class
and can be ascertained in the following manner.
Class Limits:- There are two ends of a class. The lowest value is called
lower class limit and highest value is called upper class limit.
The classes, by the exclusive method is formed in such a way that the
upper class limit of one class equals the lower class limit of the next class.
eg 0-10, 10-20.
In comparison to the exclusive method, the inclusive method does not
excludes the upper class limit in a class interval. It includes the upper class
in a class. Thus both class limits are parts of the class intervals e.g., 0-9,
10-19.
The classification of data as a frequency distribution has an inherent short
coming. While it summarizes the raw data making it concise and
comprehensible. It does not show the details that are found in raw data. So
there is a loss of information in classifying raw data.
Classification of data implies conversion of raw data in to statistical series.
The difference between
Univariate and Bivariate Frequency distribution
Univariate Frequency
Basis Bivariate Frequency distribution
distribution
When data is classified on
when data is classified on the basis
the basis of single
of two variables, the distribution
Meaning variable,the distribution is
is known as bivariate frequency
known as univariate
distribution.
frequency distribution.
Alternate
One-way frequency Two-way frequency
Name
Height of students in a Height and weight of students in a
Example
class class
Individual series are those series in which the items are listed singly. For
example:
Marks Frequency
10-20 4
20-30 5
30-40 8
40-50 5
50-60 4
60-70 3