Organisation of Data Notes
Organisation of Data Notes
● It condenses the raw data into a form suitable for statistical analysis.
Basis of Classification
Simple manifold
Class 11th - Statistics - Organisation of Data
Basis of Classification:
● Geographical (or Spatial) Classification:
This classification of data is based on the geographical or locational
differences of the data.
Punjab 30
Haryana 20
UP 25
Class 11th - Statistics - Organisation of Data
Basis of Classification:
● Chronological Classification:
When data are classified on the basis of time, it is known as
chronological classification.
Year Sales(₹)
2016 80 lakh
2017 90 lakh
2018 95 lakh
Class 11th - Statistics - Organisation of Data
Basis of Classification:
Qualitative Classification: In qualitative classification,data is classified on
the basis of descriptive characterstics or on the basis of attributes like
sex,literacy,region,caste,education,etc. Which cannot be quantified.
Simple Classification: When facts are classified into two classes according
to one attribute only, then the classification is said to be simple.
Population
Males females
10-12 150
12-14 130
14-16 100
16-18 120
Example:
Variable
0 5
1 7
2 5
3 3
Total children 20
Class 11th - Statistics - Organisation of Data
Continuous Variable:
● Those variables which can take all the possible values(integral
as well as fractional) in a given specified range are termed as
continuous variables.
40-45 22
45-50 6
50-55 9
55-60 10
60-65 3
Total Students 50
Class 11th - Statistics - Organisation of Data
Differentiate Between Discrete Variable v/s Continuous Variable:
0 5
1 7
2 5
3 3
Total children 20
Class 11th - Statistics - Organisation of Data
Frequency Distribution:
● Continuous Frequency Distribution:
40-45 22
45-50 6
50-55 9
55-60 10
60-65 3
Total Students 50
Class 11th - Statistics - Organisation of Data
Statistical Series:
● The arrangement of classified data in some logical order, like
according to the size, according to the time of occurence or
according to some other measureable or non-measurable
characteristics, is known as Statistical Series.
Spatial Condition
Time-Series Individual Series Frequency series
Series Series
Class 11th - Statistics - Organisation of Data
On the Basis of Characteristics:
● Time Series:
If the different values that a variable has taken in a period of time are
arranged in a chronological order,the series so obtained is called a
time series.
● When the investigator has collected the data and he has not
arranged the same in a systematic manner,it is called raw data
or unorganized series.
35 40 38 17 25
45 36 29 42 22
Class 11th - Statistics - Organisation of Data
Organized Individual Series:
● Organized series is an orderly arrangement of raw data.
For Example:
1 35
2 40
3 38
4 17
5 25
6 45
7 36
8 29
9 42
10 22
Class 11th - Statistics - Organisation of Data
Organized Individual Series:
According to order of magnitude:
17 45
22 42
25 40
29 38
35 36
36 35
38 29
40 25
42 22
45 17
Class 11th - Statistics - Organisation of Data
Frequency Series:
● Frequency Series may be of two types:
● Frequency Distribution
Class 11th - Statistics - Organisation of Data
Important Terms:
● Frequency : Frequency is the number of times an item occurs(or
repeats itself) in the series.
● The classes should be clearly defined and should not lead to any
confusion.
For Example: Mid-point of class 10-20 will be: Mid-Point = 10+20 =15
2
Class 11th - Statistics - Organisation of Data
Important Terms:
Frequency Distribution:
Inclusive Series
(Classes of type 10-19,20-29,etc)
Mid-Value Series
(Middle values of a class-interval are given)
Class 11th - Statistics - Organisation of Data
Exclusive Series:
● Exclusive series is that series in which every class interval
excludes items corresponding to its upper limit.
● In this series the upper limit of one interval is the lower limit of
the next class interval.
10-15 4
15-20 5
20-25 8
25-30 5
30-35 4
Total 26
Class 11th - Statistics - Organisation of Data
Inclusive Series:
● An inclusive Series is that series which includes all items upto its
upper limit.
● In such series, the upper limit of class interval does not repeat
itself as a lower limit of the next class interval.
● There is a gap between the upper limit of a class interval and the
lower limit of the next class interval.
Marks Frequency
10-14 4
15-19 5
20-24 8
25-29 5
30-34 4
Total =26
Class 11th - Statistics - Organisation of Data
Conversion of Inclusive Series into Exclusive Series:
● Sometimes, it becomes necessary to have exclusive classes to
apply some statistical tools.If the given classes are inclusive
type,then they need to be transformed into exclusive classes.
● This procedure fills up the gap between two classes and thereby
we get the exclusive classes .The upper and lower class limits of
the new ‘exclusive type’ classes as called class boundaries.
Class 11th - Statistics - Organisation of Data
Conversion of Inclusive Series into Exclusive Series:
Ques. Convert the following inclusive series into exclusive series.
10-19 4
20-29 2
30-39 12
40-49 10
50-59 9
60-69 3
Total 40
Class 11th - Statistics - Organisation of Data
Conversion of Inclusive Series into Exclusive Series:
Exclusive Series
Ans. The difference between the upper limit of
a class -interval and lower limit of the next Marks No. of Students(f)
class-interval is 1.
9.5-19.5 4
So,0.5(i.e.half of 1) will be subtracted from the
lower limits of all the classes and added to the 19.5-29.5 2
upper limits.
29.5-39.5 12
39.5-49.5 10
49.5-59.5 9
59.5-69.5 3
Total 40
Class 11th - Statistics - Organisation of Data
Conversion of Inclusive Series into Exclusive Series:
Ques. Following marks were obtained by 25 students of a class in
mathematics paper carrying 50 marks.
19 13 12 25 32 12 31 10 21 23 27 41 29
30 45 39 33 40 17 11 20 26 14 41 15
Total Students 25
Class 11th - Statistics - Organisation of Data
Conversion of Inclusive Series into Exclusive Series:
Ans. (ii)Frequency Distribution by Exclusive Method
10-20 |||||||| 9
20-30 |||| || 7
30-40 |||| 5
40-50 |||| 4
Total Students 25
Class 11th - Statistics - Organisation of Data
Difference Between Exclusive Method and Inclusive Method:
1 The upper limit of a class-interval is Both the limits of a class-interval is counted in the
counted in the next immediate class. same class
2 The upper limit of a class -interval and The upper limit of a class-interval and lower limit
the lower limit of next class are the of next class are different .The difference is
same generally of one.
Marks Frequency
Below 5 1
5-10 3
10-15 4
15-20 6
20 and above 1
Class 11th - Statistics - Organisation of Data
Open - End Distribution:
● If possible,open-end classes should be avoided because:
Less than 10 12
10-20 15
20-30 4
30-40 16
Above 40 3
Total 50
Lower limit of first class = 0 and upper limit of last class =50
Class 11th - Statistics - Organisation of Data
Conversion of open-end series into a continuous series
● Case2: Uniform Trend of class width of known class - intervals:
If the given class-intervals(except the first and last class-interval) are
showing a uniform trend, then we can determine the lower limit of
first class and upper limit of the last class after adjusting it with the
trend.
Class 11th - Statistics - Organisation of Data
Conversion of open-end series into a continuous series
Example: Classes Frequency
Less than 10 10
10-30 7
30-60 6
60-100 9
Above 100 8
Total 40
Lower limit of first class = 0 and upper limit of last class = 150.
Class 11th - Statistics - Organisation of Data
Conversion of open-end series into a continuous series
● Case3: Different class width of known class-intervals:
If class-intervals are of different width,then lower limit of the first
class and upper limit of the last class cannot be determined.
Class 11th - Statistics - Organisation of Data
Conversion of open-end series into a continuous series
Example: Classes Frequency
Less than 10 12
10-30 15
30-60 4
60-80 11
80-85 5
Above 85 3
Total 50
No.of students 2 5 10 12 17 4
Class 11th - Statistics - Organisation of Data
‘Less than’ Cumulative Frequency Distribution:
Marks 0-10 10-20 20-30 30-40 40-50 50-60
No.of students 2 5 10 12 17 4
Less than 10 2
No.of students 2 5 10 12 17 4
More than 50 4
Class 11th - Statistics - Organisation of Data
Equal and Unequal class-interval Series:
● Equal class-interval series:
When the classes in a series are of the same interval, it is called the
equal class-interval series. This is illustrated in the following table:
10-20 7
20-30 4
30-40 3
40-50 6
50-60 3
Total 23
Class 11th - Statistics - Organisation of Data
Equal and Unequal class-interval Series:
● Unequal class-interval series:
When the class-interval are not equal,it is called unequal class-interval series.It is
shown in the following table:
10-20 6
20-40 15
40-70 12
70-80 4
80-110 3
Total 30
Class 11th - Statistics - Organisation of Data
Mid-values Series:
● Mid-values or mid-point is the middle value of a class -interval.When
such mid-values are given,it is called mid-values series.
Mid-value 5 15 25 35 45
Frequency 4 8 10 6 2
Total 30
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Illustration 1.
In a survey, it was found that 50 families bought milk in the following
quantities in a particular month.Classify the following data in an
individual,discrete,continuous and cumulative frequency series
19 25 12 21 20
11 7 9 11 14
20 15 22 15 17
10 23 18 11 19
25 22 5 18 17
11 6 21 24 26
8 5 9 20 23
16 22 22 23 13
16 11 16 22 17
23 22 10 7 21
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Solution: Individual Series: In the Ascending order of the items
5 11 16 20 22
5 11 16 20 22
6 11 16 20 23
7 11 17 21 23
7 11 17 21 23
8 12 17 22 23
9 13 18 22 24
9 14 18 22 25
10 15 19 22 25
10 15 19 22 26
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Solution: Discrete Series: In the Ascending order of the items
Quality of Tally Bars Frequency Quality of Tally Bars Frequency
Milk Milk
5 || 2 16 ||| 3
6 | 1 17 ||| 3
7 || 2 18 || 2
8 | 1 19 || 2
9 || 2 20 ||| 3
10 || 2 21 ||| 3
11 |||| 5 22 |||| | 6
12 | 1 23 |||| 4
13 | 1 24 | 1
14 | 1 25 || 2
15 || 2 26 | 1
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Solution: Frequency Distribution
25-30 ||| 3
∑f=50
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Solution: Cumulative Frequency Distribution
16 |||| |||| 9
17 |||| 5
18 ||| 3
19 | 1
∑f=25
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Illustration 3:
100-150 ,150-200,200-250,250-300,300-350.
100-150 |||| 4
150-200 |||| | 6
250-300 |||| 5
300-350 || 2
∑f=30
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
(ii) Number of households spending less than 250 per month = 4+6+13 =23
Percentage of households spending less than 250 per month = 23/30 x 100=76.6%
Percentage of households spending more than 200 per month = 20/30 x 100=66.6%
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Illustrations 4:
Convert the following cumulative frequency distribution into a
simple frequency distribution.
Marks Number of Students
More than 0 55
More than 5 51
More than 10 43
More than 15 28
More than 20 16
More than 25 6
More than 30 0
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Solution: Conversion of Cumulative Frequency Distribution into simple Frequency Distribution
0-5 55-51=4
5-10 51-43=8
10-15 43-28=15
15-20 28-16=12
20-25 16-6=10
25-30 6-0=6
30-35 0
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Illustration 5:
Convert the following inclusive series into exclusive series: Marks Frequency
19-24 2
25-29 7
30-34 6
35-39 2
40-44 3
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Solution: Conversion of Inclusive Series into Exclusive Series
Marks Frequency
18.5-24.5 2
24.5-29.5 7
29.5-34.5 6
34.5-39.5 2
39.5-44.5 3
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Illustration 6:
The monthly salary of some families in a society is given below:
10-20 6
20-30 9
30-40 12
40-50 17
50-60 11
60-70 5
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
(i)What is the lower limit of the second class interval?
5 6 8 10 11 13 6 8 5 13 8 10 3
18 6 8 5 16 11 8 5 8 5 8 6
Class 11th - Statistics - Organisation of Data
Numericals of Continuous Series:
Solution: The lowest values is 3 and highest value is 18. We have to
take a class interval of 4.The various classes will be 0-4,4-8 and so on
up to 16-20.
Marks Tally Bars Frequency
0-4 | 1
12-16 || 2
16-20 || 2
∑f=25
Class 11th - Statistics - Organisation of Data
Points to be Considered for Constructing Continuous Series:
● Number of classes should neither be too small nor too
large,preferably being between 5 and 15(though there is no hard
and fast rule).
● Normally , a class-interval should be a multiple of 5, as it is easy
to grasp number like 5,10,15 etc.
● All class-intervals should be equal in size.
● It is convenient to have lower limit of a class either equal to zero
or some multiple of 5.
● If possible,open-end classes should be avoided.
● Prefer to classify the data into an exclusive series(unless stated
otherwise).
● Make three columns to classify the data(1st one of variable, 2nd
for tally marks and 3rd for frequency).
Class 11th - Statistics - Organisation of Data
Bivariate Frequency Distribution:
● When the data is classified on the basis of two variables such as
height and weight, marks in statistics and economics etc, the
distribution is known as Bivariate frequency Distribution or
Two-way Frequency distribution.
Marks in 28 26 27 25 28 28 27 27 26 27 27 26 25 26 27
Accounts
Marks in 22 21 21 20 22 21 21 20 20 19 21 19 19 20 21
Economics
Class 11th - Statistics - Organisation of Data
Bivariate Frequency Distribution:
Solution:
Marks in Accounts(X) 25 26 27 28 Total
Marks in Economics(Y)
22 _ _ _ II(2) 2
Total 2 4 6 3 15
Class 11th - Statistics - Organisation of Data
Bivariate Frequency Distribution:
● The Following data represent the marks in English (X) and
Maths(Y) of 10 students. Prepare a two-way frequency
distribution with class-interval of 15-25, 25-35,35-45 and so on.
Marks
English(X) Maths(Y)
35 56
28 42
30 40
18 30
40 60
45 62
27 18
36 52
52 64
38 60
Class 11th - Statistics - Organisation of Data
Bivariate Frequency Distribution:
Solution: Marks in English(X) 15-25 25-35 35-45 45-55 Total
Marks in Maths(Y)
15-25 _ I(1) _ _ 1
25-35 I(2) _ _ _ 1
35-45 _ II(2) _ _ 2
45-55 _ _ I(1) _ 1
Total 1 3 4 2 10
Class 11th - Statistics - Organisation of Data
Univariate Frequency Distribution v/s Bivariate Frequency Distribution:
Meaning When data is classified on the basis When the data is classified on the basis
of single variable, the distribution is of two variables, the distribution is
known as univariate frequency known as Bivariate frequency
distribution. distribution.
Purpose It aims to make description about the It aims to determine the empirical
particular variable relationship between the two
variables.