Stat Unit 1
Stat Unit 1
STATISTICS
INTRODUCTION
Dr. Soya Mathew
Unit – 1 Portions
Need for quantifying data; Quantitative and qualitative data;
Frequency distributions – discrete and continuous; Histogram,
Frequency curve, Cumulative frequency curve; Measures of central
tendency – Mean, Median and Mode; Measures of Dispersion –
quartile deviation, standard deviation, coefficient of variation;
Skewness – Karl Pearson and Bowley’s formula;
Kurtosis(definition).
What is Statistics ?
Data Science
Data mining
Big data
Artificial intelligence
Image Analysis
Descriptive Statistics
Descriptive statistics include graphical and numerical procedures
that are used to summarize data and to transform data into information.
For example,
etc…
Variable:
The looks of people, their intelligence and aptitude for art and music change
from one individual to the other, they cannot be measured numerically but is
only expressed qualitatively as good, excellent or average etc…
For example,
𝑓 𝑓1 𝑓2 𝑓3 ….. 𝑓𝑛
𝑓 𝑓1 𝑓2 ….. 𝑓𝑛
Frequency Distribution
Quantitative Data collected from surveys, reports, interviews
etc… are summarized using Frequency distribution. A systematic
presentation of the values taken by a variable and the corresponding
frequency is called frequency distribution.
A tabular presentation of the frequency distribution is called
the frequency table.
There are two types of frequency distribution:
Discrete (or Ungrouped) Frequency Distribution and
Continuous (or Grouped) Frequency Distribution.
Example for Discrete (or Ungrouped) Frequency Distribution:
For example: In a quiz, the marks obtained by 20 students out of
30 are given as: 12, 15, 15, 29, 30, 21, 30, 30, 15, 17, 19, 15, 20, 20,
16, 21, 23, 24, 23, 21
But if the range of values assumed by the variable is vast, then continuous
frequency distribution is more convenient.
For example: distribution of marks in a class of 40 students
The first column of the table represents the Marks (class No. of students
interval) (Frequency)
10 – 19 5 10 – 20 5
20 – 29 4 20 – 30 4
30 – 39 1 30 – 40 1
40 – 49 1 40 – 50 1
To ensure the continuity and to get correct class limits, exclusive
method of classification should be adopted. To convert inclusive class
intervals to exclusive, we need to do some adjustment:
Adjusted Class Interval
In the above example,
9.5 – 19.5 5
20−19
Adjustment Factor is = 0.5 19.5 – 29.5
2 4
10 – 14 6 6
15 – 19 11 17
20 – 24 12 29
25 – 29 10 39
Note:
Frequency distribution of a single variable is called Univariate
Frequency Distribution.
40 - 44 2 1 - - 3
45 - 49 2 4 2 1 9
50 - 54 - 3 5 1 9
55 - 59 - 2 5 4 11
60 - 64 - 1 3 4 8
Total 4 11 15 10 40
In a bivariate distribution, if one of the variables is ignored and the distribution
of other variable alone is considered, then it is called Marginal Frequency
Distribution
Marginal Frequency Distribution Marginal Frequency Distribution
of Weight of Height
Weight (Kgs.) Number of Students Height(cms) Number of Students
140 - 150 4
40 - 44 3
45 - 49 9 150 - 160 11
50 - 54 9 160 - 170 15
55 - 59 11
170 - 180 10
60 - 64 8
Total 40
Total 40
In a bivariate distribution, the distribution of one of the variables subject to
a condition imposed on the other variable is considered is called
Conditional Frequency Distribution.
Conditional Frequency Distribution of weight given that height is in between
150 and 160 cms.
Weight (Kgs.) Number of Students
40 - 44 1
45 - 49 4
50 - 54 3
55 - 59 2
60 - 64 1
Total 11
The following data give the total number of iPods sold by a mail order company on
each of 30 days:
8, 25, 11, 15, 29, 22, 10, 5, 17, 21, 22, 13, 26, 16, 18, 12, 9, 26, 20, 16, 23, 14, 19, 23,
20, 16, 27, 16, 21, 14
iPods Sold Tally Marks Frequency
So 𝑅𝑎𝑛𝑔𝑒 = 29 − 5 = 24 15 – 20
25 – 30
The Frequency Distribution Table for the Data on iPods Sold
MEASURES
OF
CENTRAL TENDENCIES
CENTRAL TENDENCY:
In a frequency distribution the values cluster around a central
values. This property of concentration of the values around a central
value is called central tendency.
Median
Mode
Arithmetic Mean (A.M.):
The ‘Arithmetic Mean’ (or simply ‘mean’) represented by
𝑥ഥ is a most common measure of central tendency. The mean is a
common measure in which all the values play an equal role and is
used to obtain summary measure.
σ 𝑥𝑖
⟹ 𝑥ҧ =
𝑛
10 – 20 15 10 150
Solution:
20 – 30 25 25 625
σ 𝑓𝑖 𝑥𝑖 30 – 40 35 30 1050
𝑥ҧ =
𝑁 40 – 50 45 20 900
50 – 60 55 10 550
4000
⟹ 𝑥ҧ = 60 – 70 65 5 325
110
70 – 80 75 5 375
⟹ 𝑥ҧ = 36.36 marks Total N = 110 4000
2. The table below show the age of 55 patients selected to study the effectiveness of a particular
medicine. Calculate the mean age of the patients.
Age 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
Solution: 0 – 10 5 5 25
10 – 20 15 7 105
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 20 – 30
𝑁 25 17 425
1765 30 – 40 35 12 420
⟹ 𝑥ҧ =
55 40 – 50 45 5 225
⟹ 𝑥ҧ = 32.09 50 – 60 55 2 110
60 – 70 65 7 455
Total N = 55 1765
3. Calculate the arithmetic mean for the following data:
Temperature (°𝑪) −𝟒𝟎 − (−𝟑𝟎) −𝟑𝟎 − (−𝟐𝟎) −𝟐𝟎 − (−𝟏𝟎) −𝟏𝟎 − 𝟎 0 – 10 10 – 20 20 – 30
No of Days 10 28 30 42 65 180 10
1565
⟹ 𝑥ҧ = −𝟏𝟎 − 𝟎 −5 42 −𝟐𝟏𝟎
365
0 – 10 5 65 325
⟹ 𝑥ҧ = 4.29 °𝐶
10 – 20 15 180 2700
20 – 30 25 10 250
0 – 10 5 150 – 140 = 10 50
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 10 – 20 15 140 – 100 = 40 600
𝑁
20 – 30 25 100 – 80 = 20 500
5890
⟹ 𝑥ҧ = 30 – 40 35 80 – 80 = 0 0
150
40 – 50 45 80 – 70 = 10 450
⟹ 𝑥ҧ = 39.266
50 – 60 55 70 – 30 = 40 2200
70 – 80 75 14 – 0 = 14 1050
continuous series is 39.27
Total N = 150 5890
5. The following table shows the age of workers in a factory. Find out the average age
of workers.
Age 20 – 29 30 – 39 40 – 49 50 – 59 60 – 69
No of
Workers 10 8 6 4 2
Age Mid Point No. of 𝑓𝑥
(Class 𝑥 Workers (𝑓)
Solution: Interval)
20 – 29 24.5 10 245
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 30 – 39 34.5 8 276
𝑁
Frequency 20 15 10 ? 2
20 – 30 15
frequency 30 – 40 10
40 – 50 𝑚
Given 𝑥ҧ = 25.4
50 – 60 2
Total
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ =
𝑁 Class Mid Point Frequency 𝑓𝑥
Interval 𝑥 (𝑓)
1135+45𝑚
⟹ 25.4 = 10 – 20 15 20 300
47+𝑚
20 – 30 25 15 375
⟹ 25.4 47 + 𝑚 = 1135 + 45𝑚
30 – 40 35 10 350
⟹ 1193.8 + 25.4𝑚 = 1135 + 45𝑚
40 – 50 45 𝑚 45𝑚
Frequency 17 𝑚 32 𝑛 19 120
0 – 20 17
Given 𝑥ҧ = 50 20 – 40 𝑚
40 – 60 32
We have
60 – 80 𝑛
80 – 100 19
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = Total 120
𝑁
Class Mid Point Frequency (𝑓) 𝑓𝑥
3480+ 30𝑚 + 70𝑛
⟹ 50 = Interval 𝑥
68 + 𝑚 + 𝑛
⟹ 50(68 + 𝑚 + 𝑛) = 3480+ 30𝑚 + 70𝑛 0 – 20 10 17 170
⟹ 𝑚 − 𝑛 = 4 … … (1) 40 – 60 50 32 1600
Also, 68+𝑚+𝑛=120
60 – 80 70 𝑛 70 𝑛
⟹ 𝑚 + 𝑛 = 52 … … (2)
𝑚 = 28
Total 68 + 𝑚 + 𝑛 = 120 3480 + 30𝑚 + 70𝑛
𝑛 = 24
8. The average age of 20 students in a class is 16 years. One student whose age is 18 years has left the class.
Find out average age of rest of the students.
Solution:
Given, 𝑥ҧ = 16 and 𝑁 = 20
σ 𝑥𝑖
We have 𝑥ҧ =
𝑁
σ 𝑥𝑖
⟹ 16 =
20
⟹ σ 𝑥𝑖 = 320
302
∴ average age of 19 students = = 15.89 years
19
9. The mean marks got by 300 students in statistics was 45. The mean of the top 100 of them was found to be 70 and the
mean of the last 100 was known to be 20 . Find the mean of the remaining 100 students.
Solution:
σ 𝑥𝑖
We have 𝑥ҧ = 𝑁
4500
∴ Mean mark of remaining 100 students = = 45
100
10. The mean of a group of 100 observations is known to be 50. Later it was discovered that
two observations were misread as 92 and 8 instead of 192 and 88. Find the correct mean.
Solution:
σ 𝑥𝑖
We have 𝑥ҧ =
𝑁
5180
∴ Corrected mean = = 51.80
100
Weighted Arithmetic Mean:
Usually in computing Arithmetic Mean, equal importance is given to
all the observations of the data. However there are cases where all the items
are not of equal importance. In other words some items of a series are more
important as compared to the other items in the same series.
Def: Weighted mean is the mean of a set of values wherein each value or
measurement has a different weight or degree of importance. The following
is its formula:
σ 𝑥𝑊
𝑥ҧ = σ𝑊
Solution:
Here the weights associated to the observations 82, 86, 90 and 70 are 3, 5, 3 and 1
𝑥 : 82 86 90 70
𝑤:3 5 3 1
σ 𝑥𝑊
Average Score, 𝑥ҧ = σ𝑊
(82) (3)+(86)(5)+(90)(3)+(70)(1)
⟹ 𝑥ҧ =
3+5+3+1
246+430+270+70
⟹ 𝑥ҧ =
12
1016
⟹ 𝑥ҧ =
12
⟹ 𝑥ҧ = 84.67
2. A contractor employs three types of workers, male, female and children. To a male he
pays Rs.40 per days. To a female worker Rs.32 per day and to a child worker Rs.15 per
day. The number of male, female and children workers employed is 20, 15 and 15
respectively. Find out the average amount.
Solution:
Daily Wages in No: of Workers (𝑊) 𝑥𝑾
Rs. (𝑥)
Calculation of Weighted A.M:
40 20 800
σ 𝑥𝑊
𝑥ҧ = σ𝑊 32 15 480
1505 15 15 225
⟹ 𝑥ҧ =
50
Total 50 1505
⟹ 𝑥ҧ = 30.10 Rs.
Combined Arithmetic Mean:
If we have arithmetic mean, and the number of items of two or
more than two related groups , we can calculate the combined average of
these groups by applying the formula.
𝑛1 𝑥ҧ 1 +𝑛2 𝑥ҧ 2 +⋯+𝑛𝑛 𝑥ҧ 𝑛
𝑥ҧ 1,2…,𝑛 =
𝑛1 +𝑛2 +⋯+𝑛𝑛
Solution:
Given 𝑛1 =80, 𝑥ҧ 1 =10, 𝑛2 =20, 𝑥ҧ 2 =15
𝑛1 𝑥ҧ 1 +𝑛2 𝑥ҧ 2
We have, 𝑥ҧ 1,2 =
𝑛1 +𝑛2
80)(10 + 20)(15
⟹ 𝑥ҧ 1,2 =
80+20
800+300
⟹ 𝑥ҧ 1,2 =
100
1100
⟹ 𝑥ҧ 1,2 = = 11 years
100
.
2. The mean marks got by 300 students in the subject of statistics was 45. The mean of the top 100 of them
was found to be 70 and the mean of the last 100 was known to be 20. What is the mean marks of the
remaining students?
Solution:
𝑛1 𝑥ҧ 1 + 𝑛2 𝑥ҧ 2 + 𝑛3 𝑥ҧ 3
We have, 𝑥ҧ 1,2,3 =
𝑛1 + 𝑛2 + 𝑛2
⟹ 100 𝑥 = 4500
⟹ 𝑥 = 45
Merits of Arithmetic Mean:
1. It is rigidly defined.
2. It is easy to understand and easy to calculate.
3. It is based upon all the observations.
4. It is capable of further statistical analysis.
5. It is more stable than any other average.
Demerits of Arithmetic Mean:
1. It cannot be determined by inspection nor it can be located graphically.
2. Arithmetic mean cannot be used if we are dealing with qualitative
characteristics(such as, intelligence, honesty, beauty, etc.)
3. Arithmetic mean cannot be obtained if a single observation is missing
or lost or is illegible unless we drop it out and compute the arithmetic
mean of the remaining values.
4. Arithmetic mean is affected very much by extreme values.
MEDIAN:
Median is known as the positional average. Median divides a series
of ordered observations (ascending or descending) so that the number of
observations above is equal to the number of observations below it. The
median is used when an exact midpoint is desired.
• When the number of items in the series is odd then the median is the
mid-point of the ordered values.
• When the number of items in the series is even then the median is
average of two middle numbers of the ordered values.
Example
Case 2: If N is even then there are two central terms and the mean of
these two values gives the median.
Problems:
1. Obtain the value of median from the following data: 273, 115, 83, 415, 565, 382, 31, 45, 182.
Solution:
Arranging the data in the ascending order: 31, 45, 83, 115, 182, 273, 382, 415, 565.
Here 𝑁 = 9, odd
𝑁+1 th
∴ Median = term
2
9+1 th
= term
2
= 5th term
= 182
2. Find the value of the median from the following 8, 20, 50, 25, 15, 30
Solution:
Arranging the data in the ascending order : 8, 15, 20, 25, 30, 50
Here 𝑁 = 6, even
20+25
∴ Median =
2
45
=
2
= 22.5
3. Rahna’s maths quiz scores in 9 competitions were 88, 97, 87, 92, 90, 88, 93, 98, 95.
What was her median quiz score ?
Solution:
Arranging the data in the ascending order : 87, 88, 88, 90, 92, 93, 95, 97, 98
Here 𝑁 = 9, odd
𝑁+1 th
∴ Median = term
2
9+1 th
= term = 5th term
2
= 92
Solution:
Arranging the data in the ascending order :
Here 𝑁 = 8, even
No. of Persons: 16 24 26 30 20 6
Here N=122
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟 50 16 16
𝑁
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 100 24 40
2
150 26 66
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
122 175 30 96
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥
2
200 20 116
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
250 6 122
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 61
𝑁 = 122
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 150
2. Calculate the value of median from the following data:
Income
1,200 1,800 5,000 2,500 3,000 1,600 3,500
(₹)
No. of
12 16 2 10 3 15 7
Persons
1600 15 27
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
1800 16 43
𝑁
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 2500 10 53
2
3000 3 56
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
3500 7 63
65
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 5000 2 65
2
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 1800 𝑁 = 65
Median in Continuous Series
1. Find the cumulative frequency for each class.
𝑁
2. Find the median class. It is the class interval for which cumulative frequency is ≥ .
2
3. To calculate median , the following formula is used
𝒉 𝑵
𝑴𝒆𝒅𝒊𝒂𝒏 = 𝑳 + −𝒄
𝒇 𝟐
where 𝑁 = σ 𝑓,
𝐿 = lower limit of median class,
h = width of the median class,
𝑓 = frequency of the median class,
𝑐 = cumulative frequency of the class preceeding the median class
Problems:
1. Calculate median from the following information.
Marks: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50
No. of Students: 44 60 36 44 8
Solution:
Marks No. of Students Cumulative Frequency
𝑁 192
Here = = 96𝑡ℎ 𝑡𝑒𝑟𝑚
2 2
0 – 10 44 44
20 – 30 36 140
96 falls in the class interval 10 – 20.
30 – 40 44 184
Therefore 10 – 20 is the median 40 – 50 8 192
𝑁 = 192
class.
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Marks No. of Cumulative Frequency
𝑓 2
Students
10
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 10 + 96 − 44
60 0 – 10 44 44
52 10 – 20 60 104
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 10 +
6
20 – 30 36 140
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 10 + 8.67
30 – 40 44 184
𝑁 = 192
2. Calculate median from the following data:
Group 60 – 64 65 – 69 70 – 74 75 – 79 80 – 84 85 – 89
Frequency 1 5 9 12 7 2
Solution:
Group Frequency Adjusted Class Cumulative
Interval Frequency
𝑁 36
Here = = 18𝑡ℎ 𝑡𝑒𝑟𝑚
2 2 60 – 64 1 59.5 – 64.5 1
65 – 69 5 64.5 – 69.5 6
The cumulative frequency value
70 – 74 9 69.5 – 74.5 15
18 falls in the class interval
75 – 79 12 74.5 – 79.5 27
85 – 89 2 84.5 – 89.5 36
∴ 74.5 – 79.5 is the median class.
ℎ 𝑁 Group Frequency Adjusted Class Cumulative
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Interval Frequency
𝑓 2
5
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 74.5 + 18 − 15 60 – 64 1 59.5 – 64.5 1
12
5 65 – 69 5 64.5 – 69.5 6
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 74.5 +
4
70 – 74 9 69.5 – 74.5 15
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 74.5 + 1.25
75 – 79 12 74.5 – 79.5 27
85 – 89 2 84.5 – 89.5 36
3. From the following data find out missing frequency if the median is 50.
Marks: 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
No. of 2 8 6 ? 15 10
Students:
Solution:
Marks No. of Students Cumulative Frequency
20 – 30 8 10
Median = 50 30 – 40 6 16
40 – 50 𝑚 16 + 𝑚
∴ Median Class = 50 – 60
50 – 60 15 31 + 𝑚
60 – 70 10 41 + 𝑚
ℎ 𝑁 Marks No. of Students Cumulative Frequency
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐
𝑓 2
10 – 20 2 2
10 41+𝑚
⟹ 50 = 50 + − 16 + 𝑚
15 2 20 – 30 8 10
10 41 + 𝑚−32−2𝑚
⟹ 0= ( ) 30 – 40 6 16
15 2
9−𝑚 40 – 50 𝑚 16 + 𝑚
⟹ 0=
2
50 – 60 15 31 + 𝑚
⟹ 9 − 𝑚 =0
60 – 70 10 41 + 𝑚
⟹ 𝑚=9
4. Calculate the missing frequency from the following distribution, it is
being given that median of the distribution is 24.
Age in years 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50
No. of persons 5 25 ? 18 7
Solution:
Age in No. of Persons Cumulative Frequency
years
Let 𝑚 be the missing frequency. 0 – 10 5 5
10 – 20 25 30
Median = 24
20 – 30 𝑚 30 + 𝑚
∴ Median Class = 20 – 30 30 – 40 18 48 + 𝑚
40 – 50 7 55 + 𝑚
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Age in No. of Persons Cumulative Frequency
𝑓 2 years
10 55+𝑚 0 – 10
⟹ 24 = 20 + − 30 5 5
𝑚 2
10 55 + 𝑚 − 60 10 – 20 25 30
⟹ 4=
𝑚 2
20 – 30 𝑚 30 + 𝑚
⟹ 8𝑚 = 10 𝑚 − 5
⟹ 8𝑚 = 10𝑚 − 50 30 – 40 18 48 + 𝑚
⟹ 2m = 50
40 – 50 7 55 + 𝑚
⟹ m = 25
5. The median of the following frequency distribution is 56. Find the
missing frequencies.
Class Interval 0 – 20 20 – 40 40 – 60 60 – 80 80 – 100 100 – 120 Total
Frequency 16 𝒎 25 16 𝒏 10 90
∴ Median Class = 40 – 60 40 – 60 25 41 + 𝑚
60 – 80 16 57 + 𝑚
80 – 100 𝑛 57 + 𝑚 + 𝑛
100 – 120 10 67 + 𝑚 + 𝑛
ℎ 𝑁 Class Frequency Cumulative Frequency
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐
𝑓 2 Interval
20 67+𝑚+𝑛
⟹ 56 = 40 + − 16 + 𝑚 0 – 20 16 16
25 2
4 67 + 𝑚 + 𝑛 − 32 − 2𝑚
⟹ 16 = 20 – 40 𝑚 16 + 𝑚
5 2
⟹ 40 = 35 − 𝑚 + 𝑛 40 – 60 25 41 + 𝑚
⟹ 𝑛 − 𝑚 = 5 … … (1)
60 – 80 16 57 + 𝑚
Also 67 + 𝑚 + 𝑛 = 90
80 – 100 𝑛 57 + 𝑚 + 𝑛
⟹ 𝑚 + 𝑛 = 23 … … (2)
𝑚=9
𝑛 = 14
6. Find the missing frequencies in the following distribution. It is given that the
median of the distribution is 41 and the total number of observations is 82.
Class Interval 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 Total
Frequency 10 𝒎 15 20 𝒏 11 82
∴ Median Class = 40 – 50 30 – 40 15 25 + 𝑚
40 – 50 20 45 + 𝑚
50 – 60 𝑛 45 + 𝑚 + 𝑛
60 – 70 11 56 + 𝑚 + 𝑛
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Class Frequency Cumulative Frequency
𝑓 2 Interval
10 56+𝑚+𝑛
⟹ 41 = 40 + − 25 + 𝑚 10 – 20 10 10
20 2
1 56 + 𝑚 + 𝑛 −50 − 2𝑚
⟹ 1= 20 – 30 𝑚 10 + 𝑚
2 2
⟹ 4=6−𝑚+𝑛 30 – 40 15 25 + 𝑚
⟹ 𝑚 − 𝑛 = 2 … … (1)
40 – 50 20 45 + 𝑚
Also 56 + 𝑚 + 𝑛 = 82
⟹ 𝑚 + 𝑛 = 26 … … (2) 50 – 60 𝑛 45 + 𝑚 + 𝑛
1. Determine the value of median from the following data with the help of
'less than' and 'More than' Ogive Method. (2021)
10 – 20 6 20 8 10 100
20 – 30 8 30 14 20 94
30 – 40 12 40 26 30 86
40 – 50 18 50 44 40 74
50 – 60 25 60 69 50 56
60 – 70 16 70 85 60 31
70 – 80 8 80 93 70 15
80 – 90 5 90 98 80 7
90 – 2 100 100 90 2
100
2. Determine the value of median from the following data with the help of
'less than' and 'More than' Ogive Method.
Marks: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60
No. of 10 15 25 30 10 10
Students:
Marks No. of Upper Less Lowe Greater
Students Limit than r than
Ogive Limit Ogive
0 – 10 10 10 10 0 100
10 – 20 15 20 25 10 90
20 – 30 25 30 50 20 75
30 – 40 30 40 80 30 50
40 – 50 10 50 90 40 20
50 – 60 10 60 100 50 10
3. From the following data prepare frequency distribution table with class
intervals 2 – 2.5, 2.5 – 3 , 3 – 3.5, 3.5 – 4, 4 – 4.5, 4.5 – 5. Also locate the
median graphically.
3.7, 3.4, 4.1, 4.0, 3.7, 4.7, 3.3, 2.4, 3.1, 4.2, 3.8, 3.6, 4.2, 4.3, 2.9, 3.6,
3.3, 4.8, 4.0, 3.9, 3.5, 3.8, 3.8, 4.2, 3.9, 4.9, 3.2, 4.0, 3.8, 3.2, 2.7, 3.4,
3.3, 3.0, 3.1, 3.5, 3.7, 3.9, 4.3, 3.8, 3.7, 3.0, 4.4, 4.1, 3.6, 3.7, 3.3, 3.5,
3.7, 3.0, 2.9, 3.1, 3.3, 4.2.
Class Frequ Upper Less Lowe Greater
Interv ency Limit than r than
al Ogive Limit Ogive
2 – 2.5 1 2.5 1 2 54
2.5 – 3 3 3 4 2.5 53
3 – 3.5 15 3.5 19 3 50
3.5 – 4 20 4 39 3.5 35
4 – 4.5 12 4.5 51 4 15
4.5 – 5 3 5 54 4.5 3
𝑁 54
Here = = 27𝑡ℎ 𝑡𝑒𝑟𝑚
2 2
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐
𝑓 2
0.5
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 3.5 + 27 − 19
20
4
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 3.5 +
20
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 3.7
Merits and Demerits of Median:
Merits of Median:
Demerits of Median:
Example: 3, 3, 4, 5, 5, 5, 7
Mode = 5
Bimodal Data
A series of numbers is known to be bimodal if it has two
Modes.
Example: 105, 105, 105, 107, 108, 109, 109, 109, 110, 112
The ages of six persons who participated in an interview were 20, 21, 21,
24, 25, 24, 21 and 27 years. Find the mode of the data.
Solution:
∴ Mode = 21 years.
Example 3:
Solution:
Here the data has two values, 74 and 76, which appears 3 times.
All the other values appear less than 3 times. So the data set has two distinct
modes 74 and 76.
Mode in Discrete Series
Example: The following distribution shows the sizes of shirts sold on a textile
shop in Bangalore on a month. Calculate the mode.
No of shirts sold : 15 22 31 30 20
𝒇𝟏 − 𝒇𝟎
𝑴𝒐𝒅𝒆 = 𝑳 + 𝒉
𝟐𝒇𝟏 − 𝒇𝟎 − 𝒇𝟐
where,
𝐿 = lower limit of modal class,
𝑓1 = frequency of the modal class,
𝑓0 = frequency of the class preceeding the modal class
𝑓2 = frequency of the class succeeding the modal class
h = width of the modal class,
Problems:
1. Compute the mode of the following distribution.
Class: 0–7 7 – 14 14 – 21 21 – 28 28 – 35 35 – 42 42 – 49
Frequency: 19 25 36 72 51 43 28
7 – 14 25
lies in the class-interval 21 – 28.
14 – 21 36
28 – 35 51
35 – 42 73
42 – 49 28
𝑓1 − 𝑓0
𝑀𝑜𝑑𝑒 = 𝐿 + ℎ Class Frequency
2𝑓1 − 𝑓0 − 𝑓2
72 −36 0–7 19
⟹ 𝑀𝑜𝑑𝑒 = 21 + 7
2 72 −36 − 51
7 – 14 25
36
⟹ 𝑀𝑜𝑑𝑒 = 21 + 7 14 – 21 36
144 − 87
21 – 28 72
36
⟹ 𝑀𝑜𝑑𝑒 = 21 + 7 28 – 35
57 51
⟹ 𝑀𝑜𝑑𝑒 = 21 + 4.42 35 – 42 73
42 – 49 28
⟹ 𝑀𝑜𝑑𝑒 = 25.42
2. The production per day of a company (in Tons) on 60 days are given
Number of days: 7 13 22 10 8
Solution:
27 – 28 26.5 – 28.5 10
29 – 30 28.5 – 30.5 8
𝑓1 − 𝑓0
Mode = 𝐿 + ℎ Class Adjusted Frequency
2𝑓1 − 𝑓0 − 𝑓2
Class
Interval
22 −13
⟹ Mode = 24.5 + 2 21 – 22 20.5 – 22.5 7
2 22 −13 − 10
9 23 – 24 22.5 – 24.5 13
⟹ Mode = 24.5 + 2
44 −23
25 – 26 24.5 – 26.5 22
9
⟹ Mode = 24.5 + 2
21
27 – 28 26.5 – 28.5 10
No. of Students: 3 5 8 ? 3
40 – 50 3
𝑓1 − 𝑓0 Marks No. of Students
Mode = 𝐿 + ℎ
2𝑓1 − 𝑓0 − 𝑓2
0 – 10 3
8 −5
⟹ 26 = 20 + 10
2 8 −5−𝑚 10 – 20 5
3 20 – 30 8
⟹ 6= 10
11 − 𝑚
30 – 40 𝑚
⟹ 11 − 𝑚 = 5
40 – 50 3
⟹ 𝑚=6
4. Calculate the missing frequencies from the following data, if mode is
given as 24.
Class Interval: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 Total
Frequency: 14 𝒎 27 𝒏 15 100
⟹ 3𝑚 − 2𝑛 = 27
20 – 30 27 41 + 𝑚
Also 56 + 𝑚 + 𝑛 = 100
30 – 40 𝑛 41 + 𝑚 + 𝑛
⟹ 𝑚 + 𝑛 = 44
Solving (1) and (2), we get
40 – 50 15 56 + 𝑚 + 𝑛
𝑚 = 23
𝑛 = 21
Merits and Demerits of Mode:
Merits of Mode:
Demerits of Mode:
(i) Mode is ill-defined. It is not always possible to find a clearly defined mode.
The importance of this relation is that we can estimate the value of any
one of them by knowing the values of the other two. However, this relationship
the area on the left of mode is greater than area on the right then
But for symmetrical distribution of mean, median and mode are identical.
Example:
1. If Z=400, and 𝑥=300
ҧ find median?
Solution:
⟹ 3𝑀𝑒𝑑𝑖𝑎𝑛 = 1000
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 333.33
Example:
2. For a given data the mean= 36.67 and median = 39.23 . Hence find mode.
Solution:
⟹ 𝑀𝑜𝑑𝑒 = 44.35
Locate Mode Graphically:
frequency.
frequency.
Class Intervals of Equal Width:
The variables is taken along the x – axis and class frequency along y –axis.
With class intervals as bases, rectangles with height proportional to class frequency
are drawn.
this case, the rectangles are drawn with height proportional to frequency density
should be drawn.
Step 3: Join diagonally the upper end points of the highest bar to the end points
of the adjacent bars.
Step 6: The point where the perpendicular meets the 𝑥 − 𝑎𝑥𝑖𝑠 gives the modal value.
Problems:
Age 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60
No. of People 12 18 27 20 17 6
𝑓1 − 𝑓0
Mode = 𝐿 + ℎ Class Frequency
2𝑓1 − 𝑓0 − 𝑓2
27 −18 0 – 10 12
⟹ Mode = 20 + 10
2 27 −18−20
10 – 20 18
9
⟹ Mode = 200 + 10 20 – 30 27
54 −38
9 30 – 40 20
⟹ Mode = 20 + 10
16
40 – 50 17
⟹ Mode = 20 + 5.625
50 – 60 6
⟹ Mode = 25.63
Solution:
Mode = 25.6
Problems:
12 50 – 59 49.5 – 59.5 16
⟹ Mode = 39.5 + 10
26
60 – 69 59.5 – 69.5 6
⟹ Mode = 39.5 + 4.62
70 – 79 69.5 – 79.5 8
⟹ Mode = 44.12
Mode = 44.12
Problems:
3. The weights in grams of 50 apples picked at random from a market are as follows:
131, 113, 82, 75, 204, 81, 84, 118, 104, 110, 80, 107, 111, 141, 136, 123, 90,
78, 90, 115, 110, 98, 106, 99, 107, 84, 76, 186, 82, 100, 109, 128, 115, 107, 115,
119, 93, 187, 139, 129, 130, 68, 195, 123, 125, 111, 92, 86, 70, 126.
Form the grouped frequency table by dividing the variate range into intervals
of equal width, each corresponding to 20 gms in such a way that the mid-value
5. From the following data prepare frequency distribution table and hence
determine the mean , median and mode.
0, 1, 2, 3, 4, 5, 6, 6, 5, 4, 4, 5, 5, 4, 4, 3, 3, 3, 3, 2, 2, 2, 3, 2, 3, 2, 2, 2, 1, 1, 1, 0, 0, 1, 0.
Solution: 𝒙 𝒇 𝒇𝒙
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 0 4 0
𝑁
94 1 5 5
⟹ 𝑥ҧ = 2 8 16
35
⟹ 𝑥ҧ = 2.686 3 7 21
4 5 20
⟹ 𝑀𝑒𝑎𝑛 = 2.686 5 4 20
Also, 𝑀𝑒𝑑𝑖𝑎𝑛 = 3 6 2 12
𝑀𝑜𝑑𝑒 = 2 Total 35 94