Chapter 03
Chapter 03
Mean of first n Natural Numbers : Example 3.1 : The daily wages of a group of farm workers are shown
in the following frequency distribution.
First n natural number are 1, 2, ..........., n.
1 + 2 + ....... + n 1 n (n + 1) n + 1 Daily wages Number of Daily wages Number of
The mean, x = = = (Tk.) workers (Tk.) workers
n n 2 2
n +1 50-55 5 70-75 15
x =
2
55-60 10 75-80 7
Advantages of Arithmetic mean:
60-65 25 80-85 3
• It is rigidly defined.
65-70 35
• It is easy to calculate.
• It is based upon all the observations. Computation of arithmetic mean by direct and indirect method.
• It is suitable for further algebraic treatment. Direct Method :
• It is less affected by sampling fluctuations. Number of
Daily wages
workers Mid value xi fixi
Disadvantages of Arithmetic mean: (Tk.)
fi
• It is affected very much by extreme values. 50-55 5 52.5 262.5
• It cannot be calculated if the extreme class is open. 55-60 10 57.5 575.0
60-65 25 62.5 1562.5
• It is not suitable for extremely skewed distribution.
65-70 35 67.5 2362.5
• It cannot be used if we are dealing with qualitative 70-75 15 72.5 1087.5
characteristics; such as intelligence, honesty, beauty, etc. 75-80 7 77.5 542.5
80-85 3 82.5 247.5
• It cannot be obtained if a single observation is missing or
100 6640.0
lost.
44 Methods of Statistics Central Tendency and Its Measures 45
1 1 1 n
x= f i x i = (6640.0) = Tk. 66.40 GM = Anti log log x i
N 100 n i =1
Average daily wage is Tk. 66.40
In case of frequency distribution, when f1, f2, ...... , fn be the
Indirect Method : frequencies of x1, x2, .........., xn respectively, then
1 1 n
5-10 2 7.5 0.87506 1.75012 = (f i / x i )
HM N i =1
10-15 4 12.5 1.09691 4.38764
15-20 2 17.5 1.24304 2.48608 Advantages of Harmonic Mean :
20-25 1 22.5 1.35218 1.35218 • It is rigidly defined.
• It is based upon all the observations.
filogxi
fi = N =10 • Sampling fluctuation is less.
= 10.37396
• It is not affected much by extreme values
1
GM = Anti log f i log x i Disadvantages of Harmonic Mean :
N
1 • It cannot be computed where there is any zero values in the
= Anti log (10.373996) = Anti log (1.037396)
10 series.
= 10.9 (Approx.) • It is not easily understood and difficult to compute.
• It is very complex for further algebrical treatments.
The average rate of change of yield of the new variety of wheat
• It cannot be computed if the extreme classes of the
is 10.9% frequency distribution are open.
48 Methods of Statistics Central Tendency and Its Measures 49
and (d i / A )2 .
1 For computing median from frequency distribution we first need
is a positive quantity
(1 + d i / A ) 2
to identify the median class (class which contains the median). If the
log G = log A + 0 - a positive quantity total frequency is N, the class having cumulative frequency equal to,
log A = log G + a positive quantity or immediately higher than N/2 will be the median class.
log A log G For frequency distribution, the formula for computing the
AG ............................................... (2) median is
52 Methods of Statistics Central Tendency and Its Measures 53
Me = Lm + 2 x h
fm 3 quartiles : Qi (i = 1, 2, 3); devide the whole distribution
into four equal parts
where,
9 Deciles : Dj (j = 1, 2,..., 9); devide the whole distribution
Lm = lower limit of the median class.
into 10 equal parts.
N = total frequency
99 Percentiles : Pk (k = 1, 2, ..., 99); devide the whole
fm = frequency of the median class
'
distribution into 100 equal parts.
Fm = cumulative frequency of the pre-median class
Computation of quantiles from frequency distribution is very
h = length of median class. much similar to that of median. We first need to identify the
Advantages of Median : corresponding quantile class. The classes having cumulative
• It is rigidly defined. frequencies equal to or immediately higher than iN/4, jN/10 and
• It is easily understood and easy to compute. kN/100 are respectively the ith quartile class, the jth decile class and
the kth percentile class.
• It is not influenced by extreme items.
• It can be calculated for distribution with opened classes. For frequency distributions the quantiles are computed as -
• It can be used in defining the median of attributes. iN
− Fi'
Qi = Li + 4 x h ; i = 1, 2, 3
Disadvantages of Median : fi
• It is not based upon all the observations. jN
− Fj'
• It is not suitable for further algebraic treatment. Dj = Lj + 10 x h ; j = 1, 2, .............., 9
fj
• It is affected much by the sampling fluctuation.
kN
− Fk'
Uses of Median : 100
Pk = Lk + x h ; k = 1, 2, .............., 99
• It is used in case of both quantitative and qualitative data. fk
• It is used for calculating the typical value in problem i, j, k indicate the order of quartiles, deciles and percentiles
concerning wages, distribution of wealth etc.
respectively; Fi' , Fj' and Fk' are respectively the cumulative frequencies
3.5 Quantiles : Quantiles also are some positional or location of class preceding the ith quartile, jth decile and kth percentile
measures of the distribution. Quantiles are those values in a series, classes; h is the corresponding class interval.
which divide the whole distribution into a number of equal parts when
the series is arranged in order of magnitude of observations. The It may be mentioned that
following are the quantiles that are used in Statistics - Q2 = D5 = P50 = Me ; Q1 = P25 ; Q3 = P75 ; D6 = P60 etc.
54 Methods of Statistics Central Tendency and Its Measures 55
Graphical Location of Median and Quantiles : frequent value of a distribution. In the case of frequency distribution,
Median, quartiles, deciles and percentiles can be located from mode is given by
ogive; the necessary steps are briefly discussed below : f 0 − f1
Mo = L + x h
i) An ogive is drawn and the position in the Y-axis are marked 2f 0 − f1 − f 2
for different partition values (e.g., N/2 for median, N/4 for 1st
quartile, 4N/10 for 4th decile etc.) where, L = lower limit of modal class
ii) From the corresponding points in the Y-axis, a line parallel f0 = frequency of modal class
to the X-axis is drawn which intersects the ogive at certain f1 = frequency of pre-modal class
point. f2 = frequency of post-modal class
iii) From the corresponding point of intersection mentioned [The class which corresponds to the maximum frequency is the
above, a perpendicular is drawn on the X-axis; the foot of model class]
the perpendicular is the desired partition value. The whole Advantages of Mode :
process is illustrated in figure 3.1 below : • It is easy to understand and easy to calculate
N • It is not affected by extreme values.
• It can be located graphically.
Cumulative Frequency
3N
/4 Disadvantages of Mode :
• It is not rigidly defined - a distribution may have more than
one mode.
N
/2
• It is not based upon all the observations.
• It is not suitable for further algebraic treatment.
N
/4 Uses of Mode :
• Mode is used to find the ideal size, e.g., in business
0 Q1 Me Q3 forecasting, Meteorological forecast on weather condition, in
Upper limits of class interval the manufacture of ready-made garments, shoes, etc.
Frequency
C
Frequency
Mo
Class intervals
Mo
Mid-values of class intervals. Fig. 3.3: Location of mode from the histogram.
Fig. 3.2: Location of mode from frequency curve
70
Yield(x) 2.8 3.0 3.2 3.4 3.6 3.8 4.0 4.2 4.4 4.6 4.8 5.0
60 D4=83.0
No. of 4 15 20 47 63 88 59 35 15 18 10 5
50 plots (f)
Me=86.5
40
Compute arithmetic mean, geometric mean and harmonic mean
30
P70 =94.0 and verify the relationship A.M. G. M. H.M.
20
Q3 =97.0 2. The frequency distribution below gives the cost of production of
10 sugar in different holdings :
0 Cost No. of Cost No. of Cost No. of
60 70 80 90 100 110 120 (Tk.) holdings) (Tk.) holdings) (Tk.) holdings)
Upper limits of class interval 10-14 11 22-26 45 34-38 20
14-18 27 26-30 35 38-42 15
ii) Location of mode from histogram. 18-22 42 30-34 30
25
Compute :
20
(a) A. M., G. M. and H. M.
Mo =89.0 (b) Median, Q1, D4, Q3 and P80.
Frequency
15
(c) Mode
10
(d) Draw a histogram and locate the mode
5 (e) Draw a frequency curve and locate the mode.
(f) Draw an ogive and locate median, Q1, D4, Q3 and P80.
0
0 50 60 70 80 1 90 100 110 120 3. A set of 20 observations gives arithmetic mean 45 units and
Class interval
another set of 30 observations has arithmetic mean 60. The two
set are combined; find the arithmetic mean of the combined set.