Lecture-3&4- Measure of Centeral
Lecture-3&4- Measure of Centeral
2
Cont’d …
o When two or more groups are measured, the central tendency
provides the basis of comparison between them.
A typical average is better if;
It should be based on all observations
3
Cont’d …
The most common measures of central tendency include:
1. Mean (Arithmetic, Weighted, Geometric, and Harmonic)
2. Median
3. Mode
4
X 1 X 2 ... X n
x x2 ... xn
X 1 N
n
N
x
n
x i i 1
i
i 1
N
n
5
fi X i
X i 1
k
i 1
fi
f i Xi
X i 1
k
,
f i 1
i
6
Example
7
Grouped Data
8
Special properties of A.M
Cont’d
X 1 n1 X 2 n 2 X n i i
40(350) 60(380)
Xc i 1
368Birr
n1 n 2 2
40 60
n i 1
i
10
Merits and demerits of arithmetic mean
4 3 1 2 Total
4 4 3 2 13
16 12 3 4 35
Cont’d
15
Con'd….
Merits and Demerits of Arithmetic Mean
Merits:
It is based on all observations
It is suitable for further statistical analysis
It is easy to calculate and simple to understand
Demerits:
It is affected by extreme observations
It cannot be used in the case of open-ended classes
It cannot be used when dealing with qualitative
characteristics, such as intelligence, honesty, and beauty
By habtamu.A. 11/26/2024
3. Geometric mean
Con'd….
By habtamu.A. 11/26/2024
Con'd….
Values 3 4 5 6
Freq. 2 3 1 2
Con'd….
4. Harmonic Mean
Con'd….
Activity; discuses the advantage and disadvantage of A.M, G.M
and H.M
Exercise: The number of diarrhea episodes for 25child are
summarized in the following table.
diarrhea No child
episodes
1 3
2 3
3 f3
5 2
6 10
8 f6
If the arithmetic mean is 4.8,then what are the values of f3 and f6?
23
Median is as its name indicates the middle most value in the
arrangement which divides the data in to two equal parts
~
X X1 ~ 1
( n 1)
X X n X n
2 2 1
2 2
i.e
When n = 11, then the median is the 6th observation.
When n = 12, then the median is the 6.5th observation, which is an
observation halfway between the 6th and 7th ordered observation.
24
Example: For the same random sample, the ordered observations will be as:
23, 28, 28, 31, 32, 34, 37, 42, 50, 61.
Since n = 10, then the median is the 5.5th observation, i.e. = (32+34)/2 = 33.
Median of Group Data
~ w n
X Lme lcfbm
fm 2
th
n n
or lcf
2 2
25
Example: Find Median
26
Solution: To determine the median class, we have to take the class
that contains
th th
n 5763
th
2881.5 item
2 2
27
THE MODE ( X̂ )
The mode or modal value is the value with the highest frequency in
the data set. The mode of a set of data or distribution can be:
No mode: In this case all values appear equal number of times
Unimodal: If the distribution has only one mode
Bimodal: If the distribution has two modes
Multi-modal: If the distribution has more than two modes
28
Mode of Group Data
1
x Lmo w
1 2
Lmo = Lower boundary of modal class
Δ1 = difference of frequency between modal class and class before it
Δ2 = difference of frequency between modal class and class after it
w = class width
1
f mo
f 1
2
f mo
f 2
10-14 7 2
f mo
f 2
12 7 5
15-19 5
1
20-24 0 x L mo W
Total n = 30 1 2
6
x 4.5 5
65
x 4.5 0.55 5.05
30
Measure of position(quantiles)
Quantiles are measures of position that divide a dataset into
equal intervals, each containing a specific proportion of the data.
They help to describe the distribution of a dataset by identifying
values at specific points that divide the data into portions.
The most commonly used quantiles are: quartile, decile and
percentile
31
Con'd….
Con'd….
Example: The following data shows the age of 30 sampled
patients in JUSH 6, 9, 11, 14, 16, 17, 18, 21, 22, 22, 22, 22, 23,
25, 25, 26, 27, 28, 28, 32, 33, 34, 34, 36, 39, 39, 41, 45, 46, 49
Find the lower, middle and upper quartiles for the above data.
Solution: n = 30 Q1 1 (n 1)th
4 = 1
4 (30 1) th
34
Quartile for grouped data
Deciles
Con'd….
Percentile
Con'd….
Con'd….
Measures of Variation
Introduction
o Measures of central tendency locate the center of the distribution.
However, they do not tell how individual observations are
scattered on either side of the center. The spread of observations
around the center is known as dispersion or variability.
o In other words, the degree to which numerical data tends to
spread about an average value is called dispersion or variation of
the data.
Con'd….
The variance and standard deviation
Con'd….
Con'd….
Con'd….
By habtamu.A. 11/26/2024
Con'd….
Some important properties of variance and standard
deviation
11/26/2024
Coefficient of variation (CV)
o When two data sets have different units of measurement, or
their means differ sufficiently in size, the CV should be used as
a measure of dispersion. It is used to assess the relative
variability of data.
o The coefficient of variation is defined as the ratio of standard
deviation to the mean, usually expressed as a percent.
o Data with lower CV indicates less variability or consistency,
meaning the data is more tightly clustered around the mean.
o Data with higher CV indicates more variability relative to the
mean, meaning the data is more spread out.
o measure variation relative to the mean and present in percentage
(%)
Con'd….
Example: Last semester, the students of the nursing and anesthesia
departments took Stat273 course. At the end of the semester, the
following information was recorded.
Solution: The means of the two sets of data are very different, we
use coefficient of variation to compare variability
Con'd….
The coefficients of variation are calculated as
Nursing Anesthesia
By habtamu.A. 11/26/2024
Graphical representation
By habtamu.A. 11/26/2024
Generally,
o If the distribution of data is skewed to the left, the
mean is less
than the median, which is often less than the mode.
Median closer to the third quartile
11/26/2024
Measure of skewness
kurtosis
o Kurtosis is the degree of peakedness or flatness of a distribution
(tells us how tall and sharp the central peak is, relative to a
standard bell curve). It tells us the degree of data concentration
around the mean.
o When the curve of a distribution is relatively flatter than normal it
is known as platykurtic. The distribution is more peaked than
normal, it is called leptokuric. The normal distribution which is
not very high peaked or flat topped is called mesokurtic.
Measure of kurtosis
Interpretation of the value of 𝛼4
If 𝛼4 > 3 then the curve is
leptokurtic
If 𝛼4 = 3 then the curve is
mesokurtic
If 𝛼4 < 3 then the curve is
platykurtic
By habtamu.A.