dddddd2
dddddd2
Mode Median
• The mode is the most frequently appearing value in the
population or sample.
• The median is a balance point since it splits the data into
two piles each containing half the values.
• It is the value with a highest frequency.
• To find the median, we arrange the observations in
order from smallest to largest value.
• Example: consider five women having the following
• If there is an odd number of observations, the median is
weights; 100 kg, 100 kg, 130 kg, 140 kg, and 150 kg.
the middle value.
• If there is an even number of observations, the median is
• The value with the highest frequency is 100kg the average of the two middle values.
• Thus, in the sample of five women, the median value
• Thus the mode would equal 100 kg. would be 130 kg; since 130 kg is the middle weight.
7/19/2024
Measures of Variability
Mean
• Measures of dispersion measure how spread out a set
• The sample mean is perhaps the most important of the of data is.
three measures
• They are important for describing the spread of the data,
• It represents the balance point (or centre of gravity) of a or its amount of variation around a central value
distribution
• The mean of a sample or a population is computed by • For example, consider a population of four random
adding all of the observations and dividing by the variables {5, 5 ,5, 5}. Here, each of the random variables
number of observations. are equal, so there is no variation. The set {3, 5, 5, 7}, on
the other hand, has some variation since some random
• Returning to the example of the five women, the mean variables are different.
weight would equal (100 + 100 + 130 + 140 + 150)/5 =
620/5 = 124 kg. • The three parameters that are used to quantify the
amount of variation in a set of random variables are the
range, the variance, and the standard deviation.
2
Population variance and standard
deviation computational formulae Sample variance and standard deviation
computational formulae
• σ2 = Σ ( Xi - μ )2 / N • The variance of a sample is defined by a slightly
different formula, the numerator is divide by n – 1 instead
• Population standard deviation is given by Root σ2= σ of N
2 • s2 = Σ ( xi - x )2 / ( n - 1 )
► where σ2 is the population variance,
► μ is the population mean, ►where s2 is the sample variance,
► Xi is the ith element from the population, ► x is the sample mean,
► and N is the number of elements in the population. ► xi is the ith element from the sample,
► and n is the number of elements in the sample.
• Thus by definition, the variance of a random variable is
the average squared deviation from the population mean Standard deviation s s2
7/19/2024
Measures of position 1
• Measures of position tell where a specific data value Measures of position 2
falls within the data set or its relative position in
comparison with other data values.
• In a similar way we define the quartiles as the quarter
values of the data set, deciles as the one-tenth values of
• The most common measures of position are the data set and so on.
percentiles, deciles, and quartiles.
• Quartiles, deciles and percentiles (unlike the median
• Quartiles, deciles and percentiles are just a which acts as a measure of central tendency) give us an
generalization of the median. idea about the skewness of the data set.
Deciles
Quartiles • In a Data Set deciles are the 9 values that divide the sorted
data into 10 equal parts/groups.
• The first quartile is Q1=11 The second quartile is • Accordingly they are called the 1st, 2nd... 9th deciles (also
denoted as D1,D2 ... D9). If X1, X2, X3, .., Xn are the observed
Q2=40 (This is also the Median.) The third quartile is values and are assumed to be arranged in ascending (or
Q3=61 descending order) then the corresponding definition are as
follows:
• (To within 1 datum) One quarter of the data are below
D1 10th percentile =X(10)
Q1, two quarters below Q2, three quarters below Q3
D2 20th percentile =X(20)
D3 30th percentile =X(30)
• One quarter of the data (about) is between Q1 and Q2, D4 40th percentile =X(40)
etc. D5 50th percentile=X(50)
D6 60th percentile =X(60)
D7 70th percentile =X(70)
D8 80th percentile =X(80)
D9 90th percentile =X(90)
Deciles