0% found this document useful (0 votes)
14 views

Lecture 1 Utilization of Assessment Data Part 2

The document discusses different statistical concepts including descriptive statistics, populations and samples, types of data, measures of central tendency, and frequency distributions. It provides learning outcomes and examples to explain key terminology and how to classify different types of data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

Lecture 1 Utilization of Assessment Data Part 2

The document discusses different statistical concepts including descriptive statistics, populations and samples, types of data, measures of central tendency, and frequency distributions. It provides learning outcomes and examples to explain key terminology and how to classify different types of data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 132

LECTURE 1

Utilization of
Assessment
Data
2

LEARNING OUTCOMES:
At the end of this chapter, the students should be able to:
1. Apply statistics in research and in any systematic investigation;
2. Construct frequency distribution for a given set of scores;
3. Graph the scores using histogram and frequency distribution;
4. Calculate the mean, median, and mode, decile, quartile, and
percentile of the students’ scores;
5. Identify the different properties of the measure of central
tendency;
6. Identify the uses of the different measures of variability;
3

LEARNING OUTCOMES:
At the end of this chapter, the students should be able to:
7. Calculate the value and make an analysis of range, mean
deviation, quantitative, quartile deviation, variance and
standard deviation of given scores;
8. Differentiate standard deviation from coefficient of variation;
9. Identify the properties of the different measures of variability;
10. Apply the concept of skewness in identifying the performance
of the students;
11. Determine the spread of scores using the measures of
variation;
4

LEARNING OUTCOMES:
At the end of this chapter, the students should be able to:
12. Compare the performance of the students using measures of
central tendency and measures of variability;
13. Convert raw scores to standard scores;
14. Determine the relationship of two groups of scores; and
15. Computer r and p value of scores and make an analysis.
5

INTRODUCTION
• Statistics is a very important tool in the utilization of
the assessment data most especially in describing,
analyzing, and interpreting the performance of the
students in the assessment procedures.
• The teachers should have the necessary
background in the statistical procedures used in
assessment of student learning.
A Taxonomy of Statistics 6
7

AN OVERVIEW OF
STATISTICS
8

Data and Statistics


• Data consists of information coming from observations,
counts, measurements, or responses.
• Statistics is the science of collecting, organizing,
analyzing, and interpreting data in order to make
decisions.
• A population is the collection of all outcomes,
responses, measurement, or counts that are of interest.
• A sample is a subset of a population.
9

Populations & Samples


Example:

In a recent survey, 250 college students at Union College


were asked if they smoked cigarettes regularly. Thirty-five
(35) of the students said yes. Identify the population and the
sample.
10
11

DEFINITION OF STATISTICS
• Statistics is a branch of science, which deals with
collection, presentation, analysis and interpretation of
quantitative data.
• Descriptive statistics is a method concerned with
collecting, describing, and analyzing a set of data without
drawing conclusions (or inferences) about a large group.
• Inferential statistics is a branch of statistics, concerned
with the analysis of a subset of data leading to predictions
or inferences about the entire set of data.
Descriptive & Inferential 13

Statistics
Example:
In a recent study, volunteers who had less than 6 hours of
sleep were four times more likely to answer incorrectly on a
science test than were participants who had at least 8 hours
of sleep. Decide which part is the descriptive statistic and
what conclusion might be drawn using inferential statistics.
14

The statement “four times more likely to answer


incorrectly” is a descriptive statistic. An inference
drawn from the sample is that all individuals sleeping
less than 6 hours are more likely to answer science
question incorrectly than individuals who sleep at
least 8 hours.
15

DATA CLASSIFICATION
Types of Data 16
Qualitative & Quantitative Data
17
18

Qualitative Data
Qualitative data can be separated into two subgroups:
• Dichotomic – if it takes the form of a word with two
options (sex – male or female)
• Polynomic – if it takes the form of a word with more than
two options (education – primary school, secondary
school and university)
19

Qualitative Data
Data Sample of Qualitative Data

The data are the colors of backpacks. You sample five


students. One student has a red backpack, two students
have black backpacks, one student has a green backpack,
and one student has a gray backpack. The colors red,
black, green, and gray are qualitative data.
20

Quantitative Data
Quantitative data are always numbers and are the result of
counting or measuring attributes of a population.
Quantitative data can be separated into two subgroups:
• Discrete – if it is the result of counting (the number of
students of a given ethnic group in a class, the number of
books on a shelf)
• Continuous – if it is the result of measuring (distance
traveled, weight of luggage)
21

Quantitative Data
Data Sample of Quantitative Discrete Data

The data are the number of books students carry in their


backpacks. You sample five students. Two students carry
three books, one student carries four books, one student
carries two books, and one student carries one book. The
number of books (three, four, two, and one) are the
quantitative discrete data.
22

Quantitative Data
Data Sample of Quantitative Continuous Data

The data are the weights of backpacks with books in them.


You sample the same five students. The weights (in pounds)
of their backpacks are 6.2, 7, 6.8, 9.1, 4.3. Notice that
backpacks carrying three books can have different weights.
Weights are quantitative continuous data because weights are
measured.
23

You go to the supermarket and purchase three cans of soup


(19 ounces) tomato bisque, 14.1 ounces lentil, and 19 ounces
Italian wedding, two packages of nuts (walnuts and peanuts),
four different kinds of vegetable (broccoli, cauliflower, spinach,
and carrots), and two desserts (16 ounces Cherry Garcia ice
cream and two pounds (32 ounces) chocolate cookies).

Name data sets that are quantitative discrete, continuous, and


qualitative.
24

Solution

One Possible Solution:


• The three cans of soup, two packages of nuts, four kinds of
vegetables and two desserts are quantitative discrete data because
you count them.
• The weights of the soups (19 ounces, 14.1 ounces, 19 ounces) are
quantitative continuous data because you measure weights as
precisely as possible.
• Types of soups, nuts, vegetables and desserts are qualitative data
because they are categorical.
25
26

Work collaboratively to determine the correct data type (quantitative or


qualitative). Indicate whether quantitative data are continuous or discrete.

a) The number of pairs of shoes you own


b) The type of car you drive
c) Where you go on vacation
d) The distance it is from your home to the nearest grocery store
e) The number of classes you take per school year
f) The tuition for your classes
g) The type of calculator you use
h) Movie ratings
i) Political party preferences
27

Work collaboratively to determine the correct data type (quantitative or


qualitative). Indicate whether quantitative data are continuous or
discrete.

j) Weights of sumo wrestlers


k) Amount of money (in peso) won playing poker
l) Number of correct answers on a quiz
m) People’s attitudes toward the government
n) IQ scores
Numerical Scale of Measurement 28

• Nominal – consist of categories in each of which the number of respective


observations is recorded. The categories are in no logical order and have no
particular relationship. The categories are said to be mutually exclusive since
an individual, object, or measurement can be included in only one of them.
• Ordinal – contain more information. Consists of distinct categories in which
order is implied. Values in one category are larger or smaller than values in
other categories (e.g. rating-excelent, good, fair, poor).
• Interval – is a set of numerical measurements in which the distance between
numbers is of a known, constant size.
• Ratio – consists of numerical measurements where the distance between
numbers is of a known, constant size, in addition, there is a nonarbitrary zero
point.
Levels of Measurement
29
Nominal Level of Measurement
30
Ordinal Level of Measurement
31
Interval Level of Measurement
32
Ratio Level of Measurement
33
Summary of Levels of Measurement
34
35

FREQUENCY DISTRIBUTION

• Frequency distribution is a tabular arrangement of


data into appropriate categories showing the number
of observations in each category or group.
• There are two major advantages: (a) it encompasses
the size of the table; and (b) it makes the data more
interpretive.
36

Parts of Frequency Table


1. Class limit is the groupings or categories defined by the lower
and upper limits.
Example: LL – UL
10 – 14
15 – 19
20 – 24
Lower class limit (LL) represents the smallest number in each
group.
Upper class limit (UL) represents the highest number in each
group.
37

Parts of Frequency Table

2. Class size (c.i) is the width of each class interval.

Example: LL – UL
10 – 14
15 – 19
20 – 24

The class size in this score distribution is 5.


38

Parts of Frequency Table

3. Class boundaries are the numbers used to


separate each category in the frequency
distribution but without gaps created by the class
limits. The scores of the students are discrete. Add
0.5 to the upper limit to get the upper class
boundary and subtract 0.5 to the lower limit to
get the lower class boundary in each group or
category.
39

Parts of Frequency Table

Example: LL – UL Lcb - Ucb


10 – 14 9.5 – 14.5
15 – 19 14.5 – 19.5
20 – 24 19.5 – 24.5
40

Parts of Frequency Table

4. Class marks are the midpoint of the lower and


𝐿𝐿+𝑈𝐿
upper class limits. The formula is 𝑋𝑀 = 2 .

Example: LL – UL 𝑋𝑀
10 – 14 12
15 – 19 17
20 – 24 22
Steps in Constructing Frequency 41

Distribution
1. Compute the value of the range (R). Range is the
difference between the highest score and the lowest
score.
R = HS – LS

2. Determine the class size (c.i).


𝑅 𝑅
𝑐. 𝑖 = or 𝑐. 𝑖 =
𝑑𝑒𝑠𝑖𝑟𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠𝑒𝑠 𝑘
Steps in Constructing Frequency 42

Distribution
3. Set up the class limits of each class or category.
Each class defined by the lower limit and upper limit.
Use the lowest score as the lower limit of the first
class.
4. Set up the class boundaries if needed. Use the
formula:
𝐿𝐿 𝑜𝑓 𝑡ℎ𝑒 𝑠𝑒𝑐𝑜𝑛𝑑 𝑐𝑙𝑎𝑠𝑠 − 𝑈𝐿 𝑜𝑓 𝑡ℎ𝑒 𝑓𝑖𝑟𝑠𝑡 𝑐𝑙𝑎𝑠𝑠
𝑐𝑏 =
2
Steps in Constructing Frequency 43

Distribution

5. Tally the scores in the appropriate


classes.
6. Find the other parts if necessary such
as class marks, among others.
Steps in Constructing Frequency 44

Distribution

Example: Raw scores of 40 students


in a 50-item science quiz. Construct a
frequency distribution following the
steps given previously.
Steps in Constructing Frequency 45

Distribution

17 25 30 33 25 45 23 19
27 35 45 48 20 38 39 18
44 22 46 26 36 29 15-LS 21
50-HS 47 34 26 37 25 33 49
22 33 44 38 46 41 37 32
Steps in Constructing Frequency 46

Distribution
R = HS – LS Solve the value of k.
= 50-15 k = 1 + 3.3 log n
R = 35 k = 1 + 3.3 log 40
n = 40 k = 1 + 3.3 (1.602059991)
k = 1 + 5.286797971
k = 6.286797971
k=6
Steps in Constructing Frequency 47

Distribution
Find the class size.
𝑅
𝑐. 𝑖 =
𝑘
35
𝑐. 𝑖 =
6
𝑐. 𝑖 = 5.833
𝒄. 𝒊 = 𝟔
Steps in Constructing Frequency 48

Distribution
• Construct the class limit starting with the lowest score
as the lower limit of the first category.
• The last category should contain the highest score in
the distribution.
• Each category should contain 6 as the size of the
width (X).
• Count the number of scores that falls in each category
(f).
49

X Tally frequency (f)


15-20 //// 4
21-26 ///////// 9
27-32 /// 3
33-38 ////////// 10
39-44 //// 4
50

Find the class boundaries and class marks of the given score
distribution. n = 40

Class
X f Xm
Boundaries
15-20 4 14.5-20.5 17.5
21-26 9 20.5-26.5 23.5
27-32 3 26.5-32.5 29.5
33-38 10 32.5-38.5 35.5
39-44 4 38.5-44.5 41.5
45-50 10 44.5-50.5 47.5
51
52

Cumulative
Class Cumulative
X f Xm Relative Frequency Relative
Boundaries Frequency
Frequency
15-20 4 14.5-20.5 17.5 4/40 0.1 4
21-26 9 20.5-26.5 23.5 9/40 0.325 13
27-32 3 26.5-32.5 29.5 3/40 0.4 16
33-38 10 32.5-38.5 35.5 10/40 0.65 26
39-44 4 38.5-44.5 41.5 4/40 0.75 30

45-50 10 44.5-50.5 47.5 10/40 1.00 40


Graphical Representation of 53

Scores in Frequency Distribution


• The scores expressed in frequency distribution can be
meaningful and easier to interpret when they are graphed.
• Histogram consists of a set of rectangles having bases on the
horizontal axis which centers at the class marks. Histogram is
best used for graphical representation of discrete data or non-
continuous data.
• Frequency polygon is constructed by plotting the class marks
against the class frequencies. This is best used in representing
continuous data such as the scores of students in a given test.
54
55
56
DESCRIBING GROUP 57

PERFORMANCE
• There are two major concepts in describing the
assessed performance of the group: measures of
central tendency and measures of variability.
• Measures of central tendency are used to
determine the average score of a group of scores
while measures of variability indicate the spread of
scores in the group.
58

Measure of Central Tendency


o explain your
ideas
• Describes a set of scores with a single number that
describes the performance of the group.
• A single value that is used to describe the “center” of
the data.
• There are three (3) commonly used measures of
central tendency. These are the mean, median, and
mode.
Measure of Central Tendency 59

(Mean)
1. Mean is the most commonly used measure of the center of data and it
also referred as the “arithmetic average”.

Computation of Population Mean

Σ𝑋 𝑋1 + 𝑋2 + 𝑋3 + ⋯ 𝑋𝑛
𝜇= =
𝑁 𝑁
Computation of Sample Mean

Σ𝑥 𝑥1 + 𝑥2 + 𝑥3 + ⋯ 𝑥𝑛
𝑥ҧ = =
𝑛 𝑛
Measure of Central Tendency 60

(Mean)
1. Mean is the most commonly used measure of the center of data and it
also referred as the “arithmetic average”.

Computation of the Mean for Ungrouped Data

Σ𝑥
• 𝑥ҧ =
𝑛
Σ𝑓𝑥
• 𝑥ҧ =
𝑛
61

Scores of 15 students in Mathematics I quiz consist of 25 items. The highest


score is 25 and the lowest score is 10. Here are the scores: 25, 20, 18, 18, 17, 15,
x (scores)
25
15, 15, 14, 14, 13, 12, 12, 10, 10. Find the mean in the following scores.
20
18 Σ𝑥 = 228; n=15 The average performance of 15 students
18 who participated in a mathematics quiz
17 Σ𝑥 228
𝑥ҧ = = = 𝟏𝟓. 𝟐 consisting of 25 items is 15.2. The implication
15 𝑛 15
15 of this is that student who got scores below
15 15.2 did not perform well in the said
14 examination. Students who got scores higher
14
13
than 15.2 performed well in the examination
12 compared to the performance of the whole
12 class.
10
10
62

Find the Grade Point Average (GPA) of Laiza Seguerra for the first semester of
the school year 1452-1453. Use the table below:
Subjects Grade (Xi) Units (Wi) (Wi)(Xi)
BM112 1.25 3 3.75
Σ(𝑊𝑖 )(𝑋𝑖 )
BM101 1.00 3 3.00 𝑥ҧ =
Σ𝑊𝑖
AC103N 1.25 6 7.50
BEC111 1.00 3 3.00
32
MGE101 1.50 3 4.50 𝑥ҧ =
MKM101 1.25 3 3.75
26
FM111 1.50 3 4.50
𝑥ҧ = 𝟏. 𝟐𝟑
PEN2 1.00 2 2.00

𝜮 𝑾𝒊 = 𝟐𝟔 𝜮 𝑾𝒊 𝑿𝒊 = 𝟑𝟐. 𝟎𝟎
Measure of Central Tendency 63

(Mean)
Mean for Grouped Data
• Grouped data are the data or scores that are arranged in a
frequency distribution.
• Frequency distribution is the arrangement of scores according
to category of classes including the frequency.
• Frequency is the number of observations falling in a category.
• Mean for grouped data is also called midpoint method.
Σ𝑓𝑥𝑚
𝑥ҧ =
𝑛
Measure of Central Tendency 64

(Mean)
Mean for Grouped Data

Σ𝑓𝑥𝑚
𝑥ҧ =
𝑛
Where: x = mean value
f = frequency in each class or category
𝑥𝑚 = midpoint of each class or category
Σ𝑓𝑥𝑚 =summation of the product of 𝑓𝑥𝑚
Measure of Central Tendency 65

(Mean)
Steps in Solving Mean for Grouped Data
1. Find the midpoint or class mark (𝑥𝑚 ) of each class or
𝐿𝐿+𝑈𝐿
category using the formula 𝑥𝑚 = .
2
2. Multiply the frequency and the corresponding class
mark 𝑓𝑥𝑚 .
3. Find the sum of the results in step 2.
Σ𝑓𝑥𝑚
4. Solve the mean using the formula 𝑥ҧ = .
𝑛
66

Scores of 40 students in a science class consist of 60 items and they are


tabulated below.
X f 𝑿𝒎 𝒇𝒙𝒎
Σ𝑓𝑥𝑚 1345
10-14 5 12 60 𝑥ҧ = 𝑥ҧ = ഥ = 𝟑𝟑. 𝟔𝟑
𝒙
15-19 2 17 34
𝑛 40
20-24 3 22 66
Analysis: The mean performance of
25-29 5 27 135
40 students in science quiz is 33.63.
30-34 2 32 64 Those students who got scores below
35-39 9 37 333 33.63 did not perform well in the said
40-44 6 42 252 examination while those students who
45-49 3 47 141 got scores above 33.63 performed
50-54 5 52 260 well.
n = 40 𝜮𝒇𝒙𝒎 =1345
Measure of Central Tendency 67

(Mean)
Properties of the Mean
1. It measures stability. Mean is the most stable among other
measures of central tendency because every score contributes to
the value of the mean;
2. The sum of each score’s distance (deviations) from the mean is
zero;
3. It is easily affected by the extreme scores (outliers);
4. It may not be an actual score in the distribution;
5. It can be applied to interval level of measurement; and
6. It is very easy to compute.
Measure of Central Tendency 68

(Mean)

When to Use the Mean


1. Sampling stability is desired.
2. Other measures are to be computed such as
standard deviation, coefficient of variation and
skewness.
Measure of Central Tendency 69

(Median)
• Median is the second type of measures of central tendency.
• Median is what divides the scores in the distribution into two equal
parts.
• Fifty percent (50%) lies below the median value and 50% lies above the
median value.
• Also known as the middle score or the 50th percentile.
• Numbers must be arranged from lowest to highest.
• When the number cases are odd, the median score is a score that
has the same number of scores below and above it.
• When the scores are even, determine the average of the two middle
most scores that have equal number of scores below and above it.
Measure of Central Tendency 70

(Median)
Median of Ungrouped Data
1. Arrange the scores.
2. Determine the middle most score in a
distribution if n is an odd number and get the
average of the two middle most scores if n is
an even number.
Measure of Central Tendency 71

(Median)
Example 1

Find the median score of 7 students in an


English class.
Measure of Central Tendency 72

(Median)
Example 2

Find the median score of 8 students in an


English class.
Measure of Central Tendency 73

(Median)
Median of Grouped Data
Formula:
𝑛
𝑥෤ = 𝐿𝐵 2 − 𝑐𝑓𝑝 𝑐. 𝑖
𝑓𝑚

෥ = 𝑚𝑒𝑑𝑖𝑎𝑛 𝑣𝑎𝑙𝑢𝑒
𝒙
𝑛
MC = median class is a category containing the 2
𝑳𝑩 = 𝑙𝑜𝑤𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑒𝑑𝑖𝑎𝑛 𝑐𝑙𝑎𝑠𝑠 𝑀𝐶
cfp = cumulative frequency before the median class if the scores are arranged from
lowest to highest value
fm = frequency of the median class
c.i = size of the class interval
Measure of Central Tendency 74

(Median)
Steps in Solving Median for Grouped Data
1. Complete the table for cf<.
𝑛
2. Get of the scores in the distribution so that
2
you can identify MC.
3. Determine 𝐿𝐵 , cfp, fm, and c.i.
4. Solve the median using the formula.
Measure of Central Tendency 75

(Median)
Example 3

Scores of 40 students in a science class consist


of 60 items and they are tabulated. The highest
score is 54 and the lowest score is 10.
Measure of Central Tendency 76

(Median)
Properties of the Median
1. It may not be an actual observation in the
data set.
2. It can be applied in ordinal level.
3. It is not affected by extreme values because
median is a positional measure.
Measure of Central Tendency 77

(Median)

When to Use the Median


1. The exact midpoint of the score distribution is
desired.
2. There are extreme scores in the distribution.
Measure of Central Tendency 78

(Mode)
• Mode is the third measure of central tendency.
• The mode or the modal score is a score or scores that
occurred most in the distribution.
• Unimodal is a distribution of scores that consists of only one
mode.
• Bimodal is a distribution of scores that consists of two modes.
• Trimodal is a distribution of scores that consists of three
modes or multimodal is a distribution of scores that consists
of more than two modes.
Example 1. Scores of 10 students in Section A, Section B, and Section C
79

Scores of Section A Scores of Section B Scores of Section C

25 25 25

24 24 25

24 24 25

20 20 22

20 18 21

20 18 21

16 17 21

12 10 18

10 9 18

7 7 18
Measure of Central Tendency 80

(Mode)
Mode for Grouped Data
In solving the mode value using grouped data, use the formula:

𝑑1
𝑥ො = 𝐿𝐵 𝑐. 𝑖
𝑑1 + 𝑑2

𝑳𝑩 = 𝑙𝑜𝑤𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑜𝑑𝑎𝑙 𝑐𝑙𝑎𝑠𝑠


Modal Class (MC) = is a category containing the highest frequency
d1 = difference between the frequency of the modal class and the frequency above it,
when the scores are arranged from lowest to highest
d2 = difference between the frequency of the modal class and the frequency below it,
when the scores are arranged from lowest to highest
c.i = size of the class interval
Example 2. Scores of 40 students in a science class consist of 60 items and they are
81
tabulated below.
x f
10-14 5
15-19 2
20-24 3
25-29 5
30-34 2
35-39 9
40-44 6
45-49 3
50-54 5
n = 40
Measure of Central Tendency 82

(Mode)
Properties of the Mode
1. It can be used when the data are qualitative
as well as quantitative.
2. It may not be unique.
3. It is not affected by extreme values.
4. It may not exist.
Measure of Central Tendency 83

(Mode)
When to Use the Mode
1. When the “typical” value is desired.
2. When the data set is measured on a
nominal scale.
Measure of Central Tendency 84

(Quantiles)
• Quantile is a score distribution where the scores are
divided into different equal parts.
• Quartile is a score point that divides the scores in the
distribution into four (4) equal parts.
• Decile is a score point that divides the scores in the
distribution into ten (10) equal parts.
• Percentile is a score that divides the scores in the
distribution into hundred (100) equal parts.
Measure of Central Tendency 85

(Quantiles)
Quantiles for Ungrouped Data

Example: Using the given data 6, 8, 10, 12, 12,


14, 15, 16. Find Q1, Q3, D6, D9, P65, P99. (n = 9)
Measure of Central Tendency 86

(Quantiles)
Quartiles for Grouped Data

Example 1: The data for the scores of fifty (50)


students in Filipino class are given. Solve for the
value of Q1. (n=50)
Measure of Central Tendency 87

(Quantiles)
Quartiles for Grouped Data

Example 2: The data for the scores of fifty (50)


students in Filipino class are given. Solve for the
value of Q3. (n=50)
Measure of Central Tendency 88

(Quantiles)
Deciles for Grouped Data

Example 3: The data for the scores of fifty (50)


students in Filipino class are given. Solve for the
value of D5. (n=50)
Measure of Central Tendency 89

(Quantiles)
Deciles for Grouped Data

Example 4: The data for the scores of fifty (50)


students in Filipino class are given. Solve for the
value of D7. (n=50)
Measure of Central Tendency 90

(Quantiles)
Percentiles for Grouped Data

Example 5: The data for the scores of fifty (50)


students in Filipino class are given. Solve for the
value of P82. (n=50)
Measure of Central Tendency 91

(Quantiles)
Percentiles for Grouped Data

Example 6: The data for the scores of fifty (50)


students in Filipino class are given. Solve for the
value of P91. (n=50)
92

Measures of Variation
• Central tendency is the distribution of scores.
• Also, we want to know how close or how far is the distance of the
scores of students in a certain test from the average performance
of the group.
• Measure of variation is a single value that is used to describe the
spread of the scores in a distribution.
• The term variation is also known as variability or dispersion.
• There are several ways of describing the variation of scores:
absolute measures of variation & relative measures of
variation.
Let us consider the scores of students in three sections of mathematics class. We shall consider the spread
of scores based on graphical presentation.

Section A Section B Section C


12 12 12
12 12 12
14 12 12
15 13 12
17 13 12
18 14 12
18 17 13
18 20 26
19 20 26
23 28 26
23 28 26
30 30 30
𝐱ത = 𝟏𝟖. 𝟐𝟓 𝐱ത = 𝟏𝟖. 𝟐𝟓 𝐱ത = 𝟏𝟖. 𝟐𝟓
s = 5.15 s = 6.92 s = 7.63
• What can you observe about the mean and the standard deviation of the
three groups of scores?
• Which group of students performed well in the class?
• Which group of scores is most widespread? Less scattered?
• Before answering such questions, let us first discuss the different types of
measures of variation.
Types of Absolute Measures 95

of Variation (Range)
1. Range
- The difference between the highest score and the lowest score in a
distribution.
- The simplest and the crudest measure of variation, simplest
because we shall only consider the highest and lowest score.

Range for Ungrouped Data


R = HS – LS
Where, R = range value
HS = highest scores
LS = lowest score
Types of Absolute Measures 96

of Variation (Range)
Example 1: Find the range of the two groups of score distribution.
GROUP A GROUP B
10 (LS) 15 (LS)
12 16
15 16
17 17
25 17
26 23
28 25
30 26
35 (HS) 30 (HS)
Types of Absolute Measures 97

of Variation (Range)
Range for Grouped Data

R = HSUB - LSLB

Where, R = range value


HSUB = upper boundary of the highest score
LSLB = lower boundary of the lowest score
Types of Absolute Measures 98

of Variation (Range)
Example 2: Find the value of range of the scores of 50 students in Mathematics
achievement test. X f
25-32 3
33-40 7
41-48 5
49-56 4
57-64 12
65-72 6
73-80 8
81-88 3
89-97 2
Types of Absolute Measures 99

of Variation (Range)
Properties of Range
• It is quick and easy to understand.
• It is a rough estimation of variation.
• It is easily affected by the extreme scores.

Interpretation of Range Value


• When the range value is large, the scores in the distribution are
more dispersed, widespread or heterogeneous.
• When the range value is small, the scores in the distribution are
less dispersed, less scattered, or homogeneous.
Types of Absolute Measures 100

of Variation (IQR & QD)


2. Inter-quartile Range (IQR) & Quartile Deviation (QD)

IQR is the difference between the third quartile and the first quartile.

IQR = Q3 – Q1

Properties of Inter-quartile Range


1. Reduces the influence of extreme values.
2. Not as easy to calculate as the range.
3. Only considers the middle 50% of the scores in the distribution.
4. The point of dispersion is the median value.
Types of Absolute Measures 101

of Variation (IQR & QD)


Quartile deviation
- Indicates the distance we need to go above and below the median to include
the middle 50% of the scores;
- It is based on the range of the middle 50% of the scores, instead of the
range of the entire set.

𝑸𝟑 − 𝑸𝟏
𝑸𝑫 =
𝟐
Where, QD = quartile deviation value
𝑸𝟏 = value of first quartile
𝑸𝟑 = value of third quartile
Types of Absolute Measures 102

of Variation (IQR & QD)


Steps in Solving Quartile Deviation of Ungrouped Data
1. Solve for the value of the first quartile.
2. Solve for the value of the third quartile.
3. Solve for the value of QD using the formula.
Types of Absolute Measures 103

of Variation (IQR & QD)

Quartile Deviation of Ungrouped Data

Example 1: Using the given data 6, 8, 10, 12, 12, 14, 15,
16, 20, find the quartile deviation.
Quartile Deviation of Grouped Data 104

x f cf<
25-32 3 3
33-40 7 10
Example 2: The data given
41-48 5 15
below are the scores of fifty
49-56 4 19
(50) students in Filipino 57-64 12 31
class. Solve for the value of 65-72 6 37
quartile deviation (QD). 73-80 8 45
81-88 3 48
89-97 2 50
Types of Absolute Measures 105

of Variation (IQR & QD)


Interpretation of IQR and QD
• The larger the value of the IQR or QD, the more
dispersed the scores at the middle 50% of the
distribution.
• If the IQR or QD is small, the scores are less dispersed
at the middle 50% of the dispersion.
• The point of dispersion is the median value.
Types of Absolute Measures 106

of Variation (IQR & QD)


Analysis for IQR & QD
• When the value of IQR and QD is small, the scores are
clustered within the middle 50% of the score distribution.
• The scores are dispersed in the middle 50% of the
distribution when the value IQR & QD is large.
Types of Absolute Measures 107

of Variation (Mean Deviation)

3. Mean Deviation (MD) measures the average


deviation of the values from the arithmetic mean. It
gives equal weight to the deviation of every score
in the distribution.
Types of Absolute Measures 108

of Variation (Mean Deviation)


Mean Deviation for Ungrouped Data

𝜮Τ𝒙 − 𝒙
ഥ/
𝑴𝑫 =
𝒏
Where, MD = mean deviation value
x = individual score
𝑥ҧ = sample mean
n = number of cases
Types of Absolute Measures 109

of Variation (Mean Deviation)

Steps in Solving Mean Deviation for Ungrouped Data


1. Solve the mean value.
2. Subtract the mean value from each score.
3. Take the absolute value of the difference in step 2.
4. Solve the mean deviation using the formula.
Types of Absolute Measures 110

of Variation (Mean Deviation)


𝒙 𝒙−ഥ
𝒙 /𝒙 − ഥ
𝒙/
35 13.8 13.8
Example 1:
30 8.8 8.8
26 4.8 4.8
Find the mean deviation of 24 2.8 2.8
the scores of 10 students 20 -1.2 1.2

in a Mathematics test. 18 -3.2 3.2


18 -3.2 3.2
Given the scores: 35, 30,
16 -5.2 5.2
26, 24, 20, 18, 18, 16, 15, 15 -6.2 6.2
10. 10 -11.2 11.2
x = 212 /𝑥 − 𝑥/ҧ = 60.4
Types of Absolute Measures 111

of Variation (Mean Deviation)


Mean Deviation for Grouped Data

𝜮𝒇Τ𝒙𝒎 − 𝒙ഥ/
𝑴𝑫 =
𝒏
Where, MD = mean deviation value
f = class frequency
𝑥𝑚 = class mark or midpoint of each category
𝑥ҧ = sample mean
n = number of cases
Types of Absolute Measures 112

of Variation (Mean Deviation)


Steps in Solving Mean Deviation for Grouped Data
1. Solve for the value of the mean.
2. Subtract the mean value from each midpoint or class mark.
3. Take the absolute value of each difference.
4. Multiply the absolute value and the corresponding class frequency.
5. Find the sum of the results in step 4.
6. Solve for the mean deviation using the formula for grouped data.
Example 2: Find the mean deviation of the given scores below.

𝒙 𝒇 𝒙𝒎 𝒇𝒙𝒎 𝒙𝒎 − ഥ
𝒙 /𝒙𝒎 - ഥ
𝒙/ 𝒇/𝒙𝒎 - ഥ
𝒙/

10-14 5 12 60 -21.63 21.63 108.15

15-19 2 17 34 -16.23 16.23 33.26

20-24 3 22 66 -11.63 11.63 34.89

25-29 5 27 135 -6.63 6.63 33.15

30-34 2 32 64 -1.63 1.63 3.26

35-39 9 37 333 3.37 3.37 30.33

40-44 6 42 252 8.37 8.37 50.22

45-49 3 47 141 13.37 13.37 40.11

50-54 5 52 260 18.37 18.37 91.85

n = 40 f𝒙𝒎 = 𝟏, 𝟑𝟒𝟓 𝒇/𝒙𝒎 − ഥ


𝒙/= 425.22
Types of Absolute Measures 114

of Variation (Variance & SD)

4. Variance & Standard Deviation

Variance is one of the most important measures


of variation. It shows variation about the mean.
Types of Absolute Measures 115

of Variation (Variance & SD)


Variance of Ungrouped Data

Population Variance
Σ(𝑥 − 𝜇)2
𝜎2 =
𝑁

Sample Variance
Σ(𝑥 − 𝑥)
ҧ 2
𝑠2 =
𝑛−1
Types of Absolute Measures 116

of Variation (Variance & SD)


Steps in Solving Variation of Ungrouped Data
1. Solve for the mean value.
2. Subtract the mean value from each score.
3. Square the difference between the mean and each
score.
4. Find the sum of the results in step 3.
5. Solve for the population variance or sample variance
using the formula of ungrouped data.
Example 1: Using the data below, find the variance and standard deviation of
the scores of 10 students in a science quiz. Interpret the result.

𝒙 𝒙 − ഥ
𝒙 𝒙)𝟐
(𝒙 − ഥ
19 4.4 19.36
17 2.4 5.76
16 1.4 1.96
16 1.4 1.96
15 0.4 0.16
14 -0.6 0.36
14 -0.6 0.36
13 -1.6 2.56
12 -2.6 6.76
10 -4.6 21.16
x = 146
𝒙)𝟐 = 𝟔𝟎. 𝟒𝟎
𝜮(𝒙 − ഥ

𝒙 = 𝟏𝟒. 𝟔
Types of Absolute Measures 118

of Variation (Variance & SD)


Variance of Grouped Data

Population Variance
Σ𝑓(𝑥 − 𝜇)2
𝑚
𝜎2 =
𝑁

Sample Variance
Σ𝑓(𝑥 − 𝑥)
ҧ 2
𝑚
𝑠2 =
𝑛−1
Types of Absolute Measures 119

of Variation (Variance & SD)


Steps in Solving the Variance of Grouped Data
1. Solve for the mean value.
2. Subtract the mean value from each midpoint or class mark.
3. Square the difference between the mean value and midpoint or
class mark.
4. Multiply the squared differences and the corresponding class
frequency.
5. Find the sum of step 4.
6. Solve the population variance or sample variance using the formula
of grouped data.
Example 2: Score distribution of the test results of 40 students in
Filipino class consisting of 50 items. Solve the variance and standard
deviation and interpret the result.

𝒙 𝒇 𝒙𝒎 𝒇𝒙𝒎 ഥ
𝒙 𝒙𝒎 − ഥ
𝒙 𝒙)𝟐
(𝒙𝒎 − ഥ 𝒙)𝟐
𝒇(𝒙𝒎 − ഥ

15-20 3 17.5 52.5 33.7 -16.2 262.44 787.32

21-26 6 23.5 141 33.7 -10.2 104.04 624.24

27-32 5 29.5 147.5 33.7 -4.2 17.64 88.2

33-38 15 35.5 532.5 33.7 1.8 3.24 48.6

39-44 8 41.5 332 33.7 7.8 60.84 486.72

45-50 3 47.5 142.5 33.7 13.8 190.44 571.32

n = 40 𝒇𝒙𝒎 = 1 348 𝒇(𝒙𝒎 − ഥ


𝒙)𝟐 = 2 606.4
Types of Absolute Measures 121

of Variation (Variance & SD)

Standard deviation is the most important


measures of variation. It is also known as the
square root of the variance. It is the average
distance of all the scores that deviates from the
mean value.
Types of Absolute Measures 122

of Variation (Variance & SD)


Standard Deviation of Ungrouped Data

Population Standard Deviation


Σ(𝑋 − 𝜇)2
𝜎=
𝑁

Sample Variance
Σ(𝑥 − 𝑥)ҧ 2
𝑠=
𝑛−1
Types of Absolute Measures 123

of Variation (Variance & SD)


Steps in Solving Standard Deviation of Ungrouped Data
1. Solve for the mean value.
2. Subtract the mean value from each score.
3. Square the difference between the mean and each score.
4. Find the sum of step 3.
5. Solve for the population standard deviation or sample standard
deviation using the formula for ungrouped data.

Note: If the variance is already solved, take the square root of the
variance to get the value of the standard deviation.
Types of Absolute Measures 124

of Variation (Variance & SD)


Example:

Using the data on example 1, solve the population


and sample standard deviation.
Types of Absolute Measures 125

of Variation (Variance & SD)


Steps in Solving Standard Deviation of Grouped Data
1. Solve for the mean value.
2. Subtract the mean value from each midpoint or class mark.
3. Square the difference between the mean value and midpoint or
class mark.
4. Multiply the squared difference and the corresponding class
frequency.
5. Find the sum of the results in step 4.
6. Solve the population and sample standard deviation using the
formula for grouped data
Types of Absolute Measures 126

of Variation (Variance & SD)


Example:

Using the data on example 2, solve the population


and sample standard deviation.
Types of Absolute Measures 127

of Variation (Variance & SD)


Interpretation of Standard Deviation
1. If the value of SD is large, on the average, the scores in
the distribution will be far from the mean. Therefore, the
scores are spread out around the mean value. The
distribution is also known as heterogeneous.
2. If the value of SD is small, on the average, the scores in
the distribution will be closed to the mean. Hence, the
scores are less dispersed or the scores in the distribution
are homogeneous.
• What can you observe about the mean and the standard deviation of the
three groups of scores?
• Which group of students performed well in the class?
• Which group of scores is most widespread? Less scattered?
• Before answering such questions, let us first discuss the different types of
measures of variation.
What can you observe about the mean and the standard deviation of the three
groups of scores?

Answer: The mean of the three groups of scores is the same which is equal
to 18.25 and the standard deviation of section A = 5.15, section B = 6.92, and
section C = 7.63.
Which group of students performed well in the class?

Answer: In terms of performance, the three sections of students perform the


same because they have the same mean value of 18.25.
Which group of scores is most
widespread? Less scattered?

Answer: The SD of section A = 5.15, section B = 6.92 and section C =


7.63. The scores that are most scattered are those in section C because
they have the largest value of SD which is equal to 7.63. On the other
hand, the less scattered group of scores is in section A which has the
smallest value of SD which is equal to 5.15. Therefore, the smaller value
of SD on the average the closer the scores are to the mean value and the
larger value of SD on the average makes the scores scattered from the
mean value.
Types of Absolute Measures 132

of Variation (Variance & SD)

Properties of Variance and Standard Deviation


1. The most commonly used measures of variation most
especially in research.
2. It shows variation of the individual scores about the
mean.

You might also like