Lecture 1 Utilization of Assessment Data Part 2
Lecture 1 Utilization of Assessment Data Part 2
Utilization of
Assessment
Data
2
LEARNING OUTCOMES:
At the end of this chapter, the students should be able to:
1. Apply statistics in research and in any systematic investigation;
2. Construct frequency distribution for a given set of scores;
3. Graph the scores using histogram and frequency distribution;
4. Calculate the mean, median, and mode, decile, quartile, and
percentile of the students’ scores;
5. Identify the different properties of the measure of central
tendency;
6. Identify the uses of the different measures of variability;
3
LEARNING OUTCOMES:
At the end of this chapter, the students should be able to:
7. Calculate the value and make an analysis of range, mean
deviation, quantitative, quartile deviation, variance and
standard deviation of given scores;
8. Differentiate standard deviation from coefficient of variation;
9. Identify the properties of the different measures of variability;
10. Apply the concept of skewness in identifying the performance
of the students;
11. Determine the spread of scores using the measures of
variation;
4
LEARNING OUTCOMES:
At the end of this chapter, the students should be able to:
12. Compare the performance of the students using measures of
central tendency and measures of variability;
13. Convert raw scores to standard scores;
14. Determine the relationship of two groups of scores; and
15. Computer r and p value of scores and make an analysis.
5
INTRODUCTION
• Statistics is a very important tool in the utilization of
the assessment data most especially in describing,
analyzing, and interpreting the performance of the
students in the assessment procedures.
• The teachers should have the necessary
background in the statistical procedures used in
assessment of student learning.
A Taxonomy of Statistics 6
7
AN OVERVIEW OF
STATISTICS
8
DEFINITION OF STATISTICS
• Statistics is a branch of science, which deals with
collection, presentation, analysis and interpretation of
quantitative data.
• Descriptive statistics is a method concerned with
collecting, describing, and analyzing a set of data without
drawing conclusions (or inferences) about a large group.
• Inferential statistics is a branch of statistics, concerned
with the analysis of a subset of data leading to predictions
or inferences about the entire set of data.
Descriptive & Inferential 13
Statistics
Example:
In a recent study, volunteers who had less than 6 hours of
sleep were four times more likely to answer incorrectly on a
science test than were participants who had at least 8 hours
of sleep. Decide which part is the descriptive statistic and
what conclusion might be drawn using inferential statistics.
14
DATA CLASSIFICATION
Types of Data 16
Qualitative & Quantitative Data
17
18
Qualitative Data
Qualitative data can be separated into two subgroups:
• Dichotomic – if it takes the form of a word with two
options (sex – male or female)
• Polynomic – if it takes the form of a word with more than
two options (education – primary school, secondary
school and university)
19
Qualitative Data
Data Sample of Qualitative Data
Quantitative Data
Quantitative data are always numbers and are the result of
counting or measuring attributes of a population.
Quantitative data can be separated into two subgroups:
• Discrete – if it is the result of counting (the number of
students of a given ethnic group in a class, the number of
books on a shelf)
• Continuous – if it is the result of measuring (distance
traveled, weight of luggage)
21
Quantitative Data
Data Sample of Quantitative Discrete Data
Quantitative Data
Data Sample of Quantitative Continuous Data
Solution
FREQUENCY DISTRIBUTION
Example: LL – UL
10 – 14
15 – 19
20 – 24
Example: LL – UL 𝑋𝑀
10 – 14 12
15 – 19 17
20 – 24 22
Steps in Constructing Frequency 41
Distribution
1. Compute the value of the range (R). Range is the
difference between the highest score and the lowest
score.
R = HS – LS
Distribution
3. Set up the class limits of each class or category.
Each class defined by the lower limit and upper limit.
Use the lowest score as the lower limit of the first
class.
4. Set up the class boundaries if needed. Use the
formula:
𝐿𝐿 𝑜𝑓 𝑡ℎ𝑒 𝑠𝑒𝑐𝑜𝑛𝑑 𝑐𝑙𝑎𝑠𝑠 − 𝑈𝐿 𝑜𝑓 𝑡ℎ𝑒 𝑓𝑖𝑟𝑠𝑡 𝑐𝑙𝑎𝑠𝑠
𝑐𝑏 =
2
Steps in Constructing Frequency 43
Distribution
Distribution
Distribution
17 25 30 33 25 45 23 19
27 35 45 48 20 38 39 18
44 22 46 26 36 29 15-LS 21
50-HS 47 34 26 37 25 33 49
22 33 44 38 46 41 37 32
Steps in Constructing Frequency 46
Distribution
R = HS – LS Solve the value of k.
= 50-15 k = 1 + 3.3 log n
R = 35 k = 1 + 3.3 log 40
n = 40 k = 1 + 3.3 (1.602059991)
k = 1 + 5.286797971
k = 6.286797971
k=6
Steps in Constructing Frequency 47
Distribution
Find the class size.
𝑅
𝑐. 𝑖 =
𝑘
35
𝑐. 𝑖 =
6
𝑐. 𝑖 = 5.833
𝒄. 𝒊 = 𝟔
Steps in Constructing Frequency 48
Distribution
• Construct the class limit starting with the lowest score
as the lower limit of the first category.
• The last category should contain the highest score in
the distribution.
• Each category should contain 6 as the size of the
width (X).
• Count the number of scores that falls in each category
(f).
49
Find the class boundaries and class marks of the given score
distribution. n = 40
Class
X f Xm
Boundaries
15-20 4 14.5-20.5 17.5
21-26 9 20.5-26.5 23.5
27-32 3 26.5-32.5 29.5
33-38 10 32.5-38.5 35.5
39-44 4 38.5-44.5 41.5
45-50 10 44.5-50.5 47.5
51
52
Cumulative
Class Cumulative
X f Xm Relative Frequency Relative
Boundaries Frequency
Frequency
15-20 4 14.5-20.5 17.5 4/40 0.1 4
21-26 9 20.5-26.5 23.5 9/40 0.325 13
27-32 3 26.5-32.5 29.5 3/40 0.4 16
33-38 10 32.5-38.5 35.5 10/40 0.65 26
39-44 4 38.5-44.5 41.5 4/40 0.75 30
PERFORMANCE
• There are two major concepts in describing the
assessed performance of the group: measures of
central tendency and measures of variability.
• Measures of central tendency are used to
determine the average score of a group of scores
while measures of variability indicate the spread of
scores in the group.
58
(Mean)
1. Mean is the most commonly used measure of the center of data and it
also referred as the “arithmetic average”.
Σ𝑋 𝑋1 + 𝑋2 + 𝑋3 + ⋯ 𝑋𝑛
𝜇= =
𝑁 𝑁
Computation of Sample Mean
Σ𝑥 𝑥1 + 𝑥2 + 𝑥3 + ⋯ 𝑥𝑛
𝑥ҧ = =
𝑛 𝑛
Measure of Central Tendency 60
(Mean)
1. Mean is the most commonly used measure of the center of data and it
also referred as the “arithmetic average”.
Σ𝑥
• 𝑥ҧ =
𝑛
Σ𝑓𝑥
• 𝑥ҧ =
𝑛
61
Find the Grade Point Average (GPA) of Laiza Seguerra for the first semester of
the school year 1452-1453. Use the table below:
Subjects Grade (Xi) Units (Wi) (Wi)(Xi)
BM112 1.25 3 3.75
Σ(𝑊𝑖 )(𝑋𝑖 )
BM101 1.00 3 3.00 𝑥ҧ =
Σ𝑊𝑖
AC103N 1.25 6 7.50
BEC111 1.00 3 3.00
32
MGE101 1.50 3 4.50 𝑥ҧ =
MKM101 1.25 3 3.75
26
FM111 1.50 3 4.50
𝑥ҧ = 𝟏. 𝟐𝟑
PEN2 1.00 2 2.00
𝜮 𝑾𝒊 = 𝟐𝟔 𝜮 𝑾𝒊 𝑿𝒊 = 𝟑𝟐. 𝟎𝟎
Measure of Central Tendency 63
(Mean)
Mean for Grouped Data
• Grouped data are the data or scores that are arranged in a
frequency distribution.
• Frequency distribution is the arrangement of scores according
to category of classes including the frequency.
• Frequency is the number of observations falling in a category.
• Mean for grouped data is also called midpoint method.
Σ𝑓𝑥𝑚
𝑥ҧ =
𝑛
Measure of Central Tendency 64
(Mean)
Mean for Grouped Data
Σ𝑓𝑥𝑚
𝑥ҧ =
𝑛
Where: x = mean value
f = frequency in each class or category
𝑥𝑚 = midpoint of each class or category
Σ𝑓𝑥𝑚 =summation of the product of 𝑓𝑥𝑚
Measure of Central Tendency 65
(Mean)
Steps in Solving Mean for Grouped Data
1. Find the midpoint or class mark (𝑥𝑚 ) of each class or
𝐿𝐿+𝑈𝐿
category using the formula 𝑥𝑚 = .
2
2. Multiply the frequency and the corresponding class
mark 𝑓𝑥𝑚 .
3. Find the sum of the results in step 2.
Σ𝑓𝑥𝑚
4. Solve the mean using the formula 𝑥ҧ = .
𝑛
66
(Mean)
Properties of the Mean
1. It measures stability. Mean is the most stable among other
measures of central tendency because every score contributes to
the value of the mean;
2. The sum of each score’s distance (deviations) from the mean is
zero;
3. It is easily affected by the extreme scores (outliers);
4. It may not be an actual score in the distribution;
5. It can be applied to interval level of measurement; and
6. It is very easy to compute.
Measure of Central Tendency 68
(Mean)
(Median)
• Median is the second type of measures of central tendency.
• Median is what divides the scores in the distribution into two equal
parts.
• Fifty percent (50%) lies below the median value and 50% lies above the
median value.
• Also known as the middle score or the 50th percentile.
• Numbers must be arranged from lowest to highest.
• When the number cases are odd, the median score is a score that
has the same number of scores below and above it.
• When the scores are even, determine the average of the two middle
most scores that have equal number of scores below and above it.
Measure of Central Tendency 70
(Median)
Median of Ungrouped Data
1. Arrange the scores.
2. Determine the middle most score in a
distribution if n is an odd number and get the
average of the two middle most scores if n is
an even number.
Measure of Central Tendency 71
(Median)
Example 1
(Median)
Example 2
(Median)
Median of Grouped Data
Formula:
𝑛
𝑥 = 𝐿𝐵 2 − 𝑐𝑓𝑝 𝑐. 𝑖
𝑓𝑚
= 𝑚𝑒𝑑𝑖𝑎𝑛 𝑣𝑎𝑙𝑢𝑒
𝒙
𝑛
MC = median class is a category containing the 2
𝑳𝑩 = 𝑙𝑜𝑤𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑒𝑑𝑖𝑎𝑛 𝑐𝑙𝑎𝑠𝑠 𝑀𝐶
cfp = cumulative frequency before the median class if the scores are arranged from
lowest to highest value
fm = frequency of the median class
c.i = size of the class interval
Measure of Central Tendency 74
(Median)
Steps in Solving Median for Grouped Data
1. Complete the table for cf<.
𝑛
2. Get of the scores in the distribution so that
2
you can identify MC.
3. Determine 𝐿𝐵 , cfp, fm, and c.i.
4. Solve the median using the formula.
Measure of Central Tendency 75
(Median)
Example 3
(Median)
Properties of the Median
1. It may not be an actual observation in the
data set.
2. It can be applied in ordinal level.
3. It is not affected by extreme values because
median is a positional measure.
Measure of Central Tendency 77
(Median)
(Mode)
• Mode is the third measure of central tendency.
• The mode or the modal score is a score or scores that
occurred most in the distribution.
• Unimodal is a distribution of scores that consists of only one
mode.
• Bimodal is a distribution of scores that consists of two modes.
• Trimodal is a distribution of scores that consists of three
modes or multimodal is a distribution of scores that consists
of more than two modes.
Example 1. Scores of 10 students in Section A, Section B, and Section C
79
25 25 25
24 24 25
24 24 25
20 20 22
20 18 21
20 18 21
16 17 21
12 10 18
10 9 18
7 7 18
Measure of Central Tendency 80
(Mode)
Mode for Grouped Data
In solving the mode value using grouped data, use the formula:
𝑑1
𝑥ො = 𝐿𝐵 𝑐. 𝑖
𝑑1 + 𝑑2
(Mode)
Properties of the Mode
1. It can be used when the data are qualitative
as well as quantitative.
2. It may not be unique.
3. It is not affected by extreme values.
4. It may not exist.
Measure of Central Tendency 83
(Mode)
When to Use the Mode
1. When the “typical” value is desired.
2. When the data set is measured on a
nominal scale.
Measure of Central Tendency 84
(Quantiles)
• Quantile is a score distribution where the scores are
divided into different equal parts.
• Quartile is a score point that divides the scores in the
distribution into four (4) equal parts.
• Decile is a score point that divides the scores in the
distribution into ten (10) equal parts.
• Percentile is a score that divides the scores in the
distribution into hundred (100) equal parts.
Measure of Central Tendency 85
(Quantiles)
Quantiles for Ungrouped Data
(Quantiles)
Quartiles for Grouped Data
(Quantiles)
Quartiles for Grouped Data
(Quantiles)
Deciles for Grouped Data
(Quantiles)
Deciles for Grouped Data
(Quantiles)
Percentiles for Grouped Data
(Quantiles)
Percentiles for Grouped Data
Measures of Variation
• Central tendency is the distribution of scores.
• Also, we want to know how close or how far is the distance of the
scores of students in a certain test from the average performance
of the group.
• Measure of variation is a single value that is used to describe the
spread of the scores in a distribution.
• The term variation is also known as variability or dispersion.
• There are several ways of describing the variation of scores:
absolute measures of variation & relative measures of
variation.
Let us consider the scores of students in three sections of mathematics class. We shall consider the spread
of scores based on graphical presentation.
of Variation (Range)
1. Range
- The difference between the highest score and the lowest score in a
distribution.
- The simplest and the crudest measure of variation, simplest
because we shall only consider the highest and lowest score.
of Variation (Range)
Example 1: Find the range of the two groups of score distribution.
GROUP A GROUP B
10 (LS) 15 (LS)
12 16
15 16
17 17
25 17
26 23
28 25
30 26
35 (HS) 30 (HS)
Types of Absolute Measures 97
of Variation (Range)
Range for Grouped Data
R = HSUB - LSLB
of Variation (Range)
Example 2: Find the value of range of the scores of 50 students in Mathematics
achievement test. X f
25-32 3
33-40 7
41-48 5
49-56 4
57-64 12
65-72 6
73-80 8
81-88 3
89-97 2
Types of Absolute Measures 99
of Variation (Range)
Properties of Range
• It is quick and easy to understand.
• It is a rough estimation of variation.
• It is easily affected by the extreme scores.
IQR is the difference between the third quartile and the first quartile.
IQR = Q3 – Q1
𝑸𝟑 − 𝑸𝟏
𝑸𝑫 =
𝟐
Where, QD = quartile deviation value
𝑸𝟏 = value of first quartile
𝑸𝟑 = value of third quartile
Types of Absolute Measures 102
Example 1: Using the given data 6, 8, 10, 12, 12, 14, 15,
16, 20, find the quartile deviation.
Quartile Deviation of Grouped Data 104
x f cf<
25-32 3 3
33-40 7 10
Example 2: The data given
41-48 5 15
below are the scores of fifty
49-56 4 19
(50) students in Filipino 57-64 12 31
class. Solve for the value of 65-72 6 37
quartile deviation (QD). 73-80 8 45
81-88 3 48
89-97 2 50
Types of Absolute Measures 105
𝜮Τ𝒙 − 𝒙
ഥ/
𝑴𝑫 =
𝒏
Where, MD = mean deviation value
x = individual score
𝑥ҧ = sample mean
n = number of cases
Types of Absolute Measures 109
𝜮𝒇Τ𝒙𝒎 − 𝒙ഥ/
𝑴𝑫 =
𝒏
Where, MD = mean deviation value
f = class frequency
𝑥𝑚 = class mark or midpoint of each category
𝑥ҧ = sample mean
n = number of cases
Types of Absolute Measures 112
𝒙 𝒇 𝒙𝒎 𝒇𝒙𝒎 𝒙𝒎 − ഥ
𝒙 /𝒙𝒎 - ഥ
𝒙/ 𝒇/𝒙𝒎 - ഥ
𝒙/
Population Variance
Σ(𝑥 − 𝜇)2
𝜎2 =
𝑁
Sample Variance
Σ(𝑥 − 𝑥)
ҧ 2
𝑠2 =
𝑛−1
Types of Absolute Measures 116
𝒙 𝒙 − ഥ
𝒙 𝒙)𝟐
(𝒙 − ഥ
19 4.4 19.36
17 2.4 5.76
16 1.4 1.96
16 1.4 1.96
15 0.4 0.16
14 -0.6 0.36
14 -0.6 0.36
13 -1.6 2.56
12 -2.6 6.76
10 -4.6 21.16
x = 146
𝒙)𝟐 = 𝟔𝟎. 𝟒𝟎
𝜮(𝒙 − ഥ
ഥ
𝒙 = 𝟏𝟒. 𝟔
Types of Absolute Measures 118
Population Variance
Σ𝑓(𝑥 − 𝜇)2
𝑚
𝜎2 =
𝑁
Sample Variance
Σ𝑓(𝑥 − 𝑥)
ҧ 2
𝑚
𝑠2 =
𝑛−1
Types of Absolute Measures 119
𝒙 𝒇 𝒙𝒎 𝒇𝒙𝒎 ഥ
𝒙 𝒙𝒎 − ഥ
𝒙 𝒙)𝟐
(𝒙𝒎 − ഥ 𝒙)𝟐
𝒇(𝒙𝒎 − ഥ
Sample Variance
Σ(𝑥 − 𝑥)ҧ 2
𝑠=
𝑛−1
Types of Absolute Measures 123
Note: If the variance is already solved, take the square root of the
variance to get the value of the standard deviation.
Types of Absolute Measures 124
Answer: The mean of the three groups of scores is the same which is equal
to 18.25 and the standard deviation of section A = 5.15, section B = 6.92, and
section C = 7.63.
Which group of students performed well in the class?