GE 4 Module 10
GE 4 Module 10
Prepared by:
RRBuizaJr_GE 4 Module 10
INTRODUCTION
LEARNING OUTCOMES:
After completing this module, you are expected to:
Distinguish ungrouped and grouped data
Organize and graph data using: Frequency Distribution Table,
RAW DATA
Data recorded in sequence in which they are collected and before they are processed
or ranked are called raw data.
21 19 24 25 29 34 26 27 37 33
18 20 19 22 19 19 25 22 25 23
25 19 31 19 23 18 23 19 23 26
22 28 21 20 22 22 21 20 19 21
25 23 18 37 27 23 21 25 21 24
Suppose we collect information on these ages (in years) of 50 students selected from a
university. The data values, in the order they are collected, are presented above.
These are called UNGROUPED DATA. An ungrouped data set contains information on
each member of a sample or population individually.
Suppose we ask the same 50 students about their status. In this data, F, SO, J and SE
are the abbreviations for Freshmen, Sophomore, Junior and Senior, respectively. This
is an example of QUALITATIVE (OR CATEGORICAL) RAW DATA.
J F SO SE J J SE J J J
F F J F F F SE SO SE J
J F SE SO SO F J F SE SE
RRBuizaJr_GE 4 Module 10
SO SE J SO SO J J SO F SO
SE SE F SE J SO F J SO SO
Note: The data presented in the previous examples are also called ungrouped data.
A Frequency Distribution for qualitative data lists all categories and the number of
elements that belong to each of the categories.
Example: A sample of 100 students enrolled in Libon Community College was asked
what they intended to do after graduation. Forty-four said they wanted to work for
private companies/businesses, 16 said they wanted to work for the federal
government, 23 wanted to work for state or local governments, and 17 intended to start
their own businesses.
A sample of 30 employees from large companies was selected, and these employees
were asked how stressful their jobs were. The responses of these employees are
recorded below, where very represents very stressful, somewhat means somewhat
stressful and none stands for not stressful at all. Construct a frequency distribution
table for these data.
RRBuizaJr_GE 4 Module 10
Relative Frequency and Percentage Distribution of Stress on Job
Stress on Job Relative Frequency Percentage (%)
Very 10/30 = .333 .333(100) = 33.3
Somewhat 14/30 = .467 .467(100) = 46.7
None 6/30 = .200 .200(100) = 20.0
TOTAL 1.000 100
RRBuizaJr_GE 4 Module 10
The table below shows the breakdown of cases per province in Bicol according to the
active cases, recoveries and deaths related to COVID-19 as of August 22, 2020.
Province Active Cases Recoveries Deaths
Albay 198 155 26
Camarines Norte 16 12 3
Camarines Sur 242 136 8
Catanduanes 19 19 1
Masbate 36 79 0
Sorsogon 40 42 4
TOTAL 551 443 32
Data Source: DOH CHD – Bicol Facebook Post, August 22, 2020
Hi! For your first task, answer each item below in a sheet of paper:
1) Construct a separate bar graph for active cases, recoveries and
deaths. You can decide on what interval to use for each category.
Below each graph, make a brief interpretation of the illustration.
2) Construct a separate pie chart for active cases, recoveries and
deaths. Show your computation for the size of the angle for the
corresponding province. Below each chart, make a brief interpretation
of the illustration.
RRBuizaJr_GE 4 Module 10
As we can observe, the classes are
non-overlapping, that is, each value on
earning belongs to one and only one class.
The second column in the table lists the
number of employees who have earnings
within each class. The numbers listed in the
second column are called FREQUENCIES,
which give the number of values that belong
to different classes. The frequencies are
denoted by f.
For quantitative data, the frequency of a
class represents the number of values in the
data set that fall in that class. The example
contains six classes. Each class has a lower limit and an upper limit. The values 801,
1001, 1201, 1401, 1601 and 1801 give the lower limits, and the values 1000, 1200,
1400, 1600, 1800 and 2000 are the upper limits of the six classes, respectively.
8 25 11 15 29 22 10 5 17 21
22 13 26 16 18 12 9 26 20 16
23 14 19 23 20 16 32 16 21 14
Step 3: Identify the number of class using, √ where n is the class size.
√ √ = 5.48 5
5.48 is between the integers 5 and 6
6
Note: Choose the larger integer for the number of class.
Number of Class = 6
RRBuizaJr_GE 4 Module 10
Step 4: Compute for the Range
Range = Highest Score – Lowest Score
Range = 32 – 5 = 27
Note: Since there are 6 number of class, the table should have 6 rows.
Class Tally Frequency
Row 1 5 (Lowest Score) - __
Row 2 5+5 = 10 - __
Row 3 10+5 = 15 - __
Row 4 15+5 = 20 - __
Row 5 20+5 = 25 - __
Row 6 25+5 = 30 - __
The constant 5 that was added is the class width. Next task is to find the upper
limit for each row. Just reduce one from the preceding lower limit.
Class Tally Frequency
Row 1 5 – (10 – 1 = 9)
Row 2 10 – (15 – 1 = 14)
Row 3 15 – (20 – 1 = 19)
Row 4 20 – (25 – 1 = 24)
Row 5 25 – (30 – 1 = 29)
Row 6 30 – (29 + 5 = 34)
Note: To find the last value for the upper limit, add the class width. Next is to
tally and find the frequency per class.
Class Tally Frequency
5–9 III 3
10 – 14 IIII – I 6
15 – 19 IIII – III 8
20 – 24 IIII – III 8
25 – 29 IIII 4
30 – 34 I 1
RRBuizaJr_GE 4 Module 10
Given the participant’s ages at Retirement Seminar below, construct a
frequency distribution table to organize the data. Show your step-by-step
process in your paper.
56 58 62 62 69 48 70 71 72 55 56
57 65 64 59 60 61 49 65 70 66 58
62 67 59 62 64 63 48 52 54 74
HISTOGRAMS
A histogram can be drawn for a frequency distribution, a relative frequency
distribution, or a percentage distribution. It is called a FREQUENCY HISTOGRAM, a
RELATIVE FREQUENCY HISTOGRAM or a PERCENTAGE HISTOGRAM depending
on whether frequencies, relative frequencies, or percentages are marked on the
vertical axis.
SHAPES OF HISTOGRAM
A SYMMETRIC HISTOGRAM is identical on both sides
of its central point.
A SKEWED HISTOGRAM is non-symmetric. For a
skewed histogram the tail on one side is longer than the
tail on the other side. A SKEWED-TO-THE-RIGHT
HISTOGRAM has a longer tail on the right side. A
SKEWED-TO-THE-LEFT HISTOGRAM has a longer tail
on the left side.
RRBuizaJr_GE 4 Module 10
A UNIFORM or RECTANGULAR HISTOGRAM has the same
frequency for each class.
POLYGONS
A graph formed by joining the midpoints of the tops of successive bars in a
histogram with straight lines is called a POLYGON. A polygon with relative frequencies
marked on the vertical axis is called a RELATIVE FREQUENCY POLYGON. Similarly,
a polygon with percentages marked on the vertical axis is called a PERCENTAGE
POLYGON.
RRBuizaJr_GE 4 Module 10
Class Limits Class Boundaries Frequency Cumulative Frequency
5–9 4.5 – 9.5 3 3
10 – 14 9.5 – 14.5 6 3+6=9
15 – 19 14.5 – 19.5 8 3 + 6 + 8 = 17
20 – 24 19.5 – 24.5 8 3 + 6 + 8 + 8 = 25
25 – 29 24.5 – 29.5 4 3 + 6 + 8 + 8 + 4 = 29
30 – 34 29.5 – 34.5 1 3 + 6 + 8 + 8 + 4 + 1= 30
When plotted on a diagram the cumulative frequencies give a curve that is called
an OGIVE (pronounced as o-jive).
RRBuizaJr_GE 4 Module 10
One advantage of an ogive is that it can be used to approximate the cumulative
frequency for any interval. For example, to find the number of days for which 17 or
fewer iPods were sold. First, draw a vertical line from 17 on the horizontal axis up to
the ogive. Then draw a horizontal line from the point where this line intersects the
ogive to the vertical axis. This point gives the cumulative frequency of the class 5 to
17. We can draw an ogive for cumulative relative frequency and cumulative
percentage distributions the same way as we did for the cumulative frequency
distribution.
STEM-AND-LEAF DISPLAY
In a stem-and-leaf display of quantitative data, each value is divided into two
potions – a stem and a leaf. The leaves for each stem are shown separately in a
display. An advantage of a stem-and-leaf display over a frequency distribution is that
by preparing a stem-and-leaf display we do not lose information on individual
observations. A stem-and-leaf display is constructed only for quantitative data.
Example: The following are the scores of 30 college students on a statistical test.
75 52 80 96 65 79 71 87 93 95
69 72 81 61 76 86 79 68 50 92
83 84 77 64 71 87 72 92 57 98
Construct a stem-and-leaf display.
Sometimes, a data set may contain too many stems, with each stem containing
only a few leaves. In such cases, we may want to condense the stem-and-leaf display
by grouping the stems.
Example: The following stem-and-leaf display is prepared for the number of hours that
25 students spent working on computers during the past month.
RRBuizaJr_GE 4 Module 10
DOTPLOTS
One of the simplest methods for graphing and understanding quantitative data is
to create a dotplot. As with most graphs, statistical software should be used to make a
dotplot for large data sets, but it can also be created by hand. Dotplots can help us
detect outliers (also called extreme values) in a data set. Outliers are the values that
are extremely large or extremely small with respect to the rest of the data values.
Step 1: First, we draw a horizontal line (let us call this as number line) with numbers
that cover the given data from the minimum to maximum value.
Step 2: Place a dot above the value on the numbers line that represents each distance
listed in the table. If there are two or more observations with the same value, we stack
dots vertically above each other to represent those values.
RRBuizaJr_GE 4 Module 10
Given the participant’s ages at Retirement Seminar below, construct the
following:
a. Histogram
b. Polygon
c. Ogive
d. Stem-and-Leaf Display
e. Dotplot
56 58 62 62 69 48 70 71 72 55 56
57 65 64 59 60 61 49 65 70 66 58
62 67 59 62 64 63 48 52 54 74
EVALUATION
The following scores below are the scores in a final examination in
English 10
55 65 58 52 49
64 43 62 65 61
67 75 70 55 67
44 65 36 40 50
49 57 44 49 64
63 38 76 63 59
55 70 60 77 57
46 52 56 71 62
Using the following data above, answer the following:
1. Construct the following:
a. Frequency Distribution Table
b. Histogram
c. Polygon
d. Ogive
e. Stem-and-Leaf Display
f. Dotplot
RRBuizaJr_GE 4 Module 10
REFLECTION
Now that you have completed all the tasks given above, I want you to reflect
and assess yourself through answering the questions below on your journal.
Before starting this module, what level of knowledge you have about the
lesson? Rate yourself (1 – 10) and justify it.
After completing this module, what level of knowledge you have about the lesson?
Rate yourself (1 – 10) and justify it.
What are the challenges you’ve encountered while doing this module? Cite at least
three.
What are your ways forward after completing this module?
What is your learning in this module? Create a poem/poster/slogan/any type of
illustration that will summarize your learning about the topic.
REFERENCES
Mann, P. (2010). Introductory Statistics (7th edition). Hoboken, NJ:
Wiley.
RRBuizaJr_GE 4 Module 10
RUBRIC FOR REFLECTION
RRBuizaJr_GE 4 Module 10