0% found this document useful (0 votes)
767 views

Module Two: Frequency Distribution and Their Graphic Representations

1. The document discusses frequency distributions and their graphic representations. It defines grouped and ungrouped frequency distributions and explains how to organize data into these forms of distributions. 2. Specific steps are provided to construct a grouped frequency distribution from a set of data, including determining the class interval size, assigning data to intervals, and calculating frequencies. 3. An example shows organizing test score data from 60 students into a grouped frequency distribution with class intervals of 5 points. The distribution is presented in a table with class limits and frequencies.

Uploaded by

DarcknyPusod
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
767 views

Module Two: Frequency Distribution and Their Graphic Representations

1. The document discusses frequency distributions and their graphic representations. It defines grouped and ungrouped frequency distributions and explains how to organize data into these forms of distributions. 2. Specific steps are provided to construct a grouped frequency distribution from a set of data, including determining the class interval size, assigning data to intervals, and calculating frequencies. 3. An example shows organizing test score data from 60 students into a grouped frequency distribution with class intervals of 5 points. The distribution is presented in a table with class limits and frequencies.

Uploaded by

DarcknyPusod
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 14

Module Two

Frequency Distribution and Their Graphic Representations

Objectives

After studying this chapter, the student will be able to:

1. define grouped and ungrouped frequency distributions;


2. organize data in the form of a frequency distribution;
3. construct graphic representations of frequency distributions;
4. critique how frequency distributions may differ from one another;
5. evaluate a distribution of scores.

Introduction

After we obtain a set of measurements, a common next step is to put them in a systematic
order by grouping them in classes. A set of individual measurements taken as they come does
not convey much useful information. Generally we have a conception of how large they run
numerically.

The most common method of summarizing large data is to present them in condensed
form in tables or charts. The study of statistical presentation took most of the time in statistics
courses. The scope of statistics has grown to such an extent that less time is devoted to this kind
of work since then can mostly be done with the use of computers. Nevertheless, it is necessary
to discuss how students and researchers summarize data in frequency distributions.

Data obtained from surveys or research studies are usually a large set of measurements
and are by their very nature disorganized and varied. It is necessary therefore to organize them
in some systematic fashion where the numbers will emerge. The primary steps in arriving at
some meaningful interpretation of data require setting up a frequency distribution.

Frequency Distribution

A frequency distribution is an arrangement of data from highest to lowest values or vice


versa with a frequency of occurrence. It can be done in two ways, grouped frequency
distribution and ungrouped frequency distribution. It is a tabular arrangement of data into
classes according to the size or magnitude along with corresponding class frequencies (the
number of values fall in
each class). A frequency deals with numerical data hence; we have a so-called ungrouped data
and grouped data.

Ungrouped Data or Raw Data are data which have not been arranged in a systemic order.
It is also called a raw data, while grouped data are those presented in the form of frequency
distribution. When the data or numerical raw data is arranged in ascending or descending order
is called an array.

Valuable information about a large data set can be gained and a good overall picture of it
can be taken by grouping the data into a number of classes. For instance, consider the scores of
50 students in Statistical Methods listed in Table 1. These scores ranged from 30 to 95. If you
arrange all the scores from highest to lowest then place a slash mark (tally) alongside each score
every time it occurs, the result shown in Table 2 is an ungrouped frequency distribution of
scores. Note that the scores are widely spread out and a number of scores have a frequency of
zero. Usually, it is advisable to “group” the scores into what is referred to as class interval and
then obtain a frequency distribution of group scores.

Table 1.
Total Quizzes Scores of a random sample of 60 Graduate students in Statistical Methods during
the first semester of school year 2012-2013

67 73 64 77 33 70
43 35 41 78 63 70
87 39 52 54 45 86
71 95 57 61 44 85
88 57 78 55 84 83
30 71 53 50 48 82
69 41 47 69 94 80
30 67 39 49 30 68
47 69 84 69 30 70
67 100 36 66 36 70

These data can be arranged in either descending or ascending order to form an array.
This is done so that it will be easy to construct a frequency distribution. A frequency
distribution is any arrangement of data that shows the frequency of occurrence of values falling
within arbitrarily defined ranged of variable known as class interval. This is the case of a
grouped frequency distribution.

For ungrouped case, a frequency distribution is an ordering of data from either highest to
lowest or lowest to highest with a frequency of occurrence. Table 2 shows the ungrouped
frequency distribution in descending order of the data given in Table 1. The tallies are also
shown.
Table 2.
Ungrouped Frequency Distribution of Scores in Statistical Methods Course

Score Tally F Score Tally F Score Tally F


100 / 1 73 / 1 49 / 1
95 / 1 71 // 2 48 / 1
94 / 1 70 //// 4 47 // 2
93 / 1 69 //// 4 45 / 1
88 / 1 68 / 1 44 / 1
87 / 1 67 /// 3 43 / 1
86 / 1 66 / 1 41 // 2
85 / 1 64 / 1 39 // 2
84 // 2 63 / 1 36 // 2
83 / 1 61 / 1 35 / 1
82 / 1 55 / 1 33 / 1
80 / 1 54 / 1 30 //// 4
78 // 2 53 / 1
77 / 1 52 / 1
76 / 1 50 / 1

Class Interval and Frequencies

The first thing to decide is the size of the class interval. How many units shall it contain?
Obviously, the interval selected must not be so large that we lose the discrimination provided by
the original measurement, nor should the interval be so fine that the purposes of grouping are
defeated. The following are the steps in constructing a grouped frequency distribution.

Step 1. Find the difference between the highest and the lowest score values. Call this difference,
the range. For the given data in Table 2, the range is (100 - 30) = 70.

Step 2. Decide on the size of the class intervals. The general practice is to prefer not fewer than
10 or more than 20 class intervals. In our example an interval of 5 points will give 15
class intervals. The range 70 divided by the class size 5 gives 14 class intervals plus 1,
that will give a total of 15 class intervals. We shall designate the class interval by the
symbol “ i ”.

Step 3. Start the intervals with their lowest scores at multiple of the size of the interval – when
the interval is 5, start with 5, 10, 15, 20, etc.; when the intervals is 3, start with 3, 6, 9,
12, 15, etc. In our present example, the lowest class interval begins with 30. Add to this
i-1, to obtain the higher limit of the class interval. The top and bottom scores for each
interval are called class limits.

Step 4. The next higher class interval begins at the integer following the upper limit of the
lowest class interval. In our example, the next integer is 30. Follow the same steps as in
No. 3 to obtain the upper limit of the second class interval.
Follow these procedures for each successive higher class interval until all the
scores are included in their designated class interval.

Step 5. Assign each obtained score to the class interval within which it is included.
The group frequency distribution of the data in Table 2 appears in Table 3.

Table 3
Frequency Distribution of Scores of 50 Students in Statistical Methods

Class Interval Frequency


96 – 100 1
91 – 95 3
86 – 90 3
81 – 85 5
76 – 80 5
71 – 75 3
66 – 70 13
61 – 65 3
56 – 60 0
51 – 55 4
46 – 50 5
41 – 45 5
36 – 40 4
31 – 35 2
26 – 30 _4__
60

Class Limits, Exact Limits of the Class Interval

Class limits are the smallest and largest observations which are numerical data, events, etc
in each class.  The exact limits of a number are equal to its apparent value plus and minus one-
half of the unit of measurement. The same is true with class intervals. Thus, the exact limits of
the given interval 26 - 30, 31 - 35, and 36 - 40 are 25.5 - 30.5, 30.5 - 35.5 and 35.5 - 40.5,
respectively.

Midpoints

In grouping the data in class intervals, we assume that all the observations are
concentrated at the midpoint of the interval, that is, we regard the midpoint as the representative
score of all the observations in that interval. The midpoint is halfway between the exact limits of
the interval. It may be obtained by averaging the class interval or lower limits. Thus, the
midpoint of the class intervals 26 - 30, 31 - 35, and 36 - 40 or the midpoints of the exact limits
25.5 - 30.5, 30.5 - 35.5 and 35.5 - 40.5 are 28, 33 and 38, respectively.

Cumulative Frequency and Cumulative Percentage Distribution

Sometimes we have a need for information regarding the number or percentage of values
“greater than” or “less than” a specified value. The answer is readily available by the preparation
of a cumulative frequency distribution. The cumulative frequencies are obtained by adding
successively, starting from the bottom, the individual frequencies. The top entry in the
cumulative frequency column is always equal to N.
From a cumulative frequency distribution we can obtain the number of cases or
frequencies below the upper exact limits of that interval.

The cumulative percentage distribution is obtained by dividing each entry in the


cumulative frequency by N and multiplying by 100. Note that the top entry must be 100%.

Table 4
Frequency Distribution of Scores of 60 Students in Statistical Methods with the exact limits,
midpoint, cumulative frequency and cumulative percent

Class f Exact Cumulative Cumulative


Limits/ Limit Midpoint Frequency Percent
Intervals > < > <

96 - 100 1 95.5 - 100.5 98 1 60 2 100


91 - 95 3 90.5 - 95.5 93 4 59 7 98
86 - 90 3 85.5 - 90.5 88 7 56 12 93
81 - 85 5 80.5 - 85.5 83 12 53 20 88
76 - 80 5 77.5 - 80.5 78 17 48 28 80
71 - 75 3 70.5 - 75.5 73 20 43 33 72
66 - 70 13 65.5 - 70.5 68 33 40 55 67
61 - 65 3 60.5 - 65.5 63 36 27 60 45
56 - 60 0 55.5 - 60.5 58 36 24 60 40
51 - 55 4 50.5 - 55.5 53 40 24 67 40
46 - 50 5 45.5 - 50.5 48 45 20 75 33
41 - 45 5 40.5 - 45.5 43 50 15 83 25
36 - 40 4 35.5 - 40.5 36 54 10 90 17
31 - 35 2 30.5 - 35.5 33 56 6 93 10
26 - 30 4 25.5- 30.5 28 60 4 100 7

Exercise 2a

Exercises:

1. The data below are the grades of 50 students in Methods of Research class:

92 80 85 81 78
50 62 77 78 66
79 73 57 39 79
90 87 80 60 89
79 68 75 73 65
79 46 69 80 54
59 59 70 85 48
65 78 95 80 49
48 64 42 89 68
82 74 70 59 25

Questions:

a. Make a tally of scores from lowest to highest.


b. Construct a grouped frequency distribution with a class size of 5.
c. List down the exact limits, cumulative frequency, cumulative percent and midpoints.

2. The following scores represent the final examination grade for an elementary statistics
course.

21 65 75 30 57
74 52 70 82 36
80 77 81 95 41
65 92 85 55 76
52 25 64 75 78
25 80 98 81 67
41 71 83 54 64
72 88 62 74 43
60 78 89 76 84
48 84 90 35 70
34 67 47 82 69
74 63 80 85 61

a. Make a tally of scores from lowest to highest.


b. Set-up a grouped frequency distribution using a class size of 7
c. List down the exact limits, cumulative frequency, cumulative percent and midpoints.

Graphic Representation of Frequency Distribution

Frequency distributions provide valuable information, but sometimes a picture


representation is of much importance and of greater value. It has been said that a picture is worth
a thousand words for it conveys lots of meanings and information. Different pictorial
representations can be used to represent a data. With the graphics features available in the
computer, anybody can present the data in a more interesting and attractive manner. Some of the
most common graphs used by researchers are bar graphs, histogram, frequency polygons, and
ogives.

Graphs typically have two coordinate axes: the x-axis (the horizontal axis) and the y-axis
(the vertical axis). A graph is a device for showing numerical values or relationship in pictorial
form. It enables us to think about a problem in visual terms. Graphic representations of
frequency distribution enable us to visualize important properties of a frequency distribution. A
bar graph is a graphical representation of a frequency distribution in which vertical bars are
centered above each category along the x-axis and are separated from each other by a space
(Jackson, 2012).

Histogram is a graph of a frequency distribution represented by areas in the form of bars.


In histogram, the vertical bars are centered above scores on the x-axis; however, in a histogram,
the bars touch each other to indicate that the scores on the variable represent related, increasing
values. The points marked off along the horizontal axis, or abscissa, are the upper and lower
limits of each class interval. Instead of class limits, just the midpoints of the class intervals can
be indicated. Frequencies are denoted on the vertical axis. The area under the histogram
represents the total frequency.

Frequency Polygon is another graphic representation of a frequency distribution. The


vertical axis is labeled with frequencies and the midpoints of the class boundaries are scaled
along the horizontal axis. The frequency associated with each class indicated by a dot placed
over the midpoints of the class interval, which are then joined by straight lines. Frequency
polygon is a line graph of the frequencies of individual scores or intervals.
A histogram and a frequency polygon based on the grouped frequency distribution in
Table 2 are shown in Figures 1 and 2 and in the succeeding page. It is expected that you will be
able to organize any kind of data you will have in your respective classes and offices considering
the nominal measurement scales with the corresponding description.

12.5
F re q u e n c y

10.0

7.5

5.0

2.5

Mean =64.55
Std. Dev. =17.004
N =60
0.0
30 40 50 60 70 80 90 100

Score in Statistics

Figure 1. Histogram for Scores of 60 Students in Statistical Methods

Matrix 4.
Description and use of the different graphical forms commonly applied in
presenting the data

Frequency Bar Graph Histogram Frequency Polygon


Distribution
Description A list of all A pictorial graph A pictorial graph A line graph
scores with bars with bars representing the
occurring in representing the representing the frequency of
the frequency of frequency of occurrence of items
distribution occurrence of occurrence of items for quantitative
along with the items for for quantitative variables
frequency of qualitative variables
each variables
Use with Nominal, Nominal data Typically ordinal, Typically ordinal,
ordinal, interval, or ratio data; interval, or ratio data;
interval, or most appropriate for most appropriate for
ratio data discrete data continuous data
All histograms are bar graphs but the converse is not true. Bar charts need not have class
intervals of equal widths. Histograms are graphic representations of interval or ratio scaled
frequency distribution, while bar graphs are employed with nominal or ordinal variables.

Cumulative frequency or ogive is a graphic representation of the sum of all the scores in a
frequency distribution up to any given point. Plotting the cumulative frequencies against the
lower (or upper) limits results in a “greater than” or “less than” ogive. The “less than” ogive
corresponding to the cumulative frequency distribution is given in Table 2. This is shown in
Figure 2.

Figure 2. Frequency Polygon for Scores of 60 Students in Statistical Methods

How Frequency Distribution Differs


Four important properties of frequency distribution are central location, variations,
skewness, and kurtosis.

Central location refers to a value of the variable near the center of the frequency
distribution. It is a middle point. Measures of central locations are called averages.

Variation refers to the extent of the clustering about a central value.

Skewness refers to the symmetry of the frequency distribution. A distribution is


positively skewed if it tails off at the high end of the horizontal axis and there are relatively
fewer frequencies at this end. On the other hand, a distribution is negatively skewed if there are
relatively fewer scores associated with the left hand, or low side of the horizontal axis. (See
Figures 3- 5).

Figure 3 Graph of Cumulative Frequency Distribution or ogive for the Scores of 60 students in
Statistical Methods

Y
X
Mean Median Mode

Figure 4. Negatively Skewed Distribution

X
Mode Median Mean

Figure 5. Positively Skewed Distribution


Figure 6. Normal Distribution (Symmetrical Distribution)

Kurtosis refers to the flatness or peakedness of one distribution in relation to another. If a


distribution is characterized by a piling up of scores in the center of the distribution, that is, it is
more peaked than another, it is said to be leptokurtic distribution. If it is less peaked, it is said to
be platykurtic distribution. If it takes the ideal form of the normal curve, it is referred to as a
mesokurtic distribution.

A : Mesokurtic
B: Leptokurtic
C: Platykurtic

Figure 7. Three Frequency Distributions Different in Kurtosis


Exercises

1. The data below are the grades of 50 students in Mathematics class:

95 80 56 90 91
40 42 87 78 56
79 63 57 39 79
90 87 80 60 89
79 68 75 73 65
79 46 69 80 54
59 59 70 85 48
65 78 93 80 49
48 64 42 89 68
82 71 74 59 45

Using a class size of 5,

1.1 Construct a frequency distribution.


1.2 Present the data above in graphical forms: histogram, frequency polygon
and ogive.

2. The following scores represent the final examination grade for an elementary statistics
course.

23 60 79 32 57
74 52 70 82 36
80 77 81 95 41
65 92 85 55 76
52 10 64 75 78
25 80 98 81 67
41 71 83 54 64
72 88 62 74 43
60 78 89 76 84
48 84 90 15 70
34 67 17 82 69
74 63 80 85 61

2.1 Construct a frequency distribution using a class size of 7.


2.2 Graph the histogram, frequency polygon and ogive.
3. The following scores represent the final examination scores for an elementary statistics
course:

23 60 79 32 57 74 70 52 82 36
80 77 81 95 41 65 85 92 55 76
52 10 64 75 78 25 98 80 81 67
41 71 83 54 64 72 63 88 74 43
60 78 89 76 84 48 90 84 15 79
34 67 17 82 69 74 80 63 85 61

3.1 Construct a frequency distribution using a class size of 5.


3.2 Graph the histogram, frequency polygon and ogive.

Exercise 2.c

Revisit the data file you created in Part II of Exercise 1. Present some data in grouped and
ungrouped frequency distributions.

Terms and Concepts to Remember

Abscissa Platykurtic distribution


Ordinate Leptokurtic distribution
Bar graph Mesokurtic distribution
Histogram Positively Skewed Distribution
Cumulative Frequency Negatively Skewed Distribution
Polygon Class Limits
Frequency Polygon

You might also like