0% found this document useful (0 votes)
4 views

Lecture 2-Organizing and Summarizing Data

The document provides an overview of frequency distribution and its importance in organizing and presenting data for research. It explains how to create frequency distribution tables, various types of statistical charts and graphs, including histograms, frequency polygons, and ogives, as well as their applications in data analysis. Additionally, it discusses the significance of recognizing distribution shapes and offers examples to illustrate these concepts.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Lecture 2-Organizing and Summarizing Data

The document provides an overview of frequency distribution and its importance in organizing and presenting data for research. It explains how to create frequency distribution tables, various types of statistical charts and graphs, including histograms, frequency polygons, and ogives, as well as their applications in data analysis. Additionally, it discusses the significance of recognizing distribution shapes and offers examples to illustrate these concepts.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 35

Joao Saldanha University/Inst

Elementary Statistics
(STATS 1)
Lecture 2: Frequency
Distribution and Graph

Lecturer: Januario da Costa, PhD Cand.


Email: [email protected]
Introduction
When doing a research, data collected must be organize in some ways it is easy
for readers to read, understand and interpreted.
• One of the most convenient way to organize data is Frequency
Distribution.
• After Organizing data, it must also be presented in a way that can be
easily understood by reader that will be benefit from our findings.
The most convenient way is built Statistical Charts and Graphs.
The charts and graphs illustrated here are histograms, frequency polygons,
ogives, pie graphs, Pareto charts, and time series graphs.
A graph that combines the characteristics of a frequency distribution and a
histogram, called a stem and leaf plot, is also explained in this module.
Organizing Data
Frequency Distribution
A frequency distribution is the organization of raw data in table form, using classes and
frequencies.
Steps to develop Frequency Distribution:
• Find the minimum and Maximum of the data Value
• Determine Number of Classes. Keep it mind that Number of classes should be between 5-20.
• Calculate for Class Width.

• Draw four column table. First column for Class Boundaries, second column for Frequency,
third column for relative frequency, and the fourth column for cumulative frequency.
• Develop Lower-class boundary.
• Develop Upper-class boundary.
• Develop the frequency table by counting the tally on the raw data according to their
respective class boundaries.
Example 1: Quantitative Frequency Distribution
Suppose we want to study 50 wealthiest people in Timor-Leste. The first data
we want to get is their age. When data are in original form they are Called
RAW DATA.

If we look at the raw data, there is little information that we can get from here.
Frequency Distribution Table
Class Limits Frequency Relative Frequency Cumulative Frequency

Minimum Value = 27 Maximum Value = 92


Example 2: Qualitative (Categorical) Frequency Distribution
Twenty-five army inductees were given a blood test to determine their blood
type. The data set is

Class Frequency Relative Frequency Cumulative Frequency


Grouped Frequency Distribution
When the range of the data is large, the data must be grouped into classes
that are more than one unit in width, in what is called a grouped frequency
distribution.
Example 3: Grouped Frequency Distribution
For example, a distribution of the age for 15 people.
27 43 43 45 38
55 46 45 38 39
52 46 39 40 42
Class Limits Class Boundaries Frequency Relative Frequency Cumulative
Frequency

Age of 15 People Age of 15 people


9
8 7
7 6
6 5
Frequency

Frequency
5 4
4 3
3
2
2
1
1
0 0
27-32 33-37 39-43 45-47 51-54 57-60 26.5-32.5 32.5-38.5 38.5-44.5 44.5-50.5 50.5-56.5 56.5-62.5

Age Class Boundary


Histogram, Frequency
Polygon and Ogives
Statistical Charts and Graphs
Statistical graphs can be used to describe the data set or to analyze it. Graphs
are also useful in getting the audience’s attention in a publication or a
speaking presentation. They can be used to discuss an issue, reinforce a
critical point, or summarize a data set. They can also be used to discover a
trend or pattern in a situation over a period of time.

The three most commonly used graphs in research are:


1. The histogram.
2. The frequency polygon.
3. The cumulative frequency graph, or ogive
Histogram
The histogram is a graph that displays the data by using contiguous
vertical bars (unless the frequency of a class is 0) of various heights to
represent the frequencies of the classes.
From Example 1:
Suppose we want to study 50 wealthiest people in Timor-Leste.
Class Boundaries Frequency Relative Frequency Cumulative Frequency
27-35 1 2% 2%
36-44 3 6% 8%
45-53 9 18% 26%
54-62 15 30% 56%
63-71 10 20% 76%
72-80 3 6% 82%
81-89 7 14% 96%
90-98 2 4% 100%
TOTAL 50 100%
Histogram
Example 1:
From Example 2:
Twenty-five army inductees were given a blood test to determine their blood
type.
Blood Type Frequency Relative Frequency Cumulative Frequency
A 5 21% 21%
AB 3 13% 33%
B 7 29% 63%
O 9 38% 100%
24 100%
Histogram
Example 2:
Relative Frequency
The histogram is distributions of frequency, and it can be converted to
distributions using proportions instead of raw data as frequencies.
These types of graphs are called relative frequency graphs.
From Example 1:
Suppose we want to study 50 wealthiest people in Timor-Leste.
Class Boundaries Frequency Relative Frequency Cumulative Frequency
27-35 1 2% 2%
36-44 3 6% 8%
45-53 9 18% 26%
54-62 15 30% 56%
63-71 10 20% 76%
72-80 3 6% 82%
81-89 7 14% 96%
90-98 2 4% 100%
TOTAL 50 100%
Relative Frequency
Example 1:
From Example 2:
Twenty-five army inductees were given a blood test to determine their blood
type.
Blood Type Frequency Relative Frequency Cumulative Frequency
A 5 21% 21%
AB 3 13% 33%
B 7 29% 63%
O 9 38% 100%
24 100%
Relative Frequency
Example 2:
Frequency Polygons
The frequency polygon is a graph that displays the data by using lines that
connect points plotted for the frequencies at the midpoints of the classes.
The frequencies are represented by the heights of the points.
From Example 1:
Suppose we want to study 50 wealthiest people in Timor-Leste.
Class Class Relative Cumulative
Boundaries Midpoint Frequency Frequency Frequency
27-35 1 2% 2%
36-44 3 6% 8%
45-53 9 18% 26%
54-62 15 30% 56%
63-71 10 20% 76%
72-80 3 6% 82%
81-89 7 14% 96%
90-98 2 4% 100%
TOTAL 50 100%
Frequency Polygon
Example 1:
The Ogives
The ogive is a graph that represents the cumulative frequencies for the
classes in a frequency distribution.
From Example 1:
Suppose we want to study 50 wealthiest people in Timor-Leste.
Class Relative Cumulative
Boundaries Frequency Frequency Frequency
Less than 27 0 0% 0%
27-35 1 2% 2%
36-44 3 6% 8%
45-53 9 18% 26%
54-62 15 30% 56%
63-71 10 20% 76%
72-80 3 6% 82%
81-89 7 14% 96%
90-98 2 4% 100%
TOTAL 50 100%
The Ogives
Example 1:
Combination of Histograms and Ogives
The ogive is a graph that represents the cumulative frequencies for the
classes in a frequency distribution.
From Example 1:
Suppose we want to study 50 wealthiest people in Timor-Leste.
Class Relative Cumulative
Boundaries Frequency Frequency Frequency
Less than 27 0 0% 0%
27-35 1 2% 2%
36-44 3 6% 8%
45-53 9 18% 26%
54-62 15 30% 56%
63-71 10 20% 76%
72-80 3 6% 82%
81-89 7 14% 96%
90-98 2 4% 100%
TOTAL 50 100%
Combination of Histogram and the
Ogives
Example 1:
Other Types of Graphs
Bar Graphs
A bar graph represents the data by using vertical or horizontal bars
whose heights or lengths represent the frequencies of the data.
Pareto Chart
A Pareto chart is used to represent a frequency distribution for a
categorical variable, and the frequencies are displayed by the
heights of vertical bars, which are arranged in order from highest to
lowest.
Pie Chart
A pie chart is a circle that is divided into sections or wedges
according to the percentage of frequencies in each category of the
distribution.
Time Series Graph
A time series graph represents data that occur over a specific period
of time.
Distribution Shapes
Introduction
When one is describing data, it is important to be able to recognize the
shapes of the distribution values.
A distribution can have many shapes, and one method of analyzing a
distribution is to draw a histogram or frequency polygon for the
distribution.
From Example 1:
Top 50 Wealthiest People in T.L
Histogram
16
16 14
14 12
12
10

Frequency
10
8
Frequency

8
6
6
4
4
2 2
0 0
20 30 40 50 60 70 80 90 100
27- 36- 45- 54- 63- 72- 81- 90-
35 44 53 62 71 80 89 98 Age
Age
Distribution Shapes
Plotting With Excel

You might also like