0% found this document useful (0 votes)
1 views

MathEng3_EDA_Lesson 4_Frequency Distribution and Graphs

This document outlines the objectives and methods for organizing and representing data using frequency distributions and various graphical representations. It covers topics such as creating frequency distribution tables, categorical and grouped frequency distributions, and interpreting different types of graphs. The document also provides examples and guidelines for constructing these distributions effectively.

Uploaded by

vlairlinda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
1 views

MathEng3_EDA_Lesson 4_Frequency Distribution and Graphs

This document outlines the objectives and methods for organizing and representing data using frequency distributions and various graphical representations. It covers topics such as creating frequency distribution tables, categorical and grouped frequency distributions, and interpreting different types of graphs. The document also provides examples and guidelines for constructing these distributions effectively.

Uploaded by

vlairlinda
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 98

Frequency Distributions

and Graphs
MATHENG3 – Engineering Data Analysis
Engr. Jerime C. Jimenez, MSCE, SO2
Lesson Objectives

After completing this chapter, the student should be able to:

ENGR. JERIME C. JIMENEZ, MSCE, SO2


• Organize data using a frequency distribution.

• Represent data in frequency distributions graphically, using


histograms, and ogives.

• Represent data using bar graphs, Pareto charts, time series graphs,
pie graphs, and dot plots.

• Draw and interpret a stem and leaf plot.


ENGR. JERIME C. JIMENEZ, MSCE, SO2
Organizing Data
Organize data using a frequency distribution
Organizing Data
Ages of the Top 50 Wealthiest People
45 46 64 57 85

ENGR. JERIME C. JIMENEZ, MSCE, SO2


92 51 71 54 48
27 66 76 55 69
54 44 54 75 46
61 68 78 61 83
88 45 89 67 56
81 58 55 62 38
55 56 64 81 38
49 68 91 56 68
46 47 83 71 62

When the data is presented in original form, they are called raw data.
Organizing Data
Ages of the Top 50 Wealthiest People
45 46 64 57 85

ENGR. JERIME C. JIMENEZ, MSCE, SO2


92 51 71 54 48
27 66 76 55 69
54 44 54 75 46
61 68 78 61 83
88 45 89 67 56
81 58 55 62 38
55 56 64 81 38
49 68 91 56 68
46 47 83 71 62

A frequency distribution is the organization of raw data in table form, using classes and frequencies
Frequency Distribution
Ages of the Top 50
CLASS LIMITS TALLY FREQUENCY
Wealthiest People

ENGR. JERIME C. JIMENEZ, MSCE, SO2


45 46 64 57 85
92 51 71 54 48
27 66 76 55 69
54 44 54 75 46
61 68 78 61 83
88 45 89 67 56
81 58 55 62 38
55 56 64 81 38
Frequency Distribution Table
49 68 91 56 68
46 47 83 71 62
Each raw data value is placed into a
quantitative or qualitative category called a
class.
Frequency Distribution
Ages of the Top 50
CLASS LIMITS TALLY FREQUENCY
Wealthiest People
27 – 35 | 1

ENGR. JERIME C. JIMENEZ, MSCE, SO2


45 46 64 57 85
36 – 44 ||| 3
92 51 71 54 48
45 – 53 |||| |||| 9
27 66 76 55 69 54 – 62 |||| |||| |||| 15
54 44 54 75 46 63 – 71 |||| |||| 10
61 68 78 61 83 72 – 80 ||| 3

88 45 89 67 56 81 – 89 |||| || 7
90 – 98 || 2
81 58 55 62 38
50
55 56 64 81 38
Frequency Distribution Table
49 68 91 56 68
46 47 83 71 62
ENGR. JERIME C. JIMENEZ, MSCE, SO2
Categorical and
Grouped
Organize data using a frequency distribution
Categorical Frequency Distributions
The categorical frequency distribution is used for data that can be placed in
specific categories, such as nominal- or ordinal-level data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Example: Twenty-five army inductees were given a blood test to determine their
blood type. The data set is

A B B AB O
O O B AB B
B B O A O
A O O O AB
AB A O B A
Categorical Frequency Distributions
The categorical frequency distribution is used for data that can be placed in
specific categories, such as nominal- or ordinal-level data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Example: Twenty-five army inductees were given a blood test to determine their
blood type. The data set is

A B B AB O CLASS TALLY FREQUENCY PERCENT


O O B AB B
B B O A O
A O O O AB
AB A O B A

Step 1: Create the frequency distribution table.


Categorical Frequency Distributions
The categorical frequency distribution is used for data that can be placed in
specific categories, such as nominal- or ordinal-level data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Example: Twenty-five army inductees were given a blood test to determine their
blood type. The data set is

A B B AB O CLASS TALLY FREQUENCY PERCENT


O O B AB B A
B
B B O A O
O
A O O O AB
AB
AB A O B A

Step 2: Tally the data and place the results in column ‘Tally’.
Categorical Frequency Distributions
The categorical frequency distribution is used for data that can be placed in
specific categories, such as nominal- or ordinal-level data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Example: Twenty-five army inductees were given a blood test to determine their
blood type. The data set is

A B B AB O CLASS TALLY FREQUENCY PERCENT


O O B AB B A ||||
B |||| ||
B B O A O
O ||||
A O O O AB ||||
AB A O B A AB ||||

Step 3: Count the tallies and place the results in column ‘Frequency’.
Categorical Frequency Distributions
The categorical frequency distribution is used for data that can be placed in
specific categories, such as nominal- or ordinal-level data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Example: Twenty-five army inductees were given a blood test to determine their
blood type. The data set is

A B B AB O CLASS TALLY FREQUENCY PERCENT


O O B AB B A |||| 5
B |||| || 7
B B O A O
O ||||
A O O O AB 9
||||
AB A O B A AB |||| 4
25

Step 4: Find the percentage of values in each class by using the formula:
𝒇
% = ⋅ 𝟏𝟎𝟎
𝒏
Categorical Frequency Distributions
The categorical frequency distribution is used for data that can be placed in
specific categories, such as nominal- or ordinal-level data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Example: Twenty-five army inductees were given a blood test to determine their
blood type. The data set is

A B B AB O CLASS TALLY FREQUENCY PERCENT


O O B AB B A |||| 5 20%
B |||| || 7 28%
B B O A O
O ||||
A O O O AB 9 36%
||||
AB A O B A AB |||| 4 16%
25 100%

Conclusion: More people have type ‘O’ blood than any other type.
Grouped Frequency Distributions
When the range of the data is large, the data must be grouped into classes that are
more than one unit in width, in what is called a grouped frequency distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Example: A distribution of the blood glucose levels in milligrams per deciliter for 50
randomly selected college students is shown.

CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
58 – 64 57.5 – 64.5 | 1
65 – 71 64.5 – 71.5 |||| | 6
72 – 78 71.5 – 78.5 |||| |||| 10
79 – 85 78.5 – 85.5 |||| |||| |||| 14
86 – 92 85.5 – 92.5 |||| |||| || 12
93 – 99 92.5 – 99.5 |||| 5
100 – 106 99.5 – 106.5 || 2
Total 50
Grouped Frequency Distributions

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
58 – 64
65 – 71
72 – 78
79 – 85
86 – 92
93 – 99
100 – 106

Class Limits – these are the numerical


figures used to specify the lower and
upper limits of a class.
Grouped Frequency Distributions
Lower Class Limit
This represents the smallest data
value that can be included in the class.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
58 – 64
65 – 71
72 – 78
Upper Class Limit
79 – 85 It represents the largest data value
that can be included in the class.
86 – 92
93 – 99
100 – 106

Class Limits – these are the numerical


figures used to specify the lower and
upper limits of a class.
Grouped Frequency Distributions

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
58 – 64 57.5 – 64.5
65 – 71 64.5 – 71.5
72 – 78 71.5 – 78.5
79 – 85 78.5 – 85.5
86 – 92 85.5 – 92.5
93 – 99 92.5 – 99.5
100 – 106 99.5 – 106.5

Class Boundaries – these numbers are


used to separate the classes so that there
are no gaps in the frequency distribution.
Basic rule of thumb
The class limits should have the
Grouped Frequency Distributions same decimal place value as the
data, but the class boundaries
should have one additional place
value and end in a 5.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
Lower Boundary = Lower Limit – 0.5
58 – 64 57.5 – 64.5
𝐿𝑜𝑤𝑒𝑟 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 = 58 − 0.5 = 57.5
65 – 71 64.5 – 71.5
72 – 78 71.5 – 78.5
79 – 85 78.5 – 85.5
86 – 92 85.5 – 92.5
93 – 99 92.5 – 99.5
Upper Boundary = Upper Limit + 0.5
100 – 106 99.5 – 106.5 𝑈𝑝𝑝𝑒𝑟 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 = 106 + 0.5 = 106.5

Class Boundaries – these numbers are


used to separate the classes so that there
are no gaps in the frequency distribution.
Note:
Do not subtract the limits of a
Grouped Frequency Distributions single class. It will result in an
incorrect answer.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
58 – 64 57.5 – 64.5
65 − 58 = 7
65 – 71 64.5 – 71.5
7
72 – 78 71.5 – 78.5
7 79 – 85 78.5 – 85.5
7
86 – 92 85.5 – 92.5
7
93 – 99 92.5 – 99.5
7
100 – 106 99.5 – 106.5

𝐶𝑙𝑎𝑠𝑠 𝑊𝑖𝑑𝑡ℎ = 𝑈𝑝𝑝𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐿𝑖𝑚𝑖𝑡 − 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐿𝑖𝑚𝑖𝑡


𝐶𝑙𝑎𝑠𝑠 𝑊𝑖𝑑𝑡ℎ = 𝑈𝑝𝑝𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 − 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦
Grouped Frequency Distributions
Class Width

Rules to follow in constructing the grouped frequency distribution:

ENGR. JERIME C. JIMENEZ, MSCE, SO2


1. There should be between 5 and 20 classes.

2. It is preferable but not absolutely necessary that the class width be an odd number.
This ensures that the midpoint of each class has the same place value as the data.

𝑙𝑜𝑤𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 + 𝑢𝑝𝑝𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦


𝑋𝑚 =
2
or
𝑙𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 + 𝑢𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
𝑋𝑚 =
2
Grouped Frequency Distributions
Class Width

Rules to follow in constructing the grouped frequency distribution:

ENGR. JERIME C. JIMENEZ, MSCE, SO2


3. The classes must be mutually exclusive. Mutually exclusive classes have
nonoverlapping class limits so that data cannot be placed into two classes.

Age Age
10 – 20 10 – 20
20 – 30 21 – 31
30 – 40 32 – 42
40 – 50 43 – 53

4. The classes must be continuous.


5. The classes must be exhaustive.
Grouped Frequency Distributions
Class Width

Rules to follow in constructing the grouped frequency distribution:

ENGR. JERIME C. JIMENEZ, MSCE, SO2


6. The classes must be equal in width. This avoids a distorted view of the data.
One exception occurs when a distribution has a class that is open-ended.

Age Frequency Minutes Frequency


10 – 20 3 Below 110 16
21 – 31 6 110 – 114 24
32 – 42 4 115 – 119 38
43 – 53 10 120 – 124 14
54 and above 8 125 – 129 5
Grouped Frequency Distributions
Example: These data represent the record high temperatures in degrees Fahrenheit
(°F) for each of the 50 states. Construct a grouped frequency distribution for the data,

ENGR. JERIME C. JIMENEZ, MSCE, SO2


using 7 classes.
Find the highest value and lowest value: H = 134 and L = 100
112 110 107 116 120
100 118 112 108 113 Find the range: 𝑅 =𝐻−𝐿
127 117 114 110 120
𝑅 = 134 − 100 = 34
120 116 115 121 117
134 118 118 113 105 Select the number of class: no. of class = 7
118 122 117 120 110
Class Width: 𝑅
105 114 118 119 118 𝑊𝑖𝑑𝑡ℎ =
𝑛𝑜. 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠𝑒𝑠
110 114 122 111 112
109 105 106 104 114 34
𝑊𝑖𝑑𝑡ℎ = = 4.9 ≈ 5
7
112 109 110 111 114

Step 1: Determine the classes.


Grouped Frequency Distributions
Example: These data represent the record high temperatures in degrees Fahrenheit
(°F) for each of the 50 states. Construct a grouped frequency distribution for the data,

ENGR. JERIME C. JIMENEZ, MSCE, SO2


using 7 classes. CLASS CLASS
# TALLY FREQUENCY
112 110 107 116 120 LIMITS BOUNDARIES

100 118 112 108 113 1


127 117 114 110 120 2
120 116 115 121 117 3
134 118 118 113 105 4
118 122 117 120 110 5
105 114 118 119 118 6
110 114 122 111 112 7
109 105 106 104 114
112 109 110 111 114 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ = 5

Step 1: Determine the classes.


Grouped Frequency Distributions
Example: These data represent the record high temperatures in degrees Fahrenheit
(°F) for each of the 50 states. Construct a grouped frequency distribution for the data,

ENGR. JERIME C. JIMENEZ, MSCE, SO2


using 7 classes. CLASS CLASS
# TALLY FREQUENCY
112 110 107 116 120 LIMITS BOUNDARIES

100 118 112 108 113 1 100 104


127 117 114 110 120 2 105 109
120 116 115 121 117 3 110 114
134 118 118 113 105 4 115 119
118 122 117 120 110 5 120 124
105 114 118 119 118 6 125 129
110 114 122 111 112 7 130 134
109 105 106 104 114
112 109 110 111 114 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ = 5

Step 1: Determine the classes.


Grouped Frequency Distributions
Example: These data represent the record high temperatures in degrees Fahrenheit
(°F) for each of the 50 states. Construct a grouped frequency distribution for the data,

ENGR. JERIME C. JIMENEZ, MSCE, SO2


using 7 classes. CLASS CLASS
# TALLY FREQUENCY
112 110 107 116 120 LIMITS BOUNDARIES

100 118 112 108 113 1 100 104 99.5 104.5


127 117 114 110 120 2 105 109 104.5 109.5
120 116 115 121 117 3 110 114 109.5 114.5
134 118 118 113 105 4 115 119 114.5 119.5
118 122 117 120 110 5 120 124 119.5 124.5
105 114 118 119 118 6 125 129 124.5 129.5
110 114 122 111 112 7 130 134 129.5 134.5
109 105 106 104 114
112 109 110 111 114 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ = 5

Step 2: Tally the data.


Grouped Frequency Distributions
Example: These data represent the record high temperatures in degrees Fahrenheit
(°F) for each of the 50 states. Construct a grouped frequency distribution for the data,

ENGR. JERIME C. JIMENEZ, MSCE, SO2


using 7 classes. CLASS CLASS
# TALLY FREQUENCY
112 110 107 116 120 LIMITS BOUNDARIES

100 118 112 108 113 1 100 104 99.5 104.5 ||

127 117 114 110 120 2 105 109 104.5 109.5 |||| |||

120 116 115 121 117 3 110 114 109.5 114.5 |||| |||| |||| |||

134 118 118 113 105 4 115 119 114.5 119.5 |||| |||| |||

118 122 117 120 110 5 120 124 119.5 124.5 |||| ||

105 114 118 119 118 6 125 129 124.5 129.5 |

110 114 122 111 112 7 130 134 129.5 134.5 |

109 105 106 104 114


112 109 110 111 114 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ = 5

Step 3: Find the numerical frequencies from the tallies.


Grouped Frequency Distributions
Example: These data represent the record high temperatures in degrees Fahrenheit
(°F) for each of the 50 states. Construct a grouped frequency distribution for the data,

ENGR. JERIME C. JIMENEZ, MSCE, SO2


using 7 classes. CLASS CLASS
# TALLY FREQUENCY
112 110 107 116 120 LIMITS BOUNDARIES

100 118 112 108 113 1 100 104 99.5 104.5 || 2


127 117 114 110 120 2 105 109 104.5 109.5 |||| ||| 8
120 116 115 121 117 3 110 114 109.5 114.5 |||| |||| |||| ||| 18
134 118 118 113 105 4 115 119 114.5 119.5 |||| |||| ||| 13
118 122 117 120 110 5 120 124 119.5 124.5 |||| || 7
105 114 118 119 118 6 125 129 124.5 129.5 | 1
110 114 122 111 112 7 130 134 129.5 134.5 | 1
109 105 106 104 114 Total 50
112 109 110 111 114 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ = 5

Step 3: Find the numerical frequencies from the tallies.


Grouped Frequency Distributions
Cumulative Frequency Distribution is a distribution that shows the number of
data values less than or equal to a specific value.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS CUMULATIVE
# FREQUENCY
BOUNDARIES FREQUENCY

1 99.5 104.5 2 Less than 99.5 0


2 104.5 109.5 8 Less than 104.5 2
3 109.5 114.5 18 Less than 109.5 10
4 114.5 119.5 13 Less than 114.5 28
5 119.5 124.5 7 Less than 119.5 41
6 124.5 129.5 1 Less than 124.5 48
7 129.5 134.5 1 Less than 129.5 49
Total 50 Less than 134.5 50
Frequency Distributions
After the raw data have been organized into a frequency distribution, it will be
analyzed by looking for peaks and extreme values. The peaks show which class or

ENGR. JERIME C. JIMENEZ, MSCE, SO2


classes have the most data values compared to the other classes. Extreme values,
called outliers, show large or small data values that are relative to other data values.
Ungrouped Frequency Distributions
When the range of the data values is relatively small, a frequency distribution can be
constructed using single data values for each class. This type of distribution is called

ENGR. JERIME C. JIMENEZ, MSCE, SO2


ungrouped frequency distribution.
Example: The data shown here represent the number of miles per gallon (mpg) that
30 selected four-wheel drive sport utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.
12 16 15 12 19 Find the highest value and lowest value: H = 19 and L = 12
17 18 16 14 13
Find the range: 𝑅 =𝐻−𝐿
12 12 12 15 16
14 16 15 12 18 𝑅 = 19 − 12 = 7
16 17 16 15 16
18 15 16 15 14
Ungrouped Frequency Distributions
Example: The data shown here represent the number of miles per gallon (mpg) that
30 selected four-wheel drive sport utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


12 16 15 12 19 CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
17 18 16 14 13
12 12 12 15 16
14 16 15 12 18
16 17 16 15 16
18 15 16 15 14

H = 19 and L = 12
Ungrouped Frequency Distributions
Example: The data shown here represent the number of miles per gallon (mpg) that
30 selected four-wheel drive sport utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


12 16 15 12 19 CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
17 18 16 14 13
12
12 12 12 15 16
13
14 16 15 12 18
14
16 17 16 15 16
15
18 15 16 15 14
16
H = 19 and L = 12 17
18
19
Ungrouped Frequency Distributions
Example: The data shown here represent the number of miles per gallon (mpg) that
30 selected four-wheel drive sport utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


12 16 15 12 19 CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
17 18 16 14 13
12 11.5 12.5
12 12 12 15 16
13 12.5 13.5
14 16 15 12 18
14 13.5 14.5
16 17 16 15 16
15 14.5 15.5
18 15 16 15 14
16 15.5 16.5
H = 19 and L = 12 17 16.5 17.5
18 17.5 18.5
19 18.5 19.5
Ungrouped Frequency Distributions
Example: The data shown here represent the number of miles per gallon (mpg) that
30 selected four-wheel drive sport utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


12 16 15 12 19 CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
17 18 16 14 13
12 11.5 12.5 |||| |
12 12 12 15 16
13 12.5 13.5 |
14 16 15 12 18
14 13.5 14.5 |||
16 17 16 15 16
15 14.5 15.5 |||| |
18 15 16 15 14
16 15.5 16.5 |||| |||

H = 19 and L = 12 17 16.5 17.5 ||


18 17.5 18.5 |||
19 18.5 19.5 |
Ungrouped Frequency Distributions
Example: The data shown here represent the number of miles per gallon (mpg) that
30 selected four-wheel drive sport utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


12 16 15 12 19 CLASS CLASS
TALLY FREQUENCY
LIMITS BOUNDARIES
17 18 16 14 13
12 11.5 12.5 |||| | 6
12 12 12 15 16
13 12.5 13.5 | 1
14 16 15 12 18
14 13.5 14.5 ||| 3
16 17 16 15 16
15 14.5 15.5 |||| | 6
18 15 16 15 14
16 15.5 16.5 |||| ||| 8
H = 19 and L = 12 17 16.5 17.5 || 2
18 17.5 18.5 ||| 3
19 18.5 19.5 | 1
Ungrouped Frequency Distributions
Example: The data shown here represent the number of miles per gallon (mpg) that
30 selected four-wheel drive sport utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


12 16 15 12 19 CUMULATIVE
FREQUENCY
17 18 16 14 13
Less than 11.5 0
12 12 12 15 16
Less than 12.5 6
14 16 15 12 18
16 17 16 15 16 Less than 13.5 7

18 15 16 15 14 Less than 14.5 10


Less than 15.5 16
H = 19 and L = 12 Less than 16.5 24
Less than 17.5 26
Less than 18.5 29
Less than 19.5 30
ENGR. JERIME C. JIMENEZ, MSCE, SO2
Histograms,
Frequency Polygons,
and Ogives
Represent data in frequency distributions graphically, using
histograms, frequency polygons, and ogives.
Graphing the Data
Graph
A diagram that shows relationships between variables: how changes in one variable

ENGR. JERIME C. JIMENEZ, MSCE, SO2


are related to changes in another, how one variable (the dependent variable) is a
function of another (the independent variable).
The purpose of graphs in statistics is to convey the data to the viewers in pictorial
form. It is easier for most people to comprehend the meaning of data presented
graphically than data presented numerically in tables or frequency distributions.

Histogram Frequency Polygon Ogive


Graphing the Data
Constructing a Histogram, Frequency Polygon, and Ogive
STEP 1 Draw and label the x and y axes.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


STEP 2 On the x axis, label the class boundaries of the frequency distribution
for the histogram and ogive. Label the midpoints for the frequency
polygon.

STEP 3 Plot the frequencies for each class, and draw the vertical bars for the
histogram and the lines for the frequency polygon and ogive.

NOTE: Remember that the lines for the frequency polygon begin and end on
the x axis while the lines for the ogive begin on the x axis.
Histogram
The histogram is a graph that displays the data by using contiguous vertical bars
(unless the frequency of a class is 0) of various heights to represent the frequencies of

ENGR. JERIME C. JIMENEZ, MSCE, SO2


the classes.

When to Use Histogram?


• The data should be numerical.
• A histogram is used to check the shape of the data distribution.
• Used to check whether the process changes from one period to another.
• Used to determine whether the output is different when it involves two or more
processes.
• Used to analyze whether the given process meets the customer requirements.
Histogram
Types of Histogram

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Uniform Histogram
A uniform histogram reveals that the
number of classes is too small, and each
class has the same number of elements. It
may involve distribution that has several
peaks.
Histogram
Types of Histogram

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Bimodal Histogram
If a histogram has two peaks, it is said to be
bimodal. Bimodality occurs when the data
set has observations on two different kinds
of individuals or combined groups if the
centers of the two separate histograms are
far enough to the variability in both the data
sets.
Histogram
Types of Histogram

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Symmetric Histogram
A symmetric histogram is also called a bell-
shaped histogram. When you draw the
vertical line down the center of the
histogram, and the two sides are identical in
size and shape, the histogram is said to be
symmetric. The diagram is perfectly
symmetric if the right half portion of the
image is similar to the left half. The
histograms that are not symmetric are
known as skewed.
Histogram
Types of Histogram

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Probability Histogram
A Probability Histogram shows a pictorial
representation of a discrete probability
distribution. It consists of a rectangle
centered on every value of x, and the area of
each rectangle is proportional to the
probability of the corresponding value. The
probability histogram diagram is begun by
selecting the classes. The probabilities of
each outcome are the heights of the bars of
the histogram.
Histogram
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.
y

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# FREQUENCY
BOUNDARIES

1 99.5 104.5 2
2 104.5 109.5 8
3 109.5 114.5 18
4 114.5 119.5 13
5 119.5 124.5 7
6 124.5 129.5 1
7 129.5 134.5 1
Total 50
x

Step 1: Draw and label the x and y axes.


Histogram
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.
y

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# FREQUENCY
BOUNDARIES
18
1 99.5 104.5 2
2 104.5 109.5 8 15

3 109.5 114.5 18 12
4 114.5 119.5 13
9
5 119.5 124.5 7
6 124.5 129.5 1 6

7 129.5 134.5 1 3
Total 50
0 x
99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°

Step 2: Represent the frequency in the y axis and the class boundary on the x axis.
Histogram
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.
y

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# FREQUENCY
BOUNDARIES 18
18
1 99.5 104.5 2
2 104.5 109.5 8 15
13
3 109.5 114.5 18 12
4 114.5 119.5 13
9 8
5 119.5 124.5 7 7

6 124.5 129.5 1 6

7 129.5 134.5 1 3 2
1 1
Total 50
0 x
99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°

Step 3: Using frequencies as the heights, draw vertical bars for each class.
Frequency Polygon
The frequency polygon is a graph that displays the data by using lines that connect
points plotted for the frequencies at the midpoints of the classes. The frequencies are

ENGR. JERIME C. JIMENEZ, MSCE, SO2


represented by the heights of the points.

𝑙𝑜𝑤𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 + 𝑢𝑝𝑝𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦


𝑋𝑚 =
2
or
𝑙𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 + 𝑢𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
𝑋𝑚 =
2

Frequency Polygon
Frequency Polygon
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# MIDPOINTS FREQUENCY
BOUNDARIES

1 99.5 104.5 2
2 104.5 109.5 8
3 109.5 114.5 18
4 114.5 119.5 13
5 119.5 124.5 7
6 124.5 129.5 1
7 129.5 134.5 1

Step 1: Find the midpoints of each class.


Frequency Polygon
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.
y

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# MIDPOINTS FREQ
BOUNDARIES
18
1 99.5 104.5 102 2
2 104.5 109.5 107 8 15

3 109.5 114.5 112 18 12


4 114.5 119.5 117 13
9
5 119.5 124.5 122 7
6 124.5 129.5 127 1 6

7 129.5 134.5 132 1 3

x
102° 107° 112° 117° 122° 127° 132°

Step 2: Draw the x and y axes. Label the x axis with the midpoint of each class, and then use
a suitable scale on the y axis for the frequencies.
Frequency Polygon
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.
y

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# MIDPOINTS FREQ
BOUNDARIES
18
1 99.5 104.5 102 2
2 104.5 109.5 107 8 15

3 109.5 114.5 112 18 12


4 114.5 119.5 117 13
9
5 119.5 124.5 122 7
6 124.5 129.5 127 1 6

7 129.5 134.5 132 1 3

x
102° 107° 112° 117° 122° 127° 132°

Step 3: Using the midpoints for the x values and the frequencies as the y values, plot the
points.
Frequency Polygon
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.
y

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# MIDPOINTS FREQ
BOUNDARIES
18
1 99.5 104.5 102 2
2 104.5 109.5 107 8 15

3 109.5 114.5 112 18 12


4 114.5 119.5 117 13
9
5 119.5 124.5 122 7
6 124.5 129.5 127 1 6

7 129.5 134.5 132 1 3

x
102° 107° 112° 117° 122° 127° 132°

Step 4: Connect adjacent points with line segments. Draw a line back to the x axis at the
beginning and end of the graph, at the same distance that the previous and next midpoints
would be located.
Ogive
The ogive is a graph that represents the cumulative frequencies for the classes in a
frequency distribution. This type of graph is also called the cumulative frequency

ENGR. JERIME C. JIMENEZ, MSCE, SO2


graph. The cumulative frequency is the sum of the frequencies accumulated up to
the upper boundary of a class in the distribution.

Ogive
Ogive
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# FREQUENCY
BOUNDARIES

1 99.5 104.5 2
2 104.5 109.5 8
3 109.5 114.5 18
4 114.5 119.5 13
5 119.5 124.5 7
6 124.5 129.5 1
7 129.5 134.5 1
Total 50

Step 1: Find the cumulative frequency for each class.


Ogive
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


50

CUMULATIVE 45
FREQUENCY
40
Less than 99.5 0
35
Less than 104.5 2
30
Less than 109.5 10
25
Less than 114.5 28
20
Less than 119.5 41
15
Less than 124.5 48
10
Less than 129.5 49
5
Less than 134.5 50

99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°

Step 2: Draw the x and y axes. Label the x axis with the class boundaries. Use an appropriate
scale for the y axis to represent the cumulative frequencies.
Ogive
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


50

CUMULATIVE 45
FREQUENCY
40
Less than 99.5 0
35
Less than 104.5 2
30
Less than 109.5 10
25
Less than 114.5 28
20
Less than 119.5 41
15
Less than 124.5 48
10
Less than 129.5 49
5
Less than 134.5 50

99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°

Step 3: Plot the cumulative frequency at each upper class boundary. Upper boundaries are
used since the cumulative frequencies represent the number of data values accumulated up to
the upper boundary of each class.
Ogive
Example: Construct a histogram to represent the data shown for the record high
temperatures for each of the 50 states.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


50

CUMULATIVE 45
FREQUENCY
40
Less than 99.5 0
35
Less than 104.5 2
30
Less than 109.5 10
25
Less than 114.5 28
20
Less than 119.5 41
15
Less than 124.5 48
10
Less than 129.5 49
5
Less than 134.5 50

99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°

Step 4: Connect adjacent points with line segments.


Relative Frequency Graphs
The relative frequency graph is a graph showing the distribution of the data
converted into proportions instead of raw data as frequencies. To convert a frequency

ENGR. JERIME C. JIMENEZ, MSCE, SO2


into a proportion or relative frequency, divide the frequency for each class by the total
of the frequencies. The sum of the relative frequencies will always be ‘1’.

𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑐𝑜𝑢𝑛𝑡
𝑅𝑒𝑙𝑎𝑡𝑖𝑣𝑒 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 =
𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦
Relative Frequency Graphs
Example: Construct a histogram, frequency polygon, and ogive using relative
frequencies for the distribution of the miles that 20 randomly selected runners ran
during a given week.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# FREQUENCY
BOUNDARIES

1 5.5 10.5 1
2 10.5 15.5 2
3 15.5 20.5 3
4 20.5 25.5 5
5 25.5 30.5 4
6 30.5 35.5 3
7 35.5 40.5 2
Total 20

Step 1: Convert each frequency to a proportion or relative frequency by dividing the frequency
for each class by the total number of observations.
Relative Frequency Graphs
Example: Construct a histogram, frequency polygon, and ogive using relative
frequencies for the distribution of the miles that 20 randomly selected runners ran
during a given week.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS
# FREQ REL. FREQ.
BOUNDARIES

1 5.5 10.5 1 1/20 = 0.05


2 10.5 15.5 2 2/20 = 0.10
3 15.5 20.5 3 3/20 = 015
4 20.5 25.5 5 5/20 = 0.25
5 25.5 30.5 4 4/20 = 0.20
6 30.5 35.5 3 3/20 = 0.15
7 35.5 40.5 2 2/20 = 0.10
Total 20 Total 1.00

Step 2: Find the cumulative relative frequencies.


Relative Frequency Graphs
Example: Construct a histogram, frequency polygon, and ogive using relative
frequencies for the distribution of the miles that 20 randomly selected runners ran
during a given week.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


CLASS CUMULATIVE CUMULATIVE
# FREQ REL. FREQ.
BOUNDARIES FREQUENCY REL. FREQ.

1 5.5 10.5 1 1/20 = 0.05 Less than 5.5 0 0.00


2 10.5 15.5 2 2/20 = 0.10 Less than 10.5 1 0.05
3 15.5 20.5 3 3/20 = 015 Less than 15.5 3 0.15
4 20.5 25.5 5 5/20 = 0.25 Less than 20.5 6 0.30
5 25.5 30.5 4 4/20 = 0.20 Less than 25.5 11 0.55
6 30.5 35.5 3 3/20 = 0.15 Less than 30.5 15 0.75
7 35.5 40.5 2 2/20 = 0.10 Less than 35.5 18 0.90
Total 20 Total 1.00 Less than 40.5 20 1.00

Step 3: Draw each graph (histogram, freq. polygon, and ogive).


Relative Frequency Graphs
CLASS
# REL. FREQ.
BOUNDARIES

1 5.5 10.5 1/20 = 0.05 y Histogram for Runners’ Miles

ENGR. JERIME C. JIMENEZ, MSCE, SO2


2 10.5 15.5 2/20 = 0.10
3 15.5 20.5 3/20 = 0.15
0.25
4 20.5 25.5 5/20 = 0.25 0.25
5 25.5 30.5 4/20 = 0.20

Relative frequency
0.20
6 30.5 35.5 3/20 = 0.15 0.20
7 35.5 40.5 2/20 = 0.10
0.15 0.15
Total 1.00 0.15

0.10 0.10
0.10

0.05
0.05

0 x
5.5 10.5 15.5 20.5 25.5 30.5 35.5 40.5
Miles
Relative Frequency Graphs
CLASS MID
# BOUNDARIES POINTS
REL. FREQ.

1 5.5 10.5 8 1/20 = 0.05 y Frequency Polygon for Runners’ Miles

ENGR. JERIME C. JIMENEZ, MSCE, SO2


2 10.5 15.5 13 2/20 = 0.10

3 15.5 20.5 18 3/20 = 0.15


0.25
4 20.5 25.5 23 5/20 = 0.25

Relative frequency
5 25.5 30.5 28 4/20 = 0.20 0.20

6 30.5 35.5 33 3/20 = 0.15


0.15
7 35.5 40.5 38 2/20 = 0.10

Total 1.00 0.10

0.05

0
8 13 18 23 28 33 38
x
Miles
Relative Frequency Graphs
CUM. CUM. REL.
FREQ. FREQ.
y Ogive for Runners’ Miles

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Less than 5.5 0 0.00

Less than 10.5 1 0.05

Less than 15.5 3 0.15 1.00

Less than 20.5 6 0.30

Relative frequency
0.80
Less than 25.5 11 0.55

Less than 30.5 15 0.75


0.60
Less than 35.5 18 0.90

Less than 40.5 20 1.00 0.40

0.20

0 x
5.5 10.5 15.5 20.5 25.5 30.5 35.5 40.5
Miles
Graphs
Other Types of

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Other Types of Graphs

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Bar Graph
A Bar Graph represents the data by using vertical or horizontal bars whose heights or
lengths represent the frequencies of the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Bar Graph
Example: The table shows the average money spent by first-year college students.
Draw a horizontal and vertical bar graph for the data.
y First-Year College Student Spending

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Average Money spent by
First-year College Students Electronics

Electronics $ 728
Dorm décor

Dorm décor 344


Clothing
Clothing 141

Shoes 72 Shoes

0 x
$100 $200 $300 $400 $500 $600 $700 $800

Step 1: Draw and label the x and y axes. For the horizontal bar graph place the frequency
scale on the x axis, and for the vertical bar graph place the frequency scale on the y axis
Bar Graph
Example: The table shows the average money spent by first-year college students.
Draw a horizontal and vertical bar graph for the data.
y Average Amount Spent

ENGR. JERIME C. JIMENEZ, MSCE, SO2


$800
Average Money spent by
First-year College Students $700

$600
Electronics $ 728
$500
Dorm décor 344
$400

Clothing 141 $300

$200
Shoes 72
$100

0 x
Electronics Dorm décor Clothing Shoes

Step 1: Draw and label the x and y axes. For the horizontal bar graph place the frequency
scale on the x axis, and for the vertical bar graph place the frequency scale on the y axis
Bar Graph
Bar graphs can also be used to compare data for two or more groups. These types of bar
graphs are called compound bar graphs.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Pareto Chart
A Pareto Chart is used to represent a frequency distribution for a categorical
variable, and the frequencies are displayed by the heights of vertical bars, which are

ENGR. JERIME C. JIMENEZ, MSCE, SO2


arranged in order from highest to lowest.
Pareto Chart
Example: The data shown here consist of the number of police calls for specific
categories that a local municipality received in 2011. Draw and analyze a Pareto Chart
for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Category Number Category Number

Juvenile complaint 92 Juvenile complaint 92


Loud
27 Drug offenses 79
noise/music/party
Drug offenses 79 Disabled vehicle 65
Driving under the Driving under the
38 38
influence influence
Disabled vehicle 65 Loud
27
noise/music/party

Step 1: Arrange the data from the largest to smallest according to frequency
Pareto Chart
Example: The data shown here consist of the number of police calls for specific
categories that a local municipality received in 2011. Draw and analyze a Pareto Chart
for the data. y Police Calls

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Category Number 100
90
Juvenile complaint 92 80
70
Drug offenses 79
60

Disabled vehicle 65 50
40
Driving under the
38 30
influence
20
Loud
27 10
noise/music/party
0 x

Step 2: Draw and label the x and y axes. Then, draw the bars corresponding to the
frequencies.
Pareto Chart

Suggestions for Drawing

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Pareto Charts

1. Make the bars the same


width.

2. Arrange the data from largest


to smallest according to
frequency.

3. Make the units that are used


for the frequency equal in
size.
Time Series Graph
A time series graph is a line graph that shows data such as measurements, sales or
frequencies over time. They can be used to show a pattern or trend in the data and are

ENGR. JERIME C. JIMENEZ, MSCE, SO2


useful for making predictions about the future such as weather forecasting or financial
growth.
Time Series Graph
Example: The data show the percentage of U.S. adults who smoke. Draw and analyze
a time series graph for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Year 1970 1980 1990 2000 2010
Percent 37 33 25 23 19

Step 1: Draw and label the x and y axes. Label the x axis for years and the y axis for the
percent.
Time Series Graph
Year 1970 1980 1990 2000 2010
Percent 37 33 25 23 19

ENGR. JERIME C. JIMENEZ, MSCE, SO2


y Percent of U.S. Adult Smokers

40

35

30
Percent

25

20

15

10

0 x
1970 1980 1990 2000 2010
Year
Time Series Graph
When two or more data sets are compared on the same graph it is called a compound
time series graph.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Pie Graph
A pie graph is a circle that is divided into sections or wedges according to the
percentage of frequencies in each category of the distribution.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Pie Graph
Example: This frequency distribution shows the number of pounds of each snack food
eaten during the Super Bowl. Construct a pie graph for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Snack Pounds
Potato chips 11.2 million
Tortilla chips 8.2 million
Pretzels 4.3 million
Popcorn 3.8 million
Snack nuts 2.5 million
30.0 million

Step 1: Since there are 360° in a circle, the frequency for each class must be converted to a
𝑓
proportional part of the circle. This conversion is done by using the formula 𝐷𝑒𝑔𝑟𝑒𝑒𝑠 = ⋅ 360°
𝑛
Pie Graph
Example: This frequency distribution shows the number of pounds of each snack food
eaten during the Super Bowl. Construct a pie graph for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Snack Pounds Degrees
11.2
Potato chips 11.2 million Potato chips ⋅ 360° = 134°
30
Tortilla chips 8.2 million
Pretzels 4.3 million 8.2
Tortilla chips ⋅ 360° = 98°
30
Popcorn 3.8 million
Snack nuts 2.5 million 4.3
Pretzels ⋅ 360° = 52°
30.0 million 30
3.8
Popcorn ⋅ 360° = 46°
30
2.5
Snack nuts ⋅ 360° = 30°
30

𝑓
Step 2: Each frequency must also be converted to a percentage. % = ⋅ 100
𝑛
Pie Graph
Example: This frequency distribution shows the number of pounds of each snack food
eaten during the Super Bowl. Construct a pie graph for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Snack Pounds Degrees %
11.2
Potato chips 11.2 million 134° Potato chips ⋅ 100 = 37.3%
30
Tortilla
8.2 million 98°
chips 8.2
Tortilla chips ⋅ 100 = 27.3%
Pretzels 4.3 million 52° 30
Popcorn 3.8 million 46° 4.3
Pretzels ⋅ 100 = 14.3%
Snack nuts 2.5 million 30° 30
30.0 million 360° 3.8
Popcorn ⋅ 100 = 12.7%
30
2.5
Snack nuts ⋅ 100 = 8.3%
30

Step 3: Using a protractor and a compass, draw the graph, using the appropriate degree
measures found in Step 1, and label each section with the name and percentages.
Pie Graph
Example: This frequency distribution shows the number of pounds of each snack food
eaten during the Super Bowl. Construct a pie graph for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Snack Pounds Degrees %
Super Bowl Snacks
Potato chips 11.2 million 134° 37.3% Snack nuts
8%
Tortilla
8.2 million 98° 27.3%
chips
Popcorn
Pretzels 4.3 million 52° 14.3% 13%

Potato chips
Popcorn 3.8 million 46° 12.7% 38%
Snack nuts 2.5 million 30° 8.3%
30.0 million 360° 99.9%

Pretzels
14%

Tortilla chips
27%
Dotplot
A dotplot is a statistical graph in which each data value is plotted as a point (dot)
above the horizontal axis. Dotplots are used to show how the data values are

ENGR. JERIME C. JIMENEZ, MSCE, SO2


distributed and to see if there are any extremely high or low data values.
Dotplot
Example: The data show the number of named storms each year for the last 40 years.
Construct and analyze a dotplot for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


19 15 14 7 6 11 11
9 16 8 8 11 9 8
16 12 13 14 13 12 7
15 15 19 11 4 6 13
10 15 7 12 6 10
28 12 8 7 12 9

Step 1: Find the lowest and highest data values, and decide what scale to use on the
horizontal axis.
Dotplot
Example: The data show the number of named storms each year for the last 40 years.
Construct and analyze a dotplot for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


19 15 14 7 6 11 11
9 16 8 8 11 9 8
16 12 13 14 13 12 7
15 15 19 11 4 6 13
10 15 7 12 6 10
28 12 8 7 12 9

Lowest data: 4
Highest data: 28
5 10 15 20 25 30

Step 2: Draw a horizontal line, and draw the scale on the line
Dotplot
Example: The data show the number of named storms each year for the last 40 years.
Construct and analyze a dotplot for the data.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


19 15 14 7 6 11 11
9 16 8 8 11 9 8
16 12 13 14 13 12 7
15 15 19 11 4 6 13
10 15 7 12 6 10
28 12 8 7 12 9

Lowest data: 4
Highest data: 28
5 10 15 20 25 30
The graph shows that the majority of the named storms occur
with frequency between 6 and 16 per year. There are only 3
years when there were 19 or more named storms per year.

Step 3: Plot each data value above the line. If the value occurs more than once, plot the other
point above the first point.
Stem and Leaf Plots
The stem and leaf plot is a method of organizing data and is a combination of sorting
and graphing. It has the advantage over a grouped frequency distribution of retaining

ENGR. JERIME C. JIMENEZ, MSCE, SO2


the actual data while showing them in graphical form.
Stem and Leaf Plot
Example 1: An insurance company researcher conducted a survey on the number of
car thefts in a large city for a period of 30 days last summer. The raw data are shown.
Construct a stem and leaf plot by using classes 50-54, 55-59, 60-64, 65-69, 70-74, and

ENGR. JERIME C. JIMENEZ, MSCE, SO2


75-79

52 62 51 50 69 50, 51, 51, 52, 53, 53, 55, 55, 56, 57, 57, 58, 59, 62, 63,
65, 65, 66, 66, 67, 68, 69, 69, 72, 73, 75, 75, 77, 78, 79
58 77 66 53 57
75 56 55 67 73
79 59 68 65 72
57 51 63 69 75
65 53 78 66 55

Step 1: Arrange the data in order.


Stem and Leaf Plot
Example 1: An insurance company researcher conducted a survey on the number of
car thefts in a large city for a period of 30 days last summer. The raw data are shown.
Construct a stem and leaf plot by using classes 50-54, 55-59, 60-64, 65-69, 70-74, and

ENGR. JERIME C. JIMENEZ, MSCE, SO2


75-79

52 62 51 50 69 50, 51, 51, 52, 53, 53, 55, 55, 56, 57, 57, 58, 59, 62, 63,
65, 65, 66, 66, 67, 68, 69, 69, 72, 73, 75, 75, 77, 78, 79
58 77 66 53 57
75 56 55 67 73 50 – 54 50, 51, 51, 52, 53, 53
79 59 68 65 72 55 – 59 55, 55, 56, 57, 57, 58, 59
57 51 63 69 75 60 – 64 62, 63
65 53 78 66 55 65 – 69 65, 65, 66, 66, 67, 68, 69, 69
70 – 74 72, 73
75 - 79 75, 75, 77, 78, 79

Step 2: Separate the data according to the classes.


Stem and Leaf Plot
Example 1: An insurance company researcher conducted a survey on the number of
car thefts in a large city for a period of 30 days last summer. The raw data are shown.
Construct a stem and leaf plot by using classes 50-54, 55-59, 60-64, 65-69, 70-74, and

ENGR. JERIME C. JIMENEZ, MSCE, SO2


75-79

50 – 54 50, 51, 51, 52, 53, 53 Leading digit (stem) Trailing digit (leaf)
55 – 59 55, 55, 56, 57, 57, 58, 59 5 011233
60 – 64 62, 63 5 5567789
65 – 69 65, 65, 66, 66, 67, 68, 69, 69 6 23
70 – 74 72, 73 6 55667899
75 - 79 75, 75, 77, 78, 79 7 23
7 55789

Step 3: Separate the data according to the classes.


Stem and Leaf Plot
Example 1: An insurance company researcher conducted a survey on the number of
car thefts in a large city for a period of 30 days last summer. The raw data are shown.
Construct a stem and leaf plot by using classes 50-54, 55-59, 60-64, 65-69, 70-74, and

ENGR. JERIME C. JIMENEZ, MSCE, SO2


75-79

50 – 54 50, 51, 51, 52, 53, 53 Leading digit (stem) Trailing digit (leaf)
55 – 59 55, 55, 56, 57, 57, 58, 59 5 011233
60 – 64 62, 63 5 5567789
65 – 69 65, 65, 66, 66, 67, 68, 69, 69 6 23
70 – 74 72, 73 6 55667899
75 - 79 75, 75, 77, 78, 79 7 23
7 55789

When you analyze a stem and leaf plot, look for peaks and gaps in the distribution. See if the
distribution is symmetric or skewed. Check the variability of the data by looking at the
spread.
Stem and Leaf Plot
Example 2: The number of stories in two selected samples of tall buildings in Atlanta
and Philadelphia is shown. Construct a back-to-back stem and leaf plot, and compare
the distributions.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Atlanta Philadelphia Atlanta
55 70 44 36 40 61 40 38 32 30 26, 28, 29, 31, 32, 32, 32, 32, 32, 34, 34, 36, 38,
63 40 44 34 38 58 40 40 25 30 40, 40, 44, 44, 47, 50, 50, 52, 52, 53, 55, 60, 63 70
60 47 52 32 32 54 40 36 30 30
Philadelphia
50 53 32 28 31 53 39 36 34 33
25, 30, 30, 30, 30, 32, 32, 33, 34, 36, 36, 36, 38,
52 32 34 32 50 50 38 36 39 32 38, 39, 39, 40, 40, 40, 40, 50, 53, 54, 58, 61
26 29

Step 1: Arrange the data in order.


Stem and Leaf Plot
Example 2: The number of stories in two selected samples of tall buildings in Atlanta
and Philadelphia is shown. Construct a back-to-back stem and leaf plot, and compare
the distributions.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Atlanta
Atlanta Philadelphia
26, 28, 29, 31, 32, 32, 32, 32, 32, 34,
34, 36, 38, 40, 40, 44, 44, 47, 50, 50, 986 2 5
52, 52, 53, 55, 60, 63 70 8644222221 3 000022346668899
Philadelphia 74400 4 0000
25, 30, 30, 30, 30, 32, 32, 33, 34, 36, 532200 5 0348
36, 36, 38, 38, 39, 39, 40, 40, 40, 40,
30 6 1
50, 53, 54, 58, 61
0 7

Step 2: Construct a stem and leaf plot, using the same digits as stems. Place the digits for the
leaves for Atlanta on the left side of the stem and the digits for the leaves for Philadelphia on
the right side.
Stem and Leaf Plot
Example 2: The number of stories in two selected samples of tall buildings in Atlanta
and Philadelphia is shown. Construct a back-to-back stem and leaf plot, and compare
the distributions.

ENGR. JERIME C. JIMENEZ, MSCE, SO2


Atlanta
Atlanta Philadelphia
26, 28, 29, 31, 32, 32, 32, 32, 32, 34,
34, 36, 38, 40, 40, 44, 44, 47, 50, 50, 986 2 5
52, 52, 53, 55, 60, 63 70 8644222221 3 000022346668899
Philadelphia 74400 4 0000
25, 30, 30, 30, 30, 32, 32, 33, 34, 36, 532200 5 0348
36, 36, 38, 38, 39, 39, 40, 40, 40, 40,
30 6 1
50, 53, 54, 58, 61
0 7

Step 3: Compare the distributions. The buildings in Atlanta have a large variation in the
number of stories per building. Although both distributions are peaked in the 30- to 39-story
class, Philadelphia has more buildings in this class. Atlanta has more buildings that have 40
or more stories than Philadelphia does.
Frequency Distributions
and Graphs
MATHENG3 – Engineering Data Analysis
Engr. Jerime C. Jimenez, MSCE, SO2

You might also like