0% found this document useful (0 votes)

23 views59 pages

BIOL 2163 Lecture 2 - Summarizing and Graphing Data

Uploaded by

Zara16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views59 pages

BIOL 2163 Lecture 2 - Summarizing and Graphing Data

Uploaded by

Zara16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

Chapter 2

Summarizing and Graphing Data

2-1 Overview
2-2 Frequency Distributions
2-3 Visualizing Data - Histograms
2-3 Visualizing Data - Statistical Graphics
2-3 Supplemental - Critical Thinking: Bad Graphs
Section 2-1
Overview

Copyright © 2010, 2007, 2004

Pearson Education, Inc. All
Rights Reserved.
Overview
Descriptive vs. Inferential Statistics

Descriptive Statistics: Methods used to

summarize or describe the important
characteristics of a set of data.
Inferential Statistics: Methods that use sample
data to make inferences about a population

In this lecture we look at descriptive statistics.

Overview
Important Characteristics of Data
1. Center: A representative or average value that indicates
where the middle of the data set is located.
2. Variation: A measure of the amount that the data values vary.
3. Distribution: The nature or shape of the spread of data over
the range of values (such as bell-shaped, uniform, or skewed).
4. Outliers: Sample values that
90
lie very far away from the 80
vast majority of other sample 70
60
values. 50 East
40 West
5. Time: Changing 30 North
20
characteristics of the data 10
over time. 0
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
Section 2-2
Frequency Distributions

Copyright © 2010, 2007, 2004

Pearson Education, Inc. All
Rights Reserved.
Copyright © 2010 Pearson Education
Key Concept

When working with large data sets, it is often helpful

to organize and summarize data by constructing a
table called a frequency distribution, defined later.
Because computer software and calculators can
generate frequency distributions, the details of
constructing them are not as important as what they
tell us about data sets. It helps us understand the
nature of the distribution of a data set.
Definition

 Frequency Distribution
(or Frequency Table)
shows how a data set is partitioned among several
categories (or classes) by listing all of the categories
along with the number of data values in each of the
categories.
Pulse Rates of Females and Males

Original Data – Appendix B, Data Set 1

Frequency Distribution
Pulse Rates of Females

The frequency for a

particular class is the
number of original values
that fall into that class.
Frequency Distributions

Definitions
Lower Class Limits
are the smallest numbers that can actually belong to
different classes

Lower Class
Limits
Upper Class Limits
are the largest numbers that can actually
belong to different classes

Upper Class
Limits
Class Boundaries
are the numbers used to separate classes, but
without the gaps created by class limits

59.5
69.5
79.5
Class 89.5
Boundaries 99.5
109.5
119.5
129.5
Class Midpoints
are the values in the middle of the classes and can be found by adding the lower class
limit to the upper class limit and dividing the sum by two

64.5
74.5
84.5
Class
94.5
Midpoints
104.5
114.5
124.5
Class Width
is the difference between two consecutive lower
class limits or two consecutive
lower class boundaries

10
10
Class
10
Width 10
10
10
Reasons for Constructing
Frequency Distributions
1. Large data sets can be summarized.

2. We can analyze the nature of data.

3. We have a basis for constructing important

graphs.
Constructing A Frequency Distribution
1. Determine the number of classes (should be between 5 and 20).

2. Calculate the class width (round up).

(maximum value) – (minimum value)

class width  number of classes

3. Starting point: Choose the minimum data value or a convenient value

below it as the first lower class limit.
4. Using the first lower class limit and class width, proceed to list the other
lower class limits.
5. List the lower class limits in a vertical column and proceed to enter the
upper class limits.
6. Take each individual data value and put a tally mark in the appropriate
class. Add the tally marks to get the frequency.
Relative Frequency Distribution

includes the same class limits as a frequency

distribution, but the frequency of a class is
replaced with a relative frequencies (a
proportion) or a percentage frequency ( a
percent)
class frequency
relative frequency =
sum of all frequencies

percentage class frequency

=  100%
frequency sum of all frequencies
Relative Frequency Distribution

Total Frequency = 40 * 12/40  100 = 30%

Cumulative Frequency Distribution
The cumulative frequency for a class is the
sum of the frequencies for that class and all
previous classes.

Cumulative Frequencies
Frequency Tables
Critical Thinking: Interpreting
Frequency Distributions
In later chapters, there will be frequent reference to data
with a normal distribution. One key characteristic of a
normal distribution is that it has a “bell” shape.

 The frequencies start low, then increase to one or two

high frequencies, then decrease to a low frequency.

 The distribution is approximately symmetric, with

frequencies preceding the maximum being roughly a
mirror image of those that follow the maximum.
The Normal Distribution
Gaps

Gaps
The presence of gaps can show that we have
data from two or more different populations.

 However, the converse is not true, because data

from different populations do not necessarily
result in gaps.
Recap

In this Section we have discussed

 Important characteristics of data
 Frequency distributions
 Procedures for constructing frequency distributions
 Relative frequency distributions
 Cumulative frequency distributions
Section 2-3
Visualizing Data - Histograms

Copyright © 2010, 2007, 2004

Pearson Education, Inc. All
Rights Reserved.
Copyright © 2010 Pearson Education
Key Concept

We use a visual tool called a

histogram to analyze the shape of
the distribution of the data.
Histogram

A graph consisting of bars of equal width drawn

adjacent to each other (without gaps). The
horizontal scale represents the classes of
quantitative data values and the vertical scale
represents the frequencies. The heights of the bars
correspond to the frequency values.
Histogram
Basically a graphic version of a frequency
distribution.
Histogram
The bars on the horizontal scale are labeled with
one of the following:
(1) Class boundaries
(2) Class midpoints
(3) Lower class limits (introduces a small error)

Horizontal Scale for Histogram: Use class

boundaries or class midpoints.
Vertical Scale for Histogram: Use the class
frequencies.
Relative Frequency Histogram
Has the same shape and horizontal scale as a histogram, but the vertical scale is
marked with relative frequencies instead of actual frequencies
Critical Thinking
Interpreting Histograms
Objective is not simply to construct a histogram, but rather
to understand something about the data.

When graphed, a normal distribution has a “bell” shape.

Characteristics of the bell shape are

(1) The frequencies increase to a maximum, and then

decrease, and

(2) symmetry, with the left half of the graph roughly a

mirror image of the right half.

The histogram on the next slide illustrates this.

Critical Thinking
Interpreting Histograms
Recap

In this Section we have discussed

 Histograms
 Relative Frequency Histograms
Section 2-3 Continued
Visualizing Data - Statistical Graphics

This section discusses other types of statistical

graphs.
Our objective is to identify a suitable graph for
representing the data set. The graph should be
effective in revealing the important
characteristics of the data.
Frequency Polygon
Uses line segments connected to points directly above class midpoint values
Relative Frequency Polygon
Uses relative frequencies (proportions or percentages) for the vertical scale.
Cumulative Frequency Graph (or
Ogive)
A line graph that depicts cumulative frequencies
Dot Plot
Consists of a graph in which each data value is plotted as a point (or dot) along a
scale of values. Dots representing equal values are stacked.
Stemplot (or Stem-and-Leaf Plot)
Represents quantitative data by separating each value into two parts: the stem
(such as the leftmost digit) and the leaf (such as the rightmost digit)

Pulse Rates of Females

Bar Graph

Uses bars of equal width to show frequencies of

categories of qualitative data. Vertical scale
represents frequencies or relative frequencies.
Horizontal scale identifies the different categories
of qualitative data.
A multiple bar graph has two or more sets of bars,
and is used to compare two or more data sets.
Multiple Bar Graph
Median Income of Males and Females
Pareto Chart
A bar graph for qualitative data, with the bars arranged in descending order
according to frequencies
Pie Chart
A graph depicting qualitative data as slices of a circle, size of slice is proportional
to frequency count
Scatter Plot (or Scatter Diagram)
A plot of paired (x,y) data with a horizontal x-axis and a vertical y-axis. Used to
determine whether there is a relationship between the two continuous variables
Time-Series Graph
Data that have been collected at different points in time: time-series data
Important Principles
Suggested by Edward Tufte
For small data sets of 20 values or fewer, use a table
instead of a graph.
A graph of data should make the viewer focus on
the true nature of the data, not on other elements,
such as eye-catching but distracting design features.
Do not distort data, construct a graph to reveal the
true nature of the data.
Almost all of the ink in a graph should be used for
the data, not the other design elements.
Important Principles
Suggested by Edward Tufte
Don’t use screening consisting of features such as
slanted lines, dots, cross-hatching, because they
create the uncomfortable illusion of movement.
Don’t use area or volumes for data that are actually
one-dimensional in nature. (Don’t use drawings of
dollar bills to represent budget amounts for
different years.)
Never publish pie charts, because they waste ink on
nondata components, and they lack appropriate
scale.
Car Reliability Data
Recap
In this section we saw that graphs are excellent
tools for describing, exploring and comparing data.
Describing data: Histogram - consider distribution,
center, variation, and outliers.
Exploring data: features that reveal some useful
and/or interesting characteristic of the data set.
Comparing data: Construct similar graphs to
compare data sets.
Section 2-3 Supplemental
Critical Thinking: Bad Graphs

Some graphs are bad in the sense that they

contain errors.
Some are bad because they are technically
correct, but misleading.
It is important to develop the ability to
recognize bad graphs and identify exactly how
they are misleading.
Nonzero Axis
Are misleading because one or both of the axes begin at some value other than
zero, so that differences are exaggerated.
Pictographs
Pictographs are drawings of objects. Three-dimensional
objects - money bags, stacks of coins, army tanks (for army
expenditures), people (for population sizes), barrels (for oil
production), and houses (for home construction) are
commonly used to depict data.
These drawings can create false impressions that distort the
data.
If you double each side of a square, the area does not
merely double; it increases by a factor of four; if you double
each side of a cube, the volume does not merely double; it
increases by a factor of eight.
Pictographs using areas and volumes can therefore be very
misleading.
Annual Incomes of Groups with Different
Education Levels

Bars have same width, too busy, too difficult to understand.

Annual Incomes of Groups with Different
Education Levels

Misleading. Depicts one-dimensional data with three-

dimensional boxes. Last box is 64 times as large as first box,
but income is only 4 times as large.
Annual Incomes of Groups with Different
Education Levels

Fair, objective, unencumbered by distracting features.

Daily Oil Consumption – USA vs. Japan

Part (b) is designed to exaggerate the difference by increasing each

dimension in proportion to the actual amounts of oil consumption.
Misleading . Depicts one-dimensional data with three-dimensional objects.

StayFolio - Home2 Suites by Hilton - Huntsville Research Park Area, AL - 1
No ratings yet
StayFolio - Home2 Suites by Hilton - Huntsville Research Park Area, AL - 1
2 pages
HollowWorld - HWA1 - Nightwail
100% (1)
HollowWorld - HWA1 - Nightwail
71 pages
Tes10 ch02
No ratings yet
Tes10 ch02
40 pages
2. presenting of data_١١١٠٥٩
No ratings yet
2. presenting of data_١١١٠٥٩
39 pages
002 Frequency Distribution PSY102
No ratings yet
002 Frequency Distribution PSY102
59 pages
Course: Biostatistics: Haramaya University, Chms
100% (1)
Course: Biostatistics: Haramaya University, Chms
49 pages
Introductory Statistics (Chapter 2)
No ratings yet
Introductory Statistics (Chapter 2)
3 pages
1. Descriptive Statistics (1)
No ratings yet
1. Descriptive Statistics (1)
65 pages
Math 140 Chapter 2 Notes
No ratings yet
Math 140 Chapter 2 Notes
5 pages
Week 1 - Ch 2
No ratings yet
Week 1 - Ch 2
49 pages
Organizing-Data_250120_180858
No ratings yet
Organizing-Data_250120_180858
32 pages
Chapter 2, Part A Descriptive Statistics
No ratings yet
Chapter 2, Part A Descriptive Statistics
5 pages
Topic 3
No ratings yet
Topic 3
22 pages
Introductory Statistics (Chapter 2)
No ratings yet
Introductory Statistics (Chapter 2)
3 pages
2- Presenting Data Part
No ratings yet
2- Presenting Data Part
42 pages
Data Explorations-Frequency Distributions
No ratings yet
Data Explorations-Frequency Distributions
21 pages
Biostat Lecture 3-1
No ratings yet
Biostat Lecture 3-1
162 pages
1st Mid
No ratings yet
1st Mid
19 pages
Frequency Distributions: Essentials of Statistics For The Behavioral Sciences
No ratings yet
Frequency Distributions: Essentials of Statistics For The Behavioral Sciences
45 pages
L1 Descriptive Stats
No ratings yet
L1 Descriptive Stats
149 pages
Unit 2 - Summarizing Data - Charts and Tables
100% (1)
Unit 2 - Summarizing Data - Charts and Tables
33 pages
Section 2.1, Frequency Distributions and Their Graphs
No ratings yet
Section 2.1, Frequency Distributions and Their Graphs
2 pages
Business Statistics For R: Name PRN
No ratings yet
Business Statistics For R: Name PRN
30 pages
Week 2 Data Presentation
No ratings yet
Week 2 Data Presentation
37 pages
Screenshot 2025-02-20 at 1.50.52 PM
No ratings yet
Screenshot 2025-02-20 at 1.50.52 PM
39 pages
2 Organizing and Visualizing Variables
No ratings yet
2 Organizing and Visualizing Variables
36 pages
chapter 3 descriptive biostatistics
No ratings yet
chapter 3 descriptive biostatistics
103 pages
Chapter 2. Presenting Data in Tables and Charts: Objectives
No ratings yet
Chapter 2. Presenting Data in Tables and Charts: Objectives
44 pages
Fundamentals of Statistics-Frequency Distribution: WEEK # 06
No ratings yet
Fundamentals of Statistics-Frequency Distribution: WEEK # 06
39 pages
Describing Data New
No ratings yet
Describing Data New
13 pages
MATH 101 - Data Management
No ratings yet
MATH 101 - Data Management
44 pages
BADB1014 Quantitative Methods - Lesson 3
No ratings yet
BADB1014 Quantitative Methods - Lesson 3
23 pages
Behavioral Statistics: Chapter 2 - Describing Data With Tables and Graphs
No ratings yet
Behavioral Statistics: Chapter 2 - Describing Data With Tables and Graphs
47 pages
Data Organization
No ratings yet
Data Organization
69 pages
Statanalysis C2a
No ratings yet
Statanalysis C2a
6 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
Biostatistics Module 3
No ratings yet
Biostatistics Module 3
9 pages
Unit-2 3
No ratings yet
Unit-2 3
76 pages
Chapter 2 Methods of Data Collection and Presentation
No ratings yet
Chapter 2 Methods of Data Collection and Presentation
35 pages
Frequency Distribution & Graghs
No ratings yet
Frequency Distribution & Graghs
28 pages
Describing Data_Frequency Distribution
No ratings yet
Describing Data_Frequency Distribution
15 pages
Math Midterm
No ratings yet
Math Midterm
9 pages
graphical representation of data
No ratings yet
graphical representation of data
8 pages
HRHHRHRHRHRRHHR
No ratings yet
HRHHRHRHRHRRHHR
11 pages
2.Data presentation
No ratings yet
2.Data presentation
26 pages
DATA PRESENTATION
No ratings yet
DATA PRESENTATION
36 pages
Data Visualization & Data Exploration - Unit II
No ratings yet
Data Visualization & Data Exploration - Unit II
26 pages
frequency distribution & Graphs
No ratings yet
frequency distribution & Graphs
39 pages
Part 1 Descriptive
No ratings yet
Part 1 Descriptive
42 pages
Session-4-5-6-Statistics For Data Analytics-Dr - Girish - Bagale - IsAGx5vCqq
No ratings yet
Session-4-5-6-Statistics For Data Analytics-Dr - Girish - Bagale - IsAGx5vCqq
21 pages
8614, Unit 3 5
No ratings yet
8614, Unit 3 5
48 pages
Jim 106 Webex 1 26112022 Chapter 2
No ratings yet
Jim 106 Webex 1 26112022 Chapter 2
38 pages
Lecture-02 Data Organization and Presentation
No ratings yet
Lecture-02 Data Organization and Presentation
36 pages
Chapter 2
No ratings yet
Chapter 2
74 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Statistic Frequency Distribution
100% (4)
Statistic Frequency Distribution
66 pages
Organizing and Graphing Data
No ratings yet
Organizing and Graphing Data
83 pages
Ch - 2 (Organizing and Graphing Data)
No ratings yet
Ch - 2 (Organizing and Graphing Data)
83 pages
Tes9e ch02
No ratings yet
Tes9e ch02
102 pages
CH 2 Notes Filled
No ratings yet
CH 2 Notes Filled
22 pages
Module 3 Data Presentation
No ratings yet
Module 3 Data Presentation
9 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
L2 - COMS 2302 01 Phase 1 Steps 1, 2
No ratings yet
L2 - COMS 2302 01 Phase 1 Steps 1, 2
47 pages
Lecture 2 Non - Human Primates
No ratings yet
Lecture 2 Non - Human Primates
46 pages
L 3 Nutrition
No ratings yet
L 3 Nutrition
120 pages
BIOLOGY CSEC Lecture 1
100% (1)
BIOLOGY CSEC Lecture 1
43 pages
Lecture 2 - PHOTOSYNTHESIS HSB
0% (1)
Lecture 2 - PHOTOSYNTHESIS HSB
57 pages
Lecture 4 THE RESPIRATORY SYSTEM
No ratings yet
Lecture 4 THE RESPIRATORY SYSTEM
48 pages
EX405
No ratings yet
EX405
11 pages
Extinguishment of Obligations (Art. 1231) Modes
No ratings yet
Extinguishment of Obligations (Art. 1231) Modes
50 pages
Chapter 4: Absorbers Rules of Thumb For Chemical Engineers, 5th Edition by Stephen Hall
No ratings yet
Chapter 4: Absorbers Rules of Thumb For Chemical Engineers, 5th Edition by Stephen Hall
11 pages
Aba Draft Version 1.0 9.1.12
No ratings yet
Aba Draft Version 1.0 9.1.12
12 pages
Lippert Electronic Slash Hydraulic Leveling Slideout Owners Manual
No ratings yet
Lippert Electronic Slash Hydraulic Leveling Slideout Owners Manual
31 pages
Industrial Security Management: By: FMB
100% (1)
Industrial Security Management: By: FMB
162 pages
Blue Victoria 50 Combinatorics Problems Set (Final)
No ratings yet
Blue Victoria 50 Combinatorics Problems Set (Final)
92 pages
The Basics of Hacking and Pen Testing
No ratings yet
The Basics of Hacking and Pen Testing
30 pages
Office Procedures and Practices
No ratings yet
Office Procedures and Practices
6 pages
Experimental and Statistical Investigation On Synergistic - 2023 - Developments
No ratings yet
Experimental and Statistical Investigation On Synergistic - 2023 - Developments
13 pages
Narrative On Financial Analysis
100% (1)
Narrative On Financial Analysis
28 pages
En - Installation Guide
No ratings yet
En - Installation Guide
49 pages
Neurogenesis
No ratings yet
Neurogenesis
4 pages
PC-ABS Cycoloy C2950HF
No ratings yet
PC-ABS Cycoloy C2950HF
3 pages
1st Periodic Test Eng Grade 6 2023-24
No ratings yet
1st Periodic Test Eng Grade 6 2023-24
6 pages
1-4 - April 22
No ratings yet
1-4 - April 22
3 pages
RQA
No ratings yet
RQA
4 pages
Class - 7 - Maths - Annual Exam Paper 20-21
100% (5)
Class - 7 - Maths - Annual Exam Paper 20-21
5 pages
C-TEC-QT601-2
No ratings yet
C-TEC-QT601-2
2 pages
Descriptive Essay Introduction Examples
No ratings yet
Descriptive Essay Introduction Examples
5 pages
War Hors
No ratings yet
War Hors
21 pages
English Summer H.W
No ratings yet
English Summer H.W
4 pages
Character Sheet DND RAGHHHHHHHHHHHHHHH-1
No ratings yet
Character Sheet DND RAGHHHHHHHHHHHHHHH-1
3 pages
First bite of an Apple
No ratings yet
First bite of an Apple
7 pages
Red Hat Package Manager and Syslog Server
No ratings yet
Red Hat Package Manager and Syslog Server
26 pages
28.13 Vammika S m23 Piya Tan
No ratings yet
28.13 Vammika S m23 Piya Tan
13 pages
Bank of Tanzania Academy FP Report
No ratings yet
Bank of Tanzania Academy FP Report
6 pages
Steampunk Compendium V6-133-151
No ratings yet
Steampunk Compendium V6-133-151
19 pages

BIOL 2163 Lecture 2 - Summarizing and Graphing Data

Uploaded by

BIOL 2163 Lecture 2 - Summarizing and Graphing Data

Uploaded by

Chapter 2

Summarizing and Graphing Data

Copyright © 2010, 2007, 2004

Descriptive Statistics: Methods used to

In this lecture we look at descriptive statistics.

Copyright © 2010, 2007, 2004

When working with large data sets, it is often helpful

Original Data – Appendix B, Data Set 1

The frequency for a

2. We can analyze the nature of data.

3. We have a basis for constructing important

2. Calculate the class width (round up).

(maximum value) – (minimum value)

3. Starting point: Choose the minimum data value or a convenient value

includes the same class limits as a frequency

percentage class frequency

Total Frequency = 40 * 12/40  100 = 30%

 The frequencies start low, then increase to one or two

 The distribution is approximately symmetric, with

 However, the converse is not true, because data

In this Section we have discussed

Copyright © 2010, 2007, 2004

We use a visual tool called a

A graph consisting of bars of equal width drawn

Horizontal Scale for Histogram: Use class

When graphed, a normal distribution has a “bell” shape.

(1) The frequencies increase to a maximum, and then

(2) symmetry, with the left half of the graph roughly a

The histogram on the next slide illustrates this.

In this Section we have discussed

Copyright © 2010, 2007, 2004

This section discusses other types of statistical

Pulse Rates of Females

Uses bars of equal width to show frequencies of

Copyright © 2010, 2007, 2004

Some graphs are bad in the sense that they

Bars have same width, too busy, too difficult to understand.

Misleading. Depicts one-dimensional data with three-

Fair, objective, unencumbered by distracting features.

Part (b) is designed to exaggerate the difference by increasing each

You might also like