TPS5e Lecture PPT Ch1.1
TPS5e Lecture PPT Ch1.1
Exploring Data
1.1
Analyzing Categorical
Data
The Practice of Statistics, 5th Edition
Starnes, Tabor, Yates, Moore
Categorical Variables
Categorical variables place individuals into one of several
groups or categories.
Frequency Table
Format
Variable
Count of Stations
Format
Percent of Stations
Adult Contemporary
1556
Adult Contemporary
Adult Standards
1196
Adult Standards
8.6
Contemporary Hit
4.1
Contemporary Hit
569
11.2
Country
2066
Country
14.9
News/Talk
2179
News/Talk
15.7
Oldies
1060
Oldies
Religious
2014
Religious
Rock
869
Spanish Language
750
Other Formats
Values
Total
1579
13838
7.7
14.6
Rock
Count
Spanish Language
Percent
6.3
5.4
Other Formats
11.4
Total
99.9
Count of Stations
Format
Percent of Stations
Adult Contemporary
1556
Adult Contemporary
Adult Standards
1196
Adult Standards
8.6
Contemporary Hit
4.1
Contemporary Hit
569
11.2
Country
2066
Country
14.9
News/Talk
2179
News/Talk
15.7
Oldies
1060
Oldies
Religious
2014
Religious
7.7
14.6
Rock
869
Rock
6.3
Spanish Language
750
Spanish Language
5.4
Other Formats
Total
1579
13838
Other Formats
11.4
Total
99.9
How
How to
to examine
examine aa marginal
marginal distribution:
distribution:
1)Use
1)Use the
the data
data in
in the
the table
table to
to calculate
calculate the
the marginal
marginal
distribution
distribution (in
(in percents)
percents) of
of the
the row
row or
or column
column totals.
totals.
2)Make
2)Make aa graph
graph to
to display
display the
the marginal
marginal distribution.
distribution.
Response
Percent
Almost no
chance
194/4826 = 4.0%
Some chance
712/4826 = 14.8%
A 50-50 chance
1416/4826 = 29.3%
A good chance
1421/4826 = 29.4%
Almost certain
1083/4826 = 22.4%
How
How to
to examine
examine or
or compare
compare conditional
conditional distributions:
distributions:
1)
1) Select
Select the
the row(s)
row(s) or
or column(s)
column(s) of
of interest.
interest.
2)
2) Use
Use the
the data
data in
in the
the table
table to
to calculate
calculate the
the conditional
conditional
distribution
distribution (in
(in percents)
percents) of
of the
the row(s)
row(s) or
or column(s).
column(s).
3)
3) Make
Make aa graph
graph to
to display
display the
the conditional
conditional distribution.
distribution.
Use
Use aa side-by-side
side-by-side bar
bar graph
graph or
or segmented
segmented bar
bar
graph
graph to
to compare
compare distributions.
distributions.
Response
Male
Female
Almost no chance
98/2459 =
4.0%
96/2367 =
4.1%
Some chance
286/2459 =
11.6%
426/2367 =
18.0%
A 50-50 chance
720/2459 =
29.3%
696/2367 =
29.4%
A good chance
758/2459 =
30.8%
663/2367 =
28.0%
Almost certain
597/2459 =
24.3%
486/2367 =
20.5%
10
Caution!
Even a strong association between two categorical variables can
be influenced by other variables lurking in the background.
11
12