Chapter 9 (1)

Chapter 9 discusses the chi-squared (χ2) test, which is used to determine relationships between two categorical variables. It explains how to set up observed and expected value tables, calculate degrees of freedom, and interpret significant results. The chapter also includes homework assignments that require applying the χ2 test to various datasets, including crime rates and marital status.

Uploaded by

hcolegrove05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views4 pages

Chapter 9 (1)

Uploaded by

hcolegrove05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 4

Chapter 9

Chi Squared

Concepts: So far we have looked at statistical tests where the independent variable was
categorical and the dependent variable was numerical. The χ2 (chi squared) test allows
you look for relationships when both variables are categorical. For example, if you
hypothesize that nationality will have an impact on people’s favorite colors because
people will be influenced by colors that are important to their nation (such as the colors
of their flag or the colors of their national sports teams). Nationality and favorite color
are categorical variables. It is more or less meaningless to say that your favorite color is
13, or that you are from the nation of 7, and even if you labeled all nations with numbers,
it would not be the case that nation number 5 was more of a nation than nation number 2.
As with the other tests we have used, a significant result means that there is a relationship
while a non-significant result means that we have no evidence that the one variable
influences the other. So if we tested the hypothesis about nationality and favorite color
and we got a significant result, we could conclude that nationality does influence
someone’s choice of favorite color.

Application: In order to apply a χ2 test you need to arrange the data into a table where
one axis is the independent variable and the other is the dependent variable. The test is
symmetric so it doesn’t matter which you put where. Strictly fictional data for our
hypothesis above might look like this:
Red Orange Yellow Green Blue Purple
United States of America 251 149 123 158 350 59
Australia 231 102 240 263 203 49
The Netherlands 124 406 89 117 204 65
Germany 306 189 257 145 132 57
Brazil 134 113 163 351 223 64
Notice that the number of people surveyed in each country is different. This will not
affect the test.
Using our usual plan, the first two steps can be taken for granted. The third step
we use the formula df = (#R-1)(#C-1) where #R is the number of rows, and #C is the
number of columns. The example here would have 20 degrees of freedom. For the fifth
step we will use the function CHISQ.DIST.RT (χ2,df).
The first step is to take a sum of each column, each row, and the total of all of the
data. The will result in a spreadsheet that looks like this:
Observed Red Orange Yellow Green Blue Purple
United 251 149 123 158 350 59
States of
America
1090 R1
Australia 231 102 240 263 203 49 1088 R2
The 124 406 89 117 204 65
Netherland
s
1005 R3
Germany 306 189 257 145 132 57 1086 R4

51
Brazil 134 113 163 351 223 64 1048 R5
1046 959 872 1034 1112 294 5317
C1 C2 C3 C4 C5 C6 Total

We will call this our ‘observed’ table since it is the table of our actual observations.
Next we will generate a table of expected values. This is the table that would
have generated the sums for the rows and columns if there were absolutely no
relationship between the variables. That is what the table would look like if all of the
variation was due to distributions of the two variables without either variable influencing
the other. Notice the labels that are next to the sums on the observed table. We will use
these labels to represent the sums they are next to. Each cell in the table can be uniquely
identified by its row and column, for example the USA response to red is row 1, column
1 while the German response to blue is row 4, column 5. The expected value for each
cell is found by taking the product of the row sum and column sum that correspond with
that cell, and dividing the result by the total. So, the expected value for the row 1,
column 1 cell is R1xC1/T. It is useful here to remember that the ‘$’ sign in Excel fixed a
value, and that the letters and numbers of the cell labels can be fixed independently. This
allows us to write one expression in the first cell and then copy it over the rest of the
cells. In our example, the row sums are in the ‘H’ column and the column sums are in
the ‘7’ row, so in the first cell of our ‘expected’ table we can write ‘=$H2*B$7/$H$7’,
copy that expression to the other cells in the table and we will get the following table:
Expected Red Orange Yellow Green Blue Purple
United
States of
America 214.433 196.598 178.762 211.973 227.963 60.2708
Australia 214.039 196.237 178.434 211.584 227.545 60.1602
The
Netherlands
197.711 181.267 164.822 195.443 210.186 55.5708
Germany 213.646 195.876 178.106 211.195 227.127 60.0497
Brazil 206.17 189.022 171.874 203.805 219.179 57.9485
Now we are ready to generate a χ2 table. The value we will compare to the critical value
will be the sum of all of the cells in this table. The values in those cells are generated by
inserting the values from the corresponding cells in the observed (O) and expected (E)
tables in the following expression:
In our example, the end of the spreadsheet will look like this:
Red Orange Yellow Green Blue Purple
United
States of
America 6.23574 11.5237 17.3943 13.7427 65.3307 0.0268
Australia 1.34395 45.2545 21.2421 12.4944 2.64761 2.07032

52
The
Netherlands
27.4811 278.623 34.8801 31.4838 0.18207 1.59993
Germany 39.9224 0.24139 34.9465 20.7476 39.8415 0.15488
Brazil 25.2634 30.5752 0.45821 106.309 0.0666 0.63196 Chi Squared
872.7154

Probability
5.0131E-172

Since the probability is much smaller than .05, there is a real relationship in this fictional
data.

Homework #9

Apply the χ2 test to each of the following data sets, and write a brief explanation of what
the results mean.

1) This data set is the incident rate for several types of crimes in a sample of states. The
numbers represent number of reports per 100,000 people.
Robber Vehicl
Murder Rape y Assault Burglary Larceny e
Alabama 8.2 34.5 141.4 247.8 953.8 2650 288.3
Colorado 3.7 43.4 84.6 264.7 744.8 2735.2 559.5
Hawaii 1.9 26.9 78.5 147.8 767.9 3308.4 716.4
Kansas 3.7 384 65.3 280 689.2 2758.1 339.6
Massachusetts 2.7 27.1 119 308.1 541.1 1527.4 295.1
Montana 1.9 32.2 18.9 228.5 389.2 2543 210.7
New Mexico 7.4 54.1 98.7 541.9 1093.9 2639.9 414.5
Oklahoma 5.3 41.7 91 370.5 1006 2644.2 391.8
South Dakota 2.3 46.7 18.6 108.1 324.4 1343.7 108.4
Virgina 6.1 22.7 99.2 154.8 392.1 2035 211.1

2) This data set is from an experiment on ant navigation. The categories of the
independent variable are ants that are new to the foraging arena (recruits) and ants that
have been to the food source before (experienced). The categories of the dependent
variable are whether the ant went the direction indicated by the odor cue, the direction
indicated by the light cue, or simply returned to the entrance of the maze.
Light Odor Back
Recruited 6 43 2
Experienced 60 38 7

53
3) This data set represents the marital status of American women during each of the last
five censuses.
Women 1960 1970 1980 1990 2000
Married 66.7 61.3 55.9 53.4 52.1
Never Married 16.6 20.7 22.4 22.3 23.5
Sep/Divorced 4.9 5.9 9.4 11.9 13.2
Widowed 11.8 12.1 12.3 11.4 11.1

4) This data set is the same as the last except it is for American men.
Men 1960 1970 1980 1990 2000
Married 71.8 67.8 62.0 60.0 57.0
Never Married 20.9 25.0 28.3 28.1 29.2
Sep/Divorced 3.6 4.0 7.0 9.2 11.1
Widowed 3.6 3.1 2.6 2.7 2.7

5) One possible explanation of the great diversity of hair color in Europeans is that there
was a strong sexual selection pressure on women in the tundra environment that covered
Europe in the last ice age. One prediction of this hypothesis is that hair color
distributions should be different between men and women. Does the following sample
support this?
Blonde Light Brown Black Red Gray
Brown
Men 12 19 40 24 1 5
Women 33 14 40 7 5 1

6) The same issues discussed in question 5 also apply to eye color. This data is a similar
sample of eye colors.
Blue Gray Green Hazel Brown Black
Men 30 3 12 14 35 5
Women 22 15 26 8 28 1

Double Fianchetto The Modern Chess Lifestyle by Daniel Hausrath 2020
100% (8)
Double Fianchetto The Modern Chess Lifestyle by Daniel Hausrath 2020
407 pages
William Lllewellyn's Anabolics, 11th Edition 2017 271 300
No ratings yet
William Lllewellyn's Anabolics, 11th Edition 2017 271 300
30 pages
DL RS 299a
100% (3)
DL RS 299a
9 pages
Robert v. Hogg, Allen T. Craig - Introduction To M
No ratings yet
Robert v. Hogg, Allen T. Craig - Introduction To M
448 pages
Lesson Design - Animal Characteristics
No ratings yet
Lesson Design - Animal Characteristics
5 pages
93 ChiSquare
No ratings yet
93 ChiSquare
4 pages
Z Test Formula
No ratings yet
Z Test Formula
6 pages
DAV Unit 4 Material
No ratings yet
DAV Unit 4 Material
49 pages
Activity 5 Stat
No ratings yet
Activity 5 Stat
6 pages
01 Sample Problems For Chapter 1 - ANSWER KEY
No ratings yet
01 Sample Problems For Chapter 1 - ANSWER KEY
13 pages
MIT9_63F09_lec04
No ratings yet
MIT9_63F09_lec04
7 pages
Running Head: Testing Hypothesis For Means 1
No ratings yet
Running Head: Testing Hypothesis For Means 1
9 pages
Chi Square - Problem Set
No ratings yet
Chi Square - Problem Set
2 pages
Chi Square Test
No ratings yet
Chi Square Test
11 pages
Chi Square (Lab 7)
No ratings yet
Chi Square (Lab 7)
6 pages
Unit-4 Hypothesis Testing F T Z Chi Test
No ratings yet
Unit-4 Hypothesis Testing F T Z Chi Test
17 pages
Chi Square 2 - 2
No ratings yet
Chi Square 2 - 2
13 pages
Mini Project Statistics)
100% (1)
Mini Project Statistics)
22 pages
PSAI Unit 5
No ratings yet
PSAI Unit 5
25 pages
Adstat Final Exam Reviewer2
No ratings yet
Adstat Final Exam Reviewer2
29 pages
Unit III
No ratings yet
Unit III
9 pages
Pearson Chpt 12
No ratings yet
Pearson Chpt 12
12 pages
CU ASwR Lab03 Sol
No ratings yet
CU ASwR Lab03 Sol
5 pages
4.02 Comparing Group Means - T-Tests and One-Way ANOVA Using Stata, SAS, R, and SPSS (2009)
No ratings yet
4.02 Comparing Group Means - T-Tests and One-Way ANOVA Using Stata, SAS, R, and SPSS (2009)
51 pages
Geog 3mb3 Section 4
No ratings yet
Geog 3mb3 Section 4
30 pages
Statistics2024_Final sds
No ratings yet
Statistics2024_Final sds
15 pages
Pearson R Correlation: Test
No ratings yet
Pearson R Correlation: Test
5 pages
Applied Econometrics Using Stata
No ratings yet
Applied Econometrics Using Stata
48 pages
Biostatistics FAQ
No ratings yet
Biostatistics FAQ
1 page
Week 13
No ratings yet
Week 13
21 pages
Ana Assignment 5019
No ratings yet
Ana Assignment 5019
7 pages
Solution Manual For Introductory Statistics 2nd Edition Gould Ryan 0321978277 9780321978271
100% (48)
Solution Manual For Introductory Statistics 2nd Edition Gould Ryan 0321978277 9780321978271
36 pages
Univariate_statistical_methods
No ratings yet
Univariate_statistical_methods
37 pages
3 - Bidimensional Statistics
No ratings yet
3 - Bidimensional Statistics
41 pages
Chapter 7 Notes
No ratings yet
Chapter 7 Notes
13 pages
Categorical Data Analysis
100% (1)
Categorical Data Analysis
20 pages
Chi Square Test
No ratings yet
Chi Square Test
4 pages
Chapter 4 _STAT1204 A
No ratings yet
Chapter 4 _STAT1204 A
10 pages
BS IMI U8 Oct23
No ratings yet
BS IMI U8 Oct23
100 pages
ST102 Exercise 1
No ratings yet
ST102 Exercise 1
4 pages
What Statistical Analysis Should I Use? Statistical Analyses Using SPSS
No ratings yet
What Statistical Analysis Should I Use? Statistical Analyses Using SPSS
40 pages
Inter Pet at Ion
No ratings yet
Inter Pet at Ion
11 pages
Statistical Analysis of Longitudinal Categorical Data in the Social and Behavioral Sciences An introduction With Computer Illustrations 1st Edition scribd download
100% (20)
Statistical Analysis of Longitudinal Categorical Data in the Social and Behavioral Sciences An introduction With Computer Illustrations 1st Edition scribd download
14 pages
Lecture3 - Contingency Analysis
No ratings yet
Lecture3 - Contingency Analysis
16 pages
8. Raghunath Chatterjee_Statistical Tests_Lecture
No ratings yet
8. Raghunath Chatterjee_Statistical Tests_Lecture
47 pages
Chi sq tutorial
No ratings yet
Chi sq tutorial
7 pages
Student t test
No ratings yet
Student t test
12 pages
Statistics
No ratings yet
Statistics
50 pages
Business Statistics, Chi Square Test
No ratings yet
Business Statistics, Chi Square Test
9 pages
Statistics Review
No ratings yet
Statistics Review
7 pages
Adstat Final Exam Reviewer2highlighted
No ratings yet
Adstat Final Exam Reviewer2highlighted
29 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
56 pages
Assignment 8 ChiSquareTest 1
No ratings yet
Assignment 8 ChiSquareTest 1
5 pages
ml unit 3
No ratings yet
ml unit 3
46 pages
Exercises (Chapter 3 and 4) : A B C D
No ratings yet
Exercises (Chapter 3 and 4) : A B C D
3 pages
Basic Commands SPSS
No ratings yet
Basic Commands SPSS
25 pages
The Two Sample Problem: 3.1 Observational Versus Randomized Studies
No ratings yet
The Two Sample Problem: 3.1 Observational Versus Randomized Studies
18 pages
578 Assignment 1
No ratings yet
578 Assignment 1
6 pages
L1 MultivDescriptive
No ratings yet
L1 MultivDescriptive
11 pages
Statistics in Education PDF
No ratings yet
Statistics in Education PDF
14 pages
The Numerate Leader: How to Pull Game-Changing Insights from Statistical Data
From Everand
The Numerate Leader: How to Pull Game-Changing Insights from Statistical Data
Thomas A. King
No ratings yet
Mindful Math 3: Use Your Statistics to Solve These Puzzling Pictures
From Everand
Mindful Math 3: Use Your Statistics to Solve These Puzzling Pictures
Robyn Djuritschek
No ratings yet
Mindful Math 2: Use Your Geometry to Solve These Puzzling Pictures
From Everand
Mindful Math 2: Use Your Geometry to Solve These Puzzling Pictures
Ann McNair
No ratings yet
A Study On Anti Islanding Detection Algorithms
No ratings yet
A Study On Anti Islanding Detection Algorithms
7 pages
Marcelo v. Sandiganbayan Case Digest
0% (1)
Marcelo v. Sandiganbayan Case Digest
2 pages
Thép W6
No ratings yet
Thép W6
4 pages
3... Solid Geom
No ratings yet
3... Solid Geom
3 pages
I Am An African
No ratings yet
I Am An African
15 pages
07 Chapter-5
No ratings yet
07 Chapter-5
35 pages
2019 Review of Cable Installation Protection Mitigation and Habitat Recoverability
No ratings yet
2019 Review of Cable Installation Protection Mitigation and Habitat Recoverability
140 pages
Colonoscopy Procedure
No ratings yet
Colonoscopy Procedure
4 pages
STD Ix Eng Aug 03 WP - pm2 - Wind
No ratings yet
STD Ix Eng Aug 03 WP - pm2 - Wind
3 pages
Bill of Quantities: Contract Duration
No ratings yet
Bill of Quantities: Contract Duration
3 pages
New Solar Catalog Rev 05
No ratings yet
New Solar Catalog Rev 05
39 pages
Perro Hot Dog
No ratings yet
Perro Hot Dog
22 pages
Syllabus GEC CON 2020
No ratings yet
Syllabus GEC CON 2020
6 pages
ROintro 1
No ratings yet
ROintro 1
15 pages
WOPTS
No ratings yet
WOPTS
1 page
उपदेशसाहस्री - Chapter 18 WFW, transliteration and translation
No ratings yet
उपदेशसाहस्री - Chapter 18 WFW, transliteration and translation
137 pages
Chapter 9 - Intuitionism
No ratings yet
Chapter 9 - Intuitionism
7 pages
Pds Hempadur Zinc 17360 En-Gb
No ratings yet
Pds Hempadur Zinc 17360 En-Gb
2 pages
Mind Map of Organic CompleteA
No ratings yet
Mind Map of Organic CompleteA
3 pages
EMP Bentu Process Flow 2022
No ratings yet
EMP Bentu Process Flow 2022
1 page
LWR Model
No ratings yet
LWR Model
23 pages
Refinement of Shed Microspore Culture Jaya Supena
No ratings yet
Refinement of Shed Microspore Culture Jaya Supena
6 pages
Annex 1 RESOLUTION MEPC.123 (53) Adopted On 22 July 2005 Guidelines For Ballast Water Management Equivalent Compliance (G3)
No ratings yet
Annex 1 RESOLUTION MEPC.123 (53) Adopted On 22 July 2005 Guidelines For Ballast Water Management Equivalent Compliance (G3)
4 pages
16861-123 Grant W14-ZA
No ratings yet
16861-123 Grant W14-ZA
3 pages
Strategic Management and the Circular Economy 1st edition by Marcello Tonelli, NicolÃ² Cristoni 1351592696 9781351592697 download
100% (3)
Strategic Management and the Circular Economy 1st edition by Marcello Tonelli, NicolÃ² Cristoni 1351592696 9781351592697 download
86 pages
Hacer Carbonated Basic
No ratings yet
Hacer Carbonated Basic
2 pages
Lecture#5-(staticrelay)
No ratings yet
Lecture#5-(staticrelay)
19 pages

Chapter 9 (1)

Uploaded by

Chapter 9 (1)

Uploaded by

Chapter 9

You might also like