0% found this document useful (0 votes)

27 views4 pages

3 SAS 1 Independence

This document discusses chi-square statistics for analyzing 2x2 contingency tables. It defines the chi-square test statistic Q, which follows a chi-square distribution with 1 degree of freedom under the null hypothesis of no association between the row and column variables. It also discusses the Pearson chi-square statistic QP, which is asymptotically equivalent to Q. The document provides an example analysis using PROC FREQ in SAS to calculate Q, QP, and other statistics for a 2x2 table comparing treatment and outcome. Both Q and QP are highly significant, indicating a strong association between treatment and response.

Uploaded by

vaxor paradose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views4 pages

3 SAS 1 Independence

Uploaded by

vaxor paradose

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

2.2.

Chi-Square Statistics 17

2.2 Chi-Square Statistics

Table 2.2 displays the generic 2 2 table, including row and column marginal totals.

Table 2.2 2 2 Contingency Table

Row Column
Levels 1 2 Total
1 n11 n12 n1C
2 n21 n22 n2C
Total nC1 nC2 n

Under the randomization framework that produced Table 2.1, the row marginal totals n1C and
n2C are fixed since 60 patients were randomly allocated to one of the treatment groups and 64 to
the other. The column marginal totals can be regarded as fixed under the null hypothesis of no
treatment difference for each patient (since each patient would have the same response regardless
of the assigned treatment, under this null hypothesis). Then, given that all of the marginal totals
n1C , n2C , nC1 , and nC2 are fixed under the null hypothesis, the probability distribution from the
randomized allocation of patients to treatment can be written
n1C Šn2C ŠnC1 ŠnC2 Š
Prfnij g D
nŠn11 Šn12 Šn21 Šn22 Š

which is the hypergeometric distribution. The expected value of nij is

ni C nCj
Efnij jH0 g D D mij
n
and the variance is
n1C n2C nC1 nC2
V fnij jH0 g D D vij
n2 .n 1/

For a sufﬁciently large sample, n11 approximately has a normal distribution, which implies that

.n11 m11 /2
QD
v11

approximately has a chi-square distribution with one degree of freedom. It is the ratio of a squared
difference from the expected value versus its variance, and such quantities follow the chi-square
distribution when the variable is distributed normally. Q is often called the randomization (or
Mantel-Haenszel) chi-square. It doesn’t matter how the rows and columns are arranged; Q takes
the same value since
jn11 n22 n12 n21 j n1C n2C
jn11 m11 j D jnij mij j D D jp1 p2 j
n n

where pi D .ni1 =n1C / is the observed proportion in column 1 for the i th row.
18 Chapter 2: The 2 2 Table

A related statistic is the Pearson chi-square statistic. This statistic is written

2 X
X 2
.nij mij /2 n .p1 p2 /2
QP D D QD
mij .n 1/ f.1=n1C C 1=n2C /pC .1 pC /g
i D1 j D1

where pC D .nC1 =n/ is the proportion in column 1 for the pooled rows.
If the cell counts are sufficiently large, QP is distributed as chi-square with one degree of freedom.
As n grows large, QP and Q converge. A useful rule for determining adequate sample size for
both Q and QP is that the expected value mij should exceed 5 (and preferable 10) for all of the
cells. While Q is discussed here in the framework of a randomized allocation of patients to two
groups, Q and QP are also appropriate for investigating the hypothesis of no association for all of
the sampling frameworks described previously.
The following PROC FREQ statements produce a frequency table and the chi-square statistics
for the data in Table 2.1. The data are supplied in frequency (count) form. An observation
is supplied for each configuration of the values of the variables TREAT and OUTCOME. The
variable COUNT holds the total number of observations that have that particular configuration.
The WEIGHT statement tells the FREQ procedure that the data are in frequency form and names
the variable that contains the frequencies. Alternatively, the data could be provided as case records
for the individual patients; with this data structure, there would be 124 data lines corresponding to
the 124 patients, and neither the variable COUNT nor the WEIGHT statement would be required.
The CHISQ option in the TABLES statement produces chi-square statistics.

data respire;
input treat $ outcome $ count;
datalines;
placebo f 16
placebo u 48
test f 40
test u 20
;

proc freq;
weight count;
tables treat*outcome / chisq;
run;

Output 2.1 displays the data in a 2 2 table. With an overall sample size of 124, and all expected
cell counts greater than 10, the sampling assumptions for the chi-square statistics are met. PROC
FREQ prints out a warning message when more than 20% of the cells in a table have expected
counts less than 5. (You can specify the EXPECTED option in the TABLE statement to produce
the expected cell counts along with the cell percentages.)
2.2. Chi-Square Statistics 19

Output 2.1 Frequency Table

Frequency Table of treat by outcome

Percent
Row Pct outcome
Col Pct
treat f u Total

placebo 16 48 64
12.90 38.71 51.61
25.00 75.00
28.57 70.59

test 40 20 60
32.26 16.13 48.39
66.67 33.33
71.43 29.41

Total 56 68 124
45.16 54.84 100.00

Output 2.2 contains the table with the chi-square statistics.

Output 2.2 Chi-Square Statistics

Statistic DF Value Prob

Chi-Square 1 21.7087 <.0001

Likelihood Ratio Chi-Square 1 22.3768 <.0001

Continuity Adj. Chi-Square 1 20.0589 <.0001

Mantel-Haenszel Chi-Square 1 21.5336 <.0001

Phi Coefficient -0.4184

Contingency Coefficient 0.3860

Cramer's V -0.4184

Fisher's Exact Test

Cell (1,1) Frequency (F) 16

Left-sided Pr <= F 2.838E-06

Right-sided Pr >= F 1.0000

Table Probability (P) 2.397E-06

Two-sided Pr <= P 4.754E-06

Sample Size = 124

The randomization statistic Q is labeled “Mantel-Haenszel Chi-Square,” and the Pearson chi-
square QP is labeled “Chi-Square.” Q has a value of 21.5336 and p < 0:0001; QP has a value
of 21.7087 and p < 0:0001. Both of these statistics are clearly signiﬁcant. There is a strong
20 Chapter 2: The 2 2 Table

association between treatment and outcome such that the test treatment results in a more favorable
response outcome than the placebo. The row percentages in Output 2.1 show that the test treatment
resulted in 67% favorable response and the placebo treatment resulted in 25% favorable response.
The output also includes a statistic labeled “Likelihood Ratio Chi-Square.” This statistic, often
written QL , is asymptotically equivalent to Q and QP . The statistic QL is described in Chapter
8 in the context of hypotheses for the odds ratio, for which there is some consideration in Section
2.5. QL is not often used in the analysis of 2 2 tables. Some of the other statistics are discussed
in the next section.

2.3 Exact Tests

Sometimes your data include small and zero cell counts. For example, consider the data in Table 2.3
from a study on treatments for healing severe infections. Randomly assigned test treatment and
control are compared to determine whether the rates of favorable response are the same.

Table 2.3 Severe Infection Treatment Outcomes

Treatment Favorable Unfavorable Total
Test 10 2 12
Control 2 4 6
Total 12 6 18

Obviously, the sample size requirements for the chi-square tests described in Section 2.2 are not
met by these data. However, if you can consider the margins (12, 6, 12, 6) to be ﬁxed, then the
random assignment and the null hypothesis of no association imply the hypergeometric distribution

n1C Šn2C ŠnC1 ŠnC2 Š

Prfnij g D
nŠn11 Šn12 Šn21 Šn22 Š

The row margins may be fixed by the treatment allocation process; that is, subjects are randomly
assigned to Test and Control. The column totals can be regarded as fixed by the null hypothesis;
there are 12 patients with favorable response and 6 patients with unfavorable response, regardless of
treatment. If the data are the result of a sample of convenience, you can still condition on marginal
totals being fixed by addressing the null hypothesis that the patients are interchangeable; that is,
the observed distributions of outcome for the two treatments are compatible with what would be
expected from random assignment. That is, all possible assignments of the outcomes for 12 of the
patients to Test and for 6 to Control are equally likely.
Recall that a p-value is the probability of the observed data or more extreme data occurring under
the null hypothesis. With Fisher’s exact test, you determine the p-value for this table by summing
the probabilities of the tables that are as likely or less likely, given the fixed margins. Table 2.4
includes all possible table configurations and their associated probabilities.

Chi-Square Distribution
100% (1)
Chi-Square Distribution
14 pages
L3 Categorical Data Analysis
No ratings yet
L3 Categorical Data Analysis
25 pages
Chi Square Tests2013
No ratings yet
Chi Square Tests2013
37 pages
Chi Square
100% (3)
Chi Square
2 pages
Chi Square Tests 2020
No ratings yet
Chi Square Tests 2020
42 pages
371 Chapter 15 Sum 15
No ratings yet
371 Chapter 15 Sum 15
34 pages
New Microsoft Word Document
100% (1)
New Microsoft Word Document
7 pages
Lesson 11 CHI SQUARE TEST OF SIGNIFICANCE2 (Autosaved)
No ratings yet
Lesson 11 CHI SQUARE TEST OF SIGNIFICANCE2 (Autosaved)
17 pages
Chi-Square: Heibatollah Baghi, and Mastee Badii
No ratings yet
Chi-Square: Heibatollah Baghi, and Mastee Badii
37 pages
Tests Using Contingency Tables: Test For Independence
No ratings yet
Tests Using Contingency Tables: Test For Independence
15 pages
Chisquare and Fisher Exact Test - v2
No ratings yet
Chisquare and Fisher Exact Test - v2
15 pages
Chapter 11 - Chi-Square Test
No ratings yet
Chapter 11 - Chi-Square Test
12 pages
Chi Square: Objectives
No ratings yet
Chi Square: Objectives
8 pages
Goodness of Fit Tests Contingency Tables
No ratings yet
Goodness of Fit Tests Contingency Tables
49 pages
Chi Square Statistics
100% (1)
Chi Square Statistics
7 pages
10measures of Association
No ratings yet
10measures of Association
249 pages
08. Chi-square Test
No ratings yet
08. Chi-square Test
46 pages
Market Risk Var: Model-Building Approach
No ratings yet
Market Risk Var: Model-Building Approach
38 pages
Chi Square Test
100% (1)
Chi Square Test
23 pages
Chi Square
No ratings yet
Chi Square
37 pages
Chi Square
No ratings yet
Chi Square
11 pages
Chi-Square Test: DR Ramakanth
No ratings yet
Chi-Square Test: DR Ramakanth
38 pages
ChiSquare Examples
No ratings yet
ChiSquare Examples
8 pages
Basic Biostatistics - Wakgari Module 17-21
No ratings yet
Basic Biostatistics - Wakgari Module 17-21
82 pages
X Test PDF
No ratings yet
X Test PDF
38 pages
Bios Tat
No ratings yet
Bios Tat
12 pages
Module 6 Chi-Square T Z Test
100% (1)
Module 6 Chi-Square T Z Test
72 pages
Chi Square Test
No ratings yet
Chi Square Test
23 pages
MAED 204 PA 299 Module 10 in Stat.
No ratings yet
MAED 204 PA 299 Module 10 in Stat.
8 pages
Lecture 13-14-15 Chi - Square Test
No ratings yet
Lecture 13-14-15 Chi - Square Test
22 pages
Chi Square Report
No ratings yet
Chi Square Report
35 pages
Week 16_ Testing for Independene_ Pearson Chi-Square Test
No ratings yet
Week 16_ Testing for Independene_ Pearson Chi-Square Test
18 pages
Measurement 6th Sem (H) DSE4 Lec 4 05 05 2020
No ratings yet
Measurement 6th Sem (H) DSE4 Lec 4 05 05 2020
19 pages
Chi square tests.pdf
No ratings yet
Chi square tests.pdf
45 pages
Test of Association
No ratings yet
Test of Association
27 pages
A. Find Critical Value Problem Statement: Code: Output:: Syit - Cost Aim: Chi-Squared Test
No ratings yet
A. Find Critical Value Problem Statement: Code: Output:: Syit - Cost Aim: Chi-Squared Test
42 pages
Chi Square Test 2
No ratings yet
Chi Square Test 2
27 pages
Chi Square Test
No ratings yet
Chi Square Test
16 pages
Biostatistics Practical Answers - 2023
No ratings yet
Biostatistics Practical Answers - 2023
11 pages
BIOL200_memo5_Nour_03-04
No ratings yet
BIOL200_memo5_Nour_03-04
3 pages
Probability and Statistics - Lecture 4
No ratings yet
Probability and Statistics - Lecture 4
35 pages
Chapter 6
No ratings yet
Chapter 6
13 pages
Lecture 17- Ch10- ChiSquare Test
No ratings yet
Lecture 17- Ch10- ChiSquare Test
35 pages
FRM Part 1: Basic Statistics
No ratings yet
FRM Part 1: Basic Statistics
28 pages
Chi-Square Test
No ratings yet
Chi-Square Test
23 pages
Chi Square Test
No ratings yet
Chi Square Test
9 pages
6.3 Chi-Square (2)
No ratings yet
6.3 Chi-Square (2)
35 pages
Chi-square (χ2) test compiled
No ratings yet
Chi-square (χ2) test compiled
34 pages
Com 201 Chi Square Test Ug2 Rad
No ratings yet
Com 201 Chi Square Test Ug2 Rad
27 pages
Biostatistics L11+12 2021
No ratings yet
Biostatistics L11+12 2021
9 pages
CHI-SQUARE TESTS
No ratings yet
CHI-SQUARE TESTS
3 pages
Chi-Square Test and Its Application in Hypothesis
No ratings yet
Chi-Square Test and Its Application in Hypothesis
3 pages
Chi-square tests
No ratings yet
Chi-square tests
6 pages
C22 P09 Chi Square Test
No ratings yet
C22 P09 Chi Square Test
33 pages
Nonparametric Testing
No ratings yet
Nonparametric Testing
4 pages
Chi—Square Test
No ratings yet
Chi—Square Test
12 pages
Kami Export - Vihan Aggarwal - Chi-Square WS #3 - Homogeneity Test 2024.3
No ratings yet
Kami Export - Vihan Aggarwal - Chi-Square WS #3 - Homogeneity Test 2024.3
9 pages
Chi Square Statistics
No ratings yet
Chi Square Statistics
7 pages
Rapidminer Report
No ratings yet
Rapidminer Report
28 pages
Chapter 10, Part A Statistical Inferences About Means and Proportions With Two Populations
No ratings yet
Chapter 10, Part A Statistical Inferences About Means and Proportions With Two Populations
48 pages
Introduction To Biostatistics Syllabus
No ratings yet
Introduction To Biostatistics Syllabus
8 pages
Generalized Linear Models For Categorical And Continuous Limited Dependent Variables Michael Smithson instant download
No ratings yet
Generalized Linear Models For Categorical And Continuous Limited Dependent Variables Michael Smithson instant download
91 pages
Test of Goodness of Fit
No ratings yet
Test of Goodness of Fit
38 pages
Chi - Square Test: PG Students: DR Amit Gujarathi DR Naresh Gill
No ratings yet
Chi - Square Test: PG Students: DR Amit Gujarathi DR Naresh Gill
32 pages
(eBook PDF) CFA Program Curriculum 2019 Level II Volumes 1-6 Box Set 2024 Scribd Download
100% (1)
(eBook PDF) CFA Program Curriculum 2019 Level II Volumes 1-6 Box Set 2024 Scribd Download
41 pages
Factor Analysis and Structural Equations Modelling: Statistics For Psychology
No ratings yet
Factor Analysis and Structural Equations Modelling: Statistics For Psychology
46 pages
ANOVA - Analysis of Variance (Slides)
No ratings yet
ANOVA - Analysis of Variance (Slides)
41 pages
FALLSEM2023-24 SWE2020 ETH VL2023240103291 2023-11-22 Reference-Material-II
No ratings yet
FALLSEM2023-24 SWE2020 ETH VL2023240103291 2023-11-22 Reference-Material-II
26 pages
Solutions Icda HW
No ratings yet
Solutions Icda HW
13 pages
JD - Data Science Analyst 2025
No ratings yet
JD - Data Science Analyst 2025
2 pages
Statprob Q4 Module 6
No ratings yet
Statprob Q4 Module 6
17 pages
Slides Classification Naivebayes
No ratings yet
Slides Classification Naivebayes
6 pages
Steagall Et Al 2008 Antinociceptive Effects of Tramadol and Acepromazine in Cats
No ratings yet
Steagall Et Al 2008 Antinociceptive Effects of Tramadol and Acepromazine in Cats
8 pages
Z-test-and-T-test
No ratings yet
Z-test-and-T-test
15 pages
ANOVA - Example - Welch and G-H - Key
No ratings yet
ANOVA - Example - Welch and G-H - Key
6 pages
Akaike's Information Criterion For Estimated Model - MATLAB Aic
No ratings yet
Akaike's Information Criterion For Estimated Model - MATLAB Aic
5 pages
Review Question - C3 - SACR3080
No ratings yet
Review Question - C3 - SACR3080
10 pages
Tugas2 Regresi Linear Berganda - Ipynb - Colab
No ratings yet
Tugas2 Regresi Linear Berganda - Ipynb - Colab
3 pages
"Compositional Data Analysis in Practice" by Michael Greenacre Universitat Pompeu Fabra (Barcelona, Spain), Chapman and Hall/CRC, 2018
No ratings yet
"Compositional Data Analysis in Practice" by Michael Greenacre Universitat Pompeu Fabra (Barcelona, Spain), Chapman and Hall/CRC, 2018
3 pages
CH 19
No ratings yet
CH 19
6 pages
Statistics for Reliability Modeling
No ratings yet
Statistics for Reliability Modeling
13 pages
Orbit Forecasting Model
No ratings yet
Orbit Forecasting Model
6 pages
T TEST
No ratings yet
T TEST
3 pages
Illustrating The T Distribution
No ratings yet
Illustrating The T Distribution
18 pages
A Data Set That Consists of Observations On A Variable or Several Variables Over Time Is Called
No ratings yet
A Data Set That Consists of Observations On A Variable or Several Variables Over Time Is Called
10 pages
Statistics For Management Unit 3 2marks
No ratings yet
Statistics For Management Unit 3 2marks
4 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
10 pages
F-Ratio Table 2005 - 01
No ratings yet
F-Ratio Table 2005 - 01
5 pages
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet

3 SAS 1 Independence

Uploaded by

3 SAS 1 Independence

Uploaded by

2.2.

2.2 Chi-Square Statistics

Table 2.2 2 2 Contingency Table

which is the hypergeometric distribution. The expected value of nij is

A related statistic is the Pearson chi-square statistic. This statistic is written

Output 2.1 Frequency Table

Frequency Table of treat by outcome

Output 2.2 contains the table with the chi-square statistics.

Output 2.2 Chi-Square Statistics

Statistic DF Value Prob

Chi-Square 1 21.7087 <.0001

Likelihood Ratio Chi-Square 1 22.3768 <.0001

Continuity Adj. Chi-Square 1 20.0589 <.0001

Mantel-Haenszel Chi-Square 1 21.5336 <.0001

Phi Coefficient -0.4184

Contingency Coefficient 0.3860

Fisher's Exact Test

Cell (1,1) Frequency (F) 16

Left-sided Pr <= F 2.838E-06

Right-sided Pr >= F 1.0000

Table Probability (P) 2.397E-06

Two-sided Pr <= P 4.754E-06

Sample Size = 124

2.3 Exact Tests

Table 2.3 Severe Infection Treatment Outcomes

n1C Šn2C ŠnC1 ŠnC2 Š

You might also like