離散資料分析 Categorical Data Analysis: 陳俞成 Email:[email protected]
離散資料分析 Categorical Data Analysis: 陳俞成 Email:[email protected]
離散資料分析
Categorical Data Analysis
陳俞成
Email:[email protected]
2005.10.17
Partial Tables
Partial Tables
Partial Tables
I The association in partial tables are called
conditional associations.
I Conditional associations refer to the effect of X
on Y conditional on fixing Z at some level.
I Conditional associations in partial tables can be
quite different from associations in marginal
tables.
I It can be misleading to analyze only a marginal
table of a multi-way contingency table.
陳俞成 Email:[email protected] 離散資料分析 Categorical Data Analysis
Partial Association
大綱
Cochran-Mantel-Haenszel Methods
Chapter 3 Three-Way Contingency Tables
Exact Inference About Conditional Associations
I Figure 3.2.
I A marginal association can have different
direction from the conditional associations is
called Simpson’s paradox.
I This result applies to quantitative as well as
categorical variables.
n11k n22k
I Sample estimate of θXY (k) is θ̂XY (k) = n12k n21k
n11+ n22+
I Sample estimate of θXY is θ̂XY = n12+ n21+
I Y = response(success, failure),
X = drug treatment(A,B),
and Z = clinic(1,2)
18×8 2×32
I θ̂XY (1) = 12×12 = 1 and θ̂XY (2) = 8×8 =1
20×40
I θ̂XY = 20×20 = 2 > 1
Homogeneous Association
I There is homogeneous X − Y association in a
2 × 2 × K table when
θXY (1) = θXY (2) = · · · = θXY (K ) .
I The effect of X on Y is the same at each level of
Z , and a single number describes the X − Y
conditional associations.
I Conditional idependence of X and Y is the
special case in which each conditional odds ratio
equals 1.0.
陳俞成 Email:[email protected] 離散資料分析 Categorical Data Analysis
Partial Association
大綱
Cochran-Mantel-Haenszel Methods
Chapter 3 Three-Way Contingency Tables
Exact Inference About Conditional Associations
Homogeneous Association
I Homogeneous X − Y association in an I × J × K
table means that any conditional odds ratio
formed using two levels of X and two levels of Y
is the same at each level of Z .
I When X − Y conditional odds ratios are identical
at each level of Z , the same property holds for
the other association. For instance, the
conditional odds ratio between two levels of X
and two levels of Z is identical at each level of Y .
陳俞成 Email:[email protected] 離散資料分析 Categorical Data Analysis
Partial Association
大綱
Cochran-Mantel-Haenszel Methods
Chapter 3 Three-Way Contingency Tables
Exact Inference About Conditional Associations
Homogeneous Association
Homogeneous Association
I When homogeneous association does not exist,
the conditional odds ratio for any pair of variables
changes across levels of the third variable.
I For X = smoking(yes, no), Y = lung cancer(yes,
no), and Z = age(< 45, 45 − 65, > 65), suppose
θXY (1) = 1.2, θXY (2) = 2.8, θXY (3) = 6.2. Then,
smoking has a weak effect on lung cancer for
young people, but the effect strengthens
considerably with age.
陳俞成 Email:[email protected] 離散資料分析 Categorical Data Analysis
Partial Association
大綱
Cochran-Mantel-Haenszel Methods
Chapter 3 Three-Way Contingency Tables
Exact Inference About Conditional Associations
I For 2 × 2 × K tables
I H0 : X and Y are conditionally independent, given Z
I H0 : θXY (i) = 1, i = 1, 2, · · · , K
n1+k n+1k
µ11k = E (n11k ) =
n++k
n1+k n2+k n+1k n+2k
Var(n11k ) = 2
n++k (n++k − 1)
Lung Cancer
Some Caveats
I R.Tarone showed(Biometrika, 72: 91-95(1985))
that adjust the Breslow-Day statistic by
subtracting
Some Caveats
I If for each case-control pair we had information
about whether each subject had been a smoker,
we could form a 2 × 2 table relating whether the
control had ever been a smoker(yes,no) to
whether the case had ever been a smoker(yes,no).
I Chapter 9 discusses some of these.
I §6.5.1 discusses an alternative test of
homogeneity of odds ratios, based on models.
I §7.3presents generalizations of the CMH test for
I × J × K tables.
陳俞成 Email:[email protected] 離散資料分析 Categorical Data Analysis
Partial Association
大綱
Cochran-Mantel-Haenszel Methods
Chapter 3 Three-Way Contingency Tables
Exact Inference About Conditional Associations
P ∗ ≤ P( k n11k = observed
P
Summary
I Partial Association
I Large-sample significance tests for conditional
independence and homogeneity
I Small-sample significance tests for conditional
independence and homogeneity