0% found this document useful (0 votes)

41 views

Lesson-2 1

Uploaded by

albao.elaine21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views

Lesson-2 1

Uploaded by

albao.elaine21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Polytechnic University of the Philippines

College of Science
Department of Mathematics and Statistics

CATEGORICAL DATA
ANALYSIS

Prepared by:
Ms. Katrina D. Elizon
Contingency Tables
A 2−way contingency table a.k.a. cross-tabulation
is simply a two-way array containing the joint
distribution of two categorical random variables.

Contingency tables can be used to display either

the joint frequency distribution or the joint
probability distribution.

A two-way table with I rows and J columns is called

an I × J (read I–by–J ) table.
Probabilities for contingency tables can be of three
types:

1. Join Distribution - {πi,j} = {P(X = i, Y = j)}

form the joint distribution of X and Y. They satisfy

∑
πi,j = 1

2. Marginal Distribution - the row and column

totals of the joint probabilities.

3. Condit ional Dis tribut ion - ref ers to

probability Distribution of Y at xed level of x.
fi
Contingency Tables
Heart Attack
Group Total
Yes No
Placebo 179 10,843 11,022
Aspirin 105 10,932 11,037
Total 284 21,775 22,059

Heart Attack
Group Total
Yes No
Placebo
Aspirin
Total
Independence

X and Y are statistically independent if true

conditional distribution of Y is identical at each
level of x.
When two variables are independent, the
probability of any particular column outcome i is
the same in each row.

Statistical independence is, equivalently, the

property that all joint probabilities equal the
product of their marginal probabilities.
Several ways to compare probabilities in a
table:
1. Risk Difference
2. Relative Risk
3. Odds ratio
Several ways to compare probabilities in a table:
1. Risk Difference
The difference of proportions π1 − π2 compares
the success probabilities in the two rows.
It equals zero when π1 − π2, that is, when the
response is independent of the group
classi cation.
p1 and p2 denote the sample proportions of
successes.
The sample difference p1 − p2 estimates π1 − π2.
fi
Several ways to compare probabilities in a table:
1. Risk Difference
The estimated standard error of p1 − p2:

p1(1 − p1) p2(1 − p2)

SE = +
n1 n2
The standard error decreases, and hence the
estimate of π1 − π2 improves, as the sample sizes
increase.
100(1 − α) % (Wald) con dence interval for π1 − π2 is:
(p1 − p2) ± zα/2(SE)
fi
Question???
What are the possible values
obtained by taking the difference of
two probabilities?
Several ways to compare probabilities in a table:
2. Relative Risk
A difference between two proportions of a certain
xed size usually is more important when both
proportions are near 0 or 1 than when they are
near the middle of the range.
For 2 × 2 tables, the relative risk is the ratio:
π1
relative risk =
π2
A relative risk value of 1.0 corresponds to
independence.
fi
Question???
What are the possible values if we
divide two probabilities?
Question???
Do you think it is necessary to
identify one classi cation as a
response variable in order to
estimate the relative risk?
fi
Several ways to compare probabilities in a table:
3. Odds Ratio
For a probability of success π, the odds of success
are de ned to be
π
odds =
1−π
The odds of an event are simply the probability
of the event occurring divided by the probability
that the event does not occur.
The odds are nonnegative, with value greater
than 1.0 when a success is more likely than a
failure.
fi
Question???
If the probability of success is 0.80,
compute the value of odds? And
interpret the value.
Several ways to compare probabilities in a table:
3. Odds Ratio
The success probability itself is the function of the
odds,
odds
π=
odds + 1
In 2 × 2 tables, the ratio of the two odds from the
two rows,
odds1 π1 /(1 − π1)
θ= =
odds2 π2(1 − π2)
is the odds ratio.
Question???
What are the possible values if we
divide two odds ratio?
Question???
If X and Y are independent, what is
the value of the odds ratio?
Several ways to compare probabilities in a table:
3. Odds Ratio
The independence value θ = 1 is a baseline for
comparison.
When θ > 1, the odds of success are higher in row
1 than in row 2. (π1 > π2)
When θ < 1 , the odds of success is less likely in
row 1 than in row 2. (π1 < π2)
Values of θ farther from 1.0 in a given direction
represent stronger association.
Question???
What does it mean if the computed
odds ratio for a 2 × 2 table is θ = 3?
Question???
Calculate the odds ratios for tables 1 and 2. What
did you notice?
Table 1

Heart Attack
Group Total
Yes No
Placebo 179 10,843 11,022
Aspirin 105 10,932 11,037
Total 284 21,775 22,059

Table 2

Group
Group Total
Placebo Aspirin
Yes 179 105 284
No 10,843 10,932 21,775
Total 11,022 11,037 22,059
Question???
Calculate the odds ratio for table 1, and try to
reverse the order of the row or the order of the
column. What did you notice when we compared
the odds ratio to the previous result?

Table 1

Heart Attack
Group Total
Yes No
Placebo 179 10,843 11,022
Aspirin 105 10,932 11,037
Total 284 21,775 22,059
Several ways to compare probabilities in a table:
3. Odds Ratio
When both variables are response variables, the
odds ratio can be de ned using joint probabilities as
π11 /π12 π11π22
θ= =
π21 /π22 π12π21
The odds ratio is also called the cross-product
ratio, because it equals the ratio of the products
π11π22 a n d π12π21 o f c e l l p ro b a b i l i t i e s f ro m
diagonally opposite cells.
fi
Several ways to compare probabilities in a table:
3. Odds Ratio
The sample version of θ replaces πij ’s by pij ’s, or,
equivalently by nij’s:

̂ p11 p22 n11n22

θ= =
p12 p21 n12n21
Take Note!!!
When p1 and p2 are both close to
zero, the odds ratio and relative
risk take similar values.
For rare events (small risks):
odds ratio ≈ relative risk.
Exercise 1:
Consider the following two studies reported:
a. A study reported (January 3, 1990) that, of smokers who
get lung cancer, “women were 1.4 times more
vulnerable than men to get small-cell lung cancer.”
Is 1.4 an odds ratio, or a relative risk?

a. A National Cancer Institute study about tamoxifen and

breast cancer reported (June 23, 1995) that the women
taking the drug were 37% less likely to experience
invasive breast cancer compared with the women taking
placebo. Find the relative risk for:
(i) those taking the drug compared to those taking placebo,
(ii) those taking placebo compared to those taking the drug.
Exercise 2:
In the Philippines, the estimated annual probability
that a woman over the age of 32 dies of lung cancer
equals 0.001204 for current smokers and 0.000211
for nonsmokers

a. Calculate and interpret the difference of

proportions and the relative risk. Which is more
informative for these data? Why?

b. Calculate and interpret the odds ratio. Explain why

the relative risk and odds ratio take similar values.
Exercise 3:
A 20-year study of Filipino male physicians noted that the
proportion who died from lung cancer was 0.00151 per year
for cigarette smokers and 0.00009 per year for nonsmokers.
The proportion who died from heart disease was 0.00670 for
smokers and 0.00320 for nonsmokers.

a. Describe the association of smoking with lung cancer and

with heart disease, using the difference of proportions, the
relative risk, and the odds ratio. Interpret.

b. Which response (lung cancer or heart disease) is more

strongly related to cigarette smoking, in terms of the
reduction in deaths that could occur with an absence of
smoking?

Agresti Cda
No ratings yet
Agresti Cda
191 pages
Free PMI-SP Questions
No ratings yet
Free PMI-SP Questions
7 pages
S1500 MS v00 PDF
100% (1)
S1500 MS v00 PDF
140 pages
Progress Measurement Procedure PDF
50% (2)
Progress Measurement Procedure PDF
19 pages
Lecture Notes 2
No ratings yet
Lecture Notes 2
40 pages
categorical
No ratings yet
categorical
45 pages
Statistics and Probability
No ratings yet
Statistics and Probability
42 pages
Inferences On Two-Way Contingency Tables
No ratings yet
Inferences On Two-Way Contingency Tables
45 pages
Two-Way Tables - Measures of Association
No ratings yet
Two-Way Tables - Measures of Association
33 pages
Odds Vs Risk - Common Pitfalls
No ratings yet
Odds Vs Risk - Common Pitfalls
3 pages
Odds Ratio, Hazard Ratio and Relative Risk: Janez Stare Delphine Maucort-Boulch
No ratings yet
Odds Ratio, Hazard Ratio and Relative Risk: Janez Stare Delphine Maucort-Boulch
9 pages
Methods For Proportions
No ratings yet
Methods For Proportions
19 pages
Or & RR PDF
No ratings yet
Or & RR PDF
5 pages
Lesson 5 Odds or 95CI PowerPoint
No ratings yet
Lesson 5 Odds or 95CI PowerPoint
16 pages
Odds Ratio
No ratings yet
Odds Ratio
16 pages
Odds and Relative Risk
No ratings yet
Odds and Relative Risk
4 pages
Contingency Table
No ratings yet
Contingency Table
11 pages
Mordibity and Mortality Measurements-3 (2)
No ratings yet
Mordibity and Mortality Measurements-3 (2)
33 pages
Table 1. Calculating A Risk Ratio and An Odds Ratio in A Cohort Study Develop Outcome Do Not Develop Outcome
No ratings yet
Table 1. Calculating A Risk Ratio and An Odds Ratio in A Cohort Study Develop Outcome Do Not Develop Outcome
1 page
Class Lecture-3
No ratings yet
Class Lecture-3
10 pages
Odds Ratio
No ratings yet
Odds Ratio
12 pages
Risk Assessment II
No ratings yet
Risk Assessment II
4 pages
Outline Note Allan Agresti
No ratings yet
Outline Note Allan Agresti
187 pages
Measures of Association
No ratings yet
Measures of Association
56 pages
6 Contingency Tables
No ratings yet
6 Contingency Tables
72 pages
RR and OR
No ratings yet
RR and OR
17 pages
Lecture_1_Odds_Ratio
No ratings yet
Lecture_1_Odds_Ratio
43 pages
Logistic regression_2021 ch-8
No ratings yet
Logistic regression_2021 ch-8
52 pages
Applied Statistics II-2 and III
100% (1)
Applied Statistics II-2 and III
59 pages
Measure of Association
No ratings yet
Measure of Association
18 pages
Lecture 3.measures of Effectiveness
No ratings yet
Lecture 3.measures of Effectiveness
38 pages
Lecture 2
No ratings yet
Lecture 2
48 pages
Odds Ratios and Risk Ratios: What's The Difference and Why Does It Matter?
No ratings yet
Odds Ratios and Risk Ratios: What's The Difference and Why Does It Matter?
1 page
Odds Ratios-Current Best Practice and Use: JAMA Guide To Statistics and Methods
No ratings yet
Odds Ratios-Current Best Practice and Use: JAMA Guide To Statistics and Methods
2 pages
B Relative Risk and Odds Ratios Examples
No ratings yet
B Relative Risk and Odds Ratios Examples
8 pages
PHPS30020 Week1 - 29nov2023 (Effect Measures Estimates of Risk)
No ratings yet
PHPS30020 Week1 - 29nov2023 (Effect Measures Estimates of Risk)
24 pages
1measures of Association
No ratings yet
1measures of Association
105 pages
Prospecti®e Studies Usually Condition On The Totals (N N: Comparing Two Proportions
No ratings yet
Prospecti®e Studies Usually Condition On The Totals (N N: Comparing Two Proportions
6 pages
Solutions Odd For Categorical
No ratings yet
Solutions Odd For Categorical
28 pages
1categorical Data Analysis (Chi Square) June 2022
No ratings yet
1categorical Data Analysis (Chi Square) June 2022
194 pages
Basic Concepts: Probability
No ratings yet
Basic Concepts: Probability
32 pages
Measures of Associations
No ratings yet
Measures of Associations
68 pages
Konsep Dasar Probabilitas
No ratings yet
Konsep Dasar Probabilitas
32 pages
Binary data questions
No ratings yet
Binary data questions
3 pages
Point Estimation
No ratings yet
Point Estimation
21 pages
Statistical Reasoning for Everyday Life 5th Edition Bennett Test Bank download pdf
100% (16)
Statistical Reasoning for Everyday Life 5th Edition Bennett Test Bank download pdf
38 pages
Statistical Reasoning for Everyday Life 5th Edition Bennett Test Bank download
100% (2)
Statistical Reasoning for Everyday Life 5th Edition Bennett Test Bank download
40 pages
The University of Jordan/ Faculty of Graduate Studies. Faculty of Medicine Family and Community Medicine Department
No ratings yet
The University of Jordan/ Faculty of Graduate Studies. Faculty of Medicine Family and Community Medicine Department
39 pages
RM
No ratings yet
RM
12 pages
Session 5 Week 3-1
No ratings yet
Session 5 Week 3-1
24 pages
Chap 1 Basic Probability Concept
100% (1)
Chap 1 Basic Probability Concept
32 pages
5 Measures of association-converted
No ratings yet
5 Measures of association-converted
32 pages
The Odds Ratio Principles and Applications
No ratings yet
The Odds Ratio Principles and Applications
3 pages
Catedatach2 PDF
No ratings yet
Catedatach2 PDF
100 pages
logistic regression
No ratings yet
logistic regression
79 pages
Odds Ratio
No ratings yet
Odds Ratio
3 pages
Lecture 3-5 - Analyzing Contingency Tables: Azadeh Alimadad. DANA 4820 Jan 17 - 24, 2022
No ratings yet
Lecture 3-5 - Analyzing Contingency Tables: Azadeh Alimadad. DANA 4820 Jan 17 - 24, 2022
25 pages
Basic Concepts: Probability
No ratings yet
Basic Concepts: Probability
32 pages
Measures of Effect
No ratings yet
Measures of Effect
51 pages
Chapter3 Part1 Slides
No ratings yet
Chapter3 Part1 Slides
38 pages
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet
Impulse Balance Theory and its Extension by an Additional Criterion
From Everand
Impulse Balance Theory and its Extension by an Additional Criterion
Reinhard Selten
1/5 (1)
Probability Theory: A Concise Course
From Everand
Probability Theory: A Concise Course
Y. A. Rozanov
4/5 (2)
DLP English 7 - Albao
No ratings yet
DLP English 7 - Albao
8 pages
Group 3 SQC Final
No ratings yet
Group 3 SQC Final
11 pages
Eng3 Quarter2 Module 5 and 6 Hybrid Revised
No ratings yet
Eng3 Quarter2 Module 5 and 6 Hybrid Revised
18 pages
Eng3 Quarter2 Module 3 Hybrid Revised
No ratings yet
Eng3 Quarter2 Module 3 Hybrid Revised
9 pages
Eng3 Quarter2 Module 1 Hybrid Revised
No ratings yet
Eng3 Quarter2 Module 1 Hybrid Revised
9 pages
B. SPT Memo TOR MAES
No ratings yet
B. SPT Memo TOR MAES
8 pages
Juan Paredes Poster
100% (1)
Juan Paredes Poster
1 page
Planeación
No ratings yet
Planeación
2 pages
Meaning and Definition of Grammar
No ratings yet
Meaning and Definition of Grammar
8 pages
Games: Equity Versus Efficiency? Evidence From Three-Person Generosity Experiments
No ratings yet
Games: Equity Versus Efficiency? Evidence From Three-Person Generosity Experiments
14 pages
(Thesis) Color Emotion in Arc
100% (1)
(Thesis) Color Emotion in Arc
210 pages
A Grammatical Error Analysis of The Students' Writing of Reporting A School of The Eleventh Year Students' of SMA Negeri 1 Semin in 2011
No ratings yet
A Grammatical Error Analysis of The Students' Writing of Reporting A School of The Eleventh Year Students' of SMA Negeri 1 Semin in 2011
77 pages
Atraction and Beauty
No ratings yet
Atraction and Beauty
4 pages
PipeLay Data Sheet
No ratings yet
PipeLay Data Sheet
2 pages
Johya Melliza v. Legacion
No ratings yet
Johya Melliza v. Legacion
4 pages
Javascript Interview Questions and Answers
No ratings yet
Javascript Interview Questions and Answers
26 pages
Artificial Intelligence & Managing Risk in Banking: Banking For The Future
No ratings yet
Artificial Intelligence & Managing Risk in Banking: Banking For The Future
21 pages
ACS 14 Most Essential Parts of A Business Letter
No ratings yet
ACS 14 Most Essential Parts of A Business Letter
3 pages
Consumers Equilibrium
No ratings yet
Consumers Equilibrium
52 pages
Y013AA1H2ES - Datasheet
No ratings yet
Y013AA1H2ES - Datasheet
1 page
Borang Penilaian Prestasi Ncs-Core Abilities: Jpk/Ca/Pp
No ratings yet
Borang Penilaian Prestasi Ncs-Core Abilities: Jpk/Ca/Pp
4 pages
Oral Presentation Mcqs
No ratings yet
Oral Presentation Mcqs
10 pages
Cache Design
No ratings yet
Cache Design
59 pages
NX
No ratings yet
NX
25 pages
Prosp
No ratings yet
Prosp
146 pages
Basic Understanding of Machinery Vibration
100% (1)
Basic Understanding of Machinery Vibration
48 pages
OPEN CLOZ
No ratings yet
OPEN CLOZ
3 pages
Elder Impluse System
No ratings yet
Elder Impluse System
16 pages
(Paper) P-CPICH Power and Antenna Tilt Optimization in UMTS Networks
No ratings yet
(Paper) P-CPICH Power and Antenna Tilt Optimization in UMTS Networks
6 pages
Test 10
No ratings yet
Test 10
27 pages
J. C. Burkill-Theory of Ordinary Differential Equations
100% (3)
J. C. Burkill-Theory of Ordinary Differential Equations
125 pages
Virginia Woolf The Lady in The Looking Glass
No ratings yet
Virginia Woolf The Lady in The Looking Glass
5 pages

Lesson-2 1

Uploaded by

Lesson-2 1

Uploaded by

Polytechnic University of the Philippines

Contingency tables can be used to display either

A two-way table with I rows and J columns is called

1. Join Distribution - {πi,j} = {P(X = i, Y = j)}

2. Marginal Distribution - the row and column

3. Condit ional Dis tribut ion - ref ers to

X and Y are statistically independent if true

Statistical independence is, equivalently, the

p1(1 − p1) p2(1 − p2)

̂ p11 p22 n11n22

a. A National Cancer Institute study about tamoxifen and

a. Calculate and interpret the difference of

b. Calculate and interpret the odds ratio. Explain why

a. Describe the association of smoking with lung cancer and

b. Which response (lung cancer or heart disease) is more

You might also like