0% found this document useful (0 votes)

2 views

P8120_Lecture_5_2025 - annotated

This lecture focuses on hypothesis testing for a single binomial proportion, detailing methods for testing hypotheses, defining null and alternative hypotheses, and calculating p-values. It outlines two approaches: the Score Test using normal approximation and the Exact Test using the binomial distribution for small sample sizes. Additionally, it emphasizes the importance of understanding Type I and Type II errors in hypothesis testing.

Uploaded by

liangxuange

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

P8120_Lecture_5_2025 - annotated

Uploaded by

liangxuange

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

P8120 Analysis of Categorical Data

Lecture 5: Inference for a Single Proportion (Part 2)

February 6th, 2025
Learning Objectives
 Test hypotheses about a single binomial proportion using approximate and exact methods

Practice Problems: Posted.

Lecture 4 Review

 The likelihood function expresses the probability of the observed (response) data expressed as a function of
the parameter.
o 𝐿𝐿(𝜋𝜋) = 𝑃𝑃 (𝑋𝑋 = 7 |𝜋𝜋) = �50
7
�𝜋𝜋 7 (1 − 𝜋𝜋) 50−7

 A range of possible values for a population parameter is given by a confidence interval.

Two methods to calculate confidence intervals
o Wald Confidence Interval (Approximate) used when 𝑛𝑛𝑝𝑝̂ ≥ 5 and 𝑛𝑛(1 − 𝑝𝑝̂ ) ≥ 5
o Exact Confidence Interval used when 𝑛𝑛𝑝𝑝̂ < 5 and 𝑛𝑛(1 − 𝑝𝑝̂ ) < 5
____________________________________________________________________________________

Hypothesis Testing
A hypothesis test is a method for using sample data to decide between two competing claims (hypotheses) about a
population parameter. If it were possible to carry out a census of the entire population then we would know
which of the two hypotheses are correct, but usually it is the case that we need to decide between the
two hypotheses using information from a sample.

The null hypothesis, H0, is usually chosen to represent “no change” or “no association” whereas the alternative
hypothesis, H1, usually specifies a change, difference, or association.

One Sided vs. Two-Sided Tests

Let 𝜋𝜋0 = null/hypothesized value for 𝜋𝜋. Recall:

𝜋𝜋 = true proportion (population)
One-sided 𝑝𝑝̂ = best guess of 𝜋𝜋 aka point estimate
𝐻𝐻0 : 𝜋𝜋 ≤ 𝜋𝜋0 𝜋𝜋0 = hypothesized value (known/given)
𝐻𝐻1 : 𝜋𝜋 > 𝜋𝜋0

𝐻𝐻0 : 𝜋𝜋 ≥ 𝜋𝜋0
𝐻𝐻1 : 𝜋𝜋 < 𝜋𝜋0

Two-sided
𝐻𝐻0 : 𝜋𝜋 = 𝜋𝜋0
𝐻𝐻1 : 𝜋𝜋 ≠ 𝜋𝜋0

P8120 Spring 2025 1

The null hypothesis H0 is rejected in favor of H1 only if sample evidence strongly suggests that H0 is false. If the sample
does not provide such evidence, then H0 is not rejected.

Therefore, the two possible outcomes of a hypothesis test are reject H0 or fail to reject H0

ASIDE: Do not accept the null hypothesis H0

• In a court case, defendants are guilty or not guilty; there is no verdict of “innocent”.
• In a statistical test, the null hypothesis is rejected or not rejected.
• If p>0.05, avoid “the drug was ineffective” or “there was no difference between groups”.
Instead, “we did not see evidence of a drug effect, or “there was no significant difference between groups.”

Template for Conducting a Hypothesis Test

(1) Explicitly define the population parameter of interest.

(2) State the null and alternative hypotheses.
(3) State the significance level of the test.
(4) State any necessary assumptions
(5) State the form of the test statistic and its null distribution.
(6) State the decision rule and compute the p-value.
(7) State your conclusions in the context of the problem.

P8120 Spring 2025 2

Recall the definition of p-value:

A p-value is the probability of observing a test statistic (data) as extreme as or more extreme
than that test statistic (data) that we observed, given that the null hypothesis is true.

Also recall that there are different errors that can be made when conducting a hypothesis test. Consider the table
below:

α =Type I error rate = P(reject H0 | H0 true)

β = Type II error rate = P(fail to reject H0 | H1 true)

1 −β = Power of the test = P(reject H0| H1 true)

There are TWO ways to perform a hypothesis test for a single proportion. We will review both in this lecture.

1) Score Test (Approximate)

o uses the normal approximation to the binomial distribution;
o Requirement: 𝑛𝑛𝜋𝜋0 ≥ 5 and 𝑛𝑛(1 − 𝜋𝜋0 ) ≥ 5

2) Exact Hypothesis Test

o uses the binomial distribution when a normal approximation is inappropriate;
o typically used when 𝑛𝑛𝜋𝜋0 < 5 and 𝑛𝑛(1 − 𝜋𝜋0 ) < 5

P8120 Spring 2025 3

Approach #1: Score Test for a Single Proportion

We want to test whether the true proportion of interest is different (larger or smaller) than some particular value
𝜋𝜋0 using our data (n = sample size, 𝑝𝑝̂ = MLE of 𝜋𝜋)

Two-sided
𝐻𝐻0 : 𝜋𝜋 = 𝜋𝜋0
𝐻𝐻1 : 𝜋𝜋 ≠ 𝜋𝜋0

If 𝑛𝑛𝜋𝜋0 ≥ 5 and 𝑛𝑛(1 − 𝜋𝜋0 ) ≥ 5,

then by the Central Limit Theorem, we know that:

𝜋𝜋 (1 − 𝜋𝜋)
𝑝𝑝̂ ~𝑁𝑁 �𝜋𝜋, �
𝑛𝑛

But under the null hypothesis 𝐻𝐻0 : 𝜋𝜋 = 𝜋𝜋0 , we would have:

𝜋𝜋0 (1 − 𝜋𝜋0 )
𝑝𝑝̂ ~𝑁𝑁 �𝜋𝜋0 , �
𝑛𝑛

P8120 Spring 2025 4

Example. Again, consider the acupuncture accrual example. Originally, investigators were interested in
determining if the true proportion of those who would refuse acupuncture therapy was 20% (H0, hypothesized
value). They observed that 7 of the 50 participants refused acupuncture therapy the first month and assumed that
the number of participants who refuse acupuncture treatment (X) follows a Bin(50, π) distribution. Is there
evidence that the true proportion of those who refused acupuncture treatment is different from 20%?

Let’s examine this question using the 7 steps:

(1) 𝜋𝜋 = true proportion of those who would refuse acupuncture treatment.

(2) H0:

H1:

(3) Set α = 0.05

(4) Assumptions

(5) Test Statistics and Null Distribution

𝑝𝑝̂ − 𝜋𝜋0
𝑧𝑧 = =
�𝜋𝜋0 (1 − 𝜋𝜋0 )
𝑛𝑛

(6) Decision

Approach 1 (Critical value approach)

Approach 2 (p-value approach)

(7) Conclusion
At the 5% level of significance, we have _________________ evidence to conclude that the true
proportion of those who refuse acupuncture treatment is different from 20%.

Model Answer for manuscript:

Fourteen percent of the sample refused acupuncture therapy (95% CI: 4.4% to 23.6%). There is
insufficient evidence to conclude that the true proportion of those who refuse acupuncture therapy is
different from 20% (p-value = 0.2888).
P8120 Spring 2025 5
Below is the SAS code and output for addressing the question above.

data acupuncture;
input refused;
cards;
7
43
;
run;

title 'Acupuncture example: Score test';

proc freq data = acupuncture order = data;
table refused / binomial(p=0.20) alpha = 0.05;
weight refused;
run;

P8120 Spring 2025 6

Example. Again, consider the acupuncture accrual example. What if we now wanted to test if the true proportion
of those who refused acupuncture therapy is less than 20%?

Notation Recap:
𝝅𝝅 The population proportion (parameter); often referred to as the “truth”.
�
𝒑𝒑 The sample proportion (statistic); computed from your sample data
𝝅𝝅𝟎𝟎 The null value for 𝝅𝝅. This is the value you want to compare against 𝝅𝝅.

P8120 Spring 2025 7

Approach #2: Exact Hypothesis Test for a Single Proportion

We want to test whether the true proportion of interest is different (larger or smaller) than some particular value 𝜋𝜋0
using our data (n = sample size, x = observed number of successes)

Null and Alternative Hypotheses:

Two-sided
𝐻𝐻0 : 𝜋𝜋 = 𝜋𝜋0
𝐻𝐻1 : 𝜋𝜋 ≠ 𝜋𝜋0

But now it may be the case that 𝑛𝑛𝜋𝜋0 < 5 and 𝑛𝑛(1 − 𝜋𝜋0 ) < 5. So, we cannot use Z and trust that it follows a
normal distribution. Instead, we work directly with the binomial distribution to calculate a p-value.

Recall: X = random variable corresponding to number of successes

X ~ Bin(n,𝜋𝜋)

P-value = Probability of observing our data or data more extreme if H0 is true

There are several suggestions for how to compute a p-value for an exact binomial test. Here is how SAS does it:

Computing Exact two-sided p-value:

1. Compute 𝐴𝐴 = 𝑃𝑃(𝑋𝑋 ≥ 𝑥𝑥 0 | 𝜋𝜋 = 𝜋𝜋0 ) and 𝐵𝐵 = 𝑃𝑃 (𝑋𝑋 ≤ 𝑥𝑥 0 | 𝜋𝜋 = 𝜋𝜋0 )
2. P-value = 2 ∗ min (𝐴𝐴, 𝐵𝐵)

𝑛𝑛
𝑃𝑃(𝑋𝑋 = 𝑥𝑥|𝜋𝜋 = 𝜋𝜋0 ) = � � 𝜋𝜋0𝑥𝑥 (1 − 𝜋𝜋0 ) 𝑛𝑛−𝑥𝑥
𝑥𝑥

P8120 Spring 2025 8

Example. Out of 6 participants in a pilot study, 2 refused acupuncture treatment. Test the claim that the true
proportion of participants who will refuse acupuncture treatment is different from 20%.

Let’s examine this claim using the 7 steps:

(1) 𝜋𝜋 = true proportion of those who would refuse acupuncture treatment

(2) H0:

H1:

(3) Set α = 0.05

(4) Assumptions

(5) Test Statistics and Null Distribution

𝐴𝐴 = 𝑃𝑃 (𝑋𝑋 ≥ 𝑥𝑥 0 | 𝜋𝜋 = 𝜋𝜋0 )

𝐵𝐵 = 𝑃𝑃(𝑋𝑋 ≤ 𝑥𝑥 0 | 𝜋𝜋 = 𝜋𝜋0 )

(6) Decision

P-value = 2 ∗ min (𝐴𝐴, 𝐵𝐵) =

(7) Conclusion
At the 5% level of significance, we have _________________ evidence to conclude that the true
proportion of those who refused acupuncture therapy is different from 20%.

Model Answer for manuscript:

33.3% of the sample refused acupuncture treatment (95% CI: 4.3% to 77.7%). There is insufficient
evidence to conclude that the true proportion that refuse is different from 20% (p-value = 0.6893).

P8120 Spring 2025 9

Below is the SAS code and output for addressing the question above.

data acupuncture_exact;
input refused;
cards;
2
4
;
run;

title 'Acupuncture example: Exact Test';

proc freq data = acupuncture_exact order = data;
table loss / binomial (p=0.20) alpha = 0.05;
exact binomial;
weight loss;
run;

P8120 Spring 2025 10

05 Assignment 5 Solutions
0% (1)
05 Assignment 5 Solutions
7 pages
Practical 8 For Work System Design
100% (2)
Practical 8 For Work System Design
18 pages
Statistics
100% (1)
Statistics
56 pages
P8120_Lecture_4_2025 - annotated
No ratings yet
P8120_Lecture_4_2025 - annotated
10 pages
Graded Homework 11 1
No ratings yet
Graded Homework 11 1
19 pages
Chapter 4 Lesson 3: Estimating Population Proportion (P) For The Large Sample Size
No ratings yet
Chapter 4 Lesson 3: Estimating Population Proportion (P) For The Large Sample Size
15 pages
ECT702 Lecture6 Hypothesis Testing-1
No ratings yet
ECT702 Lecture6 Hypothesis Testing-1
17 pages
Johnson 7e Sarq 19
No ratings yet
Johnson 7e Sarq 19
5 pages
Stat 11 Q4 Week 5-SSLM
No ratings yet
Stat 11 Q4 Week 5-SSLM
4 pages
Hypo Test
No ratings yet
Hypo Test
39 pages
Summary of Previous Lecture: Point and Interval Estimation
No ratings yet
Summary of Previous Lecture: Point and Interval Estimation
18 pages
One Sample Procedures
No ratings yet
One Sample Procedures
5 pages
9.2 Reading Guide
No ratings yet
9.2 Reading Guide
4 pages
5 Estimation and Hypothesis Testing
No ratings yet
5 Estimation and Hypothesis Testing
25 pages
Testing Hypotheses About Proportions
No ratings yet
Testing Hypotheses About Proportions
26 pages
PLM
No ratings yet
PLM
5 pages
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
No ratings yet
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
34 pages
Point Estimation and Interval Estimation: Learning Objectives
No ratings yet
Point Estimation and Interval Estimation: Learning Objectives
58 pages
X 24
No ratings yet
X 24
10 pages
Significance Tests
No ratings yet
Significance Tests
43 pages
Conceptsinstatistics 08 Interferenceforoneproportion
No ratings yet
Conceptsinstatistics 08 Interferenceforoneproportion
21 pages
Statistical Estimation
No ratings yet
Statistical Estimation
37 pages
1. Testing
No ratings yet
1. Testing
29 pages
Lab 8 - Sampling Techniques 1
No ratings yet
Lab 8 - Sampling Techniques 1
43 pages
NOTES Stats
No ratings yet
NOTES Stats
6 pages
Biostat Handouts Lesson 9 PDF
No ratings yet
Biostat Handouts Lesson 9 PDF
36 pages
Topic 5
No ratings yet
Topic 5
21 pages
STAT 2006 Chapter 3 - 2022
No ratings yet
STAT 2006 Chapter 3 - 2022
60 pages
5 - Test of Hypothesis (Part - 1)
No ratings yet
5 - Test of Hypothesis (Part - 1)
44 pages
Testing of Hypothesis_Note
No ratings yet
Testing of Hypothesis_Note
6 pages
Chapter Five Hypothesis Testing
No ratings yet
Chapter Five Hypothesis Testing
50 pages
Tests of Hypothesis-Large Samples
No ratings yet
Tests of Hypothesis-Large Samples
7 pages
Lecture Notes 1
No ratings yet
Lecture Notes 1
147 pages
Optimization Techniques in Pharmaceutical Formulation and Processing-1
No ratings yet
Optimization Techniques in Pharmaceutical Formulation and Processing-1
4 pages
8.hypo Testing....
No ratings yet
8.hypo Testing....
44 pages
Lecture 4 PDF
No ratings yet
Lecture 4 PDF
9 pages
Hypothesis testing Intro and Test for means
No ratings yet
Hypothesis testing Intro and Test for means
10 pages
Fin534 - Chapter 5
No ratings yet
Fin534 - Chapter 5
35 pages
Chapter 3 Test of Hypothesis
No ratings yet
Chapter 3 Test of Hypothesis
51 pages
Statistics For MGT II - CH 3
No ratings yet
Statistics For MGT II - CH 3
15 pages
Stat Q4 Mod 1 Week1new
No ratings yet
Stat Q4 Mod 1 Week1new
18 pages
Lec 9(Hypothesis Testing)
No ratings yet
Lec 9(Hypothesis Testing)
53 pages
11-12 Hypothesis Tests
No ratings yet
11-12 Hypothesis Tests
29 pages
Navidi ch6
No ratings yet
Navidi ch6
82 pages
CH 07 Solutions
No ratings yet
CH 07 Solutions
8 pages
Faculty of Information Science & Technology (FIST) : PSM 0325 Introduction To Probability and Statistics
No ratings yet
Faculty of Information Science & Technology (FIST) : PSM 0325 Introduction To Probability and Statistics
6 pages
Inferential Statistics 1
No ratings yet
Inferential Statistics 1
32 pages
Hypothesis_Testing (updated)
No ratings yet
Hypothesis_Testing (updated)
13 pages
BR - Module Ii
No ratings yet
BR - Module Ii
21 pages
Final DS 530 MCQ
No ratings yet
Final DS 530 MCQ
15 pages
Ie226 - Week 9
No ratings yet
Ie226 - Week 9
11 pages
05-Hypothesis Testing T-Test (1) - 54
No ratings yet
05-Hypothesis Testing T-Test (1) - 54
56 pages
Chapter 5
No ratings yet
Chapter 5
65 pages
Bio 7
No ratings yet
Bio 7
49 pages
Math 140 Introductory Statistics: Types of Error
No ratings yet
Math 140 Introductory Statistics: Types of Error
4 pages
24-Differnce of Proportions-23-03-2024
No ratings yet
24-Differnce of Proportions-23-03-2024
20 pages
Quarter 4 Mod 1 Test of Hypothesis
No ratings yet
Quarter 4 Mod 1 Test of Hypothesis
16 pages
Navidi_ch6 (1)
No ratings yet
Navidi_ch6 (1)
82 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
Statistical Theory A Concise Introduction 2nd Edition Felix Abramovich - Read the ebook online or download it to own the full content
100% (3)
Statistical Theory A Concise Introduction 2nd Edition Felix Abramovich - Read the ebook online or download it to own the full content
47 pages
(Ebook) Elementary Statistics for Geographers by James E. Burt; Gerald M. Barber; David L. Rigby ISBN 9781572304840, 1572304847 pdf download
100% (2)
(Ebook) Elementary Statistics for Geographers by James E. Burt; Gerald M. Barber; David L. Rigby ISBN 9781572304840, 1572304847 pdf download
45 pages
Statistics in Kinesiology 5th Edition Readable PDF Download
100% (4)
Statistics in Kinesiology 5th Edition Readable PDF Download
17 pages
Foundations of Applied Statistical Methods Full Book Download
100% (5)
Foundations of Applied Statistical Methods Full Book Download
15 pages
A Second Course in Statistics: Regression Analysis 8th Edition William Mendenhall - Download the full ebook set with all chapters in PDF format
100% (3)
A Second Course in Statistics: Regression Analysis 8th Edition William Mendenhall - Download the full ebook set with all chapters in PDF format
54 pages
Machine Learning - AL3451 - Important Questions with Answer
No ratings yet
Machine Learning - AL3451 - Important Questions with Answer
27 pages
237526 Candidate Style Answers a Level Unit 1 Research Methods
No ratings yet
237526 Candidate Style Answers a Level Unit 1 Research Methods
19 pages
9fm0-3b-que-20240615
No ratings yet
9fm0-3b-que-20240615
24 pages
Statistical methods in water resources Dennis R. Helsel instant download
100% (4)
Statistical methods in water resources Dennis R. Helsel instant download
49 pages
14221580
No ratings yet
14221580
72 pages
Methods In Behavioural Research 3rd Edition Paul C. Cozby - eBook PDF pdf download
100% (3)
Methods In Behavioural Research 3rd Edition Paul C. Cozby - eBook PDF pdf download
73 pages
Econometrics notes Final
No ratings yet
Econometrics notes Final
10 pages
HYPOTHESIS TESTING - ITEC 95
No ratings yet
HYPOTHESIS TESTING - ITEC 95
13 pages
(11)chi-square
No ratings yet
(11)chi-square
19 pages
(eBook PDF) Research Methods: A Modular Approach 3rd Edition download
100% (5)
(eBook PDF) Research Methods: A Modular Approach 3rd Edition download
48 pages
Get Elementary Statistics A Step by Step Approach 9th Edition Bluman Test Bank Free All Chapters Available
100% (28)
Get Elementary Statistics A Step by Step Approach 9th Edition Bluman Test Bank Free All Chapters Available
42 pages
5
No ratings yet
5
10 pages
22st202- p&s Notes - 2024- Dr. k. Kalyani (1)
No ratings yet
22st202- p&s Notes - 2024- Dr. k. Kalyani (1)
224 pages
DETECTING-INFORMED-TRADING-ACTIVITIES-IN-THE-OPTIONS-MARKET
No ratings yet
DETECTING-INFORMED-TRADING-ACTIVITIES-IN-THE-OPTIONS-MARKET
36 pages
Elementary Statistics A Step by Step Approach 9th Edition Bluman Test Bank - Available For Instant Download And Reading
100% (3)
Elementary Statistics A Step by Step Approach 9th Edition Bluman Test Bank - Available For Instant Download And Reading
43 pages
2nd Year Statistics Question Bank CH#13
No ratings yet
2nd Year Statistics Question Bank CH#13
3 pages
Statistics For The Behavioral Sciences 10th Edition Gravetter Test Bank download
100% (1)
Statistics For The Behavioral Sciences 10th Edition Gravetter Test Bank download
56 pages
Get (Ebook) Introductory criminal justice statistics and data analysis by Blevins, Kristie R.; Soderstrom, Irina R. ISBN 9781478627098, 1478627093 free all chapters
100% (3)
Get (Ebook) Introductory criminal justice statistics and data analysis by Blevins, Kristie R.; Soderstrom, Irina R. ISBN 9781478627098, 1478627093 free all chapters
82 pages
Research Methods and Statistics A Critical Thinking Approach Sherri L. Jackson - The latest ebook is available, download it today
100% (1)
Research Methods and Statistics A Critical Thinking Approach Sherri L. Jackson - The latest ebook is available, download it today
59 pages
Large Scale Inference Empirical Bayes Methods for Estimation Testing and Prediction Institute of Mathematical Statistics Monographs 1st Edition Bradley Efron pdf download
No ratings yet
Large Scale Inference Empirical Bayes Methods for Estimation Testing and Prediction Institute of Mathematical Statistics Monographs 1st Edition Bradley Efron pdf download
54 pages
lesson plan in Statistics
No ratings yet
lesson plan in Statistics
4 pages
(eBook PDF) Introduction to Statistics and Data Analysis 6th Edition 2024 scribd download
100% (3)
(eBook PDF) Introduction to Statistics and Data Analysis 6th Edition 2024 scribd download
52 pages
Probability Statistics with R for Engineers and Scientists 1st Edition Michael Akritas - The complete ebook version is now available for download
No ratings yet
Probability Statistics with R for Engineers and Scientists 1st Edition Michael Akritas - The complete ebook version is now available for download
77 pages
P&S UNIT-5 Testing of Hypothesis
No ratings yet
P&S UNIT-5 Testing of Hypothesis
47 pages
(Ebook) Bayesian Statistics The Fun Way: Understanding Statistics And Probability With Star Wars, Lego, And Rubber Ducks by Will Kurt ISBN 9781593279561, 1593279566 instant download
100% (3)
(Ebook) Bayesian Statistics The Fun Way: Understanding Statistics And Probability With Star Wars, Lego, And Rubber Ducks by Will Kurt ISBN 9781593279561, 1593279566 instant download
46 pages