0% found this document useful (0 votes)

50 views56 pages

9sample Size Determination

This document discusses determining appropriate sample sizes for studies. It covers estimating sample sizes for single populations, comparing two populations, and hypothesis testing. Key factors that influence sample size calculations include the required precision, confidence level, estimates of variance, and desired power.

Uploaded by

Muhe Man

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views56 pages

9sample Size Determination

Uploaded by

Muhe Man

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 56

Sample Size Determination

Wakgari Deressa, PhD

School of Public Health
Addis Ababa University
• An essential part of planning any study
is to decide how many people need to
be studied
Sample Size
• Sample Size: The number of study
subjects selected to represent a given
study population.
• Important to make inferences based on
the findings from the sample.
• Should be sufficient to represent the
characteristics of interest of the study
population.
• In estimating a certain characteristic of
a population, sample size calculations
are important to ensure that estimates
are obtained with required precision or
confidence
• The accuracy of the envisaged results
determine the size of the sample
Example
• A prevalence of 10% from a sample size of
20
– would have a 95% CI of 1% to 31%,
– which is not very precise or informative.
• But, a prevalence of 10% from a sample
of size 400
– would have a 95% CI of 7% to 13%,
– which may be considered sufficiently
accurate.
• In studies concerned with detecting an
effect (e.g. a difference between two
groups), sample size calculations are
important to ensure the detection of
whether association exists or not.
• Large sample size can result in statistical
significance when Δ is very small (not of
practical, clinical or public health
importance).
• Small sample size can result in a non
statistically significant finding even when Δ
is large (of practical, clinical or public
health importance).
• If the sample is too small, then even if
large differences are observed, it will be
impossible to show that these are due to
anything more than sampling variation.
Sample size determination depends on the:
– Objective of the study
– Design of the study
• Descriptive/Analytic
– Accuracy of the measurements to be made
– Degree of precision required for generalization
– Plan for statistical analysis
– Degree of confidence with which to conclude
• Common questions:
– “How many subjects should I study?”
– Too small sample = Waste of time and resources
= Results have no practical use
– Too large sample = Waste of resources
= Data quality compromised
When deciding on sample size:
PRECISION COST
∆

Sample size = Precision = Cost

• The feasible sample size is also
determined by the availability of
resources:
– time
– manpower
– transport
– available facility, and
– money
1. Sample Size: Single Sample
• The aim is to have a large enough sample
with which to estimate a population mean
or proportion within a narrow interval with
high reliability.
• Concerned with the precision of the
estimate (“narrowness of the CI”).
estimate ± d units
Sample size for single sample
includes:
A. Sample size for estimating a single
population mean
B. Sample size to estimate a single
population proportion
A. Sample size for estimating
a single population mean
• AIM: Estimate µ
• WANT: Estimate ( ) ± d units
where d = Margin of error =
= Absolute precision
= Half of the width (w) of CI
Steps:
1. Specify d (or w = 2d)
2. Use known σ2 or estimate using s2
Standard error of the
estimator of the parameter
3. of interest

Where d = e in some text books

Example:
1. Find the minimum sample size needed to estimate
the drop in heart rate (µ) for a new study using a
higher dose of propranolol than the standard one.
We require that the two-sided 95% CI for µ be no
wider than 5 beats per minute and the sample sd
for change in heart rate equals 10 beats per
minute.
2 2 2
n = (1.96) 10 /(2.5) = 62 patients
2. Suppose that for a certain group of cancer patients, we
are interested in estimating the mean age at diagnosis.
We would like a 95% CI of 5 years wide. If the
population SD is 12 years, how large should our
sample be?
• Suppose d=1
• Then the sample size increases
3. A hospital director wishes to estimate the
mean weight of babies born in the
hospital. How large a sample of birth
records should be taken if she/he wants a
95% CI of 0.5 wide? Assume that a
reasonable estimate of  is 2. Ans: 246
birth records.
But the population 2 is most of the
time unknown
As a result, it has to be estimated from:
• Pilot or preliminary sample:
– Select a pilot sample and estimate 2 with
the sample variance, s2
• Previous or similar studies
B. Sample size to estimate a single
population proportion
• Aim: Estimate p
• Want: Estimate ± d units where d = Z•SE
(95% CI of width=2d)
Steps:
1. Specify d (or w = 2d)
2. Use estimated p (use p=0.5 if no
information)
3. Solve for n
1. Suppose that you are interested to know the
proportion of infants who breastfed >18
months of age in a rural area. Suppose that in
a similar area, the proportion (p) of breastfed
infants was found to be 0.20. What sample
size is required to estimate the true proportion
within ±3% points with 95% confidence. Let
p=0.20, d=0.03, α=5%
• Suppose there is no prior information
about the proportion (p) who breastfeed
• Assume p=q=0.5 (most conservative)
• Then the required sample size increases
• An estimate of p is not always available.
• However, the formula may also be used
for sample size calculation based on
various assumptions for the values of p.
• P = 0.1  n = (1.96)2(0.1)(0.9)/(0.05)2 = 138
P = 0.2  n = (1.96)2(0.2)(0.8)/(0.05)2 = 246
P = 0.3  n = (1.96)2(0.3)(0.7)/(0.05)2 = 323
P = 0.5  n = (1.96)2(0.5)(0.5)/(0.05)2 = 384
P = 0.7  n = (1.96)2(0.7)(0.3)/(0.05)2 = 323
P = 0.8  n = (1.96)2(0.8)(0.2)/(0.05)2 = 246
• For a fixed absolute precision (d), the
required sample size increases as P
increases form 0 to 0.5, and then
decreases in the same way as the
prevalence approaches 1.
2. A survey is planned to determine what
proportion of the medical students have
regularly chewed khat. If no estimate of p is
available and a pilot sample cannot be
drawn, what sample size would be required
if a 95% confidence is desired, and d=0.04
is to be used.
Ans: 600 students
2. Sample Size: Two Samples
A. Estimation of the difference between two
population means
B. Estimation of the difference between two
population proportions
A. Sample size for estimating
a difference in two means
• Aim: Estimate μ1-μ2
• Want: within ± d units,
where d = Zα/2.SE
(95% CI of width= w =2d)
• If equal sample size in both groups is
required, then:

2 2 2 2
• Use σ1 , σ2 or estimate using s1 and s2
B. Sample size for estimating a
difference in two proportions
• Aim: Estimate p1-p2
• Want: within ± d units
where d = Zα/2•SE
(95% CI of width = w = 2d)
• If equal sample sizes in both groups, then:

• Use estimates of p1, p2 or (or p1=p2 =0.5 if

unknown)
Points for Consideration
1. Sample size estimates might need to be adjusted to compensate
for non-response rate, patient dropout or loss to follow-up, lack
of compliance, etc.
2. If sampling is from a finite population of size N, then:
n0
n=
 n0 
1 + 
 N

where n0 is the sample from an infinite population. When N is

large in comparison to n, (i.e., n/N ≤ 0.05), the finite population
correction may be ignored.
3. Design effect for complex cluster sampling. Common values:
multiply n by 2, 3, …5.
3. Sample Size Based on
Hypothesis Testing
• The method of determining sample size in
the preceding sections takes into account
the probability of a type I error, but not a
type II error since the level of confidence is
determined by the confidence level (1-α).
• However, in many statistical inference
procedures, type II and type I errors are
considered when determining the sample
size.
Significance Difference Between Two Groups

• Using power of a study to determine

sample size = significant difference
= Hypothesis testing

• Aim: Have large enough samples to

detect a difference in population means
(or in population proportions)
• We would like to maintain low probability of
a Type I error (α) and low probability of a
Type II error (β) [high power = 1 - β].

Significance level of a test = α = Type I error

1 – α = Confidence 1 – β = Power
• Type I error (α) = The probability of
rejecting Ho when it is true

• Type II error () = The probability of not

rejecting Ho when it is false
• Power (1-) = the probability H0 is rejected
given that it is false
= P (rejecting Ho/H1 is true)
• If the power of a test is low, then there is
little chance of detecting a difference even if
one really exists
• Power is an important part of the design of a
study
– Power (1 - β) = 50%, Zβ = 0.00
– Power (1 - β) = 75%, Zβ = 0.67
– Power (1 - β) = 80%, Zβ = 0.84
– Power (1 – β) = 90%, Zβ = 1.28
• Power is one-sided and Zβ is always one-
sided
• Most of the studies recommend power of
80%.
Factors affecting the power
• If α decreases, the power decreases
• When the difference between Ho and HA
increases, then the power increases
• When  increases, then the power
decreases
• If the sample size (n) increases, the power
increases
Factors affecting the sample size
• The sample size increases as 2 increases
• The sample size increases as the
significance level (α) is made smaller (α
decreases)
• The sample size increases as the required
power increases
• The sample size decreases as the absolute
value of the difference between the Ho and
HA) increases
Ho = There is no difference between the
two groups
Ho: µ1 - µ2 = 0
P1 - P 2 = 0
HA = There is a difference between the
two groups
HA: µ1 - µ2 ≠ 0
P1 - P 2 ≠ 0
A. Comparison between two
means (Equal sample sizes)

∆ = /μ1-μ2/

The means and variances of the two respective groups

are (µ1, 2 ) and (µ2, 22).
1
Example
1. Determine the sample sizes required to detect a
difference of 5 mm in mean blood pressure
between individuals receiving placebo and those
receiving drug with α =5% and power of 0.80
• Assume σ1=σ2 = 15 mm in each group.
• We are interested in testing:
Ho: μ1- μ2 = 5, HA: μ1- μ2 ≠ 5

• We would need 142 individuals in each group

2. Suppose that the true blood pressure distribution
among OC users is normal with µ1 and 12.
Similarly, for non-users the distribution is normal
with µ2 and 22.We wish to test the hypothesis
that Ho: µ1 = µ2 versus µ1 ≠ µ2. Determine the
appropriate sample size for the study using α
=0.05 and a power of 80%. It was revealed by
the small study that: sample mean1=132.86,
s1=15.34, sample mean2=127.44, and s2=18.23.
Use the sample data to estimate population
parameters.
n = (15.342+18.232)(1.96+0.84)2/(132.86-127.44)2
= 152 in each group
B. Comparison between two
means (Unequal sample sizes)

λ =n2/n1
In some text books, λ = k = r
3. Suppose we anticipate twice as many non
OC users as OC users entering the study
using the previous example. Determine
the sample size to achieve an 80% power
in the study using α=0.05. λ = 2.

n1 = (15.342+18.232/2)(1.96+0.84)2/(5.42)2
= 108 OC users
and n2 = 2(108) = 216 non-OC users.
C. Comparison between two
proportions (Equal sample sizes)
• To test the hypothesis,
Ho: p1-p2 vs HA: p1-p2 ≠ 0,
|p1-p2| = ∆
with α and power (1-)
Where

∆ = p1-p2
• Let p1=0.35, p2=0.25, and Δ=p1-p2=0.35-
0.25 =0.10

• We would need approximately 329

subjects in each group
D. Comparison between two
proportions (Unequal sample sizes)

Note: This formula is quite general, and applies to cross-sectional,

case-control and cohort studies.
Example
• A study is proposed to study the effect of a new
anticoagulant therapy. Patients are to be
randomly divided into two groups: one receives
the anticoagulant, and the other placebo. The
groups are then followed for the incidence of
major bleeding events over 3 years. Suppose
that 5% of treated patients and 22% of controls
are anticipated to experience a major event over
3 years. How large sample should such a study
be to have an 80% chance of finding a
significance difference at a ratio of 1:2 for
treated and control at α =5%.
Solution
p1=0.05, p2=0.22, = (0.05+2*0.22)/(1+2) = 0.16
q1=0.95, q2=0.78, ∆ = 0.22-0.05 = 0.17
• If the OR or RR and one of the
proportions are known, we can compute
the unknown proportion by:

P2
P1  P1 = P2 * RR
1  P2
P2 
OR
Example
• A case-control study to compare the efficacy of
a vaccine for the prevention of child-hood
tuberculosis with a placebo. Let the proportion of
unvaccinated children is 30%, with an estimated
OR of at least 2.
P2 = 0.3, q2 = 0.7, OR = 2.0
P1 = 0.3/(0.3+0.7/2) = 0.462
• With equal cases and controls, what sample size
is required to detect, with 80% power and at α
5%?
= 140 in each group
Summary
• Sample size calculations depend on a
number of assumptions:
– the hypothesized difference of interest, Δ
– the probability of Type I error (α)
– the probability of Type II error (β)
– the variance
• Choice of sample size depends on a
balance of reasonable assumptions, time,
effort, and expense
• Sample sizes provide a minimum estimate
of the desired sample sizes for the study

Biostatistics-I MCQS: Topic: Sample Descriptive Statics
100% (9)
Biostatistics-I MCQS: Topic: Sample Descriptive Statics
40 pages
2sample Size Determination Jan 2023
No ratings yet
2sample Size Determination Jan 2023
69 pages
Sample Size Determination
No ratings yet
Sample Size Determination
66 pages
L6 Sample Size Estimation
No ratings yet
L6 Sample Size Estimation
16 pages
Sample Size (1) .PPTX - Read-Only
No ratings yet
Sample Size (1) .PPTX - Read-Only
43 pages
Sampling
No ratings yet
Sampling
30 pages
5 Sample size determ
No ratings yet
5 Sample size determ
29 pages
Sample Size
No ratings yet
Sample Size
45 pages
Sample Size Determination
No ratings yet
Sample Size Determination
29 pages
SampleSizeNew(01062022)
No ratings yet
SampleSizeNew(01062022)
49 pages
Kajal Srivastava SPM Deptt. S.N.Medical College,: Determining The Size of A Sample
No ratings yet
Kajal Srivastava SPM Deptt. S.N.Medical College,: Determining The Size of A Sample
38 pages
Samplesize - Ug 2021
No ratings yet
Samplesize - Ug 2021
22 pages
DR Pinzon - Sample Size Klinik
No ratings yet
DR Pinzon - Sample Size Klinik
45 pages
Sample_Size_calculation_2024-Esnat Chirwa
No ratings yet
Sample_Size_calculation_2024-Esnat Chirwa
38 pages
Sample - Size - Calculation - LeyeADEOMI
No ratings yet
Sample - Size - Calculation - LeyeADEOMI
42 pages
Sample Size Determination
No ratings yet
Sample Size Determination
29 pages
Sample Size Estimation
No ratings yet
Sample Size Estimation
20 pages
13. Sample Size Determination
No ratings yet
13. Sample Size Determination
38 pages
PDF Sample Size Determination
No ratings yet
PDF Sample Size Determination
22 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
30 pages
Type of Studies: Sample Size Determination
No ratings yet
Type of Studies: Sample Size Determination
14 pages
7 Sample Size Determination
No ratings yet
7 Sample Size Determination
27 pages
EPIB 660 - 2008 - Session 8 - Sample Size Calculations
No ratings yet
EPIB 660 - 2008 - Session 8 - Sample Size Calculations
17 pages
Sample Size Calculations: DR R.P. Nerurkar
No ratings yet
Sample Size Calculations: DR R.P. Nerurkar
48 pages
Stat Lea Int Cal PDF
No ratings yet
Stat Lea Int Cal PDF
5 pages
Sample Size Determination
No ratings yet
Sample Size Determination
42 pages
Sample - Size - Calculation - LeyeADEOMI
No ratings yet
Sample - Size - Calculation - LeyeADEOMI
42 pages
Exercises 5 4
No ratings yet
Exercises 5 4
8 pages
Sample Size.determination
No ratings yet
Sample Size.determination
36 pages
Submodule6 Sample-Size-Determination Ver2 21nov2018
No ratings yet
Submodule6 Sample-Size-Determination Ver2 21nov2018
9 pages
5. sample size (1)
No ratings yet
5. sample size (1)
22 pages
Metlit 10-Besar Sampel_20210920
No ratings yet
Metlit 10-Besar Sampel_20210920
41 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
14 pages
Sample Size
No ratings yet
Sample Size
30 pages
Sample_size_calculation
No ratings yet
Sample_size_calculation
3 pages
Week 9(1).pdf
No ratings yet
Week 9(1).pdf
4 pages
Sample Size (PH.D.) Aswan - PPSX
No ratings yet
Sample Size (PH.D.) Aswan - PPSX
44 pages
Sample Size A Rough Guide: Ronán Conroy
No ratings yet
Sample Size A Rough Guide: Ronán Conroy
30 pages
How To Calculate Sample Size F
No ratings yet
How To Calculate Sample Size F
9 pages
Sample Size Estimation
No ratings yet
Sample Size Estimation
14 pages
DOC-20230605-WA0005.
No ratings yet
DOC-20230605-WA0005.
13 pages
Sample Size Determination 03202012
No ratings yet
Sample Size Determination 03202012
28 pages
Lesson 4 Sample Size Determination
No ratings yet
Lesson 4 Sample Size Determination
16 pages
Sample Size Determination: Janice Weinberg, SCD Professor of Biostatistics Boston University School of Public Health
No ratings yet
Sample Size Determination: Janice Weinberg, SCD Professor of Biostatistics Boston University School of Public Health
28 pages
Sample Size Determination
No ratings yet
Sample Size Determination
3 pages
Sample Size Determination: Maj. Tun Tun Win
No ratings yet
Sample Size Determination: Maj. Tun Tun Win
38 pages
sampling size dertimantion (1)
No ratings yet
sampling size dertimantion (1)
26 pages
Sample Size Estimation
No ratings yet
Sample Size Estimation
14 pages
Designing Methodology
No ratings yet
Designing Methodology
93 pages
Sample Size Merwyn
No ratings yet
Sample Size Merwyn
40 pages
Sample Size and Power of Study
No ratings yet
Sample Size and Power of Study
5 pages
L5 Sample Size Calculation
No ratings yet
L5 Sample Size Calculation
23 pages
Lect 5 Sample Size Estimation-2013
No ratings yet
Lect 5 Sample Size Estimation-2013
17 pages
3165
No ratings yet
3165
6 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
Final Assignment 2
No ratings yet
Final Assignment 2
10 pages
Research Methods
No ratings yet
Research Methods
103 pages
Sample Size Calculation, Test of Significance,Parametric Non Parametric Test,SPSS
No ratings yet
Sample Size Calculation, Test of Significance,Parametric Non Parametric Test,SPSS
106 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
4_5778384412320205850
No ratings yet
4_5778384412320205850
35 pages
Edited_Health_Centers_2nd_Q_and_6months_2014al_Performance_RMTemplate(3)
No ratings yet
Edited_Health_Centers_2nd_Q_and_6months_2014al_Performance_RMTemplate(3)
19 pages
Biostatistics chapter 3
No ratings yet
Biostatistics chapter 3
66 pages
LO3 Collect and Handle Sample
No ratings yet
LO3 Collect and Handle Sample
63 pages
Lo 2
No ratings yet
Lo 2
145 pages
Final Exam Health Survay
0% (1)
Final Exam Health Survay
4 pages
JJ Research
No ratings yet
JJ Research
37 pages
Copy of Assessment of Women During Labor and Delivery
No ratings yet
Copy of Assessment of Women During Labor and Delivery
19 pages
Anwer
No ratings yet
Anwer
42 pages
Statistical Methods For Environmental Pollution Monitoring
No ratings yet
Statistical Methods For Environmental Pollution Monitoring
20 pages
Significance Test and Confidence Intervals (N 30) : Hlaing Minn Latt
No ratings yet
Significance Test and Confidence Intervals (N 30) : Hlaing Minn Latt
24 pages
Statdisk User Manual
0% (1)
Statdisk User Manual
20 pages
Even You Can Learn Statistics and Analytics: An Easy to Understand Guide, 4th Edition David M. Levine 2024 scribd download
100% (8)
Even You Can Learn Statistics and Analytics: An Easy to Understand Guide, 4th Edition David M. Levine 2024 scribd download
66 pages
Unit 3 Part II
No ratings yet
Unit 3 Part II
45 pages
Standard Test Method for Determination of Total Solids in Biomass
No ratings yet
Standard Test Method for Determination of Total Solids in Biomass
3 pages
Lost in Translation: How Not To Make Qualitative Research More Scientific
No ratings yet
Lost in Translation: How Not To Make Qualitative Research More Scientific
8 pages
Infineon-TDA21470-FITReport-v04_00-EN
No ratings yet
Infineon-TDA21470-FITReport-v04_00-EN
1 page
Statistical Analysis Illustrated - Foundations
No ratings yet
Statistical Analysis Illustrated - Foundations
91 pages
5 Assignment5
67% (3)
5 Assignment5
10 pages
ConfidenceIntervalFormulaMeaning, Calculation, SolvedExamples 1710827614746
No ratings yet
ConfidenceIntervalFormulaMeaning, Calculation, SolvedExamples 1710827614746
8 pages
Results of Statistical Analysis of Pressure Relief Valve Proof Test Data
No ratings yet
Results of Statistical Analysis of Pressure Relief Valve Proof Test Data
20 pages
Simple Model For Wall Deflection Caused by Braced Excavation in Clays
No ratings yet
Simple Model For Wall Deflection Caused by Braced Excavation in Clays
16 pages
HLST 2301 Notes Print Me
No ratings yet
HLST 2301 Notes Print Me
29 pages
Research Methodology
No ratings yet
Research Methodology
145 pages
The Intersection of Marketing and Human Resource Disciplines: Employer Brand Equity As A Mediator in Recruitment Process
100% (1)
The Intersection of Marketing and Human Resource Disciplines: Employer Brand Equity As A Mediator in Recruitment Process
11 pages
Tugas Ibu Sumah No 2 PDF
No ratings yet
Tugas Ibu Sumah No 2 PDF
42 pages
1977 - Sample Size To Set Tolerance Interval
No ratings yet
1977 - Sample Size To Set Tolerance Interval
8 pages
1-s2.0-S0015028224000815-main
No ratings yet
1-s2.0-S0015028224000815-main
3 pages
State Practice ACI 317
50% (2)
State Practice ACI 317
8 pages
Weibull Distribution - Real Statistics Using Excel
No ratings yet
Weibull Distribution - Real Statistics Using Excel
16 pages
Statistics: Complementary: Syllabus For B.Sc. (Mathematics/Cs Main) CBCSSUG 2019 (2019 Admission Onwards)
No ratings yet
Statistics: Complementary: Syllabus For B.Sc. (Mathematics/Cs Main) CBCSSUG 2019 (2019 Admission Onwards)
9 pages
Two Mark Questions With Answers (1)
No ratings yet
Two Mark Questions With Answers (1)
31 pages
Elliott 2007
No ratings yet
Elliott 2007
23 pages
Statistical Inference
100% (1)
Statistical Inference
33 pages
Eco 5
No ratings yet
Eco 5
30 pages
24 Bonferroni Inequality
No ratings yet
24 Bonferroni Inequality
3 pages
Confidence Intervals For Point Biserial Correlation
No ratings yet
Confidence Intervals For Point Biserial Correlation
6 pages
Frisco TX Cancer Report Final
No ratings yet
Frisco TX Cancer Report Final
20 pages

9sample Size Determination

Uploaded by

9sample Size Determination

Uploaded by

Sample Size Determination

Wakgari Deressa, PhD

Sample size = Precision = Cost

Where d = e in some text books

• Use estimates of p1, p2 or (or p1=p2 =0.5 if

where n0 is the sample from an infinite population. When N is

• Using power of a study to determine

• Aim: Have large enough samples to

Significance level of a test = α = Type I error

• Type II error () = The probability of not

The means and variances of the two respective groups

• We would need 142 individuals in each group

• We would need approximately 329

Note: This formula is quite general, and applies to cross-sectional,

You might also like