0% found this document useful (0 votes)

6 views

Biostatistics 2b statistical testing theory

The lecture focuses on statistical testing theory, discussing three tests: the binomial test, normal approximation of the binomial test, and one sample t-test, along with confidence intervals for binomial and normal distributions. It introduces the concepts of null and alternative hypotheses, type I and II errors, significance levels, and the eight steps of conducting a statistical test. Additionally, it covers the calculation of critical values, P-values, and the power of tests, illustrating these concepts through a practical example involving a new medicine's efficacy compared to an old one.

Uploaded by

mijnspammail11223

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Biostatistics 2b statistical testing theory

Uploaded by

mijnspammail11223

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Lecture notes of Biostatistics:

Lecture 𝟐𝒃 Statistical testing theory

In this lecture we focus on the concepts of statistical testing theory.

We consider three tests:

The ‘binomial’ test,

The normal approximation of the binomial test,

The one sample t-test,

Furthermore we introduce confidence intervals for 𝑝 in case of a binomial distribution

and µ in case of a normal distribution.

1
First concepts of statistical testing theory (1)
Imagine some medicine has been developed to cure patients having some disease 𝐷. The
medicine has to replace on old medicine. It is known that the old medicine is successful in
60% of all cases.

The new medicine is distributed to 200 patients and 145 are cured by means of the new
medicine.

Can we prove that the new medicine is better than the old one?

Problem:

We would like to draw a conclusion about

𝑝 = the probability that a patient is cured by the new medicine (is unknown)
145
The outcome of the estimator 𝑝̂ = 𝑋⁄𝑛 is = 72.5% , this is larger than 60% but is not free
200
from chance. Can we really conclude that the true probability 𝑝 is larger than 60%?

In statistical testing theory we define a null hypothesis and an alternative hypothesis.

For our problem we define:

Null hypothesis 𝐻0 : 𝑝 = 0.60

Alternative hypothesis 𝐻1 : 𝑝 > 0.60

Our approach:

We first assume that the null hypothesis is true.

We want to prove the alternative hypothesis.

If we carry out a statistical test then we shall take one of the following two actions:

(1) We reject the null hypothesis, which means that the alternative hypothesis is proven,
(2) We don’t reject the null hypothesis, which means that we did not succeed to prove
the alternative hypothesis.

2
First concepts of statistical testing theory (2)

Doing statistics is never free from errors. Possible errors in testing theory are:

The error of the first kind (type I error): rejecting the null hypothesis if the null hypothesis is
true.

(This is the worst error we can make.)

The error of the second kind (type II error): not rejecting the null hypothesis if the null
hypothesis is false.

If we don’t reject the null hypothesis, we can also say that we accept the null hypothesis.

The decision of the rejection of 𝐻0 is determined for a large part by the significance level 𝛼.

We require that the probability of the error of the first kind is at most 𝛼.

Often one chooses 𝛼 = 5% or 𝛼 = 1%.

The action to reject the null hypothesis will be based on the test statistic, which determines
the statistical test for the major part.

3
The binomial test

For our example the situation is just simple.

We only observe: 𝑋 = the number of cured patients (by the new medicine)

So our test statistic has to be 𝑋.

Assuming a common probability 𝑝 for curing for all patients and independence between
patients a binomial situation arises with the consequence that

𝑋 has the binomial distribution with 𝑛 = 200 and unknown 𝑝 .

We are testing the null hypothesis 𝐻0 : 𝑝 = 0.60 against the alternative hypothesis 𝐻1 : 𝑝 >
0.60 .

It is natural to reject the null hypothesis for large values of 𝑋, say if 𝑋 ≥ 𝑐 for some ‘critical
value’ 𝑐 because if 𝑝 > 0.60 then larger values of 𝑋 get higher probabilities.

The number 𝑐 is determined by the significance level 𝛼.

Let us take 𝛼 = 5%.

Assuming the null hypothesis 𝑋 has the binomial distribution with 𝑛 = 200 and 𝑝 = 0.60.

We select the number 𝑐 such that (1) 𝑃(𝑋 ≥ 𝑐) ≤ 0.05, and (2) 𝑃(𝑋 ≥ 𝑐) approximates 0.05
as close as possible.

We use Excel to determine the number 𝑐:

Cumulative probabilities 𝑃(𝑋 ≤ 𝑥) can be computed by means of the statistical function

BINOM.DIST.

We find for the binomial distribution with 𝑛 = 200 and 𝑝 = 0.60:

𝑃(𝑋 ≥ 131) = 1 − 𝑃(𝑋 ≤ 130) = 1 − 0.936 = 0.064

𝑃(𝑋 ≥ 132) = 1 − 𝑃(𝑋 ≤ 131) = 1 − 0.953 = 0.047

So here we take 𝑐 = 132, we reject the null hypothesis if 𝑋 ≥ 132.

Note this procedure can be established before 𝑋 is realized (before the experiment is carried
out).

4
The eight steps of a statistical test
Besides the choice of 𝛼 doing a statistical test can be described by eight steps:

1. Formulate the assumptions of your model.

2. State the null hypothesis 𝐻0 and alternative hypothesis 𝐻1 .
3. Give the appropriate test statistic.
4. Give the distribution of the test statistic in case of 𝐻0 .
5. Find / compute the outcome of the test statistic.
6. a. Determine critical value(s) , or b. find/compute the P-value.
7. Decide whether you reject the null hypothesis
8. Formulate your conclusions (in ordinary language, avoiding mathematics).

The P-value will be introduced later on.

Elaboration for our binomial test.

(𝑋 = the number of cured patients by the new medicine )

1. 𝑋 has the binomial distribution with 𝑛 = 200 and unknown success probability
𝑝,
2. We test 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 > 0.60,
3. Test Statistic: 𝑋
4. Under 𝐻0 : 𝑋 has the binomial distribution with 𝑛 = 200 and 𝑝 = 0.60
5. Outcome of 𝑋: 145
6. We reject the null hypothesis if 𝑋 ≥ 132 (see earlier calculations)
7. As 𝑋 = 145 we reject the null hypothesis
8. We have proven that the probability that patients are cured by means of the
new medicine is larger than 60% (is thus better than the old one).

Remark:

If the experiment gave 𝑋 = 125, then we did not reject the null hypothesis. Then we did not
prove the alternative hypothesis of a higher probability (higher than 60%) for the new
medicine.

5
The power of this binomial test
This binomial test can be seen as a procedure for taking some action regarding
proving the alternative hypothesis.

Working with significance level 𝛼 = 5% means that there is some risk of at most 5%
for stating that the alternative hypothesis is true given a true null hypothesis (error of
the first kind).

At the contrary, suppose that the null hypothesis is not true. Imagine that the true
probability 𝑝 is given by 𝑝 = 0.70 (hence 𝐻1 is true).

What is now the probability of rejecting the null hypothesis (the right action now).

We have to calculate the probability

𝑃(𝑟𝑒𝑗𝑒𝑐𝑡 𝑡ℎ𝑒 𝑛𝑢𝑙𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠) = 𝑃(𝑋 ≥ 132) if 𝑋~𝐵(200, 0.70)

(𝑋~𝐵(200, 0.70) : 𝑋 has the binomial distribution with 𝑛 = 200 and 𝑝 = 0.70.)

This probability is called the power of the test for 𝑝 = 0.70 .

Using Excel we can compute it rather quickly:

𝑃(𝑋 ≥ 132) = 1 − 𝑃(𝑋 ≤ 131) = 1 − 0.096 = 90.4%

This means that if the true probability of being cured is 70% then then the null
hypothesis is rejected with probability 90.4% in favor of the alternative hypothesis.

6
One sided and two sided tests

In our example we tested 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 > 0.60, we reject the null hypothesis if
𝑋 ≥ 𝑐.

If we tested (for some reason) 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 < 0.60 then we would reject the null
hypothesis if 𝑋 ≤ 𝑐. The critical value 𝑐 can be determined using the significance level 𝛼.

These tests are called one sided tests.

If we test 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 ≠ 0.60 then we have to reject the null hypothesis if 𝑋 ≤
𝑐1 or 𝑋 ≥ 𝑐2 . The critical values are determined by

𝑃(𝑋 ≤ 𝑐1 ) ≤ 𝛼 ⁄2 and 𝑃(𝑋 ≥ 𝑐2 ) ≤ 𝛼 ⁄2 using the binomial distribution with 𝑛 = 200 and 𝑝 =
0.60, select the values 𝑐1 and 𝑐2 such that the probabilities 𝑃(𝑋 ≤ 𝑐1 ) and 𝑃(𝑋 ≥ 𝑐2 )
approximate 𝛼 ⁄2 in the best way.

This is called a two sided test.

7
P-values (of the binomial test)

In our example we have 𝑋 = 145 where we rejected the null hypothesis if 𝑋 ≥ 𝑐.

In this case the P-value is the probability 𝑃(𝑋 ≥ 145) , computed according to the
𝐵(200, 0.60)-distribution (the distribution of the test statistic under 𝐻0 ).

Using Excel we get: 𝑃(𝑋 ≥ 145) = 1 − 𝑃(𝑋 ≤ 144) = 1 − 0.99985 ≈ 0.000

The general rule for P-values is: Reject the null hypothesis if P-value ≤ 𝜶.

(Here events 𝑋 ≥ 𝑐 and P-value ≤ 𝛼 are equivalent.)

For our example: we again reject the null hypothesis since the P-value is (much) smaller than
𝛼 = 5%.

If we tested 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 < 0.60 then we reject the null hypothesis if 𝑋 ≤ 𝑐 and
the P-value would be 𝑃(𝑋 ≤ 145), computed with 𝐵(200, 0.6).

Finally, if we test 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 ≠ 0.60 then the two sided P-value is as follows:

Compute (in this case) 𝑃(𝑋 ≤ 145) and 𝑃(𝑋 ≥ 145), according to 𝐵(200, 0.60).

Take the smallest probability and multiply with 2.

This rule is invented to maintain the rule ‘Reject the null hypothesis if P-value ≤ 𝜶.’

Here 𝑃(𝑋 ≥ 145) = 0.00015 is the smallest probability, the two sided P-value would be

2 × 0.00015 = 0.00030 ≈ 0.000 .

8
The normal approximation of the binomial test
This is, as a matter of fact, a second version of the previous test. Note that the binomial
distribution can be approximated by a normal distribution if 𝑛𝑝 ≥ 5 and 𝑛(1 − 𝑝) ≥ 5.

For our example the distribution of the test statistic is the 𝐵(200, 0.60) distribution which can
be approximated by the normal distribution with µ = 𝑛𝑝 = 120 and 𝜎 = √𝑛𝑝(1 − 𝑝) = 6.928.

Hence 𝑍 = (𝑋 − µ)⁄𝜎 = (𝑋 − 120)⁄6.928 has a standard normal distribution if 𝑝 = 0.60

Instead of 𝑋 we may use 𝑍 as test statistic.

The eight steps of the normal approximation of the binomial test are as follows:

(1) 𝑋 has the binomial distribution with 𝑛 = 200 and unknown success probability 𝑝
(2) We test 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 > 0.60
(3) Test statistic: 𝑍 = (𝑋 − µ)⁄𝜎 = (𝑋 − 120)⁄6.928
(4) Under 𝐻0 : 𝑍 ~ 𝑁(0,1)
145−120
(5) Outcome of 𝑍: 𝑍 = 6.928
= 3.61
(6) We reject if 𝑍 ≥ 𝑐, 𝛼 = 5% and standard normal table: 𝑐 = 1.645
(7) As 𝑍 = 3.61 we reject the null hypothesis
(8) We have proven that the probability that patients are cured by means of the
new medicine is larger than 60% (the new one is thus better than the old one).

Remarks:

We applied the normal approximation of the binomial distribution without additional

correction.

The P-value is the probability 𝑃(𝑍 ≥ 3.61) computed according to 𝑁(0,1).

We get: 𝑃(𝑍 ≥ 3.61) = 1 − 𝑃(𝑍 < 3.61) = 1 − 0.9998 = 0.0002 ≈ 0.000

9
Again the power for 𝒑 = 𝟎. 𝟕𝟎
Let us calculate the probability 𝑃(𝑟𝑒𝑗𝑒𝑐𝑡 𝑡ℎ𝑒 𝑛𝑢𝑙𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠) = 𝑃(𝑍 ≥ 1.645) for 𝑝 = 0.70 for
the normal approximation of the binomial test.

Note that in case of 𝑝 = 0.70 test statistic 𝑍 does not have a standard normal distribution but
𝑋 has approximately a normal distribution with µ = 𝑛𝑝 = 140 and 𝜎 = √𝑛𝑝(1 − 𝑝) = 6.481,
hence

𝑍2 = (𝑋 − µ)⁄𝜎 = (𝑋 − 140)⁄6.481 does have a standard normal distribution.

Since
𝑋−120 𝑋−140+140−120 6.481
𝑍= = × = (𝑍2 + 3.086) × 0.9355
6.928 6.481 6.928

We have:

𝑃(𝑍 ≥ 1.645) = 𝑃((𝑍2 + 3.086) × 0.9355 ≥ 1.645) = ⋯ = 𝑃(𝑍2 ≥ −1.33)

= 𝑃(𝑍2 ≤ 1.33) = 0.9082 = 90.8%

Which approximates the previous calculated power.

Power calculations often are carried out for finding a minimal number 𝑛 that satisfies a
desired level of the power.

Suppose we want to raise the power for 𝑝 = 0.70. We want to have power 95% for 𝑝 = 0.70

For arbitrary sample size 𝑛 the test statistic of the normal approximation is

𝑍 = (𝑋 − 0.60 × 𝑛)⁄√0.24 × 𝑛) and we reject 𝐻0 if 𝑍 ≥ 1.645 for 𝛼 = 5%.

If 𝑝 = 0.70 then 𝑍2 = (𝑋 − 0.70𝑛)⁄√0.21 × 𝑛 has the standard normal distribution instead of

𝑍.

(𝑋−060𝑛) (𝑋−0.70𝑛+0.70𝑛−0.60𝑛) √0.21𝑛

As 𝑍 = = × = (𝑍2 + 0.2182√𝑛) × 0.9355
√0.24𝑛 √0.21𝑛 √0.24𝑛

we get

𝑃(𝑍 ≥ 1.645) = 𝑃(𝑍2 + 0.2182√𝑛 ≥ 1.758) = 𝑃(𝑍2 ≥ 1.758 − 0.2182√𝑛 ) = 0.95

If we use the standard normal distribution in the inverse way we get

1.758+1.645
1.758 − 0.2182√𝑛 = −1.645 hence take √𝑛 = 0.2182
= 15.60, thus 𝑛 = 243.2,

take 𝑛 = 244.

10
The one sample t-test (1)
Suppose a group of 𝑛 patients which have a high score for systolic blood pressure.

Each patients receives a treatment to lower the systolic blood pressure.

For each patient the (systolic) blood pressure before and the blood pressure after the
treatment have been measured.

We define: 𝑋 = blood pressure before treatment minus blood pressure after treatment

Assume the variable 𝑋 is normally distributed and that the data can be summarized as
follows:

𝑛 = 25, 𝑋̅ = 5.32, 𝑆 = √∑𝑖(𝑋𝑖 − 𝑋̅)2 /(𝑛 − 1) = 4.74 (fictive data)

(Methods for checking a normal distribution are postponed to lecture 4.)

We want to answer the following question using statistical testing theory:

Does the treatment really reduce the (systolic) blood pressure?

We assume that 𝑋 has a normal distribution with expectation µ and standard deviation 𝜎.

The stochastic variable 𝑋 denotes the individual blood pressure reduction.

So µ denotes the average blood pressure reduction for the population of patients (long run
average).

It is therefore natural to test 𝐻0 : µ = 0 against 𝐻1 : µ > 0 .

The null hypothesis states that there is no blood pressure reduction on the average, and we
want to prove that there exists some blood pressure reduction.

11
The one sample t-test (2)

Searching for a testing statistic

Because 𝐻0 and 𝐻1 are statements about the expectation µ the corresponding estimator 𝑋̅ =
(𝑋1 + 𝑋2 + ⋯ + 𝑋𝑛 )⁄𝑛 should play a dominant role in this testing problem.

We know: 𝑋̅ has a normal distribution with expectation µ and standard deviation 𝜎⁄√𝑛 .

(𝑋̅−µ)
From standard theory we get furthermore: 𝑍= ~ 𝑁(0,1)
𝜎 ⁄ √𝑛

In case of independent stochastic variables 𝑋𝑖 ~ 𝑁(µ, 𝜎 2 ), one can prove the following result:

(𝑋̅−µ)
has a t-distribution with 𝑛 − 1 degrees of freedom.
𝑆 ⁄ √𝑛

The shape of the t-distributions resemble the shape of the 𝑁(0,1)-distribution, they are again
symmetric around 0 but the tails are thicker.

Note: 𝑃( −1.96 < 𝑍 < 1.96) = 0.95

Applying the table of Student’s t-distribution we get:

(𝑋̅ −µ)
𝑃 (−2.064 < < 2.064) = 0.95 as 𝑛 − 1 = 24
𝑆 ⁄√ 𝑛

This kind of calculations are necessary for performing tests or constructing confidence
intervals, here the difference between 2.064 and 1.96 is caused by estimation of 𝜎 which is
unknown as well.

(𝑋̅−µ)
The theoretical result about renders the test statistic.
𝑆 ⁄ √𝑛
𝑋̅
Note that under 𝐻0 the expectation µ disappears, we get 𝑇 = 𝑆⁄ which is observable
√𝑛
and hence suitable for being a test statistic.
𝑋̅
Under 𝐻0 the test statistic 𝑇 = 𝑆⁄ has a t-distribution with 𝑛 − 1 degrees of freedom
√𝑛
(short notation 𝑑𝑓 = 𝑛 − 1).

12
The one sample t-test (3)
We are now ready to do the eight steps of the one sample t-test for our example. We choose
𝛼 = 5% .

(1) The individual blood pressure reductions are independent and normally distributed
with expectation µ and standard deviation 𝜎 .
(2) We test 𝐻0 : µ = 0 against 𝐻1 : µ > 0.
𝑋̅
(3) Test statistic: 𝑇 =
𝑆⁄√𝑛
(4) Under 𝐻0 : 𝑇 has the t-distribution with 𝑛 − 1 = 24 degrees of freedom
5.32
(5) Outcome of 𝑇: 𝑇 = = 5.61
4.74/√25
(6) We reject the null hypothesis if 𝑇 ≥ 𝑐.
t-table, 𝛼 = 5% : 𝑐 = 1.711
(7) As 𝑇 = 5.61 we reject the null hypothesis.
(8) We conclude the treatment is successful in reducing the blood pressure.

Remarks:

The one sided P-value is here 𝑃(𝑇 ≥ 5.61), calculated according the t-distribution
with 𝑑𝑓 = 24. From the t-table we conclude that this P-value is smaller than 0.0005
(since 5.61 is larger than 3.745).

If (for some reason) we tested 𝐻0 : µ = 0 against 𝐻1 : µ ≠ 0 we reject the null hypothesis if

𝑇 ≤ −2.064 or 𝑇 ≥ 2.064 (use the column 𝑡0.025 ).

The two sided P-value here is 2 × 𝑃(𝑇 ≥ 5.61) = 𝑃(𝑇 ≤ −5.61) + 𝑃(𝑇 ≥ 5.61).

𝑋̅ −5
If e.g. we test 𝐻0 : µ = 5 against 𝐻1 : µ > 5 then the test statistic becomes 𝑇 = , under
𝑆 ⁄√ 𝑛
𝐻0 the new test statistic 𝑇 has again the t-distribution with 𝑛 − 1 = 24 degrees of
freedom.

13
Confidence intervals for µ (1)
Applying the t-test we concluded that the treatment has some effect on the blood
pressure, there is some blood pressure reduction.

Note that the t-test does not give information about the size of the effect.

The estimator 𝑋̅ gives this information, but the inaccuracy of estimation should
expressed as well.

The appropriate way of expressing inaccuracy of estimation is constructing

confidence intervals.

A 95% confidence interval for µ is an interval (𝐿, 𝑅) such that

𝑃(𝐿 < µ < 𝑅) = 0.95

holds. Instead of the confidence level 95% one may choose other levels, e.g. 99% or
90% etc.

The boundaries 𝐿 and 𝑅 have to be statistics which can be computed from the
sample. For the construction of the confidence interval we need again:

(𝑋̅−µ)
has a t-distribution with 𝑛 − 1 degrees of freedom (here: 𝑛 − 1 = 24)
𝑆 ⁄ √𝑛
From the table of the t-distribution we can conclude that the following event occurs with
probability 95%:

(𝑋̅−µ)
−2.064 < < 2.064
𝑆 ⁄√ 𝑛

The next inequalities are equivalent:

𝑆 𝑆
−2.064 × < 𝑋̅ − µ < 2.064 ×
√𝑛 √𝑛

𝑆 𝑆
2.064 × > − 𝑋̅ + µ > −2.064 ×
√ 𝑛 √𝑛

𝑆 𝑆
𝑋̅ − 2.064 × < µ < 𝑋̅ + 2.064 ×
√𝑛 √𝑛

𝑆 𝑆
So we should take: 𝐿 = 𝑋̅ − 2.064 × and 𝑅 = 𝑋̅ + 2.064 ×
√𝑛 √𝑛

14
Confidence intervals for µ (2)

In general the boundaries of a confidence interval for µ are given by:

𝑆
𝑋̅ ± 𝑐
√𝑛

where 𝑐 depends on the confidence level (and the degrees of freedom)

𝑆
Note that is the standard error of the mean, it is 𝑠𝑒(𝑋̅).
√𝑛

For our example we get:

𝑆 4.74
2.064 × = 2.064 × = 1.96
√ 𝑛 5

The 95% confidence interval for µ is therefore (5.32 − 1.96, 5.32 + 1.96) =
(3.36, 7.28).

We are 95% confident that average reduction of blood pressure (population mean) is
lying between 3.36 and 7.28 .

15
Confidence intervals for p
In a similar way confidence intervals for 𝑝 can be constructed if we observe a count 𝑋
that has a binomial distribution with certain 𝑛 and unknown success probability 𝑝.

We only consider large 𝑛 such that we can apply the normal approximation of the
binomial distribution.

So our starting point is that the distribution of the count 𝑋 is approximately the normal
distribution with expectation µ = 𝑛𝑝 and standard deviation 𝜎 = √𝑛𝑝(1 − 𝑝).

Hence
𝑋−𝑛𝑝 𝑝̂−𝑝 𝑋
𝑍= = has the standard normal distribution with 𝑝̂ = 𝑛
√𝑛𝑝(1−𝑝) √𝑝(1−𝑝)/𝑛

It can established furthermore (beyond the scope of this course):

𝑝̂−𝑝
𝑍2 = has the standard normal distribution as well.
√𝑝̂(1−𝑝̂)/𝑛

Note: 𝑆𝐷(𝑝̂ ) = √𝑝(1 − 𝑝)/𝑛 and hence 𝑠𝑒(𝑝̂ ) = √𝑝̂ (1 − 𝑝̂ )/𝑛.

So we conclude:

(𝑝̂ − 𝑝)⁄𝑠𝑒(𝑝̂ ) has the standard normal distribution.

Applying the standard normal distribution we get that the next event has probability 95%:

−1.96 < (𝑝̂ − 𝑝)⁄𝑠𝑒(𝑝̂ ) < 1.96

The following inequalities are equivalent:

−1.96 × 𝑠𝑒(𝑝̂ ) < 𝑝̂ − 𝑝 < 1.96 × 𝑠𝑒(𝑝̂ )

1.96 × 𝑠𝑒(𝑝̂ ) > − 𝑝̂ + 𝑝 > −1.96 × 𝑠𝑒(𝑝̂ )

𝑝̂ − 1.96 × 𝑠𝑒(𝑝̂ ) < 𝑝 < 𝑝̂ + 1.96 × 𝑠𝑒(𝑝̂ )

So the boundaries of the 95% confidence interval are given by: 𝑝̂ ± 1.96 × 𝑠𝑒(𝑝̂ )

In general: 𝑝̂ ± 𝑐 × 𝑠𝑒(𝑝̂ ) where 𝑐 depends on the confidence level.

145
If 𝑝̂ = 200 = 0.725 and 𝑛 = 200 we get 95% confidence interval (0.663, 0.787), then
we are 95% confident that the true 𝑝 is lying between 66.3% and 78.7%.

16
Assignment of lectures 𝟐𝒂 and 𝟐𝒃 (CLT and testing theory)
Send your solutions by mail. Use a Word file with your typed solutions, or a Word file
converted to a pdf-file. In case of handwritten solutions: collect your handwritten pages in
one Word file or one pdf-file.

Exercise 1
Consider random numbers 𝑋𝑖 that are independent and all have the uniform distribution on
the interval (0,1). We study the sample mean 𝑋̅ = (𝑋1 + 𝑋2 + ⋯ + 𝑋𝑛 )⁄𝑛.
𝑎.
For the sample sizes 𝑛 = 100, 𝑛 = 1000, 𝑛 = 10 000 and 𝑛 = 100 000 approximate the
probability 𝑃(𝑋̅ ≤ 0.508) using the central limit theorem (CLT).
𝑏.
The probability 𝑃(𝑋̅ ≤ 0.508) is an increasing function of 𝑛. Check whether your calculations
are in agreement with this statement and explain why this statement is true.

Exercise 2
A certain tennis player makes a successful first serve 83% of the time. Assume that each
serve is independent of the others. Suppose the tennis player serves 100 times in an match.
Define 𝑋 = number of good first serves.
Use the normal approximation of the binomial distribution to compute/approximate the
following probabilities:
𝑎. 𝑃(𝑋 > 90)
𝑏. 𝑃(𝑋 ≤ 75)
𝑐. 𝑃(75 ≤ 𝑋 < 85)

Exercise 3
The waiting time 𝑋 of patients in some hospital has an exponential distribution with (long run)
average 𝜇 = 14 (unit is minute).
𝑎.
Calculate the probability 𝑃(𝑋 > 20), the probability that an arbitrary patient has to wait more
than 20 minutes.
𝑏.
Consider the total waiting time 𝑇 = 𝑋1 + 𝑋2 + ⋯ + 𝑋70 of 70 patients and assume that the 70
waiting times are independent and exponentially distributed with expectation 14.
Approximate the probability that the total waiting time exceeds 1000 minutes.

Exercise 4
Let us return to the binomial test of the lecture notes. We observe
𝑋 = the number of cured patients by the new medicine
which has the binomial distribution with 𝑛 = 200 and unknown 𝑝,
we test 𝐻0 : 𝑝 = 0.60 against 𝐻1 : 𝑝 > 0.60
and we reject the null hypothesis if 𝑋 ≥ 𝑐 . In this exercise we choose 𝜶 = 𝟐%.
𝑎.
Use Excel and the statistical function BINOM.DIST to determine 𝑐.

17
𝑏.
Calculate the probability 𝑃(𝑋 ≥ 𝑐) for 𝑝 = 0.65 using Excel and BINOM.DIST. This is the
power of the test for 𝑝 = 0.65.
𝑐.
Calculate the probability 𝑃(𝑋 ≥ 𝑐) for 𝑝 = 0.70, 𝑝 = 75, … and sketch the graph of the
probability 𝑃(𝑋 ≥ 𝑐), this is the power of the test as function of 𝑝 > 0.60. You should see a
curve/function that increases.
𝑑.
Imagine that we change the value of 𝑛. We take 𝑛 = 500 instead of 𝑛 = 200. The graph of
the power will change as well. Indicate in which way the graph of the power will change and
explain why.

Exercise 5
Let us now consider the one sample t-test of the lecture notes. We study a group of 𝑛
patients.
We assume that the individual blood pressure reductions 𝑋1 , 𝑋2 , … , 𝑋𝑛 are independent and
are all normally distributed with expectation µ and standard deviation 𝜎.
𝑋̅
We test 𝐻0 : µ = 0 against 𝐻1 : µ > 0 using test statistic 𝑇 = 𝑆⁄ 𝑛. We choose 𝛼 = 𝟏%.
√
Suppose that 𝑋̅ = 0.79 and 𝑆 = 2.70 summarize the data.
𝑎.
Determine the critical value 𝑐 and determine whether we have to reject the null hypothesis in
case of 𝑛 = 25, 𝑛 = 50 and 𝑛 = 100 .
𝑏.
For ‘power calculations’ the t-distribution of the test statistic is approximated by the standard
normal distribution. This can be motivated by the fact that for large 𝑛 the difference between
a t-distribution and the standard normal distribution is rather small.
For large 𝑛 we reject then the null hypothesis if 𝑇 ≥ 2.33 (verify this).

We use the following theoretical result:

For large 𝑛 the test statistic has approximately the normal distribution with
µ
expectation ⁄ = (µ⁄𝜎) × √𝑛 and standard deviation 1.
𝜎 √𝑛
Given a fixed value for 𝑛 we can thus calculate probabilities 𝑃(𝑇 ≥ 2.33) for values of µ⁄𝜎.
Here the parameter µ⁄𝜎 is the average blood pressure reduction in units of the standard
deviation 𝜎 . The probability 𝑃(𝑇 ≥ 2.33) is then the power of the test as function of the
parameter µ⁄𝜎.

Determine the minimal value for 𝑛 such that the power 𝑃(𝑇 ≥ 2.33) is equal to 0.95 for
µ⁄𝜎 = 0.5.

Exercises Ch9 Hypo Tests Mean Proportions
No ratings yet
Exercises Ch9 Hypo Tests Mean Proportions
21 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
60 pages
Y. Conducting Small Sample Tests About A Population Mean Myu PDF
No ratings yet
Y. Conducting Small Sample Tests About A Population Mean Myu PDF
34 pages
AE9-FINAL-MODULE
No ratings yet
AE9-FINAL-MODULE
33 pages
Hypothesis Testing-2 PDF
No ratings yet
Hypothesis Testing-2 PDF
16 pages
1 Vocab Reasoning
No ratings yet
1 Vocab Reasoning
3 pages
04 Hypothesis Testing IITB PDF
No ratings yet
04 Hypothesis Testing IITB PDF
33 pages
chapter 7 hypothesis testing and sample size determination _2
No ratings yet
chapter 7 hypothesis testing and sample size determination _2
69 pages
Inferential Statistics FWACP_035611
No ratings yet
Inferential Statistics FWACP_035611
54 pages
Non_parametric_ranked data
No ratings yet
Non_parametric_ranked data
53 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
64 pages
W7 Lecture7
No ratings yet
W7 Lecture7
19 pages
Lecture 39 - Hypothesis Testing
No ratings yet
Lecture 39 - Hypothesis Testing
26 pages
MAS250 Ch.08 Solutions
No ratings yet
MAS250 Ch.08 Solutions
5 pages
(10)HT_proportion
No ratings yet
(10)HT_proportion
12 pages
Hypothesis Testing For Binomial Distribution
No ratings yet
Hypothesis Testing For Binomial Distribution
2 pages
Statistics I: Hypothesis Testing, Part II
No ratings yet
Statistics I: Hypothesis Testing, Part II
27 pages
Large Sample
No ratings yet
Large Sample
3 pages
(9)HT_mean
No ratings yet
(9)HT_mean
46 pages
05-Hypothesis Testing T-Test (1) - 54
No ratings yet
05-Hypothesis Testing T-Test (1) - 54
56 pages
bbbbbb
No ratings yet
bbbbbb
6 pages
8.hypo Testing....
No ratings yet
8.hypo Testing....
44 pages
Computational Data Science - Unit 4
No ratings yet
Computational Data Science - Unit 4
18 pages
8.hypothesis testing (2)
No ratings yet
8.hypothesis testing (2)
43 pages
L7-Hypothesis Testing
No ratings yet
L7-Hypothesis Testing
44 pages
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
No ratings yet
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
34 pages
Biostat Hypothesis Testing
No ratings yet
Biostat Hypothesis Testing
67 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
Week 6 Hypothesis Testing Proportions 1-Sample
No ratings yet
Week 6 Hypothesis Testing Proportions 1-Sample
5 pages
Overview of Hypothesis Testing: Laura Lee Johnson, PH.D
No ratings yet
Overview of Hypothesis Testing: Laura Lee Johnson, PH.D
71 pages
7.Hypothesis testing and Sample size determination
No ratings yet
7.Hypothesis testing and Sample size determination
60 pages
B.Sc. (Hons.) Biotechnology Core Course 13: Basics of Bioinformatics and Biostatistics (BIOT 3013) Biostatistics (BIOT 3013)
No ratings yet
B.Sc. (Hons.) Biotechnology Core Course 13: Basics of Bioinformatics and Biostatistics (BIOT 3013) Biostatistics (BIOT 3013)
29 pages
Hypothesis Test
100% (1)
Hypothesis Test
52 pages
Mas S Mohktar Email: Mas - Dayana@um - Edu.my Phone (Office) : 0379677681
No ratings yet
Mas S Mohktar Email: Mas - Dayana@um - Edu.my Phone (Office) : 0379677681
22 pages
Chapter 4 Lesson 3: Estimating Population Proportion (P) For The Large Sample Size
No ratings yet
Chapter 4 Lesson 3: Estimating Population Proportion (P) For The Large Sample Size
15 pages
DataScience Interview Master Doc
No ratings yet
DataScience Interview Master Doc
120 pages
06_Testing of Hypothesis
No ratings yet
06_Testing of Hypothesis
24 pages
STAB22 Final Exam Review Seminar (WINTER 2021)
No ratings yet
STAB22 Final Exam Review Seminar (WINTER 2021)
65 pages
m09-inference
No ratings yet
m09-inference
20 pages
Hypothesis Testing Ug
No ratings yet
Hypothesis Testing Ug
66 pages
Two Sample Updated Test
No ratings yet
Two Sample Updated Test
35 pages
Eda Research
No ratings yet
Eda Research
11 pages
Test of Hypothesis For 2020
100% (1)
Test of Hypothesis For 2020
62 pages
Lec6 Hypothesis Testing
No ratings yet
Lec6 Hypothesis Testing
8 pages
Chapter Five Hypothesis Testing
No ratings yet
Chapter Five Hypothesis Testing
50 pages
Lec 20 - Testing For One Proportion
No ratings yet
Lec 20 - Testing For One Proportion
12 pages
Chapter 6
No ratings yet
Chapter 6
47 pages
inference
No ratings yet
inference
7 pages
Learning Module - Statistics and Probability
No ratings yet
Learning Module - Statistics and Probability
71 pages
04 Hypothesis Testing
No ratings yet
04 Hypothesis Testing
28 pages
Lecture BDS 8-23-24 Print
No ratings yet
Lecture BDS 8-23-24 Print
11 pages
S244 18 Non Parametric Statistical Techniques
No ratings yet
S244 18 Non Parametric Statistical Techniques
117 pages
mxxssx
No ratings yet
mxxssx
3 pages
Lab 8 - Sampling Techniques 1
No ratings yet
Lab 8 - Sampling Techniques 1
43 pages
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
No ratings yet
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
2 pages
X 24
No ratings yet
X 24
10 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Proposal Sample
No ratings yet
Proposal Sample
38 pages
Mini-Research_Grammatical Errors in Cebu-Based Online News Articles
No ratings yet
Mini-Research_Grammatical Errors in Cebu-Based Online News Articles
27 pages
Complete Download Essentials of Marketing Research Joseph F. PDF All Chapters
No ratings yet
Complete Download Essentials of Marketing Research Joseph F. PDF All Chapters
55 pages
Research The Journey from Pondering to Publishing 1st Edition Serwan M.J. Baban (Editor) All Chapters Instant Download
100% (11)
Research The Journey from Pondering to Publishing 1st Edition Serwan M.J. Baban (Editor) All Chapters Instant Download
60 pages
Apqp Ybr
No ratings yet
Apqp Ybr
56 pages
John M. Norris and Lourdes Ortega-Research-Synthesis-Div
No ratings yet
John M. Norris and Lourdes Ortega-Research-Synthesis-Div
19 pages
8801-Article Text-18677-1-10-20220801
No ratings yet
8801-Article Text-18677-1-10-20220801
10 pages
Practical Reseach 2 1st Periodical Exam 2024-2025
No ratings yet
Practical Reseach 2 1st Periodical Exam 2024-2025
7 pages
ELT 7 Parts of Language Research
No ratings yet
ELT 7 Parts of Language Research
37 pages
Prokopton Issue 2 2021
No ratings yet
Prokopton Issue 2 2021
117 pages
RM Proposal Components
No ratings yet
RM Proposal Components
73 pages
Primjer Pregleda Literature Kao Seminarskog Rada
No ratings yet
Primjer Pregleda Literature Kao Seminarskog Rada
8 pages
A Professional and Practitioner's Guide To Public ... - (Chapter 6 Qualitative Research Methodologies)
No ratings yet
A Professional and Practitioner's Guide To Public ... - (Chapter 6 Qualitative Research Methodologies)
20 pages
OCM_RES_2 syllabus
No ratings yet
OCM_RES_2 syllabus
7 pages
Practice of Research in Criminology and Criminal Justice 6th Edition Bachman Test Bank - Download Today For Unlimited Reading
100% (2)
Practice of Research in Criminology and Criminal Justice 6th Edition Bachman Test Bank - Download Today For Unlimited Reading
48 pages
2024 ISHS General Maths IA1 PSMT Budget Car - Final-2
No ratings yet
2024 ISHS General Maths IA1 PSMT Budget Car - Final-2
7 pages
WW: - PT: - : Answer Sheet Quarter 4 - Module 5 Statistics and Probability
No ratings yet
WW: - PT: - : Answer Sheet Quarter 4 - Module 5 Statistics and Probability
2 pages
Data Collection Methods and Procedures
No ratings yet
Data Collection Methods and Procedures
23 pages
E-JRA Vol. 08 No. 05 Agustus 2019 Fakultas Ekonomi Dan Bisnis Universitas Islam Malang
No ratings yet
E-JRA Vol. 08 No. 05 Agustus 2019 Fakultas Ekonomi Dan Bisnis Universitas Islam Malang
11 pages
Handbook of Research Methods and Applications in Experimental Economics 1st Edition Arthur Schram - The ebook in PDF format is ready for immediate access
No ratings yet
Handbook of Research Methods and Applications in Experimental Economics 1st Edition Arthur Schram - The ebook in PDF format is ready for immediate access
79 pages
Research Methods Cheat Sheet: by Via
No ratings yet
Research Methods Cheat Sheet: by Via
5 pages
Module Handbook Oct23Cohort
No ratings yet
Module Handbook Oct23Cohort
12 pages
Format For Qualitative Paper 6 1
No ratings yet
Format For Qualitative Paper 6 1
42 pages
Investigatory Project 2
No ratings yet
Investigatory Project 2
4 pages
QUALITATIVE Data Gathering Instruments
No ratings yet
QUALITATIVE Data Gathering Instruments
17 pages
Thesis Title For Public Administration in The Philippines
100% (2)
Thesis Title For Public Administration in The Philippines
8 pages
Ismael
No ratings yet
Ismael
21 pages
Anova 1: Eric Jacobs Hubert Korzilius
No ratings yet
Anova 1: Eric Jacobs Hubert Korzilius
39 pages
Value Proposition As A Framework For Value Cocreation in Crowdfunding Ecosystems
No ratings yet
Value Proposition As A Framework For Value Cocreation in Crowdfunding Ecosystems
17 pages
EVM NCIII 21st Century
No ratings yet
EVM NCIII 21st Century
118 pages