0% found this document useful (0 votes)

121 views

Hypothesis Testing

1. The document describes the procedure for hypothesis testing which involves setting up null and alternative hypotheses, collecting a sample, computing test statistics, and determining whether to reject the null hypothesis. 2. There are three types of alternative hypotheses: upper-tailed tests for increases, lower-tailed tests for decreases, and two-tailed tests for any change. 3. The decision rule for whether to reject the null hypothesis depends on the type of alternative hypothesis, the test statistic, and the level of significance.

Uploaded by

farhaj

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views

Hypothesis Testing

Uploaded by

farhaj

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Hypothesis Testing: Upper-, Lower, and Two

Tailed Tests

The procedure for hypothesis testing is based on the ideas described above. Specifically, we set
up competing hypotheses, select a random sample from the population of interest and compute
summary statistics. We then determine whether the sample data supports the null or alternative
hypotheses. The procedure can be broken down into the following five steps.

Step 1. Set up hypotheses and select the level of significance .

H0: Null hypothesis (no change, no difference);

H1: Research hypothesis (investigator's belief); =0.05

Upper-tailed, Lower-tailed, Two-tailed Tests

The research or alternative hypothesis can take one of three
forms. An investigator might believe that the parameter has
increased, decreased or changed. For example, an investigator
might hypothesize:

1. H1: > 0 , where 0 is the comparator or null value (e.g.,

0 =191 in our example about weight in men in 2006) and
an increase is hypothesized - this type of test is called an
upper-tailed test;

2. H1: < 0 , where a decrease is hypothesized and this is

called a lower-tailed test; or

3. H1: 0, where a difference is hypothesized and this is

called a two-tailed test.

The exact form of the research hypothesis depends on the

investigator's belief about the parameter of interest and
whether it has possibly increased, decreased or is different
from the null value. The research hypothesis is set up by the
investigator before any data are collected.

Step 2. Select the appropriate test statistic.

The test statistic is a single number that summarizes the sample information. An example of a
test statistic is the Z statistic computed as follows:

When the sample size is small, we will use t statistics (just as we did when constructing
confidence intervals for small samples). As we present each scenario, alternative test statistics
are provided along with conditions for their appropriate use.

Step 3. Set up decision rule.

The decision rule is a statement that tells under what circumstances to reject the null hypothesis.
The decision rule is based on specific values of the test statistic (e.g., reject H 0 if Z > 1.645). The
decision rule for a specific test depends on 3 factors: the research or alternative hypothesis, the
test statistic and the level of significance. Each is discussed below.
1. The decision rule depends on whether an upper-tailed, lower-tailed, or two-tailed test is
proposed. In an upper-tailed test the decision rule has investigators reject H 0 if the test
statistic is larger than the critical value. In a lower-tailed test the decision rule has
investigators reject H0 if the test statistic is smaller than the critical value. In a two-tailed
test the decision rule has investigators reject H0 if the test statistic is extreme, either larger
than an upper critical value or smaller than a lower critical value.

2. The exact form of the test statistic is also important in determining the decision rule. If
the test statistic follows the standard normal distribution (Z), then the decision rule will
be based on the standard normal distribution. If the test statistic follows the t distribution,
then the decision rule will be based on the t distribution. The appropriate critical value
will be selected from the t distribution again depending on the specific alternative
hypothesis and the level of significance.

3. The third factor is the level of significance. The level of significance which is selected in
Step 1 (e.g., =0.05) dictates the critical value. For example, in an upper tailed Z test, if
=0.05 then the critical value is Z=1.645.

The following figures illustrate the rejection regions defined by the decision rule for upper-,
lower- and two-tailed Z tests with =0.05. Notice that the rejection regions are in the upper,
lower and both tails of the curves, respectively. The decision rules are written below each figure.

Low
er-
Taile
d
Test

a Z

0. -
Rejection Region for Upper-Tailed Z Test (H1: > 0 ) with
10 1.
=0.05
2
8
The decision rule is: Reject H0 if Z > 1.645.
2

0. -
05 1.
6
4
5

0. -
02 1.
5 9
6
0

0. -
01 2.
0 3
2
6

0. -
00 2.
5 5
7
6

0. -
00 3.
1 0
9
0

0. -
00 3.
01 7
1
9
Uppe
r-
Taile
d
Test

0. 1.
Rejection Region for Lower-Tailed Z Test (H1: < 0 ) with 10 2
=0.05 8
2
The decision rule is: Reject H0 if Z < 1.645.

0. 1.
05 6
4
5

0. 1.
02 9
5 6
0

0. 2.
01 3
0 2
6

0. 2.
00 5
5 7
6
0. 3.
00 0
1 9
0

0. 3.
00 7
01 1
9

Two-
Taile
d
Test

0. 1.
Rejection Region for Two-Tailed Z Test (H1: 0 ) with 20 2
=0.05 8
2
The decision rule is: Reject H0 if Z < -1.960 or if Z > 1.960.

0. 1.
10 6
4
5

0. 1.
05 9
6
0

0. 2.
01 5
0 7
6

0. 3.
00 2
1 9
1

0. 3.
00 8
01 1
9

The complete table of critical values of Z for upper, lower and two-tailed tests can be found in
the table of Z values to the right in "Other Resources."

Critical values of t for upper, lower and two-tailed tests can be found in the table of t values in
"Other Resources."

Step 4. Compute the test statistic.

Here we compute the test statistic by substituting the observed sample data into the test statistic
identified in Step 2.

Step 5. Conclusion.

The final conclusion is made by comparing the test statistic (which is a summary of the
information observed in the sample) to the decision rule. The final conclusion will be either to
reject the null hypothesis (because the sample data are very unlikely if the null hypothesis is true)
or not to reject the null hypothesis (because the sample data are not very unlikely).

If the null hypothesis is rejected, then an exact significance level is computed to describe the
likelihood of observing the sample data assuming that the null hypothesis is true. The exact level
of significance is called the p-value and it will be less than the chosen level of significance if we
reject H0.

Statistical computing packages provide exact p-values as part of their standard output for
hypothesis tests. In fact, when using a statistical computing package, the steps outlined about can
be abbreviated. The hypotheses (step 1) should always be set up in advance of any analysis and
the significance criterion should also be determined (e.g., =0.05). Statistical computing
packages will produce the test statistic (usually reporting the test statistic as t) and a p-value. The
investigator can then determine statistical significance using the following: If p < then reject
H0.
Things to Remember When Interpreting P Values

1. P-values summarize statistical significance and do not address

clinical significance. There are instances where results are both
clinically and statistically significant - and others where they are
one or the other but not both. This is because P-values depend upon
both the magnitude of association and the precision of the estimate
(the sample size). When the sample size is large, results can reach
statistical significance (i.e., small p-value) even when the effect is
small and clinically unimportant. Conversely, with small sample
sizes, results can fail to reach statistical significance yet the effect is
large and potentially clinical important. It is extremely important to
assess both statistical and clinical significance of results.

2. Statistical tests allow us to draw conclusions of significance or not

based on a comparison of the p-value to our selected level of
significance. Remember that this conclusion is based on the
selected level of significance ( ) and could change with a different
level of significance. While =0.05 is standard, a p-value of 0.06
should be examined for clinical importance.

3. When conducting any statistical analysis, there is always a

possibility of an incorrect conclusion. With many statistical
analyses, this possibility is increased. Investigators should only
conduct the statistical analyses (e.g., tests) of interest and not all
possible tests.

4. Many investigators inappropriately believe that the p-value

represents the probability that the null hypothesis is true. P-values
are computed based on the assumption that the null hypothesis is
true. The p-value is the probability that the data could deviate from
the null hypothesis as much as they did or more. Consequently, the
p-value measures the compatibility of the data with the null
hypothesis, not the probability that the null hypothesis is correct.

5. Statistical significance does not take into account the possibility of

bias or confounding - these issues must always be investigated.

6. Evidence-based decision making is important in public health and

in medicine, but decisions are rarely made based on the finding of a
single study. Replication is always important to build a body of
evidence to support findings.
We now use the five-step procedure to test the research hypothesis that the mean weight in men
in 2006 is more than 191 pounds. We will assume the sample data are as follows: n=100,
=197.1 and s=25.6.
Step 1. Set up hypotheses and determine level of significance

H0: = 191 H1: > 191 =0.05

The research hypothesis is that weights have increased, and therefore an upper tailed test is used.

Step 2. Select the appropriate test statistic.

Because the sample size is large (n>30) the appropriate test statistic is

Step 3. Set up decision rule.

In this example, we are performing an upper tailed test (H 1: > 191), with a Z test statistic and
selected =0.05. Reject H0 if Z > 1.645.

Step 4. Compute the test statistic.

We now substitute the sample data into the formula for the test statistic identified in Step 2.

Step 5. Conclusion.

We reject H0 because 2.38 > 1.645. We have statistically significant evidence at a =0.05, to show
that the mean weight in men in 2006 is more than 191 pounds. Because we rejected the null
hypothesis, we now approximate the p-value which is the likelihood of observing the sample
data if the null hypothesis is true. An alternative definition of the p-value is the smallest level of
significance where we can still reject H0. In this example, we observed Z=2.38 and for =0.05,
the critical value was 1.645. Because 2.38 exceeded 1.645 we rejected H 0. In our conclusion we
reported a statistically significant increase in mean weight at a 5% level of significance. Using
the table of critical values for upper tailed tests, we can approximate the p-value. If we select
=0.025, the critical value is 1.96, and we still reject H 0 because 2.38 > 1.960. If we select
=0.010 the critical value is 2.326, and we still reject H 0 because 2.38 > 2.326. However, if we
select =0.005, the critical value is 2.576, and we cannot reject H 0 because 2.38 < 2.576.
Therefore, the smallest where we still reject H 0 is 0.010. This is the p-value. A statistical
computing package would produce a more precise p-value which would be in between 0.005 and
0.010. Here we are approximating the p-value and would report p < 0.010.

Type I and Type II Errors

In all tests of hypothesis, there are two types of errors that can be committed. The first is called a
Type I error and refers to the situation where we incorrectly reject H 0 when in fact it is true. This
is also called a false positive result (as we incorrectly conclude that the research hypothesis is
true when in fact it is not). When we run a test of hypothesis and decide to reject H 0 (e.g.,
because the test statistic exceeds the critical value in an upper tailed test) then either we make a
correct decision because the research hypothesis is true or we commit a Type I error. The
different conclusions are summarized in the table below. Note that we will never know whether
the null hypothesis is really true or false (i.e., we will never know which row of the following
table reflects reality).

Table - Conclusions in Test of Hypothesis

Do Not Reject H0 Reject H0

H0 is True Correct Decision Type I Error

H0 is False Type II Error Correct Decision

In the first step of the hypothesis test, we select a level of significance, , and = P(Type I error).
Because we purposely select a small value for , we control the probability of committing a Type
I error. For example, if we select =0.05, and our test tells us to reject H 0, then there is a 5%
probability that we commit a Type I error. Most investigators are very comfortable with this and
are confident when rejecting H0 that the research hypothesis is true (as it is the more likely
scenario when we reject H0).

When we run a test of hypothesis and decide not to reject H0 (e.g., because the test statistic is
below the critical value in an upper tailed test) then either we make a correct decision because
the null hypothesis is true or we commit a Type II error. Beta () represents the probability of a
Type II error and is defined as follows: =P(Type II error) = P(Do not Reject H 0 | H0 is false).
Unfortunately, we cannot choose to be small (e.g., 0.05) to control the probability of
committing a Type II error because depends on several factors including the sample size, , and
the research hypothesis. When we do not reject H0, it may be very likely that we are committing
a Type II error (i.e., failing to reject H 0 when in fact it is false). Therefore, when tests are run and
the null hypothesis is not rejected we often make a weak concluding statement allowing for the
possibility that we might be committing a Type II error. If we do not reject H 0, we conclude that
we do not have significant evidence to show that H1 is true. We do not conclude that H0 is true

Tender Response Template
50% (2)
Tender Response Template
17 pages
QSP - 530-01 Risk Assessment
No ratings yet
QSP - 530-01 Risk Assessment
20 pages
Dibbern Et Al. 2004
No ratings yet
Dibbern Et Al. 2004
97 pages
HYPOTHESIS TESTING 2023
No ratings yet
HYPOTHESIS TESTING 2023
6 pages
MATH 264 Statistics For Social Sciences: Hypothesis Testing
No ratings yet
MATH 264 Statistics For Social Sciences: Hypothesis Testing
62 pages
BRM-9 updated
No ratings yet
BRM-9 updated
73 pages
Methodology of Hypothesis Testing
No ratings yet
Methodology of Hypothesis Testing
15 pages
Buss. Stat - Chapter 3 Hypothesis Testing
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing
10 pages
Eda Research
No ratings yet
Eda Research
11 pages
Buss. Stat - Chapter 3 Hypothesis Testing
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing
10 pages
Parametric Testing
No ratings yet
Parametric Testing
127 pages
Hypothesis Testing - New
No ratings yet
Hypothesis Testing - New
43 pages
Hypothesis Testing Hand Notre
No ratings yet
Hypothesis Testing Hand Notre
6 pages
Hypothesis Testting3
No ratings yet
Hypothesis Testting3
7 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
One-Sample Tests of Hypothesis: Chapter Ten
No ratings yet
One-Sample Tests of Hypothesis: Chapter Ten
40 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
51 pages
Chapter 5
No ratings yet
Chapter 5
65 pages
An Introduction To Statistical Inference
No ratings yet
An Introduction To Statistical Inference
33 pages
Five Steps of Hypothesis Testing
No ratings yet
Five Steps of Hypothesis Testing
3 pages
Lecture III
No ratings yet
Lecture III
52 pages
Learning Unit 8
No ratings yet
Learning Unit 8
20 pages
Inferential Statistics 1
No ratings yet
Inferential Statistics 1
32 pages
Chapter 10 One Sample Tests of Hypothesis
No ratings yet
Chapter 10 One Sample Tests of Hypothesis
36 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
78 pages
Fundamentals of Hypothesis Testing: One-Sample Tests
100% (1)
Fundamentals of Hypothesis Testing: One-Sample Tests
105 pages
Fundamentals of Hypothesis Testing: Dr. K. M. Salah Uddin
No ratings yet
Fundamentals of Hypothesis Testing: Dr. K. M. Salah Uddin
59 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
60 pages
Null Vs Alternative Hypothesis, Rejection Region, and Significance Level Type I Error and Type II Error, Test For The Mean. Population Variance Known, P-Value
No ratings yet
Null Vs Alternative Hypothesis, Rejection Region, and Significance Level Type I Error and Type II Error, Test For The Mean. Population Variance Known, P-Value
14 pages
04 Hypothesis Testing IITB PDF
No ratings yet
04 Hypothesis Testing IITB PDF
33 pages
10 Hypothesis Testing
No ratings yet
10 Hypothesis Testing
47 pages
Probability and Statistics Notes
No ratings yet
Probability and Statistics Notes
10 pages
HYPOTHESES
No ratings yet
HYPOTHESES
32 pages
Hawch 11
No ratings yet
Hawch 11
8 pages
Chapter 10 One-Sample Tests of Hypothesis
100% (2)
Chapter 10 One-Sample Tests of Hypothesis
36 pages
DMDA Unit-5 notes (2) (1)
No ratings yet
DMDA Unit-5 notes (2) (1)
35 pages
Testing of Hypotheses
No ratings yet
Testing of Hypotheses
19 pages
Session 12 - Hypothesis Testing-Single Sample Tests
No ratings yet
Session 12 - Hypothesis Testing-Single Sample Tests
56 pages
Module 7 - MAMW100 Hypothesis Testing New
No ratings yet
Module 7 - MAMW100 Hypothesis Testing New
6 pages
7 Step
No ratings yet
7 Step
5 pages
SB K49 Lecture8
No ratings yet
SB K49 Lecture8
51 pages
Applied Statistics: Testing of Hypotheses
No ratings yet
Applied Statistics: Testing of Hypotheses
21 pages
Basic Concepts in Hypothesis Testing (Rosalind L P Phang)
No ratings yet
Basic Concepts in Hypothesis Testing (Rosalind L P Phang)
7 pages
Blood Glucose Levels For Obese Patients Have A Mean of 100 With A Standard Deviation of 15
No ratings yet
Blood Glucose Levels For Obese Patients Have A Mean of 100 With A Standard Deviation of 15
11 pages
HYPOTHESIS TESTING Z Test 1
No ratings yet
HYPOTHESIS TESTING Z Test 1
11 pages
Lecture 09
No ratings yet
Lecture 09
48 pages
Lecture 7 With No Solutions2
No ratings yet
Lecture 7 With No Solutions2
42 pages
Navidi ch6
No ratings yet
Navidi ch6
82 pages
Biostat Hypothesis Testing
No ratings yet
Biostat Hypothesis Testing
67 pages
4 Hypothesis Testing 1 Sample Mean For Students
No ratings yet
4 Hypothesis Testing 1 Sample Mean For Students
18 pages
Buss. Stat - Chapter 3 Hypothesis Testing 2
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing 2
26 pages
Buss. Stat - Chapter 3 Hypothesis Testing 2
No ratings yet
Buss. Stat - Chapter 3 Hypothesis Testing 2
26 pages
CVE 303 - 6. Hypothesis Test
No ratings yet
CVE 303 - 6. Hypothesis Test
44 pages
Q4 W2 Hypothesis Testing Using Critical and p Value Method Population Mean
No ratings yet
Q4 W2 Hypothesis Testing Using Critical and p Value Method Population Mean
38 pages
1. Testing
No ratings yet
1. Testing
29 pages
Chapter 10 PowerPoint
No ratings yet
Chapter 10 PowerPoint
36 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
7 pages
Hypothesis Testing Revised
No ratings yet
Hypothesis Testing Revised
22 pages
Chap 10
No ratings yet
Chap 10
12 pages
ZCVZCVZXVC
No ratings yet
ZCVZCVZXVC
66 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
Bab 5 Fundamentals of Hypothesis
No ratings yet
Bab 5 Fundamentals of Hypothesis
55 pages
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
CPRJ 2
No ratings yet
CPRJ 2
253 pages
Urban Green Infrastructure A Review On Valuation Toolkits From An Urban Planning Perspective
No ratings yet
Urban Green Infrastructure A Review On Valuation Toolkits From An Urban Planning Perspective
10 pages
Non-Digital Instructional Materials Evaluation Tool
100% (1)
Non-Digital Instructional Materials Evaluation Tool
2 pages
Epy 410 E-Learning Document
No ratings yet
Epy 410 E-Learning Document
95 pages
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
7 pages
Policy Proposal Summary
No ratings yet
Policy Proposal Summary
4 pages
Verb Use in Different Domains: Cognitive Domain Level Description Action Verbs Describing Learning Outcomes
No ratings yet
Verb Use in Different Domains: Cognitive Domain Level Description Action Verbs Describing Learning Outcomes
3 pages
BV Technical Guide Iso 14001 2015
100% (1)
BV Technical Guide Iso 14001 2015
12 pages
ANPQP Version 2 3 Changes
No ratings yet
ANPQP Version 2 3 Changes
25 pages
Sample Exam Istqb CTFL 2018
No ratings yet
Sample Exam Istqb CTFL 2018
60 pages
Test Construction and Evaluation
100% (1)
Test Construction and Evaluation
34 pages
Warren Leslie Wright Curriculum Vitae
No ratings yet
Warren Leslie Wright Curriculum Vitae
5 pages
Product Information Safe Stretch Wrapping Solutions en Im0072568
No ratings yet
Product Information Safe Stretch Wrapping Solutions en Im0072568
16 pages
GTA 41-01-005 Religious Factors Analysis - US Army
No ratings yet
GTA 41-01-005 Religious Factors Analysis - US Army
60 pages
Ascld Guidance On Traceability of Measurement - 2011
No ratings yet
Ascld Guidance On Traceability of Measurement - 2011
25 pages
Job Opportunity at Kampala Capital City Authority
No ratings yet
Job Opportunity at Kampala Capital City Authority
4 pages
English For Academic and Professional Purposes: Quarter 1-Module 7 Writing A Critique
No ratings yet
English For Academic and Professional Purposes: Quarter 1-Module 7 Writing A Critique
20 pages
"Sericulture " (Study of Different Type of Silkworm)
No ratings yet
"Sericulture " (Study of Different Type of Silkworm)
4 pages
STLC - Software Testing Life Cycle Phases
No ratings yet
STLC - Software Testing Life Cycle Phases
14 pages
WritingSMARTLearningObjectives PDF
No ratings yet
WritingSMARTLearningObjectives PDF
3 pages
Quality Assessment of Food Provided at Indira Canteen in Bangalore
No ratings yet
Quality Assessment of Food Provided at Indira Canteen in Bangalore
13 pages
6087 2013 Syllabus Document
No ratings yet
6087 2013 Syllabus Document
10 pages
Fundamental Principles of School Administration and Supervision
No ratings yet
Fundamental Principles of School Administration and Supervision
38 pages
Final report - Sales Management - Nguyễn Thị Mỹ Huyền - 215221599 - Ca 2
No ratings yet
Final report - Sales Management - Nguyễn Thị Mỹ Huyền - 215221599 - Ca 2
23 pages
Passengers Terminal Operation
No ratings yet
Passengers Terminal Operation
401 pages
WVPT 4
No ratings yet
WVPT 4
52 pages
3247 First Language Urdu: MARK SCHEME For The May/June 2013 Series
No ratings yet
3247 First Language Urdu: MARK SCHEME For The May/June 2013 Series
12 pages

Hypothesis Testing

Uploaded by

Hypothesis Testing

Uploaded by

Hypothesis Testing: Upper-, Lower, and Two

Step 1. Set up hypotheses and select the level of significance .

H0: Null hypothesis (no change, no difference);

H1: Research hypothesis (investigator's belief); =0.05

Upper-tailed, Lower-tailed, Two-tailed Tests

1. H1: > 0 , where 0 is the comparator or null value (e.g.,

2. H1: < 0 , where a decrease is hypothesized and this is

3. H1: 0, where a difference is hypothesized and this is

The exact form of the research hypothesis depends on the

Step 2. Select the appropriate test statistic.

Step 3. Set up decision rule.

Step 4. Compute the test statistic.

1. P-values summarize statistical significance and do not address

2. Statistical tests allow us to draw conclusions of significance or not

3. When conducting any statistical analysis, there is always a

4. Many investigators inappropriately believe that the p-value

5. Statistical significance does not take into account the possibility of

6. Evidence-based decision making is important in public health and

H0: = 191 H1: > 191 =0.05

Step 2. Select the appropriate test statistic.

Step 3. Set up decision rule.

Step 4. Compute the test statistic.

Type I and Type II Errors

Table - Conclusions in Test of Hypothesis

Do Not Reject H0 Reject H0

H0 is True Correct Decision Type I Error

H0 is False Type II Error Correct Decision

You might also like