0% found this document useful (0 votes)

145 views

Hypothesis Testing Lecture

Hypothesis testing involves formulating a null hypothesis (H0) and an alternative hypothesis (Ha). There are two types of errors that can occur: 1) Type I error is rejecting the null hypothesis when it is true. 2) Type II error is failing to reject the null hypothesis when it is false. The test statistic is used to assess how compatible the data are with the null hypothesis. If the p-value is less than the significance level α, then the null hypothesis is rejected in favor of the alternative hypothesis. Choosing a smaller α increases the likelihood of a type I error but decreases the likelihood of a type II error. Power is the probability of rejecting a false null hypothesis. Sample size

Uploaded by

yogibh

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

145 views

Hypothesis Testing Lecture

Uploaded by

yogibh

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Lecture 5

Hypothesis Testing
A hypothesis is a conjecture about the distribution of some random
variables. For example, a claim about the value of a parameter of
the statistical model.
There are two types of hypotheses:
The null hypothesis,

, is the current belief.

The alternative hypothesis,

want to show.

, is your belief; it is what you

Examples: Each of the following situations requires a significance

test about a population mean. State the appropriate null hypothesis
and alternative hypothesis
in each case.
(a) The mean area of the several thousand apartments in a new
development is advertised to be 1250 square feet. A tenant
group thinks that the apartments are smaller than advertised.
They hire an engineer to measure a sample of apartments to
test their suspicion.

(b) Larry's car consume on average 32 miles per gallon on the

highway. He now switches to a new motor oil that is
advertised as increasing gas mileage. After driving 3000
highway miles with the new oil, he wants to determine if his
gas mileage actually has increased.

(c) The diameter of a spindle in a small motor is supposed to be

5 millimeters. If the spindle is either too small or too large,
the motor will not perform properly. The manufacturer
measures the diameter in a sample of motors to determine
whether the mean diameter has moved away from the target.

Guidelines for Hypothesis testing

Hypothesis testing is a proof by contradiction. The testing process
has four steps:
Step 1: Assume

is true.

Step 2: Use statistical theory to make a statistic (function of the

data) that includes . This statistic is called the test statistic.
Step 3: Find the probability that the test statistic would take a
value as extreme or more extreme than that actually observed.
Think of this as: probability of getting our sample assuming
is
true.
Step 4: If the probability we calculated in step 3 is high it means
that the sample is likely under
and so we have no evidence
against . If the probability is low, there are two possibilities:
- we observed a very unusual event, or
- our assumption is wrong

Test Statistic
The test is based on a statistic that estimates the parameter

that appears in the hypotheses. Usually this is the same

estimate we would use in a confidence interval for the
parameter. When
is true, we expect the estimate to take a
value near the parameter value specified in

Values of the estimate far from the parameter value specified by

give evidence against
. The alternative hypothesis
determines which directions count against .
A test statistic measures compatibility between the null
hypothesis and the data.
To assess how far the estimate is from the parameter, standardize
the estimate. In many common situations the test statistics has the
form

Example: An air freight company wishes to test whether or not the

mean weight of parcels shipped on a particular root exceeds 10
pounds. A random sample of 49 shipping orders was examined and
found to have average weight of 11 pounds. Assume that the
standard deviation of the weights is 2.8 pounds.
Solution:

Graphical Representation
Suppose we want to test a set of hypotheses concerning a
parameter based on a random sample
.
vs
is the estimate of our parameter .
Rejection Region (RR) is the specified values of the test statistics
for which we reject .
The probability that defines the critical region is called the size of
the test or level of the significance of the test and is denoted by .

Example: The hourly wages in a particular industry are normally

distributed with mean $13.20 and standard deviation $2.50. A
company employs 40 workers paying them an average of $12.20
per hour. Can this company be accused of paying substandard
wages? Use
.
Solution:

Decision Errors
When we perform a statistical test we hope that our decision will
be correct, but sometimes it will be wrong. There are two possible
errors that can be made in hypothesis test.
Definition: The error made by rejecting the null hypothesis
when in fact
is true is called a type I error.
The error made by failing to reject the null hypothesis
when in
fact
is false is called a type II error.
Note: The level of significance of the test is also the probability of
type I error, denoted by , i.e.

The probability of a type II error is denoted by .

Example: An experimenter has prepared a drug dosage level that
she claims will induce sleep for 80% of people suffering from
insomnia. In an attempt to disprove her claim, we administer her
prescribed dosage to 20 insomniacs and observe X, the number of
people for whom the drug dose induces sleep. We wish to test
vs
. Assume
.

Example: We are interested in testing whether or not a coin is

balanced based on the number of heads, X, on 36 tosses of the coin.
Assume
.
(a) What is the value of ?
(b) What is the value of if p = 0.7?

P-value
Definition: The probability, assuming
is true, that the test
statistic would take a value as extreme or more extreme than that
actually observed is called the P-value of the test. The smaller the
P-value, the stronger the evidence against
provided by the data.
Guideline for how small is small:
P-value > 0.1 provides no evidence against

0.05 < P-value < 0.1 provides weak evidence against

0.01 < P-value < 0.05 provides moderated evidence against

P-value < 0.01 provides strong evidence against

We can compare the P-value we calculate with a fixed value that

we regard as decisive. The decisive value of P is called the
significance level (this is our ). Most common values for are
0.1, 0.05, 0.01.
If the P-value is as small or smaller than , we say that the data are
statistically significant at level . In other words, the P-value is
the smallest level of significance for which the null hypothesis
should be rejected.

Example: 85% of the general public is right-handed. A survey of

300 chief executive officers of large corporations found that 95%
were right-handed. Is this difference in percentages statistically
significant? Use
. Find the P-value for the test.
Solution:

Tests for a Population Mean ( is known)

where

is the specified value of .

Example: In 1999, it was reported that the mean serum cholesterol

level for female undergraduates was 168 mg/dl with a standard
deviation of 27 mg/dl. A recent study at Baylor University
investigated the lipid levels in a cohort of sedentary university
students. The mean total cholesterol level among n = 71 females
was
. Is this evidence that cholesterol levels of sedentary
students differ from the previously reported average?
Solution:

Two-sided significance tests and CIs

Note: A level
two-sided significance test rejects a hypothesis
exactly when the value
falls outside a level
confidence interval for .

Tests for a Population Mean ( is unknown)

Recall: (one-sample t CI)

Example: Founded in 1998, Telephia provides a wide variety of

information on cellular phone use. In 2006, Telaphia reported that,
on average, United Kingdom (U.K.) subscribers with thirdgeneration technology (3G) phones spent an average of 8.3 hours
per month listening to full-track music on their cell phones.
Suppose we want to determine a 95% CI for the U.S. average and
draw the following random sample of size 8 from the U.S.
population of 3G subscribers:
5 6 0 4 11 9 2 3
The sample mean is
and the standard deviation s = 3.63 with
degrees of freedom n - 1 = 7.

Example: Suppose that, for the U.S. data in example before we

want to test whether the U.S. average is different from the reported
U.K. average.

Power
The ability of a test to detect that
is false is measured by the
probability that the test will reject
when an alternative is true.
The higher this probability is, the more sensitive the test is.
Definition: The probability that a fixed size
test will reject
when
is false is called the power of the test.
A powerful test has a large probability of rejecting
false.

when it is

Example: Can a 6-month exercise program increase the total body

bone mineral content (TBBMC) of young women? A team of
researchers is planning a study to examine this question. Based on
the results of a previous study, they are willing to assume that
for the percent change in TBBMC over the 6-month period.
A change in TBBMC of 1% would be considered important, and
the researchers would like to have a reasonable chance of detecting
a change this large or larger. Is 25 subjects a large enough sample
for this project?
Three steps to find the power of the test:
1. State , , the particular alternative we want to detect, and
the significance level .
2. Find the values of (or other estimates) that will lead to reject
.
3. Calculate the probability of observing these values of when
the alternative is true.

Example: Power of the pharmaceutical product test:

What is the power of the test against the specific alternative

How to increase the power?

Back to Error Probabilities

Example: The mean outer diameter of a skateboard bearing is
supposed to be 22.000 millimeters (mm). The outer diameters vary
Normally with standard deviation
mm. When a lot of
bearings arrives, the skateboard manufacturer takes an SRS of 5
bearings from the lot and measures their outer diameters. The
manufacturer rejects the bearings if the sample mean diameter is
significantly different from 22 at the 5% significance level.

Suppose the producer and the manufacturer agree that a lot of

bearings with mean 0.015 mm away from 22 should be rejected.

Significance and Type I error: The significance level

of any
fixed level test is the probability of a Type I error. That is, is the
probability that the test will reject
when
is in fact true.

Power and Type II error: The power of a fixed level test to detect a
particular alternative is 1 minus the probability of a Type II error
for that alternative.

Testing Hypotheses on a Proportion

Example: According to the National Institute for Occupational

Safety and Health, job stress poses a major threat to the health of
workers. A national survey of restaurant employees found that
75% said that work stress had a negative impact on their personal
lives. A sample of 100 employees of a restaurant chain finds that
68 answer Yes when asked, Does work stress have a negative
impact on your personal life? Is this good reason to think that the
proportion of all employees of this chain who would say Yes,
differs from the national proportion p0=0.75?

The Impact of Ict On The Nigerian Economic Growth and Development
No ratings yet
The Impact of Ict On The Nigerian Economic Growth and Development
64 pages
Project
100% (1)
Project
63 pages
Testing of Hypothesis
No ratings yet
Testing of Hypothesis
68 pages
MNSTA Chapter 4
No ratings yet
MNSTA Chapter 4
31 pages
Testing of Hypothesis Hypothesis
No ratings yet
Testing of Hypothesis Hypothesis
32 pages
Eda Research
No ratings yet
Eda Research
11 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
12 pages
Data Analytics Module 1 Lesson 6 Summary Notes
No ratings yet
Data Analytics Module 1 Lesson 6 Summary Notes
17 pages
Estimation and Hypothesis Testing
No ratings yet
Estimation and Hypothesis Testing
46 pages
HYPOTHESIS TESTING
No ratings yet
HYPOTHESIS TESTING
56 pages
Chapter 3
No ratings yet
Chapter 3
45 pages
Hypothesis Testing About A Mean 3
No ratings yet
Hypothesis Testing About A Mean 3
39 pages
Learning Module - Statistics and Probability
No ratings yet
Learning Module - Statistics and Probability
71 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
BRM-Chapter-10-Hypothesis Testing For Single Populations - Revised
No ratings yet
BRM-Chapter-10-Hypothesis Testing For Single Populations - Revised
28 pages
Chapter10 - One Tailed Test of Hypothesis
No ratings yet
Chapter10 - One Tailed Test of Hypothesis
44 pages
Test of Hypotheses
0% (1)
Test of Hypotheses
26 pages
L15 Testing of Hypothesis
No ratings yet
L15 Testing of Hypothesis
42 pages
Lecture 5 Test of Hypothesis Upload T
No ratings yet
Lecture 5 Test of Hypothesis Upload T
30 pages
Hypotheses Testing
No ratings yet
Hypotheses Testing
5 pages
H T S M: Ypothesis Ests FOR A Ingle EAN
No ratings yet
H T S M: Ypothesis Ests FOR A Ingle EAN
1 page
Hypothesis Testing
No ratings yet
Hypothesis Testing
59 pages
Stat 3 RD
No ratings yet
Stat 3 RD
91 pages
Chapter IX Hypothesis Testing
No ratings yet
Chapter IX Hypothesis Testing
31 pages
Statistics 1
No ratings yet
Statistics 1
34 pages
Statistical Analysis (T-Test)
No ratings yet
Statistical Analysis (T-Test)
61 pages
Testing of Hypotheses PDF
No ratings yet
Testing of Hypotheses PDF
21 pages
Week 14_15 Testing Claims About Means and Proportions
No ratings yet
Week 14_15 Testing Claims About Means and Proportions
74 pages
St. Paul University Philippines
No ratings yet
St. Paul University Philippines
14 pages
Confidence Interval For Population Variance
No ratings yet
Confidence Interval For Population Variance
48 pages
1. Testing
No ratings yet
1. Testing
29 pages
3. Inferential Statistics
No ratings yet
3. Inferential Statistics
48 pages
Module 2 - Hypothesis Testing_afterclass
No ratings yet
Module 2 - Hypothesis Testing_afterclass
37 pages
Testing of Hypothesis
67% (3)
Testing of Hypothesis
37 pages
Week 7 Hypothesis Testing
No ratings yet
Week 7 Hypothesis Testing
37 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
7 pages
Hypothesis Testing MBA
No ratings yet
Hypothesis Testing MBA
39 pages
Infer Ential
No ratings yet
Infer Ential
25 pages
Chapter 10 One-Sample Tests
No ratings yet
Chapter 10 One-Sample Tests
28 pages
Stat
67% (3)
Stat
70 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
Hypothesis Test
No ratings yet
Hypothesis Test
23 pages
Chapter 9
No ratings yet
Chapter 9
14 pages
Unit5 Hypothesis Testing-1
No ratings yet
Unit5 Hypothesis Testing-1
82 pages
One Sample Procedures
No ratings yet
One Sample Procedures
5 pages
Chapter 4
No ratings yet
Chapter 4
77 pages
PSM 201 Sampling Distributions and Hypothesis Testing
No ratings yet
PSM 201 Sampling Distributions and Hypothesis Testing
31 pages
Padeepz MA3251 Notes-1
No ratings yet
Padeepz MA3251 Notes-1
239 pages
Testing of Hypotheses (1)
No ratings yet
Testing of Hypotheses (1)
24 pages
5 & 6 - BIOSTATISTICS V & VI Inferential Statistics I & II
No ratings yet
5 & 6 - BIOSTATISTICS V & VI Inferential Statistics I & II
68 pages
Chapter 10 One-Sample Tests of Hypothesis
100% (2)
Chapter 10 One-Sample Tests of Hypothesis
36 pages
Chapter 5_ All Methods p Value
No ratings yet
Chapter 5_ All Methods p Value
36 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
86 pages
UNIT III Testing of Hypothesis
No ratings yet
UNIT III Testing of Hypothesis
54 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
7 pages
Chapter 4 - One Sample Test of Hypothesis
No ratings yet
Chapter 4 - One Sample Test of Hypothesis
37 pages
Hypothesis I
No ratings yet
Hypothesis I
70 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
8 pages
Testing of Hypothesis
100% (1)
Testing of Hypothesis
54 pages
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
SWE5201 Advanced Programming - Assignment 2
No ratings yet
SWE5201 Advanced Programming - Assignment 2
6 pages
Chaltu Hunduma
No ratings yet
Chaltu Hunduma
112 pages
Revised CBC Mses NC Ii
No ratings yet
Revised CBC Mses NC Ii
88 pages
General Information On Measurement Uncertainty
No ratings yet
General Information On Measurement Uncertainty
3 pages
An Essay of Understanding The Methods of Qualitative Research In Social Science
No ratings yet
An Essay of Understanding The Methods of Qualitative Research In Social Science
4 pages
Mix Method Research - Chapter 1 (2010)
100% (1)
Mix Method Research - Chapter 1 (2010)
41 pages
Sprocket Central Pty LTD: Data Analytics Approach
No ratings yet
Sprocket Central Pty LTD: Data Analytics Approach
5 pages
Final Output in Research Report
No ratings yet
Final Output in Research Report
27 pages
Diagnostic Test Practical Research 1
No ratings yet
Diagnostic Test Practical Research 1
9 pages
Lecture5 Philosophy of Research
No ratings yet
Lecture5 Philosophy of Research
15 pages
Research Topics For Economics Thesis
100% (3)
Research Topics For Economics Thesis
7 pages
Research Methodology
No ratings yet
Research Methodology
2 pages
Thesis Proposal in Economics
100% (1)
Thesis Proposal in Economics
6 pages
PR1 PPT 2 Process Ethics of Research
No ratings yet
PR1 PPT 2 Process Ethics of Research
39 pages
Stat For Business Sem Sum 1516 Final Exam PSUT
No ratings yet
Stat For Business Sem Sum 1516 Final Exam PSUT
7 pages
Lesson 12
No ratings yet
Lesson 12
15 pages
Quality Improvement Vs Research Guide
No ratings yet
Quality Improvement Vs Research Guide
4 pages
Understanding Tiktok Usage: Communication Strategy of Bytedance Based On The Background of New Media
No ratings yet
Understanding Tiktok Usage: Communication Strategy of Bytedance Based On The Background of New Media
11 pages
Evaluation In Practice A Methodological Approach 2nd Edition Bingham R download
No ratings yet
Evaluation In Practice A Methodological Approach 2nd Edition Bingham R download
77 pages
Research Paradigms and Christian Worldview - Edited2
No ratings yet
Research Paradigms and Christian Worldview - Edited2
2 pages
Talent Acquisition Strategies A Comprehensive Exam
No ratings yet
Talent Acquisition Strategies A Comprehensive Exam
15 pages
Ep 3
No ratings yet
Ep 3
7 pages
How To Write A Literature Review For A Lab Report
No ratings yet
How To Write A Literature Review For A Lab Report
7 pages
Research-Proposal-SAMPLES GRADE 11 SECOND SEMESTER HUMSS
No ratings yet
Research-Proposal-SAMPLES GRADE 11 SECOND SEMESTER HUMSS
5 pages
Thesis Ideas For Biology
100% (3)
Thesis Ideas For Biology
7 pages
Lecture 9 - Reporting and Sharing Findings
No ratings yet
Lecture 9 - Reporting and Sharing Findings
21 pages
8604 Solved Quiz
No ratings yet
8604 Solved Quiz
13 pages
0.1 Simulation Based Power Analysis For Factorial ANOVA Designs PDF
No ratings yet
0.1 Simulation Based Power Analysis For Factorial ANOVA Designs PDF
11 pages

Hypothesis Testing Lecture

Uploaded by

Hypothesis Testing Lecture

Uploaded by

Lecture 5

, is the current belief.

The alternative hypothesis,

, is your belief; it is what you

Examples: Each of the following situations requires a significance

(b) Larry's car consume on average 32 miles per gallon on the

(c) The diameter of a spindle in a small motor is supposed to be

Guidelines for Hypothesis testing

Step 2: Use statistical theory to make a statistic (function of the

that appears in the hypotheses. Usually this is the same

Values of the estimate far from the parameter value specified by

Example: An air freight company wishes to test whether or not the

Example: The hourly wages in a particular industry are normally

The probability of a type II error is denoted by .

Example: We are interested in testing whether or not a coin is

0.05 < P-value < 0.1 provides weak evidence against

0.01 < P-value < 0.05 provides moderated evidence against

We can compare the P-value we calculate with a fixed value that

Example: 85% of the general public is right-handed. A survey of

Tests for a Population Mean ( is known)

is the specified value of .

Example: In 1999, it was reported that the mean serum cholesterol

Two-sided significance tests and CIs

Tests for a Population Mean ( is unknown)

Recall: (one-sample t CI)

Example: Founded in 1998, Telephia provides a wide variety of

Example: Suppose that, for the U.S. data in example before we

Example: Can a 6-month exercise program increase the total body

Example: Power of the pharmaceutical product test:

What is the power of the test against the specific alternative

How to increase the power?

Back to Error Probabilities

Suppose the producer and the manufacturer agree that a lot of

Significance and Type I error: The significance level

Testing Hypotheses on a Proportion

Example: According to the National Institute for Occupational

You might also like