Hypothesis Testing Lecture
Hypothesis Testing Lecture
Hypothesis Testing
A hypothesis is a conjecture about the distribution of some random
variables. For example, a claim about the value of a parameter of
the statistical model.
There are two types of hypotheses:
The null hypothesis,
is true.
Test Statistic
The test is based on a statistic that estimates the parameter
Graphical Representation
Suppose we want to test a set of hypotheses concerning a
parameter based on a random sample
.
vs
is the estimate of our parameter .
Rejection Region (RR) is the specified values of the test statistics
for which we reject .
The probability that defines the critical region is called the size of
the test or level of the significance of the test and is denoted by .
Decision Errors
When we perform a statistical test we hope that our decision will
be correct, but sometimes it will be wrong. There are two possible
errors that can be made in hypothesis test.
Definition: The error made by rejecting the null hypothesis
when in fact
is true is called a type I error.
The error made by failing to reject the null hypothesis
when in
fact
is false is called a type II error.
Note: The level of significance of the test is also the probability of
type I error, denoted by , i.e.
P-value
Definition: The probability, assuming
is true, that the test
statistic would take a value as extreme or more extreme than that
actually observed is called the P-value of the test. The smaller the
P-value, the stronger the evidence against
provided by the data.
Guideline for how small is small:
P-value > 0.1 provides no evidence against
where
Note: A level
two-sided significance test rejects a hypothesis
exactly when the value
falls outside a level
confidence interval for .
Power
The ability of a test to detect that
is false is measured by the
probability that the test will reject
when an alternative is true.
The higher this probability is, the more sensitive the test is.
Definition: The probability that a fixed size
test will reject
when
is false is called the power of the test.
A powerful test has a large probability of rejecting
false.
when it is
Power and Type II error: The power of a fixed level test to detect a
particular alternative is 1 minus the probability of a Type II error
for that alternative.