Hypothesis Testing (1)
Hypothesis Testing (1)
Critical value:
In hypothesis testing, critical value and critical region are key concepts used to
determine whether to reject the null hypothesis based on the test statistic.
The critical value is a threshold or cutoff point that defines the boundary
between the rejection region and the non-rejection region for the null
hypothesis.
It is determined based on the significance level α (e.g., 0.05 or 0.01) and the
type of test being conducted (one-tailed or two-tailed).
If the calculated test statistic (e.g., z-score or t-score) is more extreme than
the critical value, you reject the null hypothesis.
Critical Value and the Significance Level (α):
For a significance level of α=0.05, the critical value represents the point
beyond which there is only a 5% chance that the test statistic will fall if the
null hypothesis is true.
Example:
For a z-test with α=0.05, the critical value for a two-tailed test is
approximately ±1.96, meaning that if the calculated z-value is less than -
1.96 or greater than 1.96, you reject H 0.
Critical Region:
The critical region (or rejection region) is the area in the tails of the
distribution that corresponds to extreme values of the test statistic. If the
test statistic falls within this region, the null hypothesis is rejected.
The size of the critical region is determined by the significance level α. In a
two-tailed test with α=0.05, the critical region consists of the upper 2.5%
and lower 2.5% of the distribution (together making up 5%).
Example of Critical Region:
For a two-tailed z-test with α=0.05, the critical region lies in both tails of
the standard normal distribution, at z-values greater than 1.96 and less than
-1.96.
For a one-tailed z-test with α=0.05, the critical region would lie in just one
tail (greater than 1.645 or less than -1.645, depending on the direction of
the test).
Z-test:
A Z-test is a type of statistical test used to determine whether there is a
significant difference between sample and population means, or between
the means of two samples, assuming the population variance is known.
It is used when the sample size is large (typically n>30) and the data follows
a normal distribution.
Assumptions of Z-tests:
The population standard deviation (σ) is known.
The data is normally distributed.
Large sample size (for the Central Limit Theorem to hold)
Hypotheses:
Null hypothesis ( H 0): The means of the two samples are equal.
Alternative hypothesis ( H 1): The means of the two samples are not equal
x̄−μ
Z= σ
√n
Where
x̄ = sample mean,
μ= population mean,
σ = population standard deviation,
n = sample size
t – test:
A t-test is a statistical test used to compare the means of two groups to determine
if they are significantly different from each other. It is commonly used when the
data is normally distributed and the sample sizes are relatively small (typically
n<30).
Assumptions of T-Tests
The data is approximately normally distributed.
The variances of the two groups are equal (for independent t-tests).
The data should be independent of each other in the case of independent
samples.
x̄−μ
t= s
√n
Where
x̄ = sample mean,
μ= population mean,
s = sample standard deviation,
n = sample size
Summary Table:
Type 1 Null hypothesis is true Reject the null hypothesis False Positive
Type 2 Null hypothesis is false Fail to reject the null hypothesis False Negative