Unit 4 Hypothesis Testing (One Sample Mean) (SY22)
Unit 4 Hypothesis Testing (One Sample Mean) (SY22)
INTRODUCTION TO HYPOTHESIS
TESTING
Mathematics Department
XAVIER UNIVERSITY-ATENEO DE CAGAYAN
INFERENTIAL
STATISTICS
An area in Statistics that deals with
methods used to make generalizations
or inferences about some
characteristics of the population based
on information contained in a sample
Hypothesis Testing
- one type of inferential analysis
- a decision-making process for testing the claims
about a population parameter based on the
characteristics of a sample randomly taken from
the population
Important Terms
■ The null hypothesis, H0, is a statement that specifies a
particular value (or values) for the parameter being studied. It is
the hypothesis that is being tested.
■ The alternative hypothesis, H1, specifies those values of the
parameter that represent an important change from the null
hypothesis. It opposes the null hypothesis.
■ The probability of committing a Type I error is the level of
significance of the test. It is the maximum probability with which
we are willing to commit a Type I error. In practice, usually set at
0.05 (*significant) or 0.01 (** highly significant)
■ The test statistic is a statistic computed from the sample on
which the decision to reject or not to reject H0 is based. If the
computed test statistic falls in the rejection region, the H0 is
rejected.
■ The critical value is the boundary between the rejection region
and the nonrejection region.
The following is the summary of the steps when
performing hypothesis testing.
The degrees of freedom relate to the number of observations that are free to vary.
Most computer programs do not have this option because Z test
and t test approximately give similar output when the sample size is
large.
One sample t test is available in SPSS.
Z distribution – based on the t distribution
Standard Normal Distribution
Properties of the t Distribution
Properties of a Normal Distribution
It is, like the Z distribution, a
■ The normal curve is symmetrical
about the mean μ; bell-shaped, symmetrical,
continuous distribution.
■ In a standard normal distribution,
the mean is zero. There is not one t distribution
■ The mean is at the middle and but rather a “family” of t
divides the area into halves; distributions. All have the same
mean (center) of zero, but their
■ The total area under the curve is
equal to 1;
standard deviations differ
according to the sample size n.
■ It is completely determined by its
mean and standard deviation σ (or The t distribution is more spread
variance σ2) out and flatter at the center
than the Z distribution.
However, as the sample size
increases, the t distribution
approaches the standard
normal distribution.
Decision of the test
■ i. The null hypothesis is rejected when the computed test
statistic falls in the rejection region.
■ ii. The null hypothesis is not rejected when the computed test
statistic falls in the non-rejection region.
A small p-value basically means that your data are unlikely under some null
hypothesis.
Example 1 (two-tailed Z test).
Lighthouse Electrical Company manufactures light bulbs that
have a lifetime that is approximately normally distributed with a
mean of 750 hours. A random sample 30 bulbs has been
tested and showed a mean lifetime of 738 hours. It has been
known that the population standard deviation is 37.5 hours. At
0.05 level of significance, test the hypothesis that the mean
lifetime of light bulbs is significantly different from 750 hours.
Solution:
Let = mean lifetime of light bulbs in hours
Z Z 0.025 or Z Z 0.025
.
Based from the Z distribution table, the critical value of the test is 1.96
7. Conclusion
There is no sufficient evidence to claim that the mean lifetime of the light
bulbs is significantly different from 750 hours.
Example 2 (one-tailed t test)
A manager of a hotdog company claims that mean time to
pack one dozen of hotdog manually is 11 seconds. Suppose a
random sample of 20 workers spent a mean of 13 seconds
with a standard deviation of 2.3 seconds to pack a dozen of
hotdog. At 0.01 level of significance, do the sample results
provide sufficient evidence to conclude that it takes more than
11 seconds to pack one dozen of hotdog manually? Assume
that the length of time is normally distributed.
Solution:
Let = mean length of time to pack one dozen of hotdog manually
t 3 . 889
6. Statistical decision
Since t = 3.889 is in the rejection region, H0 is rejected.
36 28 21 25 31 17 22 18 18
29 21 26 17 18 30 19 19 28
4. Click OK.
H 0 : 20 yrs.old H 0 : 0 20 yrs.old
Edit the table.
t-Test: One-Sample
t-Test: Two-Sample Assuming Unequal Variances
Decision/Conclusion: Reject Ho. The mean age of patients in NMMC who had
undergone appendectomy is significantly different from 20yrs old.
SPSS: Analyze Compare Means One Sample t test
Ho: µ = 20 yrs old
H1: µ ≠ 20 yrs old
By default, SPSS provides
only the two-tailed
probability. Since the p-value
is less than 0.05, Ho is
rejected. The mean age of
patients in NMMC who had
undergone appendectomy is
significantly different from
20yrs old.
Since p-value (Sig.) 0.316 is not less than 0.05, Ho is not rejected and
conclude that the data come from a normal distribution.
Note: In large samples these tests can be significant even when the scores are only
slightly different from a normal distribution. Therefore, they should always be
interpreted in conjunction with histograms, P–P or Q–Q plots, and the values of
skewness and kurtosis.
Jamovi: Analyses ->
Exploration -> Descriptive ->
Statistics -> Shapiro-Wilk
Example
Since the p-value (Sig.) 0.410 is not less than 0.05, Ho is not rejected
and conclude that the variances are equal.