Hypothesis Testing Excercise 1st August 2023
Hypothesis Testing Excercise 1st August 2023
A production engineer claims that there is no difference in the variance of nut diameter
manufactured by two different methods. The data shows nuts diameter (in centimeters),
produced by two methods with the following diameters.
a) At the 5% level of significance can we conclude that nut diameters produced by two
different methods are not normally distributed.
b) At the 5 % level of significance, can you reject the production engineer’s claim?
c) At the 5% level of significance can we conclude that average size of nut diameters
produced by two different methods is different?
Q-1C Homework
A nationwide shipping firm purchased a new computer system to track its shipments, pickups,
and deliveries. Employees were expected to need about 2 hours to learn how to use the system.
In fact, some employees could use the system in very little time, whereas others took
considerably longer. Someone suggested that the reason for this difference might be that only
some employees had experience with this kind of computer system. To test this suggestion,
independent samples of employees with and without such experience were randomly selected.
The times, in minutes, required for these employees to learn how to use the system are given in
Q-1C. At the 5% significance level, do the data provide sufficient evidence to conclude that the
mean learning time for all employees without experience exceeds the mean learning time for all
employees with experience? Assuming same shape population.
Normality test.
Equality of mean ( Parametric approach or non-parametric approach)
Data file is given in excel file as Q-1C
The baseball coach suggests that a baseball clinic will help players raise their batting averages.
The data shows the batting averages of 14 players before participating in the clinic and two
months after participating in the clinic.
a) At the 5% significance level do the data provide evidence to conclude that differences in
batting scores are normally distributed.
In the very first step do create a difference variable and check its normality. If differences
are normally distributed we will be proceeding for parametric approach otherwise we will
apply nonparametric method.
Ho: Differences in batting scores are normally distributed
Kolmogorov-Smirnova Shapiro-Wilk
Ha: µ1< µ2
Test Statisticsa
After - Before
Z -.175b
Asymp. Sig. (2-tailed) .861
A medical researcher claims that a new drug affects the number of headache hours experienced
by headache sufferers. The number of headache hours (per day) experienced by eight randomly
selected patients before and after taking the drug are shown in the table. Use α= 0.05 to test that
differences are normally distributed. Based on your results from part (a), do you support the
researcher’s claim?
Normality test
Ho : Differences in number of headache hours suffered are normally distributed
Ha: Differences in number of headache hours suffered are not normally distributed
Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Note: As the normality test passed, there is a scope of applying parametric approach ( 1-e
Paired Sample t test)
Ho: µ1=µ2
Ha: µ1> µ2
Paired Samples Test
Lower Upper
At the 5% significance level data do provide evidence to conclude that medicine is effective
in reducing the number of headache hours suffered.
c) Formulate and interpret the 90% confidence interval for the differences in number
of headache hours suffered.
We are 90% confident that the differences in number of headache hours suffered
are lying somewhere between 0.4042 to 1.296 hours.
Non- Parametric approach to examine the association between two Categorical Variables
Q-3)
The data related to gender and their choice of eating ice creams is given. Test the hypothesis at
the 0.05 significance level that there is a significant association between that favorite way to eat
ice cream and Gender?
Ho:
Ha:
It is a non-parametric test.
SPSS does not allow you to change level of significance
To apply Chi Square test for independence , Frequency of categorical variables
must be given
To apply Chi square test for independence you need to assign weight.
Q-4)
The distribution of the opinions of U.S. parents on whether a college education is worth the
expense is given . An economist believes that the distribution of the opinions of U.S. teenagers is
different from the distribution for U.S. parents. The economist randomly selects 200 U.S.
teenagers and asks each whether a college education is worth the expense. The results are shown
in the table. At the 5 % level of significance are the distributions different?
Q-5A
a) The number of grams of fiber per serving for a random sample of three different
kinds of foods is listed. Is there sufficient evidence at the 0.05 level of significance to
conclude that there is a difference in mean fiber content among breakfast cereals,
fruits, and vegetables?
b) At the 0.05 level of significance, is there evidence that variances of fiber content
differ?
Q-5B The data describes the salaries of undergraduate workers from four regions. Perform the
hypothesis at the 0.05 significance level that the mean salary of undergraduates differs in four
regions.
KWS test
A researcher believes that the mean earnings of top-paid actors, athletes, and musicians are the
same. The earnings (in millions of dollars) for several randomly selected people from each
category are shown in the table at the left. Assume that the populations are normally distributed,
the samples are independent, and the population variances are equal. At α = .10 can you reject
the claim that the mean earnings are the same for the three categories?
Data file is given in excel file as Q-5c
a) At the 0.05 level of significance, is there evidence of a linear relationship between the
number of customers and the waiting time on the checkout line?
b) At the 0.05 percent of significance can we conclude that number of customers is a
significant determinant of waiting time on the checkout line?