upload_f1520c99-c1f4-4534-a66e-112eafe7561c
upload_f1520c99-c1f4-4534-a66e-112eafe7561c
parametric alternatives
HYPOTHESIS TESTING
A large research project has been going on for several years. In this project, new babies are
given a particular vitamin, and then the research team follows their development during the
first 2 years of life. So far, the vitamin has not speeded up the development of the babies. The
mean is 14 months , the standard deviation is 3 months, and the ages follow a normal curve.
Only about 2% of babies start walking before 8 months of age; these are the babies who are
more than 2 standard deviations below the mean. A newborn in the project is then randomly
selected to take the highly purified version of the vitamin, and the researchers then follow this
baby’s progress for 2 years. This is a hypothesis-testing problem. The researchers want to
draw a general conclusion about whether the purified vitamin allows babies in general to walk
earlier. The conclusion will be about babies in general (a population of babies). However, the
conclusion will be based on results of studying a sample.
• Null hypothesis –
• Alternate hypothesis –
• Ho=.
• Ha =
• Df =
• Critical value - 2.043
• t = 10.77
• If t value
t test for dependent means
• hypothesis-testing procedure in which there are two scores for each person
and the population variance is not known; it determines the significance of
a hypothesis that is being tested using difference or change scores from a
single group of people.
• This kind of research situation is called a repeated measures design (also
known as a within-subjects design) or a matched group design
• A common example is when you measure the same people before and
after some psychological or social intervention.
• difference scores difference between a person’s score on one testing and
the same person’s score on another testing; often an after-score minus a
before score, in which case it is also called a change score.
• Df = n - 1
t – test for independent means
• Hypothesis-testing procedure in which there are two separate
groups of people tested and in which the population variance is
not known.
• When the scores in one group are for different people than the
scores in the other group, what you can compare is the mean of
one group to the mean of the other group.
• The goal of a t test for independent means is to decide whether
the difference between the means of your two actual samples is
a more extreme difference than the cutoff difference on this
distribution of differences between means.
• Df = (n1-1) + (n2-1) = N -2
Df = df1 + df2
= (N1-1) + (N2-1)
Questions from previous year papers
• In matched two-group design with 30 subjects per group, the ‘t’ test
would be based on degrees of freedom.
1)58 2) 29 3) 30 4) 59
• A report claims that a t-value of 2.85 is significant (p < .01) when the
number of people in a repeated measures design was 11. What would be
the df?
• A researcher wanted to test the effect of drug in reducing anxiety.
For this purpose he used two groups of individuals, experimental
(10) and control (10) matched in pairs. He made use of an anxiety
scale for the measurement of anxiety among the subjects of the
group. What would be the df?
• A group of 20 students were given an achievement test under two
conditions – when tense and when relaxed. What would be the df to test
the significance of difference between means of the conditions.
• Five students take AP Calculus AB one year and AP Calculus BC the next
year. Their overall course grades (%) are listed below for both courses.
Which of the following statistical procedures would be most appropriate
to test the claim that student overall course grades are the same in both
courses? Assume that any necessary normality requirements hold.
Student 1 2 3 4 5
AP CAL AB 80.0% 72.6% 99.0% 91.3% 68.9%
AP CAL BC 85.5% 71.0% 93.2% 93.0% 74.8%
Girls 10 15 5 15 15 60
QUESTIONS FROM PREVIOUS YEAR PAPERS
• The chi square computed for a contingency table, was based on six
degrees of freedom. If the contingency table had three rows, how
many columns would it have?
a)2 b) 3 c) 4 d) 6
• In a contingency table, one of the cell has an obtained frequency of 20
and an expected frequency of 30. What would be the contribution of
this cell towards the total Chi-square value?
1) 3.33 2) 5.00 3) 10.00 4) 20.00
• How many subjects would be required in a 3 x 3 x 2 factorial
design with subjects 10 per cell?
a)10 b) 18 c) 90 d)180
a) 3 7
b) 12 18
c) 9 5
d) 7 7
e) 8 12
f) 1 5
g) 15 16
h) 10 12
i) 11 15
j) 10 17
S=
DF = total number of signs =
CV = 2
THE WILCOXON (T) MATCHED PAIRS SIGNED
RANKS TEST
• Ordinal-level significance test for differences between two related
sets of data.
• More efficient and powerful non-parametric test
• Takes in to account both magnitude and direction
• Suppose we ask students to rate two methods of learning that they
have experienced on two different modules. Method A is a traditional
lecture-based approach while method B is an active assignment-
based method.
Participant No. Rating of Rating of D (B – A) RANK
traditional lecture assignment based
method (A) method (B)
1 23 33
2 14 22
3 35 38
4 26 30
5 28 31
6 19 17
7 42 42
8 30 25
9 26 34
10 31 24
11 18 21
12 25 46
13 23 29
14 31 40
15 30 41
• ADD POSITIVE RANK SCORES –
• ADD NEGATIVE RANK SCORES
•
• THAT WILL BE THE VALUE OF T
• LOWER T IS THE T VALUE =
• DF =
• CV = 20
• SO NULL HYPOTHESIS =
THE MANN-WHITNEY U TEST
• Useful non-parametric alternative to the t test for assessing the
difference between two independent samples having uncorrelated
data, especially in the circumstances when the assumptions and
conditions for applying the t test are not met.
• This test too is used to find out whether or not the two independent
samples have been drawn from the same population.
• Data at least at ordinal level.
• In general, H0 is that the populations from which the two samples have
been randomly selected are identical. In most cases it is specifically
that the two population medians are equal.
Stereotype scores for children whose mothers had
Full time jobs No jobs
Scores Scores
17 9 19 6
32 7 63 0
39 6.5 78 0
27 8 29 4
58 6 39 1.5
25 8 59 0
31 7 77 0
81 0
68 0
TOTAL U1 = 51.5 U2= 11.5
U VALUE = 11.5
CV = 12
WE CHECK THE VALUE AT N1 AND N2
Questions from previous years papers
• A researcher wanted to test the hypothesis that a given drug would
adversely affect the rate of learning. A group of 40 subjects was employed
in the research. Each subject was required to learn a task upto two
errorless trials. Then each subject was given a specified drug dosage
(1mg/body weight in kilogram) and thirty minutes after, the subject was
required to learn another equated learning task upto two errorless trials.
The number of trials required to learn each task followed a normal
distribution. The mean number of trials required to learn the tasks were 24
and 20 only for normal and ‘drugged conditions’, respectively. The two
conditions had comparable standard deviations. The mean difference was
tested by a suitable statistical test and the difference was found to be
significant at .01 level.
• What can you conclude about the researcher’s hypothesis?
a). The hypothesis has been verified
b). The hypothesis has been rejected
c). The hypothesis has been partially verified
d). The data are inadequate to evaluate researcher’s hypothesis.