T-And F-Tests: Testing Hypotheses
T-And F-Tests: Testing Hypotheses
Testing hypotheses
Overview
Distribution& Probability Standardised normal distribution t-test F-Test (ANOVA)
Starting Point
Central aim of statistical tests:
Determining the likelihood of a value in a sample, given that the Null hypothesis is true: P(value|H0)
H0: no statistically significant difference between sample & population (or between samples) H1: statistically significant difference between sample & population (or between samples)
Types of Error
Population H0 H1
b-error (Type II error) 1-b
H0
Sample H1
1-a
a-error (Type I error)
x
a/2
x
i 1
( xi x ) 2
i 1
xi x zi s
xz 0 sz 1
the z-score represents a value on the x-axis for which we know the p-value 2-tailed: z = 1.96 is 2SD around mean = 95% significant 1-tailed: z = +-1.65 is 95% from plus or minus infinity
t-tests:
Testing Hypotheses About Means
x1 x2 t s x1 x2
t
s x1 x2
s1 s2 n1 n2
difference s _ between _ sample _ means estimated _ standard _ error _ of _ difference s _ between _ means
Kinds of t-tests
Formula is slightly different for each:
Single-sample:
tests whether a sample mean is significantly different from a pre-existing value (e.g. norms)
Paired-samples:
tests the relationship between 2 linked samples, e.g. means obtained in 2 conditions by a single group of participants
Independent-samples:
tests the relationship between 2 independent populations
formula see previous slide
df = (n1-1) + (n2-1) = 18
x1 x2 19 26 t 7 s x1 x2 1
t ( 0.05,18) 2.101 t t ( 0.05,18)
Reject H0
Bonferroni correction
To control for false positives:
p pc n
0.05 pc 0.0125 4
ANOVA controls this error by testing all means at once - it can compare k number of means. Drawback = loss of specificity
Assumptions observations within each sample were independent samples must be normally distributed samples must have equal variances
F=
20
30 35 s2=58.3
28
30 31 s2=2.33
These 2 variances provide a relatively accurate representation of the size of the differences
Between treatments variance ---------------------------Measures differences due to: 1. 2. Treatment effects Chance
F=
MSbetween
MSwithin
When the treatment does have an effect then the between-treatment differences (numerator) should be larger than chance (denominator). F-ratio should be noticeably larger than 1.00
There is a difference somewhere - have to use post-hoc tests (essentially t-tests corrected for multiple comparisons) to examine further
Participants have to rate 5 types of crackers, dependent variable is how many they eat This expt is a 2x2 factorial design - 2 factors x 2 levels
22
15
= 37
Result: No main effect for factor A (normal/obese) No main effect for factor B (empty/full)
Obese
17
= 39
18
= 33
= 35
22
15
obese normal
Full Stomach
Obese
17
18
References Gravetter & Wallnau - Statistics for the behavioural sciences Last years presentation, thank you to: Louise Whiteley & Elisabeth Rounis
http://www.fil.ion.ucl.ac.uk/spm/doc/mfd-2004.html