Reliability and Validity
Reliability and Validity
&
Validity
1
Goals in questionnaire design
Warwick and Linninger(1975) point out that there are two
basic goals in questionnaire design
2
Reliability
3
Assessment of reliability
4
Test-retest reliability
Measured by having the same respondents complete a
survey at two different points in time to see how stable the
responses are
Usually quantified with a correlation coefficient (r value)
In case of same results on the two administrations of the
instrument (questionnaire) then reliability coefficient will be
one
Normally, correlation of measurements across time will be
less than perfect due to different experiences and attitidus
that that respondents have encountered from time of the
first test.
5 In general, r values are considered good if r 0.70
Test-retest reliability
6
Problems with Test-retest reliability
7
Alternate-form reliability
9
Alternate-form reliability
10
Example: Assessment of depression
Version B:
During the past 4 weeks, I have felt downhearted:
Never 1
Some days 2
11 Every day 3
Alternate-form reliability
12
Example: Assessment of loneliness
Version A:
How often in the past month have you felt alone in the world?
Every day
Some days
Occasionally
Never
Version B:
During the past 4 weeks, how often have you felt a sense of
loneliness?
All of the time
Sometimes
13 From time to time
Example of nonequivalent item rewording
Version A:
When your boss blames you for something you did not do, how often do
you stick up for yourself?
All the time
Some of the time
None of the time
Version B:
When presented with difficult professional situations where a superior
censures you for an act for which you are not responsible, how
frequently do you respond in an assertive way?
All of the time
Some of the time
None of the time
14
Internal consistency
(Split-half method)
2r
PXX
1 r
15
PROBLEM
16
SPLIT half method
Spilt-half can be viewed as a one-test equivalent to alternate form and test-
retest, which use two tests.
In spilt-half, you treat one single test as two tests by dividing the items into
two subsets.
Reliability is estimated by computing the correlation between the two
subsets.
For example, let's assume that you calculate the subtotal scores of all even
numbered items and the subtotal of all odd numbered items. The two sets
of scores are as the following:
Calculate the correlation of these two sets of scores to check the internal
consistency.
If the correlation of the two sets of scores is low, it implies that some
people received high scores on odd items but received low scores on even
items while other people received high scores on even items but received
low scores on odd items. In other words, the response pattern is
inconsistent.
17
Example: Calculate SPLIT HALF reliability for the
following data of 5 students on 4 test items
Total Total
Stude Q Q Q Q Different Correlation Split half
(Q1+Q2) (Q3+Q4)
nt 1 2 3 4 X Y halves coefficient reliability
S XY 8 .5
r 0.7863
S 2X S 2y (10.3)(11 .5)
Average of split half reliability coefficients=0.9244
2(0.7863)
PXX 0.8763
1 0.7863
18
Methods to split items into two halves
Assign the odd numbered items to one half and the even numbered
items to the other half of the test.
Divide the items from center (discard center item, if necessary) into
two halves
Drawback:
Correlation between the two halves is dependent upon the method used
to divide the items
Solution:
Calculate correlation coefficients between every possible division of test
into two halves and find average of these correlation coefficients
Problem:
Incase of large number of test items difficult to calculate correlation
between every possible split of the test items into two halves
Solution:
Calculate CORNBACH’s ALPHA
19
Internal consistency
(Cornbach’s alpha)
Studen Q1 Q2 Q3 Q4 TOTAL
t
1 2 1 1 3 7
2 6 4 5 6 21
3 3 2 1 1 7 S2total=31.04
4 6 3 3 3 15
k
5 6 4 4 3 17 Si2
k i 1 4 9.52
1 2 1 0.9244
(x x) 2
15.2 6.8 12.8 12.8
k1 Stotal 4 1 31.04
3.04 1.36 2.56 2.56
Si2 9.52
21
Example: Calculate Cornbach’s Alpha/ KR-20 for the
following data of 5 students on 4 test
items(TRUE/FALSE)
Student Q1 Q2 Q3 Q4 TOTAL
1 1 1 0 0 2
2 1 0 0 0 1
3 1 1 1 1 4
S2total=1.44
4 1 1 1 1 4 k
5 0 1 1 0 2 k
S i2
4 1 0.8 0.5926
1 i 12
k1 S total 4 1 1.44
0.8 0.8 1.2 1.2
(x x) 2
Si2 0.16 0.16 0.24 0.24 0.8 k
k
pi qi
4 1 0.8 0.5926
KR 20 1 i 1
k 1 2
S total 4 1 2
S total
p
0.8 0.8 0.6 0.6
q
0.2 0.2 0.4 0.4 As the value of reliability coefficient
0.16 0.16 0.24 0.24
is less than recommended standard
pq
0.8 of 0.7 so the test is not reliable
22
Validity
23
Definition
24
Assessment of validity
25
Face validity
26
Content validity
27
Content validity (2)
28
Criterion validity
29
Construct validity
30
Construct validity (2)
31
Construct validity (3)
32