0% found this document useful (0 votes)

4 views

Reliability and Validity

The document discusses the importance of reliability and validity in questionnaire design, highlighting that the main goals are to gather relevant information and ensure consistent measurements. It outlines various methods to assess reliability, including test-retest, alternate-form, and internal consistency, as well as different forms of validity such as face, content, criterion, and construct validity. The document emphasizes that reliability and validity are crucial for ensuring that survey instruments accurately measure what they intend to measure.

Uploaded by

m.ari.a.noh.ur.l.ey3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Reliability and Validity

Uploaded by

m.ari.a.noh.ur.l.ey3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 32

Reliability

&
Validity

1
Goals in questionnaire design
Warwick and Linninger(1975) point out that there are two
basic goals in questionnaire design

 To obtain information relevant to the purpose

of the survey
 To collect this information with maximal
RELIABILITY and VALIDITY
How can a researcher be sure that the data gathering
instrument being used will measure what is supposed
to measure and will do this in a consistent manner ?

2
Reliability

 The degree of stability exhibited when a

measurement is repeated under identical conditions
 Reliability is the consistency of your measurement,
or the degree to which an instrument measures the
same way each time it is used under the same
condition with the same subjects. In short, it is the
repeatability of your measurement. A measure is
considered reliable if a person's score on the same
test given twice is similar.

3
Assessment of reliability

 Reliability is assessed in 3 forms

– Test-retest reliability
– Alternate-form reliability
– Internal consistency reliability

4
Test-retest reliability
 Measured by having the same respondents complete a
survey at two different points in time to see how stable the
responses are
 Usually quantified with a correlation coefficient (r value)
 In case of same results on the two administrations of the
instrument (questionnaire) then reliability coefficient will be
one
 Normally, correlation of measurements across time will be
less than perfect due to different experiences and attitidus
that that respondents have encountered from time of the
first test.
5  In general, r values are considered good if r  0.70
Test-retest reliability

 You can test-retest specific questions or the

entire survey instrument
 Be careful about test-retest with items or scales
that measure variables likely to change over a
short period of time, such as energy, happiness,
anxiety
 If you do it, make sure that you test-retest over
very short periods of time

6
Problems with Test-retest reliability

 Potential problem with test-retest is the practice

effect
– Individuals become familiar with the items and simply
answer based on their memory of the last answer

 Researchers do not have resources for multiple

administration
 What effect does this have on your reliability
estimates?
 It inflates the reliability estimate

7
Alternate-form reliability

 Unlike retest method Alternate-form requires

two similar tests with the same respondents
instead of same test for twice.
 Use differently worded forms to measure the
same attribute
 Questions or responses are reworded or
their order is changed to produce two items
that are similar but not identical
8
Alternate-form VS retest

 Alternative is viewed as superior to the retest

because a respondent’s memory of test
items is not as likely to play a role in the data
received.
 Practically it is difficult to develop similar test
items that are consistent in the measurement
of a specific phenomenon.

9
Alternate-form reliability

 Be sure that the two items address the same aspect

of behavior with the same vocabulary and the same
level of difficulty
– Items should differ in wording only
 It is common to simply change the order of the
response alternatives
– This forces respondents to read the response alternatives
carefully and thus reduces practice effect

10
Example: Assessment of depression

Circle one item

Version A:
During the past 4 weeks, I have felt downhearted:
Every day 1
Some days 2
Never 3

Version B:
During the past 4 weeks, I have felt downhearted:
Never 1
Some days 2
11 Every day 3
Alternate-form reliability

 You could also change the actual wording of the

question
– Be careful to make sure that the two items are equivalent
– Items with different degrees of difficulty do not measure the
same attribute
– What might they measure?
 Reading comprehension or cognitive function

12
Example: Assessment of loneliness

Version A:
How often in the past month have you felt alone in the world?
Every day
Some days
Occasionally
Never
Version B:
During the past 4 weeks, how often have you felt a sense of
loneliness?
All of the time
Sometimes
13 From time to time
Example of nonequivalent item rewording

Version A:
When your boss blames you for something you did not do, how often do
you stick up for yourself?
All the time
Some of the time
None of the time
Version B:
When presented with difficult professional situations where a superior
censures you for an act for which you are not responsible, how
frequently do you respond in an assertive way?
All of the time
Some of the time
None of the time
14
Internal consistency
(Split-half method)

 In this method the total number of items is divided into

halves and correlation taken between the two halves.
 Use spearman-Brown prophecy formula

2r
PXX 
1 r

15
PROBLEM

 I am a graduate student who is conducting a

research project for my thesis. I can't wait to
graduate! I would like to find out whether my
instrument is reliable in order to proceed with my
experiment. I heard about using alternate forms and
test-retest to estimate reliability. But due to lack of
resources, I cannot afford to write two tests or
administer the same test in two different times. With
only one test result, what should I do to evaluate the
reliability of my measurement tool?

16
SPLIT half method
 Spilt-half can be viewed as a one-test equivalent to alternate form and test-
retest, which use two tests.
 In spilt-half, you treat one single test as two tests by dividing the items into
two subsets.
 Reliability is estimated by computing the correlation between the two
subsets.
 For example, let's assume that you calculate the subtotal scores of all even
numbered items and the subtotal of all odd numbered items. The two sets
of scores are as the following:
 Calculate the correlation of these two sets of scores to check the internal
consistency.
 If the correlation of the two sets of scores is low, it implies that some
people received high scores on odd items but received low scores on even
items while other people received high scores on even items but received
low scores on odd items. In other words, the response pattern is
inconsistent.

17
Example: Calculate SPLIT HALF reliability for the
following data of 5 students on 4 test items

Total Total
Stude Q Q Q Q Different Correlation Split half
(Q1+Q2) (Q3+Q4)
nt 1 2 3 4 X Y halves coefficient reliability

1 2 1 1 3 3 4 1,2 and 3,4 0.7863 0.8763

2 6 4 5 6 10 11
1,3 and 2,4 0.9011 0.9278
3 3 2 1 1 5 2
1,4 and 2,3 0.9423 0.9691
4 6 3 3 3 9 6
5 6 4 4 3 10 7

S XY 8 .5
r  0.7863
S 2X S 2y (10.3)(11 .5)
Average of split half reliability coefficients=0.9244

2(0.7863)
PXX  0.8763
1  0.7863

18
Methods to split items into two halves

 Assign the odd numbered items to one half and the even numbered
items to the other half of the test.
 Divide the items from center (discard center item, if necessary) into
two halves
 Drawback:
Correlation between the two halves is dependent upon the method used
to divide the items
 Solution:
Calculate correlation coefficients between every possible division of test
into two halves and find average of these correlation coefficients
 Problem:
Incase of large number of test items difficult to calculate correlation
between every possible split of the test items into two halves
 Solution:
Calculate CORNBACH’s ALPHA
19
Internal consistency
(Cornbach’s alpha)

The most common internal consistency

measure is Cronbach's alpha, which is usually
interpreted as the mean of all possible split-
half coefficients.
Cronbach's alpha is a generalization of an earlier
form of estimating internal consistency,
Kuder-Richardson Formula 20
( test items with only two possible outcomes e.g
Yes/True or No/Fasle)
– Interpret like a correlation coefficient (0.70 is good)
20
Example: Calculate Cornbach’s Alpha for the
following data of 5 students on 4 test items

Studen Q1 Q2 Q3 Q4 TOTAL
t
1 2 1 1 3 7

2 6 4 5 6 21

3 3 2 1 1 7 S2total=31.04

4 6 3 3 3 15
 k

5 6 4 4 3 17   Si2 
k  i 1  4  9.52 
 1 2  1   0.9244
 (x  x) 2
15.2 6.8 12.8 12.8
k1  Stotal  4  1  31.04 
3.04 1.36 2.56 2.56
 
Si2 9.52  

21
Example: Calculate Cornbach’s Alpha/ KR-20 for the
following data of 5 students on 4 test
items(TRUE/FALSE)

Student Q1 Q2 Q3 Q4 TOTAL

1 1 1 0 0 2
2 1 0 0 0 1
3 1 1 1 1 4
S2total=1.44
4 1 1 1 1 4  k

5 0 1 1 0 2 k 
  S i2 
  4  1  0.8  0.5926
 1  i 12
k1  S total  4  1 1.44 
0.8 0.8 1.2 1.2  
 (x  x) 2
 
Si2 0.16 0.16 0.24 0.24 0.8  k

k 
  pi qi 
  4  1  0.8  0.5926
KR  20  1 i 1
k  1 2
S total  4  1  2
S total 

p
0.8 0.8 0.6 0.6  
 
q
0.2 0.2 0.4 0.4 As the value of reliability coefficient
0.16 0.16 0.24 0.24
is less than recommended standard
pq
0.8 of 0.7 so the test is not reliable
22
Validity

23
Definition

 How well a survey measures what it sets out

to measure

24
Assessment of validity

 Validity is measured in four forms

– Face validity
– Content validity
– Criterion validity
– Construct validity

25
Face validity

 Cursory review of survey items by untrained

judges
– Ex. Showing the survey to untrained individuals to
see whether they think the items look okay
– Very casual, soft
– Many don’t really consider this as a measure of
validity at all

26
Content validity

 Subjectivemeasure of how appropriate the

items seem to a set of reviewers who have
some knowledge of the subject matter
– Usually consists of an organized review of the
survey’s contents to ensure that it contains
everything it should and doesn’t include anything
that it shouldn’t
– Still very qualitative

27
Content validity (2)

 Who might you include as reviewers?

 How would you incorporate these two
assessments of validity (face and content)
into your survey instrument design process?

28
Criterion validity

 Measure of how well one instrument stacks

up against another instrument or predictor
– Concurrent: assess your instrument against a
“gold standard”
– Predictive: assess the ability of your instrument to
forecast future events, behavior, attitudes, or
outcomes
– Assess with correlation coefficient

29
Construct validity

 Most valuable and most difficult measure of

validity
 Basically, it is a measure of how meaningful
the scale or instrument is when it is in
practical use

30
Construct validity (2)

 Convergent:Implies that several different

methods for obtaining the same information
about a given trait or concept produce similar
results
– Evaluation is analogous to alternate-form
reliability except that it is more theoretical and
requires a great deal of work-usually by multiple
investigators with different approaches

31
Construct validity (3)

 Divergent: The ability of a measure to

estimate the underlying truth in a given area-
must be shown not to correlate too closely
with similar but distinct concepts or traits

Guide For Reflective Journaling Tanner Model
No ratings yet
Guide For Reflective Journaling Tanner Model
1 page
Watershed Segmentation
No ratings yet
Watershed Segmentation
19 pages
RELIABILITY AND VALIDITY
No ratings yet
RELIABILITY AND VALIDITY
47 pages
Reliabilty Lecture (5)
No ratings yet
Reliabilty Lecture (5)
16 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Reliability
No ratings yet
Reliability
9 pages
Relibility Testing
No ratings yet
Relibility Testing
44 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Reliability
No ratings yet
Reliability
11 pages
Class 10
No ratings yet
Class 10
54 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
Reliability and Reliability Analysis (Business Research Methods)
No ratings yet
Reliability and Reliability Analysis (Business Research Methods)
30 pages
Construct Reability Validity
No ratings yet
Construct Reability Validity
37 pages
Unit 9
No ratings yet
Unit 9
27 pages
RMBS M2 Lecture 5a
No ratings yet
RMBS M2 Lecture 5a
42 pages
Slide 4-Reliability
No ratings yet
Slide 4-Reliability
17 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
9 Reliability
No ratings yet
9 Reliability
10 pages
Reliability Estimates: Source of Error Variance Is Test Administration
No ratings yet
Reliability Estimates: Source of Error Variance Is Test Administration
8 pages
Questionnaire Reliability Validity
No ratings yet
Questionnaire Reliability Validity
29 pages
Validity and Reliability
100% (1)
Validity and Reliability
22 pages
CHAPTER 6
No ratings yet
CHAPTER 6
8 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
35 40 Ganesh
No ratings yet
35 40 Ganesh
6 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
5 Reliability
No ratings yet
5 Reliability
29 pages
Reliability, Validity, 2015
No ratings yet
Reliability, Validity, 2015
15 pages
May 2 - Reliability
No ratings yet
May 2 - Reliability
16 pages
TYPESOFRELIABILITY
No ratings yet
TYPESOFRELIABILITY
5 pages
Students_Slides_1_Realibity
No ratings yet
Students_Slides_1_Realibity
59 pages
3 - Types of Reliability
No ratings yet
3 - Types of Reliability
36 pages
Reliability and its Types
No ratings yet
Reliability and its Types
13 pages
What Is Reliability and Its Types
No ratings yet
What Is Reliability and Its Types
6 pages
RELIABILITY 2024
No ratings yet
RELIABILITY 2024
30 pages
Handbook of Psychological Assessment Fourth Edition
100% (1)
Handbook of Psychological Assessment Fourth Edition
9 pages
Introduction To Reliability: What Is Reliability? Why Is It Important?
No ratings yet
Introduction To Reliability: What Is Reliability? Why Is It Important?
14 pages
Week4 1 Testing
No ratings yet
Week4 1 Testing
28 pages
Topic: Reliability SUBJECT: Methods of Research Student: Ma. Kasandra B. Monforte Professor: Mr. Graciano Banaga
No ratings yet
Topic: Reliability SUBJECT: Methods of Research Student: Ma. Kasandra B. Monforte Professor: Mr. Graciano Banaga
2 pages
Module 4 Psychometric properties (1)
No ratings yet
Module 4 Psychometric properties (1)
49 pages
RELIABILITY Report
No ratings yet
RELIABILITY Report
20 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
9 pages
UNIT-5 psychometry_240505_1652001
No ratings yet
UNIT-5 psychometry_240505_1652001
20 pages
Reliability and Validity Analysis: Dr. Jeevan Jyoti Dept. of Commerce University of Jammu
No ratings yet
Reliability and Validity Analysis: Dr. Jeevan Jyoti Dept. of Commerce University of Jammu
25 pages
Strructures
No ratings yet
Strructures
28 pages
reliability
No ratings yet
reliability
15 pages
Reliability and Validity of Measurement: Learning Objectives
No ratings yet
Reliability and Validity of Measurement: Learning Objectives
8 pages
CLASS PRESENTATION - Test Reliability
No ratings yet
CLASS PRESENTATION - Test Reliability
7 pages
Psych Assessment Unit V
No ratings yet
Psych Assessment Unit V
2 pages
Paprint
No ratings yet
Paprint
3 pages
Reliability by Vartika Verma
No ratings yet
Reliability by Vartika Verma
17 pages
Reliability and Validity
No ratings yet
Reliability and Validity
19 pages
Chapter 4 Notes
100% (1)
Chapter 4 Notes
3 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
Reliability and Validity
No ratings yet
Reliability and Validity
21 pages
Unit 2 Reliability and Validity (External and Internal)
No ratings yet
Unit 2 Reliability and Validity (External and Internal)
3 pages
Realibility and Coefficient of Reliability
No ratings yet
Realibility and Coefficient of Reliability
4 pages
Unit Three-Measurement Instruments V4 - 2 - 3
No ratings yet
Unit Three-Measurement Instruments V4 - 2 - 3
268 pages
Inter Rather Reliabaility_045145
No ratings yet
Inter Rather Reliabaility_045145
5 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
38 pages
script-sir Fano
No ratings yet
script-sir Fano
1 page
Analytical Writing Insights on the GRE General Test
From Everand
Analytical Writing Insights on the GRE General Test
Vibrant Publishers
No ratings yet
Revision Exercises in Basic Engineering Mechanics
From Everand
Revision Exercises in Basic Engineering Mechanics
Gregory Pastoll
No ratings yet
Duke University Press Program Ad For The American Political Science Association Conference 2014
No ratings yet
Duke University Press Program Ad For The American Political Science Association Conference 2014
2 pages
Porting To Arm
No ratings yet
Porting To Arm
23 pages
A Case Study of Eco-Village
No ratings yet
A Case Study of Eco-Village
45 pages
Light-Dependent Resistor, LDR
No ratings yet
Light-Dependent Resistor, LDR
7 pages
PHIDAC 2019 IIb Information
No ratings yet
PHIDAC 2019 IIb Information
2 pages
Polycold Maxcool Series
0% (1)
Polycold Maxcool Series
250 pages
BSG RPG Sister Josephine
100% (6)
BSG RPG Sister Josephine
4 pages
20 Advanced Processor Designs
No ratings yet
20 Advanced Processor Designs
28 pages
Sims 3 Create A World Instruction Manual
No ratings yet
Sims 3 Create A World Instruction Manual
8 pages
Powerpoint Presentation Demoteaching
No ratings yet
Powerpoint Presentation Demoteaching
18 pages
Lecture - Design of Retaining Walls - Part - 2 - 230330 PDF
No ratings yet
Lecture - Design of Retaining Walls - Part - 2 - 230330 PDF
36 pages
Collaborative Relationships in Construction: The UK Contractors' Perception
No ratings yet
Collaborative Relationships in Construction: The UK Contractors' Perception
21 pages
Cpu Emulator PDF
No ratings yet
Cpu Emulator PDF
37 pages
PH, Orp, CD, TDS, Do, Salt Meter: Operation Manual
No ratings yet
PH, Orp, CD, TDS, Do, Salt Meter: Operation Manual
53 pages
Is 771 Part 7 Specification For Glazed Fire-Clay Sanitary Appliances Part 7 Specific Requirements of Slop Sinks
No ratings yet
Is 771 Part 7 Specification For Glazed Fire-Clay Sanitary Appliances Part 7 Specific Requirements of Slop Sinks
9 pages
English Level Four
No ratings yet
English Level Four
2 pages
The Palarong Pambansa Program
No ratings yet
The Palarong Pambansa Program
4 pages
Unit 10 Institutional Culture: Reading
No ratings yet
Unit 10 Institutional Culture: Reading
12 pages
GLF2015 India
No ratings yet
GLF2015 India
56 pages
PC Cards Adn Peripherals
No ratings yet
PC Cards Adn Peripherals
38 pages
Random Variables and Probability Distributions: Lesson 2: Constructing Probability Distribution
No ratings yet
Random Variables and Probability Distributions: Lesson 2: Constructing Probability Distribution
21 pages
Dictionary of English
100% (3)
Dictionary of English
503 pages
Testing Writing
No ratings yet
Testing Writing
29 pages
Probability Unit 4
100% (1)
Probability Unit 4
20 pages
5 Points 4 Points Peer Edit With Perfection! Tutorial Peer Edit With Perfection! Worksheet Answer Key Peer Edit With Perfection! Handout
No ratings yet
5 Points 4 Points Peer Edit With Perfection! Tutorial Peer Edit With Perfection! Worksheet Answer Key Peer Edit With Perfection! Handout
4 pages
Fa 3
No ratings yet
Fa 3
3 pages
Comparison of Web Browsers
No ratings yet
Comparison of Web Browsers
10 pages

Reliability and Validity

Uploaded by

Reliability and Validity

Uploaded by

Reliability

 To obtain information relevant to the purpose

 The degree of stability exhibited when a

 Reliability is assessed in 3 forms

 You can test-retest specific questions or the

 Potential problem with test-retest is the practice

 Researchers do not have resources for multiple

 Unlike retest method Alternate-form requires

 Alternative is viewed as superior to the retest

 Be sure that the two items address the same aspect

Circle one item

 You could also change the actual wording of the

 In this method the total number of items is divided into

 I am a graduate student who is conducting a

1 2 1 1 3 3 4 1,2 and 3,4 0.7863 0.8763

The most common internal consistency

 How well a survey measures what it sets out

 Validity is measured in four forms

 Cursory review of survey items by untrained

 Subjectivemeasure of how appropriate the

 Who might you include as reviewers?

 Measure of how well one instrument stacks

 Most valuable and most difficult measure of

 Convergent:Implies that several different

 Divergent: The ability of a measure to

You might also like