0% found this document useful (0 votes)

9 views

Validity and Reliability

Validity and reliability are important aspects of measurement quality. Validity explores if an instrument measures the intended construct, while reliability assesses the accuracy of measurement. There are different types of validity including face, content, criterion, and construct validity. Reliability can be assessed via internal consistency and test-retest or parallel versions of instruments. Measurement errors can occur systematically or randomly, and reliability quantifies inaccuracies.

Uploaded by

María Céspedes Montilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Validity and Reliability

Uploaded by

María Céspedes Montilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

QUANTITATIVE RESEARCH METHODS

4. Validity and reliability

There are two aspects of the quality of a measurement instrument (questionnaire or observation):
- Validity explores if we measure what we want to measure. In other words, if the instrument
measuring the underlying construct (no systematic error in the measurements).
- Reliability investigates the accuracy with which the instrument measure the construct. If
we get the same result when the measurement is repeated (no influence of random/non-
systematic errors).

There are different types of validity (questionnaires) and two ways of assessing validity:

Face
validity
Theoretical
Content
Ways of validity
assessing
validity Criterium
validity
Empirical
(statistics)
Construct
validity

- Face validity studies if the test at first sight measure what it is intending to measure. For
example, in mathematical ability, you should expect math items. But other types of validity
are needed.
- Content validity study if the test items are representative for the construct. For this, we
have to investigate the relevant dimensions/aspects of the construct that should be
reflected in the items and the irrelevant aspects that should not be as judged by (clinical)
experts, consulting the literature. For example, different aspects of problem behavior (of
child) should be measure suing different items which pertain to different aspects and we
have to avoid irrelevant aspects like “do you beat your child?”.

- Criterion validity looks if the scores are related to an external criterion. For example, test
regarding school maturity of a child. There are two types:
o Concurrent validity shows the relation with a “measurement” at the same
moment. For example, teachers’ opinion regarding maturity of the child.
o Predictive validity shows the relation with a “measurement” later in time. For
example, school results in the first grade.
• Convergent validity:
- Construct validity explore the degree to which a test measures what it claims, or
purports, to be measuring.

36
QUANTITATIVE RESEARCH METHODS

o Convergent validity shows that different measurement instruments of the same

construct (or of related constructs) should be related. If there is a large positive
correlation
o Discriminant validity shows that the different measurement instruments of
different constructs should not (or less) be related.for example, problem behavior
and parenting behavior.

Validity Systematic errors

Non-systematic/random errors
Reliability
Accuracy, consistency of the measurements

Error in fluctuations in reaction times is commited because of the presence of accidental factors
that yield inaccurate measurements. There could be problems with the measurement device
(computer) or being distracted for a moment. The goal is to control these accidental factors. Also
with questionnaires owing to a skipping of a question or a misunderstanding of it.
But, there are different degrees of inaccurary of a measurement instrument,that we can quantify
by a correlation. There are two types:
An error in reliability can lead to fluctuations in reaction times (error) due to accidental factors
that result in inaccurate measurements because of problems with the measuring device
(computer) or being distracted for a moment. Therefore, its goal is to control these accidental
factors.
Errors can also occur in questionnaires when, for example, the participant skips a question or it is
not understood as the researcher intended.
The degree of inaccurary of a measurement instrument is quantified by a correlation. There are
two types:
a) Internal consistency of the measurement instrument (consistency of the different
items): cronbach alfa is. It is a function of the number of items in a test, the average
covariance between item-pairs, and the variance of the total score.
b) Stability of the measurement instrument: test-retest reliability and parallel reliability.

37
QUANTITATIVE RESEARCH METHODS

2.1. TYPES OF RELIABILITY

a) Test-retest reliability.
It involves administering the test twice (at different times) to the same people. There will be
greater reliability, the greater the correlation between the two tests.
There may be a problem of the time interval, because we don't know how long it should last. If
it's too short, it skews because people remember your answers. But, if it's too long, people can
change (the construct can change over time, less so for cognitive skills). In turn, there could be a
learning effect.
That's not the best option.
b) Parallel reliability.
It consists of administering two parallel tests at the same time (of the same individuals). There will
be greater reliability, the higher the correlation between parallel tests.
The advantage is that there is no time effect. However, the tests must be parallel. For example, if
we measure reading ability by looking at the number of words a child can read in a minute (sorted
from easy to difficult), the parallel test should have the same order. But this is not possible for all
constructions

2.2. OBSERVATIONS/JUDGEMENTS

a) Inter. Proportion of agreement among two raters regarding the same set of items
b) Intra. Proportion of agreement within a rater when evaluating the same items twice.
Example: case of dichotomous (binary) items
passed failed total
passed 87 23 110
failed 14 95 109
total 101 118 219

Ratio of number of agreements to total number of items: (87 + 95) / (87 + 23 + 95 + 14) = 182 /
219 = .8311

Test manuals always contain information regarding validity and reliability. When constructing your
own instrument, you should give arguments to prove the validity and reliability of the instrument.
• Face/content validity: evaluated by experts.
• Criterium/construct validity: relation with other tests or information regarding the
construct under study.
• Reliability: administer test twice from same people.

38
QUANTITATIVE RESEARCH METHODS

3.1. TYPES

A) Test-retest reliability. Administering the test twice (at different times) to the same
individuals. high reliability = high correlation among both tests. The problem is to
determine how long will be the time interval, because if it is too short, bias because people
remember their answers; but, if it is too long, people change (construct may change over
time, less for cognitive abilities)-learning effect. That´s not the best option.
B) Parallel reliability. Administering two parallel tests at the same moment in time (from
the same individuals). High reliability = high correlation among parallel tests.
Advantages Disadvantages
No time effect Tests should be parallel. Reading ability by the
number of words that a child can read in one
minute (ordered from easy to difficult). Parallel
test should have the same order.
It´s not possible for all constructs.

3.2. OBSERVATIONS/JUDGEMENTS

Intra- and inter-rater reliability:

- Inter: proportion of agreement among two raters regarding the same set of items.
- Intra: proportion of agreement within a rater when evaluating the same items twice.
Example: case of dichotomous (binary) items

Ratio of number of agreements to total number of items: (87 + 95) / (87 + 23 + 95 + 14) = 182 /
219 = .8311

Test manuals always contain information regarding validity and reliability.

When constructing your own instrument, you should give arguments to prove the validity and
reliability of the instrument. Also, face/content validity must be evaluated by experts and
criterium/construct validity related with other tests or information regarding the construct under
study. Finally, reliability administer test twice from same people.

The Mini Script - Taibi Kahler
100% (4)
The Mini Script - Taibi Kahler
12 pages
The Weight Influenced Self-Esteem Questionnaire (WISE-Q) Factor Structure and Psychometric Properties
No ratings yet
The Weight Influenced Self-Esteem Questionnaire (WISE-Q) Factor Structure and Psychometric Properties
9 pages
Validity and Reliability of Research Instrument
100% (5)
Validity and Reliability of Research Instrument
47 pages
Characteristics of A Good Test
50% (2)
Characteristics of A Good Test
5 pages
Y9 Science Transition Test Teacher Guide
0% (1)
Y9 Science Transition Test Teacher Guide
30 pages
New Directions in Teaching and Learning English Discussion - Volume 1, Issue 1
No ratings yet
New Directions in Teaching and Learning English Discussion - Volume 1, Issue 1
230 pages
Presentation Guide
No ratings yet
Presentation Guide
2 pages
Lesson 8
No ratings yet
Lesson 8
1 page
Research Chapter 05 HMM
No ratings yet
Research Chapter 05 HMM
32 pages
Validity&Reliability
No ratings yet
Validity&Reliability
16 pages
Research Instrument: Dr. Eunice B. Custodio Philippines
No ratings yet
Research Instrument: Dr. Eunice B. Custodio Philippines
12 pages
Unit 2 Reliability and Validity (External and Internal)
No ratings yet
Unit 2 Reliability and Validity (External and Internal)
3 pages
Qualities of Test(Validity & Relibility Etc)
No ratings yet
Qualities of Test(Validity & Relibility Etc)
38 pages
Characteristicsofagoodtest3 140227023631 Phpapp02
No ratings yet
Characteristicsofagoodtest3 140227023631 Phpapp02
41 pages
66cee8ee676c720018ba7acb_##_Research Aptitude 02- Daily Classnotes
No ratings yet
66cee8ee676c720018ba7acb_##_Research Aptitude 02- Daily Classnotes
13 pages
Validity and Reliability
100% (2)
Validity and Reliability
20 pages
Establishing The Validity and Reliability of A Research Instrument
No ratings yet
Establishing The Validity and Reliability of A Research Instrument
17 pages
Validity & Reliability
No ratings yet
Validity & Reliability
27 pages
Reliability and Validity Mha1
No ratings yet
Reliability and Validity Mha1
13 pages
Lm-Reliability and Validity-Slides
No ratings yet
Lm-Reliability and Validity-Slides
13 pages
Educational Research
No ratings yet
Educational Research
32 pages
Valadity and Reliability
100% (1)
Valadity and Reliability
12 pages
Research Instrument: Dr. Eunice B. Custodio Philippines
No ratings yet
Research Instrument: Dr. Eunice B. Custodio Philippines
14 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Qualities of Good Measuring Instruments
56% (9)
Qualities of Good Measuring Instruments
4 pages
Module 4 Psychometric properties (1)
No ratings yet
Module 4 Psychometric properties (1)
49 pages
2.measurement of Validity Reliability
No ratings yet
2.measurement of Validity Reliability
31 pages
Properties of Assessment Method: Validity
No ratings yet
Properties of Assessment Method: Validity
30 pages
Module 6 Methods of Research
No ratings yet
Module 6 Methods of Research
3 pages
L9 Qualities of A Good Measuring Instrument
No ratings yet
L9 Qualities of A Good Measuring Instrument
22 pages
Chapter 5
No ratings yet
Chapter 5
20 pages
Quality of A Good Instrument
No ratings yet
Quality of A Good Instrument
28 pages
Validity and Reliability
No ratings yet
Validity and Reliability
33 pages
meai.21 (1)
No ratings yet
meai.21 (1)
11 pages
QUALITY OF A TEST
No ratings yet
QUALITY OF A TEST
7 pages
Topic 3 Characteristics and Principles of Assessment
100% (1)
Topic 3 Characteristics and Principles of Assessment
45 pages
Chapter 6edited
No ratings yet
Chapter 6edited
15 pages
Module 3 Educ 105 Modified
No ratings yet
Module 3 Educ 105 Modified
28 pages
Class quiz 4
No ratings yet
Class quiz 4
6 pages
Reliability and Validity in Research
100% (2)
Reliability and Validity in Research
2 pages
Validity and reliability
No ratings yet
Validity and reliability
24 pages
Lesson 8 Validity and Reliability
No ratings yet
Lesson 8 Validity and Reliability
4 pages
Reliability and Validity in Research
No ratings yet
Reliability and Validity in Research
5 pages
Validity and Reliability: Presented by - Palak Brahmbhatt
No ratings yet
Validity and Reliability: Presented by - Palak Brahmbhatt
30 pages
Assignment: G.C. University, Faisalabad
No ratings yet
Assignment: G.C. University, Faisalabad
5 pages
10-Validity-Reliability (2) PRAC RES
No ratings yet
10-Validity-Reliability (2) PRAC RES
52 pages
Topic 8F Validity Reliability and Sources of Error
No ratings yet
Topic 8F Validity Reliability and Sources of Error
24 pages
Validity and Reliability Lesson 3.
No ratings yet
Validity and Reliability Lesson 3.
48 pages
Measuring Instrument Module 2
No ratings yet
Measuring Instrument Module 2
10 pages
Unit 9 - Validity and Reliability of A Research Instrument
No ratings yet
Unit 9 - Validity and Reliability of A Research Instrument
17 pages
Is The Research Investigation Providing Answers To The Research Questions For Which It Was Undertaken?
No ratings yet
Is The Research Investigation Providing Answers To The Research Questions For Which It Was Undertaken?
16 pages
0520-20Validity2020Reliability
No ratings yet
0520-20Validity2020Reliability
37 pages
Instrument Validity and Reliability in Research
100% (2)
Instrument Validity and Reliability in Research
3 pages
3 ความรู้เกี่ยวกับการวัด Eng
No ratings yet
3 ความรู้เกี่ยวกับการวัด Eng
23 pages
Unit 4: Qualities of A Good Test: Validity, Reliability, and Usability
No ratings yet
Unit 4: Qualities of A Good Test: Validity, Reliability, and Usability
18 pages
Unit 3
No ratings yet
Unit 3
15 pages
Validity and Reliability of Instruments
No ratings yet
Validity and Reliability of Instruments
26 pages
ARM301 METHODOLOGY Lesson4 ResearchInstruments Reliability Validity Condensedvers
No ratings yet
ARM301 METHODOLOGY Lesson4 ResearchInstruments Reliability Validity Condensedvers
54 pages
Royal University of Phnom Penh Faculty of Education Master of Education Program
No ratings yet
Royal University of Phnom Penh Faculty of Education Master of Education Program
41 pages
Qualities of A Good Measuring Instrument
100% (1)
Qualities of A Good Measuring Instrument
9 pages
BBA-BI-Class 19 Business Research Notes For BHM
No ratings yet
BBA-BI-Class 19 Business Research Notes For BHM
28 pages
Validity and Reliability of A Research Instrument
No ratings yet
Validity and Reliability of A Research Instrument
11 pages
Evaluating a Psychometric Test as an Aid to Selection
From Everand
Evaluating a Psychometric Test as an Aid to Selection
Zuzana Robertson C.Psychol
5/5 (1)
Testing Impact Review
From Everand
Testing Impact Review
Mason Ross
No ratings yet
Escala Validada para o Brasil de Mindfulness
No ratings yet
Escala Validada para o Brasil de Mindfulness
9 pages
Exploring The Potential of Zakah For Supporting Realization of Sustainable Development Goals (SDGS) in Indonesia
No ratings yet
Exploring The Potential of Zakah For Supporting Realization of Sustainable Development Goals (SDGS) in Indonesia
20 pages
BRM Module 1
No ratings yet
BRM Module 1
12 pages
Proposal PDF
100% (1)
Proposal PDF
29 pages
Reliability and Validity
No ratings yet
Reliability and Validity
3 pages
"Lacuna in Practices of Accounting and Auditing Professionals Leading to Financial Frauds in India,
0% (1)
"Lacuna in Practices of Accounting and Auditing Professionals Leading to Financial Frauds in India,
29 pages
Sollermann Hand Function Test (SHFT) SAMPLE
No ratings yet
Sollermann Hand Function Test (SHFT) SAMPLE
6 pages
Abay Research Error Handling
No ratings yet
Abay Research Error Handling
19 pages
Thesis Format New
No ratings yet
Thesis Format New
13 pages
Development and Evaluation of An Obstetric Quality-Of-Recovery Score (Obsqor-11) After Elective Caesarean Delivery
No ratings yet
Development and Evaluation of An Obstetric Quality-Of-Recovery Score (Obsqor-11) After Elective Caesarean Delivery
11 pages
Roditeljski Stavovi
No ratings yet
Roditeljski Stavovi
20 pages
Psychological Measurement
100% (1)
Psychological Measurement
21 pages
The SCI Exercise Self-Efficacy Scale (ESES) : Development and Psychometric Properties
No ratings yet
The SCI Exercise Self-Efficacy Scale (ESES) : Development and Psychometric Properties
6 pages
752-Article Text-3316-1-10-20220727.id - en
No ratings yet
752-Article Text-3316-1-10-20220727.id - en
10 pages
Benefit Factor
No ratings yet
Benefit Factor
4 pages
Micro Teaching Handbook
No ratings yet
Micro Teaching Handbook
128 pages
Perceived Value On Consumer Purchase Intention
No ratings yet
Perceived Value On Consumer Purchase Intention
7 pages
Study of Incentives Monetary and Non Monetary On The Employees Intention of Turnover
No ratings yet
Study of Incentives Monetary and Non Monetary On The Employees Intention of Turnover
39 pages
An Experimental Paper and Pencil Test For Assessing Ego States
No ratings yet
An Experimental Paper and Pencil Test For Assessing Ego States
5 pages
University of Science and Technology of Southern Philippines
No ratings yet
University of Science and Technology of Southern Philippines
12 pages
The Methodology of The Corruption Perceptions Index 2007: Prof. Dr. Johann Graf Lambsdorff
No ratings yet
The Methodology of The Corruption Perceptions Index 2007: Prof. Dr. Johann Graf Lambsdorff
15 pages
The Relationship Between Strategic Readi
No ratings yet
The Relationship Between Strategic Readi
18 pages
Using The Self-Directed Search in Research
No ratings yet
Using The Self-Directed Search in Research
24 pages
Persuasive Technolgy For Enhanced Learning Behavior in Higher Education
No ratings yet
Persuasive Technolgy For Enhanced Learning Behavior in Higher Education
16 pages
Altruism Adapted Self-Report Scale (Rushton) (All Youth) - 0
100% (2)
Altruism Adapted Self-Report Scale (Rushton) (All Youth) - 0
3 pages

Validity and Reliability

Uploaded by

Validity and Reliability

Uploaded by

QUANTITATIVE RESEARCH METHODS

4. Validity and reliability

o Convergent validity shows that different measurement instruments of the same

Validity Systematic errors

2.1. TYPES OF RELIABILITY

Intra- and inter-rater reliability:

Test manuals always contain information regarding validity and reliability.

You might also like