0% found this document useful (0 votes)

214 views7 pages

Types of Validity

This document discusses different types of validity in language testing, including internal validity (face validity, content validity, and response validity) and external validity (concurrent validity, predictive validity, and consequential validity). It defines each type and provides examples. The document concludes that construct validity encompasses all other forms of validity, and discusses ways to assess construct validity such as analyzing correlations between sub-tests and comparing test scores to student characteristics.

Uploaded by

Anonymous IP3gjs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

214 views7 pages

Types of Validity

Uploaded by

Anonymous IP3gjs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Types of Validity

Validity might be seemed as united concept but, according to Ferguson (2006), with different
aspects. These differences are, in fact, based on methods of view of validity. Alderson,
Clapham and Wall (1995) indicate previous statement and suggest that it would be useful to
validate tests in many types, since each type provide more evidences. In this section I will
generally follow such statement dividing these types into two main parts; internal validity and
external validity. Then I will conclude with construct validity as an umbrella for all other
form.

1. Internal Validity:
There are three kind of internal validity; face validity, content (or rational) validity and
response validity.
a. Face Validity:
Face validity refers, as Alderson, Clapham and Wall (1995:172) mention, to the test's 'surface
credibility or public acceptability'. Bachman (1990:307) states that 'face validity is the
appearance of real life'. Thus, it can be said that face validity is, normally, assessed by people
who are not necessarily experts.

The objective of such assessment is, Alderson, Clapham and Wall (1995:172) say, to make
sure that the test is considered as face valid due to the fact the this believe will lead users to
take the test seriously and perform to the test of the best of their ability.

b. Content Validity:
It is, Kerlinger (1973:458) cited in Alderson, Clapham and Wall (1995:173), the
representative or sampling adequacy of the content-the substance, the matter, the topic- of a
measuring instrument. Usually, content validity is based on the experts' assessment.
In the light of previous definition, content validity can be assessed. Ferguson (2006:2) by
attempting to answer following questions:
 Is what candidates are asked to do relevant to their future work instrument?
 Is there a match between the characteristics of the test takers and the characteristics
of the target language use situation?
 Does the test test what is contained in the syllabus?
 How relevant is the test content-to the needs of the students; to the syllabus?
 Does the test content offer a good basis for inferences about canididate's ability in the
target language use domain?

The key issue for the content validity, according to Messick (1999) is the specification of to
be assessed. Thus, a common way of such assessment is, Ferguson (2006:2) to analyse the
actual test content, by using prepared checklist or instrument, and then compare it with the
statement of what the content should be.

Content validity has useful application. For example, a scale could be improved on which the
experts rate the test according to some criteria. Test Methods Characteristic (TMC) scale was
adapted by Clapham (1992), cited in Alderson, Clapham and Wall (1995:174), when she
evaluate the content of three reading comprehensive test by asking three teachers to rate
aspects of the test input.

In practice, the judgment of content validity is not easy. McNamra (2000) states previous
notice, exemplifying a test of ability to read academic texts when the question is likely to be
arisen: does it matter from which academic field the texts are drawn?

2.1.3 Response Validity:

This type of validation has appeared as, a result of the growing range of qualitative
techniques such as self-observation on the parts of test takers are used, as Alderson, Clapham
and Wall (1995) state, to make sure how exactly they response to test items and why

The responding to test components cannot be gained simply. Thus, according to Ferguson
(2006) some research methodologies such as think aloud protocols have provided a kind of
promise and the answers obtained might help identify what the test is testing as comparing to
what the testers think it is testing.

2.2 External Validity:

There are three types of external validity: concurrent, predictive and consequential validity.
2.2.1 Concurrent Validity:

Concurrent validity, as in Alderson, Clapham and Wall (1995:177), 'involves the comparison
of the test scores with some other measures if the same candidates taken at roughly the same
time as the test'. Other measures should be, according to Ferguson (2006:3) already known to
be valid. For example:
 Scores on an older test whose validity is already established.
 Scores on a parallel version of the same test.
 Teachers ranking and estimates of students language ability.

Alderson, Clapham and Wall (1995) and Ferguson (2006) mention that in case of using
previous valid test as a measure for new one, the high correlation, say .90, is considered as a
declaration of the validity of new test. However, correlation alone cannot state that the two
tests are measure the same language. For example, Shohamy (1994) has examined such claim
by practical study. She looks at two tests; direct versus semi direct oral tests. At the end, she
concludes that "… concurrent validation, using correlation, cannot provide sufficient
evidence that two tests actually test the same language and therefore comparable…"
Shohamy (1994:120).

2.2.2 Predictive Validity:

Although this type is similar to previous one, there is a significant difference between them in
terms of the time of collecting the external measures. In predictive validity, as Alderson,
Clapham and Wall (1995) and Ferguson (2006) point put, the measures will gathered in the
future after the test has been given.

Predictive validity, according to Alderson, Clapham and Wall (1995) and Ferguson (2006),
has a special attention in the field of proficiency tests (e.g. IELTS, TOFEL…etc) where the
aim is to predict how one will do in the future (e.g. in university, jobs…etc).

Nevertheless, there are two difficulties mentioned by Ferguson (2006:3) as following:

i. The first is the truncated sample problem. The fact that the test has already been used to
screen out low scoring applicants means that there are no longer low scoring students in
the sample. Thus, the sample is slightly distorted to a narrower range of scores and the
resulting correlation accordingly depressed.
ii. The second major problem is the suitability of the criterion itself. Clearly, academic result
is influenced by more than language ability and this means that one would not be
necessarily expect a high correlation coefficient.

2.2.3 Consequential Validity:

Messick (1996:251) states that consequential validity "…includes evidence and rationales for
evaluating the intended and unintended consequences of score interpretation and use in both
the short- and long-term, especially those associated with bias in scoring and interpretation,
with unfairness in test use, and with positive or negative washback effect on teaching and
learning".

Most efforts in consequential validity are focused on the washback. The previous term is
considered by Alderson & Wall (1993:117) as a common term in language teaching and
testing which "…can be related to 'influence'. If the test is poor, then the washback may be
felt to be negative…"

2.3 Construct Validity

As I have mentioned above, construct validity is a superordinate form of validity (see:

Bachman and Palmer (1996), Messick (1996) Alderson, Clapham and Wall (1995)).
Bachman and Palmer (1996:21) state that construct validity "…refer to the extent to which we
can interpret a given test score as an indicator of the ability(ies) or construct(s) we want to
measure". Such interpretation should be based on the evidences supporting that the test score
reflects the area(s) of language skills that we want to measure (see: Bachman and Palmer
(1996), Ferguson (2006)), in order to insure that test scores mean what we expect them to
mean (see: Alderson, Clapham and Wall (1995).
2.3.1 Assessing construct validity:

There are many ways of assessing construct validity. Ferguson (2006:4) have summarised as
following:

i. Studying internal correlation between the sub-tests. The rational here is that the reason for
having different components of a test (e.g. reading, grammar, writing etc) is that they
measure something different from each other, so we should expect the correlation
between the sub-tests to be fairly moderate, say between 0.3-0.6. if the correlation
between two sub-tests was very high, then we might wonder if they were testing the same
thing, and if , in turn, one of the sub-tests was redundant. It is also necessary to correlate
the sub-tests with the whole overall test score. According to the classical theory, the
correlation here might be expected to be higher- around 0.7.

ii. Comparing the test with theory: according some writers, construct validation may also
involve assessing the extent to which the test is successfully based on its underlying
theory. In other word, it is a successful operationalisation of theory. This is assessed by
experts, who-having looked at the test and having been informed as to underlying theory,
reach an informed judgment as to construct validity.

iii. Comparison with Students Biodata and Psychological characteristics: another form of
construct validation involves comparing test performance with bio-data from test-takers.
The aim is to detect and bias for or against a particular group of students defined in terms
if age, nationality, first language, etc. an alternative is to compare test scores with
theoretically relevant psychological measures.

There are, also, other ways to assess the construct validity such as Multitrait-multimethod
analysis and convergent-divergent validation, factor analysis (e.g. Bachman & Palmer 1989)
but, according to Alderson, Clapham and Wall (1995) and Ferguson (2006), these ways are
complex and involving sophisticated producers of statistics. Therefore, they might be not
related to the concerns of this short essay.

2.3.2 Some difficulties facing construct validity:

Fulcher (1999:225) point out that there are two major threats to construct validity and score
interpretation;

i. The first is the construct under-representation, when the test fails to represent the
construct supposed to be measuring.
ii. The second, on the other hand, is construct irrelevant variance, which the test seems to
ignore the construct we wish to measure and, rather, focuses on something not related,
although that this kind of test is sometimes reliable test.

Most validity research, Fulcher (1996) says, are attempted to reduce the negative impact of
these two threats. Such concerns, according to Messick (1996) are linked to negative
washback (which has been mentioned above in consequential validity). For instance, if the
test under-represent an important construct, teachers might overemphasis the well-present
constructs and down play others.

2.4 Suitable data for test validity:

To focus on the data which is suitable regarding some types of test validity, Alderson,
Clapham and Wall (1995:193-194) provide a useful checklist as following:

Type of validity Procedures for evaluation

Face Validity Questionnaires to, interview with candidates, administrations and

other users
Content Validity a) Compare test content with specifications/syllabus.
b) Questionnaires to, interview with 'experts' such as teachers,
subject specialists, applied linguists.
c) Expert judges rate test items and texts according to precise list
criteria
Response Students introspect on their test-taking procedures, either
Validity concurrently or retrospectively.
Concurrent a) Correlate students' test score with their scores on other test.
Validity b) Correlate students' test scores with teachers' ranking.
c) Correlate students' test scores with other measuers of ability
such as students' teacher rating.
Predicative a) Correlate students' test scores with their scores on tests taken
Validity some time later.
b) Correlate students' test scores with success in final exam.
c) Correlate students' test scores with other measures of their
ability taken some time later, such as teachers' assessment.
d) Correlate students' test scores with success of later placement.
Construct a) Correlate each student with other subtest.
Validity b) Correlate each student with total test
c) Correlate each student with total minus self.
d) Compare students' test score with students' biodata and
psychological characteristics.
e) Multitrait-multimethode studies.
f) Factor analysis.

Daftar Peserta Ekskul (Responses)
No ratings yet
Daftar Peserta Ekskul (Responses)
7 pages
MULTICULTURAL AND GLOBAL LITERACY
No ratings yet
MULTICULTURAL AND GLOBAL LITERACY
20 pages
Module 4 The Concept of Validity (YENI)
No ratings yet
Module 4 The Concept of Validity (YENI)
18 pages
300 Questions
33% (3)
300 Questions
1,054 pages
Year 1 Week 22
No ratings yet
Year 1 Week 22
11 pages
Task 3 - What have I learned_ - Evaluation Quiz (página 1 de 2) _ SES21 ingles
No ratings yet
Task 3 - What have I learned_ - Evaluation Quiz (página 1 de 2) _ SES21 ingles
1 page
Grade R Worksheets
No ratings yet
Grade R Worksheets
17 pages
English For Tourism
No ratings yet
English For Tourism
12 pages
(9789004183896 - in The Path of The Moon) Chapter One. Fate and Divination in Mesopotamia
No ratings yet
(9789004183896 - in The Path of The Moon) Chapter One. Fate and Divination in Mesopotamia
12 pages
7. Adjective & Adverb
No ratings yet
7. Adjective & Adverb
6 pages
L1 Coding-Decoding PPT - 5520494 - 2024 - 02 - 06 - 12 - 24
No ratings yet
L1 Coding-Decoding PPT - 5520494 - 2024 - 02 - 06 - 12 - 24
22 pages
Watson Theory
No ratings yet
Watson Theory
2 pages
Tahas, Basal, Lansakan Worksheet
92% (75)
Tahas, Basal, Lansakan Worksheet
2 pages
Adolescence - Physical and Cognitive Development
100% (1)
Adolescence - Physical and Cognitive Development
34 pages
The Structure of English II Notes
No ratings yet
The Structure of English II Notes
5 pages
Usage of Social Media
No ratings yet
Usage of Social Media
34 pages
Applicataion Form Certificate Course Intake 15
No ratings yet
Applicataion Form Certificate Course Intake 15
4 pages
Saint Theresa College of Tandag, Inc
No ratings yet
Saint Theresa College of Tandag, Inc
85 pages
Wilhelm Wundt Is The Father of Modern Psychology
100% (1)
Wilhelm Wundt Is The Father of Modern Psychology
4 pages
English 8 - Learning Packet - Lesson 3
No ratings yet
English 8 - Learning Packet - Lesson 3
4 pages
Test Construction
100% (1)
Test Construction
3 pages
Working With Interpreters
No ratings yet
Working With Interpreters
21 pages
2 - Module 1 Building and Enhancing New Literacies Across The Curriculum
90% (161)
2 - Module 1 Building and Enhancing New Literacies Across The Curriculum
17 pages
Affective Assessment
No ratings yet
Affective Assessment
43 pages
Phil Iri Sample Computation
89% (18)
Phil Iri Sample Computation
2 pages
Esp2 q2 Sdo Quirino
No ratings yet
Esp2 q2 Sdo Quirino
63 pages
Matlab Quick Reference
100% (3)
Matlab Quick Reference
12 pages
Types of Tests
100% (1)
Types of Tests
4 pages
Prof Ed 2016
100% (1)
Prof Ed 2016
244 pages
Chapter 3 - Social Literacy
No ratings yet
Chapter 3 - Social Literacy
22 pages
Page 4 Exercise 1a: Advanced Student's Book Answer Key
No ratings yet
Page 4 Exercise 1a: Advanced Student's Book Answer Key
3 pages
Teddy and Mrs. Thompson
No ratings yet
Teddy and Mrs. Thompson
1 page
Eric Hamp Indo - European - Languages PDF
No ratings yet
Eric Hamp Indo - European - Languages PDF
17 pages
Ed Ing Adjectives American English Student Ver2
No ratings yet
Ed Ing Adjectives American English Student Ver2
3 pages
Sudanese Interpreters by Dr. Hisham Khogali
No ratings yet
Sudanese Interpreters by Dr. Hisham Khogali
9 pages
Module Bped Personal Community and Environmental Health
100% (2)
Module Bped Personal Community and Environmental Health
59 pages
Types of Validity
100% (2)
Types of Validity
4 pages
Test Construction
No ratings yet
Test Construction
19 pages
Timsirin N Tmaziɣt Sɣur NUMIDYA
No ratings yet
Timsirin N Tmaziɣt Sɣur NUMIDYA
63 pages
Pokus NG Pandiwa
100% (2)
Pokus NG Pandiwa
4 pages
1.the Tale of 2 Pebbles-Preeti
No ratings yet
1.the Tale of 2 Pebbles-Preeti
3 pages
Difficult Words in Filipino 19-20
100% (2)
Difficult Words in Filipino 19-20
10 pages
Sigmund Freud's Psychoanalytic Theory
No ratings yet
Sigmund Freud's Psychoanalytic Theory
23 pages
First Year Latin Collrich
No ratings yet
First Year Latin Collrich
340 pages
TCS NQT Practice Paper 5
No ratings yet
TCS NQT Practice Paper 5
100 pages
Speaking & Listening Workshop
No ratings yet
Speaking & Listening Workshop
28 pages
Course Name: Teaching Testing and Assessment Professor: Dr. Mohamadi Name of Student: Mehdi Karimi Soofloo
No ratings yet
Course Name: Teaching Testing and Assessment Professor: Dr. Mohamadi Name of Student: Mehdi Karimi Soofloo
4 pages
Mga Produkto NG Bicol
83% (6)
Mga Produkto NG Bicol
15 pages
Research in Child and Adolescent Development
No ratings yet
Research in Child and Adolescent Development
48 pages
Theories of Learning
No ratings yet
Theories of Learning
25 pages
Preboard Reviewer
No ratings yet
Preboard Reviewer
78 pages
Reliability Vs Validity
No ratings yet
Reliability Vs Validity
27 pages
Social Development Theory
No ratings yet
Social Development Theory
4 pages
Syllabus Pagtuturo NG Filipino Sa Elementarya: Pre-Requisite
100% (6)
Syllabus Pagtuturo NG Filipino Sa Elementarya: Pre-Requisite
7 pages
Romeo and Juliet Reading Log
No ratings yet
Romeo and Juliet Reading Log
3 pages
Best!!Correlation Research Design Presentation Kieran and Emmanuel
No ratings yet
Best!!Correlation Research Design Presentation Kieran and Emmanuel
36 pages
Muet Lesson Plan
No ratings yet
Muet Lesson Plan
2 pages
Assessment Concepts and Issues: Made by Rahila Khan SBK Women's University
No ratings yet
Assessment Concepts and Issues: Made by Rahila Khan SBK Women's University
35 pages
Maikling Kuwento at Nobela Modyul
100% (10)
Maikling Kuwento at Nobela Modyul
42 pages
TCRC December Evaluation
No ratings yet
TCRC December Evaluation
53 pages
Validity and Reliability
No ratings yet
Validity and Reliability
19 pages
Let Review Louie
No ratings yet
Let Review Louie
36 pages
The Information-Processing Approach: © 2008 Mcgraw-Hill Higher Education. All Rights Reserved
No ratings yet
The Information-Processing Approach: © 2008 Mcgraw-Hill Higher Education. All Rights Reserved
28 pages
P E O P L E: Kapalong College of Agriculture, Sciences and Technology
No ratings yet
P E O P L E: Kapalong College of Agriculture, Sciences and Technology
12 pages
A Comparative Study Between Male and Female Students
100% (1)
A Comparative Study Between Male and Female Students
6 pages
The Multi-Age Classroom
No ratings yet
The Multi-Age Classroom
16 pages
LET Reviewer
No ratings yet
LET Reviewer
41 pages
Hidden Curriculum
No ratings yet
Hidden Curriculum
9 pages
Teaching Profession
No ratings yet
Teaching Profession
8 pages
Frisbee
100% (1)
Frisbee
21 pages
ED 106 - Module 7
No ratings yet
ED 106 - Module 7
8 pages
Gifted and Talented Children
No ratings yet
Gifted and Talented Children
14 pages
Technology For Teaching and Learning 2
90% (91)
Technology For Teaching and Learning 2
175 pages
Physical Cognitive and Socio Emotional Devt of Intermediate Schooler
100% (1)
Physical Cognitive and Socio Emotional Devt of Intermediate Schooler
32 pages
Komunikasyon at Pananaliksik Sa Wika at Kulturang Pilipino
78% (9)
Komunikasyon at Pananaliksik Sa Wika at Kulturang Pilipino
36 pages
A Study of Subject-Verb Agreement
No ratings yet
A Study of Subject-Verb Agreement
7 pages
Early Childhood
100% (2)
Early Childhood
36 pages
Instructional Design Models, Theories & Methodology
No ratings yet
Instructional Design Models, Theories & Methodology
2 pages
D
0% (1)
D
4 pages
Richard Atkinson and Richard Schiffrin, Etc Theories PDF
No ratings yet
Richard Atkinson and Richard Schiffrin, Etc Theories PDF
25 pages
Technology and Learning Styles Term Paper
No ratings yet
Technology and Learning Styles Term Paper
6 pages
14 Learning Principles: - Erica Miles Barrientos
No ratings yet
14 Learning Principles: - Erica Miles Barrientos
6 pages
Behaviorist Learning Theory
No ratings yet
Behaviorist Learning Theory
12 pages
Assessment Rev
No ratings yet
Assessment Rev
4 pages
Direct and Indirect Speech Grade 9 New
No ratings yet
Direct and Indirect Speech Grade 9 New
6 pages
Language On Schools - English Irregular Verbs List PDF
No ratings yet
Language On Schools - English Irregular Verbs List PDF
5 pages
Table Analysis Collect
No ratings yet
Table Analysis Collect
35 pages
Socio-Cultural Theory
No ratings yet
Socio-Cultural Theory
22 pages
Piaget's Cognitive Development DEMO
No ratings yet
Piaget's Cognitive Development DEMO
25 pages
Child Adolescence & Facilitating Learning - Practice Test (60 Items)
No ratings yet
Child Adolescence & Facilitating Learning - Practice Test (60 Items)
11 pages
COURSE SYLLABUS (Fil 17) Panunuring Pampanitikan
100% (6)
COURSE SYLLABUS (Fil 17) Panunuring Pampanitikan
10 pages
Course Syllabus (Fil 10) Paghahanda at Ebalwasyon NG Kagamitang Panturo
75% (8)
Course Syllabus (Fil 10) Paghahanda at Ebalwasyon NG Kagamitang Panturo
9 pages
English Marungko Booklet
100% (10)
English Marungko Booklet
48 pages
Module 2
No ratings yet
Module 2
23 pages
Assessing Student Learning Outcomes
No ratings yet
Assessing Student Learning Outcomes
27 pages
Structured Discussion Board 1 Kohlberg Dilemma
No ratings yet
Structured Discussion Board 1 Kohlberg Dilemma
2 pages
Piaget's Theory of Cognitive Development - English
No ratings yet
Piaget's Theory of Cognitive Development - English
3 pages
Assessment of Student Learning Handout
No ratings yet
Assessment of Student Learning Handout
7 pages
Ang Maikling Kuwento at Nobela Syllabus
100% (4)
Ang Maikling Kuwento at Nobela Syllabus
9 pages
Purposes of Assessment
No ratings yet
Purposes of Assessment
5 pages
DEBUGGING
No ratings yet
DEBUGGING
15 pages
Maikling Kwento
No ratings yet
Maikling Kwento
49 pages
DLP Format
92% (157)
DLP Format
7 pages
Late Childhood and Early Childhood
100% (1)
Late Childhood and Early Childhood
5 pages
COURSE SYLLABUs Fil 15 & 23 Kulturang Popular
100% (2)
COURSE SYLLABUs Fil 15 & 23 Kulturang Popular
14 pages
Assessment of Learning
No ratings yet
Assessment of Learning
12 pages
Fill in The Blanks
No ratings yet
Fill in The Blanks
8 pages
Kayarian NG Pang Uri Worksheet
100% (1)
Kayarian NG Pang Uri Worksheet
4 pages
Governance and the Three Arms of Government in Sierra Leone
From Everand
Governance and the Three Arms of Government in Sierra Leone
Abubakar Hassan Kargbo
No ratings yet
LCP Filipino 9 1st Quarter
100% (1)
LCP Filipino 9 1st Quarter
6 pages
Paghahanda at Ebalwasyon NG Kagamitang Panturo
No ratings yet
Paghahanda at Ebalwasyon NG Kagamitang Panturo
65 pages
de So 12
No ratings yet
de So 12
4 pages
DSPC - Fact Sheet Editorial Writing
90% (10)
DSPC - Fact Sheet Editorial Writing
5 pages
Module 3: FRISBEE: Intended Learning Outcomes
100% (2)
Module 3: FRISBEE: Intended Learning Outcomes
19 pages
Ponemang Suprasegmental
100% (1)
Ponemang Suprasegmental
15 pages
MATATAG Curriculum Presentation
94% (52)
MATATAG Curriculum Presentation
21 pages
Aspekto NG Pandiwa - Worksheet
90% (10)
Aspekto NG Pandiwa - Worksheet
3 pages
Narrative-Report-2023 Buwan NG Wika
67% (3)
Narrative-Report-2023 Buwan NG Wika
10 pages
IPCRF For Proficient Teachers SY 2023 2024 1
98% (52)
IPCRF For Proficient Teachers SY 2023 2024 1
11 pages
Reliability and Validity
No ratings yet
Reliability and Validity
11 pages

Types of Validity

Uploaded by

Types of Validity

Uploaded by

Types of Validity

2.1.3 Response Validity:

2.2 External Validity:

2.2.2 Predictive Validity:

Nevertheless, there are two difficulties mentioned by Ferguson (2006:3) as following:

2.2.3 Consequential Validity:

2.3 Construct Validity

As I have mentioned above, construct validity is a superordinate form of validity (see:

2.3.2 Some difficulties facing construct validity:

2.4 Suitable data for test validity:

Type of validity Procedures for evaluation

Face Validity Questionnaires to, interview with candidates, administrations and

You might also like