0% found this document useful (0 votes)

129 views

Properties of Assessment Method: Validity

This document discusses various types of validity and reliability in assessment methods. It describes content validity, criterion-related validity including concurrent and predictive validity, construct validity, and face validity. It also covers different measures of reliability such as test-retest, parallel forms, internal consistency using KR21 and Spearman Brown, and discusses how to interpret reliability coefficients. Fairness, practicality, and ethics in assessment are important considerations as well.

Uploaded by

DANIEL LANCE RESPONTE NEVADO

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

129 views

Properties of Assessment Method: Validity

Uploaded by

DANIEL LANCE RESPONTE NEVADO

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Properties Of Assessment Method

Validity
Validity
-Appropriateness, Correctness, Meaningfulness and Usefulness of the
Specific conclusions of a teacher on the teacher-learner situation

-(For testing) The extent to which an Instrument measures what it

intends to measure
Types of Validity
Content Validity – Content
- Evidence that test items represent the proper domain
-Teachers should give emphasis on: adequacy of experience of the students, coverage of sufficient
material to assess a domain

How do we establish Content Validity?

1. Show that the number of items for each content area matches the relative importance of these items
as reflected in the survey of the domain

2. Show that the content of the test matches what was found in the survey of the domain
3. Established by an expert
Example:
Professor Elle G. Beaty gave a Preliminary examination for Test and
Measurement class that covers the content of the discussion of the last
3 weeks
Item Validity:
Criteria Item No.

1 2 3 4 5 6 7
1. Material Covered Sufficiently
2. Students have prior experience with the type of task
3.Most students are able to answer them correctly
4.Decision
Entire Test
Skills Are Estimated Percent of Percentage of items Covered
Instruction in Tests
1. Knowledge

2. Comprehension

3. Application

4. Analysis

5. Synthesis

6. Evaluation
Face Validity
Superficial Appearance of the test

Is the mere appearance that a test measures its target construct

Tests wherein the purpose is clear, even to naïve respondents, are said
to have high face validity. Accordingly, tests wherein the purpose is
unclear have low face validity (Nevo, 1985).

Usually established by the test takers themselves through a Likert scale

CRITERION-RELATED VALIDITY
Use of an established criterion to create a new measurement
to measure a construct you are interested in
Theoretically
Related

what is the relationship between a test and a criterion

(external source) that the test should be related to
Example of Criterion-Related Validity:
• An existing measurement procedure for depression (valid and
reliable) is available but response rate is low and is too long ( say 100
items)
FORMS OF CRITERION RELATED
VALIDITY
CONCURRENT VALIDITY - relationship between test scores and
another currently obtainable benchmark
EXAMPLE: Test scores obtained from an class achievement test correlate highly on
the school –wide achievement test

PREDICTIVE VALIDITY - relationship between test scores and a future

standard
How well the test predicts future performances?
EXAMPLE : College Entrance Exams
CONSTRUCT VALIDITY
Seeks agreement between theoretical concept.
The extent to which an assessment corresponds to other
variables as predicted by some rationale or theory

Example: Test that tries to establish a degree of ego-centrism across

different age groups
Reliability
Extent to which an assessment is consistent
Dependability and Stability

degree of freedom from measurement error-

consistency of test scores.
the degree to which test scores are free from
errors of measurement
Factors can cause some errors
Poorly worded questions
Poor test taking instructions
Test taker anxiety
Destructing in testing room
Test-Retest Reliability

Relationship between scores from one test given at two different

administrations.

Measures stability

The same measuring instrument is administered twice to the same

group of people and the correlation coefficient is determined.
LIMITATIONS OF TEST-RETEST METHOD
ARE:
1. When the time interval is short, the respondents may recall their
previous responses and this tends to make correlation coefficient
high.
2. When the time is long, such factors as unlearning, forgetting,
among others may occur and may result in low correlation of the
measuring instrument.
3. Regardless of the time interval separating conditions such as noise,
temperature, lighting, and other factors may affect the correlation
coefficient of the measuring instrument.
Inter-Rater Reliability
Parallel/Alternate/Equivalent Form
Reliability
Relationship between scores from two similar versions of the same test

A challenge involved in this kind of reliability is to assure that both

forms of the test use the same or very similar directions, format and
number of questions, and equal in difficulty and content.
Split-Half Reliability

Correlating one half of the test against the other half.

Requires only one form and one administration of the test, splits the
test in half and correlates the scores of one half of the test with the
other half.
Internal Consistency
Reliability measured statistically by going “within the test”

How scores on individual items relate to each other or to the test as a

whole.

Ex. Individuals who score high on a test of depression should, on average,

respond to all items on the test in a manner that indicates depressive
ideation.

BArOn
Kuder-Richardson KR21

KR21= K/(K-1)[1-{M(K-M)}/K(Variance)]

K= total number of items

M- Mean of the scores
V- Variance of test scores ( sum of differences of the score and mean/
n-1) or the square of standard deviation
N- Number of test takers
(best for dichotomous, multiple choice, if there is a right answer)
So how to interpret the scores? ( Index of Reliability)

• .50 or below = Questionable reliability

• .50-.60 = May need revision, other measure may be supplemented
• .60 - .70 = Somewhat low, Some items may need improvement
• .70 - .80 = Good for a classroom test, few items need revision
• .80 - .90 Very good for a classroom test
• .90 above = Excellent reliability ( usually standardized tests)
Example: Try to find the KR21 Reliability
of Index
• 8 students took a 10 item multiple choice test, the following are the
scores:
1 7
2 7
3 8
4 9
5 3
6 4
7 5
8 6
Pearson Product Moment Correlation
• In our case, X = one person’s score on the first half of items, X = the
mean score on the first half of items, Y = one person’s score on the
second half of items, Y = the mean score on the second half of items.
Σ(X – X) (Y – Y)
rxy = √[Σ (X – X)2] [ΣY – Y)2]
Spearman Brown Prophecy Formula
• A degree of correlation between two set of scores must first be
established to determine that index of reliability for SB

• SB= (2xrhalf)/(1+rhalf)

rhalf= reliability of half of the test ( Pearson)

Try for Yourself
• A 50 item test was administered to a group of 20 students. The mean
score was 35, standard deviation is 5.5. What is KR21 index of
reliability?

• Compute for the Internal consistency (using Spearman Brown) for the
following scores: x= 1,3,4,4, y= 2, 5, 5, 8
Can test be reliable and not valid?
Fairness, Practicality, Efficiency
• Assessment should be fair
• Assessment should be viewed as an opportunity to learn
• Should be free from stereotyping

• Practical, Easy to use and does not use to much time

Ethics in Assessment
• Sexual Fantasies
• Sensitive Information
• Using unreliable Tests,

• “Will any physical or psychological harm come to any as a result of

this assessment/testing?”
• Confidentiality of results
• Deception

Initial Sample Inspection Report (ISIR) : Front Sheet
100% (1)
Initial Sample Inspection Report (ISIR) : Front Sheet
6 pages
Adjustment
No ratings yet
Adjustment
33 pages
Condensed Guide for the Stanford Revision of the Binet-Simon Intelligence Tests
From Everand
Condensed Guide for the Stanford Revision of the Binet-Simon Intelligence Tests
Lewis Madison Terman
No ratings yet
Approches To International Staffing
100% (2)
Approches To International Staffing
23 pages
Psychological Tests in Industries
100% (6)
Psychological Tests in Industries
25 pages
Moral Development: Chapter 11 - Developmental Psychology
100% (1)
Moral Development: Chapter 11 - Developmental Psychology
16 pages
Characteristics of A Good Test: Content Validity
No ratings yet
Characteristics of A Good Test: Content Validity
23 pages
Validity of Psychological Test1
No ratings yet
Validity of Psychological Test1
16 pages
Unit 4
No ratings yet
Unit 4
8 pages
Week 10 - Personality Assessment - An Overview
No ratings yet
Week 10 - Personality Assessment - An Overview
10 pages
Introduction To Clinical Psychology
100% (1)
Introduction To Clinical Psychology
18 pages
Week 8 - Intelligence and Its Measurement
No ratings yet
Week 8 - Intelligence and Its Measurement
10 pages
Physiological Psychology Lec 1
100% (1)
Physiological Psychology Lec 1
16 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
06 Chapter2 PDF
No ratings yet
06 Chapter2 PDF
41 pages
Functions of Measurement & Evaluation
No ratings yet
Functions of Measurement & Evaluation
16 pages
Research Methods in Psychology - Lo and Lecture Notes
No ratings yet
Research Methods in Psychology - Lo and Lecture Notes
3 pages
Reliability, Validity & Norms
No ratings yet
Reliability, Validity & Norms
25 pages
APP - 79 Principles of Test Construction
No ratings yet
APP - 79 Principles of Test Construction
6 pages
Psychology Assessment 1 Quiz
No ratings yet
Psychology Assessment 1 Quiz
10 pages
VI - Essentials of Psychological Testing
100% (1)
VI - Essentials of Psychological Testing
26 pages
Psych Assessment - Chapter 1
No ratings yet
Psych Assessment - Chapter 1
56 pages
Community Psychology Class
No ratings yet
Community Psychology Class
32 pages
Chapter 11 PDF
No ratings yet
Chapter 11 PDF
31 pages
Validity and Reliability
No ratings yet
Validity and Reliability
2 pages
Classroom Anxiety Scale and Details PDF
No ratings yet
Classroom Anxiety Scale and Details PDF
5 pages
PRELIM CHAPTER 1 Psychological Testing and Assessment
No ratings yet
PRELIM CHAPTER 1 Psychological Testing and Assessment
80 pages
Psychology - Problem Solving
No ratings yet
Psychology - Problem Solving
6 pages
Cohen Based Summary of Psychological Testing Assessment
No ratings yet
Cohen Based Summary of Psychological Testing Assessment
31 pages
Allport's Theory of Personality
100% (1)
Allport's Theory of Personality
8 pages
Chapter 4 Psych Assessment
No ratings yet
Chapter 4 Psych Assessment
5 pages
BA Psychology 2016
No ratings yet
BA Psychology 2016
69 pages
Intelligence (Psychology)
No ratings yet
Intelligence (Psychology)
6 pages
Assessment of Intelligence
No ratings yet
Assessment of Intelligence
21 pages
Questionnaire Revised 2
No ratings yet
Questionnaire Revised 2
7 pages
Validity
No ratings yet
Validity
4 pages
George Kelly
No ratings yet
George Kelly
21 pages
Chapter 1
No ratings yet
Chapter 1
14 pages
Working Notes - Psy Assessment (REVIEW)
No ratings yet
Working Notes - Psy Assessment (REVIEW)
5 pages
Psych Assessment Chapter 2
No ratings yet
Psych Assessment Chapter 2
2 pages
History of Psychological Assessment
No ratings yet
History of Psychological Assessment
3 pages
Notes On Murray
100% (2)
Notes On Murray
8 pages
Raven's Progressive Matrices: by J.C. Raven
No ratings yet
Raven's Progressive Matrices: by J.C. Raven
2 pages
Gojan College of Teacher Education: Basics in Educational Research
No ratings yet
Gojan College of Teacher Education: Basics in Educational Research
30 pages
Learning
No ratings yet
Learning
67 pages
Chapter 4 Psych Assessment
No ratings yet
Chapter 4 Psych Assessment
8 pages
Unit-2: Sensation & Perception: Varun Muthuchamy
No ratings yet
Unit-2: Sensation & Perception: Varun Muthuchamy
110 pages
Diagnosis and Assessment
No ratings yet
Diagnosis and Assessment
10 pages
Ethical Standards
No ratings yet
Ethical Standards
2 pages
Personality Assessment
No ratings yet
Personality Assessment
35 pages
Psyel45a Reviwer
No ratings yet
Psyel45a Reviwer
22 pages
Unit Ii_ Methods of Assessment in Clinical Psychology
100% (3)
Unit Ii_ Methods of Assessment in Clinical Psychology
61 pages
Kaplan Tutorial Quiz
No ratings yet
Kaplan Tutorial Quiz
7 pages
Item Writing
No ratings yet
Item Writing
11 pages
Construction of Test Items - Rational and Empirical Approach
No ratings yet
Construction of Test Items - Rational and Empirical Approach
3 pages
Cognitive Learning Theory
No ratings yet
Cognitive Learning Theory
52 pages
Assessment in Learning 1 Csa 2
No ratings yet
Assessment in Learning 1 Csa 2
2 pages
Mba 1st Sem Notes
100% (1)
Mba 1st Sem Notes
11 pages
Clinical Assessment Introduction Lecture
No ratings yet
Clinical Assessment Introduction Lecture
34 pages
Lec 1 Clinical Psych
No ratings yet
Lec 1 Clinical Psych
9 pages
UNIT - 5 - Behavioural Therapy
No ratings yet
UNIT - 5 - Behavioural Therapy
42 pages
Module 4 Psychometric properties (1)
No ratings yet
Module 4 Psychometric properties (1)
49 pages
Activities for Week 1
No ratings yet
Activities for Week 1
3 pages
Kolb's and Gregorc's Learning Style Model
No ratings yet
Kolb's and Gregorc's Learning Style Model
2 pages
Systematics and Eco Final
No ratings yet
Systematics and Eco Final
2 pages
History of Psychological Testing
No ratings yet
History of Psychological Testing
11 pages
Discussion - Values VS Ethics
No ratings yet
Discussion - Values VS Ethics
1 page
Test and Measurement: The Nuts and Bolts
No ratings yet
Test and Measurement: The Nuts and Bolts
21 pages
Assignment - Examples of Values and Ethics
No ratings yet
Assignment - Examples of Values and Ethics
2 pages
Assignment - Examples of Values and Ethics
No ratings yet
Assignment - Examples of Values and Ethics
2 pages
Performance Management and ISO 9001
100% (6)
Performance Management and ISO 9001
188 pages
B. Developing A Health Education Plan Step 1: Manage The Planning Process
No ratings yet
B. Developing A Health Education Plan Step 1: Manage The Planning Process
3 pages
rrb.digialm.com__per_g22_pub_1181_touchstone_AssessmentQPHTMLMode1__RRB241_RRB241S4D13190_17331517172653261_281244260724179_RRB241S4D13190E1.html#
No ratings yet
rrb.digialm.com__per_g22_pub_1181_touchstone_AssessmentQPHTMLMode1__RRB241_RRB241S4D13190_17331517172653261_281244260724179_RRB241S4D13190E1.html#
31 pages
Request Letter Pathankot
100% (1)
Request Letter Pathankot
23 pages
Pdca - Plan Do Check Act
No ratings yet
Pdca - Plan Do Check Act
5 pages
Quality, Reliability & Maintenance Warwick Manufacturing Group
No ratings yet
Quality, Reliability & Maintenance Warwick Manufacturing Group
42 pages
Sop Template
100% (1)
Sop Template
11 pages
Construction Curriculum L 2
No ratings yet
Construction Curriculum L 2
124 pages
Review of Literature
100% (1)
Review of Literature
23 pages
Country Minimum Requirement For Admission Consideration
No ratings yet
Country Minimum Requirement For Admission Consideration
8 pages
Management Notes
100% (1)
Management Notes
63 pages
TQM Milan Seminarski Rad
No ratings yet
TQM Milan Seminarski Rad
14 pages
Practicum Journal (Eloiza Diaz) Final
No ratings yet
Practicum Journal (Eloiza Diaz) Final
21 pages
BS EN 24920-1992, ISO 4920-1981 Textiles. Determination of Resistance To Surface Wetting (Spray Test) of Fabrics
No ratings yet
BS EN 24920-1992, ISO 4920-1981 Textiles. Determination of Resistance To Surface Wetting (Spray Test) of Fabrics
12 pages
Board Charter
No ratings yet
Board Charter
24 pages
BUSS 5070 Project Risk Management - Week 2 (With Answers)
No ratings yet
BUSS 5070 Project Risk Management - Week 2 (With Answers)
44 pages
Cambridge IGCSE™: Enterprise 0454/13 May/June 2021
No ratings yet
Cambridge IGCSE™: Enterprise 0454/13 May/June 2021
21 pages
Cit100-Calibration of Pressure Indicating Instrument: Objective
No ratings yet
Cit100-Calibration of Pressure Indicating Instrument: Objective
4 pages
Sales Budget/ Future Forecasting Sales
100% (1)
Sales Budget/ Future Forecasting Sales
12 pages
Total Quality Management Gurus and Expert and Their Contributions
No ratings yet
Total Quality Management Gurus and Expert and Their Contributions
9 pages
WHO Health Promotion Glossary: New Terms
No ratings yet
WHO Health Promotion Glossary: New Terms
6 pages
Sag Life Sciences
No ratings yet
Sag Life Sciences
31 pages
Practice Quiz 1
No ratings yet
Practice Quiz 1
2 pages
Brody Jason
No ratings yet
Brody Jason
201 pages
SNI ISO 37001 Certification in Indonesia - Kukuh S Achmad PDF
No ratings yet
SNI ISO 37001 Certification in Indonesia - Kukuh S Achmad PDF
21 pages
Agribusiness Development and Logistics Officer SEG 2 Et Al Ministry of Agriculture and Fisheries
No ratings yet
Agribusiness Development and Logistics Officer SEG 2 Et Al Ministry of Agriculture and Fisheries
5 pages
2009 Nightingale Feedback Report QT
No ratings yet
2009 Nightingale Feedback Report QT
49 pages
SMB Strategy
No ratings yet
SMB Strategy
10 pages

Properties of Assessment Method: Validity

Uploaded by

Properties of Assessment Method: Validity

Uploaded by

Properties Of Assessment Method

-(For testing) The extent to which an Instrument measures what it

How do we establish Content Validity?

Is the mere appearance that a test measures its target construct

Usually established by the test takers themselves through a Likert scale

what is the relationship between a test and a criterion

PREDICTIVE VALIDITY - relationship between test scores and a future

Example: Test that tries to establish a degree of ego-centrism across

degree of freedom from measurement error-

Relationship between scores from one test given at two different

The same measuring instrument is administered twice to the same

A challenge involved in this kind of reliability is to assure that both

Correlating one half of the test against the other half.

How scores on individual items relate to each other or to the test as a

Ex. Individuals who score high on a test of depression should, on average,

K= total number of items

• .50 or below = Questionable reliability

rhalf= reliability of half of the test ( Pearson)

• Practical, Easy to use and does not use to much time

• “Will any physical or psychological harm come to any as a result of

You might also like