0% found this document useful (0 votes)

415 views

Reliability & Validity

This document discusses the concepts of reliability and validity in research. Reliability refers to the consistency and dependability of measurement tools or procedures. There are several types of reliability, including test-retest reliability, internal consistency, and inter-rater reliability. Validity refers to whether a measurement tool accurately measures what it is intended to measure. Types of validity include construct validity, face validity, content validity, predictive validity, and concurrent validity. Together, reliability and validity are important for establishing the quality and accuracy of research measurements and results.

Uploaded by

wajahatroomi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

415 views

Reliability & Validity

Uploaded by

wajahatroomi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Reliability & Validity

What is Reliability?
Reliability: Consistency and dependability.
If a measurement device or procedure consistently
assigns the same score to individuals or objects with
equal values, the device is considered reliable.
Researchers must establish the reliability of their
measurement devices in order to be certain that
they are obtaining a systematic and consistent
record of the variation in X and Y.

Types of Reliability
Several types:
Test-retest reliability and alternate reliability
Inter-item reliability and internal consistency
Split-half reliability
Inter-rater reliability
Scorer reliability

Test-retest Reliability

Measure

the

scores

twice

with

the

same

instrument. Reliable measures should produce

very similar scores. Examples:
IQ tests typically show high test-retest reliability.
The reliability of a bathroom scale can be tested
by recording your weight 2-3 times within a
minute or two.

Alternate Forms Reliability

Test-retest
participants

procedures
may

may
able

not
to

recall

useful
their

when

responses and simply repeat them upon retesting. In

cases where administering the exact same test will not
necessarily be a good test of reliability, we may use
alternate forms reliability. As the name implies, two
or more versions of the test are constructed that are
equivalent in content and level of difficulty. Professors use
this technique to create makeup or replacement exams
because students may already know the questions from
the earlier exam.

Inter-item reliability
Inter-item reliability: The degree to which different
items measuring the same variable attain
consistent results.
Scores on different items designed to measure the
same construct should be highly correlated. It also
goes by the name internal consistency.
Example: Math tests often ask you to solve several
examples of the same type of problem. Your
scores on these questions will normally represent
your ability to solve this type of problem, and the
test would have high inter-item reliability.

Inter-rater reliability
When observers must use their own judgment to
interpret the events they are interpreting
(including live or videotaped behaviors and
written answers to open-ended interview
questions), scorer reliability must be measured.
Have different observers take measurements of
the same responses; the agreement between
their measurements is called inter-rater reliability.
Their results can be compared statistically and
represent the scorers reliability.

A measure is valid if it measures what it is supposed to

measure, and does so cleanly without accidentally
including other factors.
Most experiments are designed to measure
hypothetical constructs such as intelligence, learning,
or love. The experimenter must create an operational
definition of the dependent variable because one
cannot measure these hypothetical constructs directly.
A valid measure is one that measures this hypothetical
construct accurately (such as intelligence) without
being influenced by other factors (such as motivation).

Types of Validity
Validity: (actually studying the
variables that we wish to study)
Construct validity
Face validity
Content validity
Criterion validity -- 2 types:
Predictive validity
Concurrent validity

Construct Validity
Do my dependent variables actually
measure the hypothetical construct that I
want to test?
Does my IQ test really measure IQ, and
nothing else?
Do my procedures actually measure
learning, (without being influenced by
motivation)?
Does my personality test really measure
personality traits without including fatigue?

Face Validity
The consensus (usually by experts in the field) that a
measure represents a particular concept. It is the least
stringent type of validity. Because most psychological
variables require indirect measures (like the intelligence
example before), the validity of a measured definition
may not be self-evident.
Does rate of eating really reflect hunger? In rats, does the
rate of lever pressing actually measure learning?
Does talking measure extroversion?
Does GPA or SAT score
really reflect intelligence?

Comparing face validity with

construct validity
Face validity: The consensus that a measure
represents a particular concept the face
value of the measure. (Would a 130-pound
53 college student be a good football or
basketball player?)
Construct validity: The accuracy with which
a measure represents the particular concept,
without influence of additional factors.
Construct validity implies that other
operational definitions of the same construct
will yield correlated results.

Content Validity
Does the content of our measure fairly reflect the
content of the thing we are measuring?
Example: Do the questions on an exam accurately
reflect what you have learned in the course, or were
the exam questions sampled from only a subsection of the material?
A test to measure your knowledge of mathematics
should not be limited to addition problems, nor
should it include questions about French literature.
It should cover the entire range appropriate math
problems you are trying to measure.

Criterion Validity
A powerful indicator of the validity of a
measure is its ability to accurately predict
performance on other, independent outcome
measures (referred to as criterion measures).
The extent to which your SAT score predicts
your college GPA is an indication of the SATs
criterion validity.
There are two approaches to criterion
validity: Concurrent validity and Predictive
validity.

Concurrent vs. Predictive

Validity
In concurrent validity, the SAT test scores and
criterion measures (high school GPA) are
obtained at roughly the same time
(concurrent).
If the SAT shows high concurrent validity, it will
be highly correlated with GPA obtained at the
same time the SAT is taken.
Predictive validity, however, would be high if
your SAT score accurately predicted your
college GPA, which is obtained long after taking
the SAT.

Jombay Pshychometric Test
38% (8)
Jombay Pshychometric Test
19 pages
Marketing Scales Handbook
No ratings yet
Marketing Scales Handbook
27 pages
Grey and Beige Vintage Timeline History Archeology Infographic
100% (1)
Grey and Beige Vintage Timeline History Archeology Infographic
1 page
Worry Domains Questionnaire
No ratings yet
Worry Domains Questionnaire
2 pages
Erin Kyle WK 2 Behaviorist Lesson Plan
No ratings yet
Erin Kyle WK 2 Behaviorist Lesson Plan
5 pages
Learners Who Are Gifted and Talented (Visual Arts & Music)
No ratings yet
Learners Who Are Gifted and Talented (Visual Arts & Music)
14 pages
Combining Scores Multi Item Scales
No ratings yet
Combining Scores Multi Item Scales
41 pages
Taxonomy of Aptitude Test Items: A Guide For Item Writers
100% (5)
Taxonomy of Aptitude Test Items: A Guide For Item Writers
13 pages
Assessment Tools
No ratings yet
Assessment Tools
7 pages
Behaviorism: What Role Does Technology Play in This?
No ratings yet
Behaviorism: What Role Does Technology Play in This?
14 pages
A Systematic Review of Teachers Perceptions Towards Effective Teaching-Learning of Students With Intellectual Disability
No ratings yet
A Systematic Review of Teachers Perceptions Towards Effective Teaching-Learning of Students With Intellectual Disability
6 pages
Rules in Creating A Multiple Choice Test
No ratings yet
Rules in Creating A Multiple Choice Test
38 pages
Course Outline Educ 103 Advanced Stat
No ratings yet
Course Outline Educ 103 Advanced Stat
4 pages
PowerPoint 2013 - Applying Transitions
No ratings yet
PowerPoint 2013 - Applying Transitions
6 pages
CHAPTER 9 Students With Blindness or Low Vision
No ratings yet
CHAPTER 9 Students With Blindness or Low Vision
4 pages
Values Orientation and Performance
No ratings yet
Values Orientation and Performance
16 pages
Observing The Outer Person: Behavioral Perspective
No ratings yet
Observing The Outer Person: Behavioral Perspective
7 pages
Multigenerational Family Therapy
No ratings yet
Multigenerational Family Therapy
12 pages
One-Way ANOVA PDF
No ratings yet
One-Way ANOVA PDF
20 pages
Assess 2 Module 1 Lesson 1
No ratings yet
Assess 2 Module 1 Lesson 1
5 pages
Top - Ch17 (Bandura) Reviewer
No ratings yet
Top - Ch17 (Bandura) Reviewer
6 pages
T282 Stages in Implementing Portfolio Assessment - Written Report 2 Checked
No ratings yet
T282 Stages in Implementing Portfolio Assessment - Written Report 2 Checked
10 pages
Validity and Reliability
No ratings yet
Validity and Reliability
2 pages
Spects of Ducational Anagement: Ducational Lanning AND Management
No ratings yet
Spects of Ducational Anagement: Ducational Lanning AND Management
11 pages
Purposes of Assessment
No ratings yet
Purposes of Assessment
5 pages
Reliability and Validity
No ratings yet
Reliability and Validity
11 pages
Unn Edu Family Back Project
No ratings yet
Unn Edu Family Back Project
86 pages
Ed 101 Part 2
No ratings yet
Ed 101 Part 2
67 pages
Sse115 M5
No ratings yet
Sse115 M5
5 pages
Reliability Vs Validity
No ratings yet
Reliability Vs Validity
27 pages
Chapter 8 Test Development
100% (1)
Chapter 8 Test Development
3 pages
Attitude of Secondary School Teachers Towards The Use of ICT in Teaching Learning Process
100% (1)
Attitude of Secondary School Teachers Towards The Use of ICT in Teaching Learning Process
4 pages
Module 1
No ratings yet
Module 1
16 pages
ED 106 - Module 7
No ratings yet
ED 106 - Module 7
8 pages
Reviewer (Midterm) : Assessment of Learning 1
No ratings yet
Reviewer (Midterm) : Assessment of Learning 1
3 pages
Reliability Vs Validity
No ratings yet
Reliability Vs Validity
11 pages
Assessment of Learning Summary / Keyword Assessment
No ratings yet
Assessment of Learning Summary / Keyword Assessment
26 pages
Test Construction and Validation
100% (2)
Test Construction and Validation
88 pages
10 Non Virtues
No ratings yet
10 Non Virtues
7 pages
Paired Ttest
No ratings yet
Paired Ttest
1 page
Intellectual Skill
No ratings yet
Intellectual Skill
3 pages
Savings and Investments
No ratings yet
Savings and Investments
6 pages
Dependent T Test Paired Sample
No ratings yet
Dependent T Test Paired Sample
2 pages
CM6.3 Authentic Assessment and Assessment of Process and Product
No ratings yet
CM6.3 Authentic Assessment and Assessment of Process and Product
4 pages
7 Principles of Student
No ratings yet
7 Principles of Student
8 pages
Assignment
No ratings yet
Assignment
23 pages
Dissertation Report
No ratings yet
Dissertation Report
26 pages
Importance of Outcome Based Education (OBE) To Advance Educational Quality and Enhance Global Mobility
No ratings yet
Importance of Outcome Based Education (OBE) To Advance Educational Quality and Enhance Global Mobility
10 pages
Pointers For Assessment
No ratings yet
Pointers For Assessment
9 pages
A Study On Teachers' Perception About Inclusive Education
No ratings yet
A Study On Teachers' Perception About Inclusive Education
80 pages
The Effect of Mental Health On The Academic Performance of Senior High Students in Pinagbuhatan High School
No ratings yet
The Effect of Mental Health On The Academic Performance of Senior High Students in Pinagbuhatan High School
2 pages
CHAPTER 3 lESSON 2 Political Aspect
No ratings yet
CHAPTER 3 lESSON 2 Political Aspect
8 pages
Chapter 1 Module
No ratings yet
Chapter 1 Module
21 pages
09 - Data Analysis - Descriptive Statistics
No ratings yet
09 - Data Analysis - Descriptive Statistics
23 pages
Situated Learning Theory
No ratings yet
Situated Learning Theory
7 pages
Defining Children and Childhood Part 2 - Prof Ed 1
No ratings yet
Defining Children and Childhood Part 2 - Prof Ed 1
21 pages
8081 Characteristics of Good Measurement Instruments
No ratings yet
8081 Characteristics of Good Measurement Instruments
3 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
58 pages
Executive Summary
No ratings yet
Executive Summary
4 pages
How To Calculate The Mean, Mode and Median LA 2016
No ratings yet
How To Calculate The Mean, Mode and Median LA 2016
22 pages
Program Evaluation
No ratings yet
Program Evaluation
29 pages
Alphy Biju BAP.21.440 Assignment On Validity Psychology
No ratings yet
Alphy Biju BAP.21.440 Assignment On Validity Psychology
6 pages
Validity and Reliability
100% (1)
Validity and Reliability
6 pages
Validity and Reliability
No ratings yet
Validity and Reliability
6 pages
ElliottScreeningWholeSEChildTSassessmentsSPR2021
No ratings yet
ElliottScreeningWholeSEChildTSassessmentsSPR2021
17 pages
Score: Name - Grade 10 - Date
No ratings yet
Score: Name - Grade 10 - Date
2 pages
Development and Psychometric Properties of The Children's Assertive Behavior Scale
No ratings yet
Development and Psychometric Properties of The Children's Assertive Behavior Scale
11 pages
New Scoring Methodology Improves The Adas Cog
No ratings yet
New Scoring Methodology Improves The Adas Cog
17 pages
A Normative Study of The Raven Coloured Progressive Matrices Test For Omani Children Aged 5-11 Years
No ratings yet
A Normative Study of The Raven Coloured Progressive Matrices Test For Omani Children Aged 5-11 Years
15 pages
Jurnal Tata Rias Dan Kecantikan: Ilmi Fadila, Prima Minerva, Murni Astuti
No ratings yet
Jurnal Tata Rias Dan Kecantikan: Ilmi Fadila, Prima Minerva, Murni Astuti
9 pages
PSYCHOMETRIC PROPERTIES of CRI
100% (1)
PSYCHOMETRIC PROPERTIES of CRI
5 pages
Alfred Binet
No ratings yet
Alfred Binet
7 pages
Psychological Assessment Q - A Topic 6 F.
No ratings yet
Psychological Assessment Q - A Topic 6 F.
85 pages
Interobserver Agreement in Behavioral Research: Importance and Calculation
No ratings yet
Interobserver Agreement in Behavioral Research: Importance and Calculation
8 pages
Validity, Reliability, Data Collection, Data Analysis
No ratings yet
Validity, Reliability, Data Collection, Data Analysis
81 pages
Principles of Language Assessment
100% (1)
Principles of Language Assessment
26 pages
Dementia Search
No ratings yet
Dementia Search
112 pages
CABS - Difazio-2018-Item Generation and Content Valid
No ratings yet
CABS - Difazio-2018-Item Generation and Content Valid
21 pages
Satisfaction Scale For Athlete (SSA) : A Study of Validity and Reliability
No ratings yet
Satisfaction Scale For Athlete (SSA) : A Study of Validity and Reliability
14 pages
Notes PSYCH ASSESSMENT
No ratings yet
Notes PSYCH ASSESSMENT
163 pages
Parreira Jaco
No ratings yet
Parreira Jaco
97 pages
Alpha Max
No ratings yet
Alpha Max
36 pages
Soane Et Al. (2012) - Development and Application of A New Measure of Empoyee Engagement
No ratings yet
Soane Et Al. (2012) - Development and Application of A New Measure of Empoyee Engagement
21 pages
Mako Ijgor
No ratings yet
Mako Ijgor
14 pages
Educational Measurement From Foundations to Future 1st Edition Craig S. Wells Phd 2024 scribd download
100% (1)
Educational Measurement From Foundations to Future 1st Edition Craig S. Wells Phd 2024 scribd download
55 pages
Download Full Cognitive Diagnostic Assessment for Education Theory and Applications 1st Edition Jacqueline Leighton PDF All Chapters
No ratings yet
Download Full Cognitive Diagnostic Assessment for Education Theory and Applications 1st Edition Jacqueline Leighton PDF All Chapters
51 pages
Statistical Methods in Psychology Journals: Guidelines and Explanations
No ratings yet
Statistical Methods in Psychology Journals: Guidelines and Explanations
17 pages
Dass Translation and Validation Procedure in The Greek Language
No ratings yet
Dass Translation and Validation Procedure in The Greek Language
15 pages
[FREE PDF sample] Questionnaires in Second Language Research 3rd Edition Zoltán Dörnyei ebooks
100% (1)
[FREE PDF sample] Questionnaires in Second Language Research 3rd Edition Zoltán Dörnyei ebooks
65 pages
Chapter 01
No ratings yet
Chapter 01
21 pages

Reliability & Validity

Uploaded by

Reliability & Validity

Uploaded by

Reliability & Validity

instrument. Reliable measures should produce

Alternate Forms Reliability

responses and simply repeat them upon retesting. In

A measure is valid if it measures what it is supposed to

Comparing face validity with

Concurrent vs. Predictive

You might also like