Week4 2 Testing

Uploaded by

seyfelizeliha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views21 pages

Week4 2 Testing

Uploaded by

seyfelizeliha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

Using and Interpreting Information

about Test Reliability

Chapter 7

05/10/24 1
Aim is...
Using the information about reliability in
evaluating, interpreting and improving
psychological tests.
Reliability information alone is NOT
enough
Relationship between reliability and
validity

05/10/24 2
Using the Reliability Coefficient
It provides a relative measure of the accuracy of
test scores.
It doesn’t provide an indication of how accurate
test scores really are, in absolute terms.
A score of 110 from an intelligence test. Is it
really higher than the average score (i.e.100)?
How much variability should we expect on the
basis of measurement error? RC doesn’t say this
in concrete terms!
So, we need to know the size of the standard
error of measurement.
05/10/24 3
Using the Reliability Coefficient vs
SEM
Reliability coefficients are most useful in
comparing the scores produced by different tests.
The standard error of measurement (SEM) is
more useful when interpreting test scores.

05/10/24 4
SEM – Standard Error of
Measurement
SEM – measure of how much the
individual’s score is likely to differ from the
individual’s true test score.

SEM – 2 factors;
The reliability of the test (rxx)
The variability of test scores (X )

05/10/24 5
SEM – Standard Error of
Measurement
A spelling test has a reliability coefficient of .84
and a standard deviation of 10, then

SEM=

05/10/24 6
SEM – Standard Error of
Measurement
For testing purposes, SEM is more useful
than reliability coefficients.
We can use SEM to create a confidence
interval around a users score.
confidence interval= SEM X 1.96 (confidence
level %95)
As reliability increases, SEM decreases
As a test becomes more reliable, we can feel
more confident that an individual’s observed
score is close to the individual’s true score.
05/10/24 7
SEM – Standard Error of
Measurement

A mean of 100, a standard error of 4.7

4.7 x 1.96= 9.2 (confidence interval)
Range is 90.8 and 109.2
05/10/24 8
Confidence Intervals
Confidence intervals reflect a range that
contains the examinee’s true score.
Confidence intervals are calculated using the
SEM and the SD of the scores.
As reliability increases, SEM and confidence
intervals get smaller.

05/10/24 9
Confidence Intervals
Example of Confidence Interval:
“Johnny’s FSIQ is 113 (between 108 and 118
with 95% confidence).”
The SEM and confidence intervals remind us
that scores are not perfect.

05/10/24 10
SEM – Standard Error of
Measurement
If a person's true score is 110 on a test with a standard
error of measurement of 3.7 and a mean of 100, we
would expect 95% of the person's test scores to fall
within
a) 102.75 - 117.25
b) 92.75 - 107.25
c) 90.75 - 120.25
d) 100-110
A mean of 100, a standard error of 3.7
3.7 x 1.96= 7.25
Range is 92.75 and 107.25
05/10/24 11
Reliability and Validity
A reliable test is NOT necessarily valid.
A test can be reliable, not yet valid.

05/10/24 12
Reliability and Validity
Measurement errors would decrease the correlation
between two tests, X and Y
(in other words, the validity of predictions.)
‘correction for attenuation’
A method of estimating the true correlation between X and Y
given the correlation between two unreliable measures of
X and Y is by using the correction for attenuation.

05/10/24 13
Reliability and Validity
If the reliability of tests are increased,
the validity of tests would also
be expected to increase.

The aim is to increase the correlation between two tests

05/10/24 14
Reliability and Validity

 Example (a) shows what an unreliable test would look like. Example (b) shows what a
reliable but invalid test would look like. It is similar to a rifle that has its sights mis-
aligned. The high degree of reliability is shown by the consistency of the strikes. The
lack of validity is shown by the fact that the missiles are missing their target, the
bullseye. For example, a job satisfaction test given to unskilled workers may measure
literacy skills rather than job satisfaction if the test is written in complex language. In
psychometric terms, the test is not measuring what it was intended to measure.
Example (c) is what a valid and reliable test would look like: the missiles hit the mark
and they hit it consistently.
05/10/24 15
Special Issues
Speed test vs. power tests
Speed:A test in which items are trivially easy
60 seconds for a 100-item test.

Power: 20-item test with no time limit.

05/10/24 16
Special Issues
Speed test vs. power tests
A pure speed test should have an odd-even split-half
reliability of about 1.0

The most useful method of assessing the reliability of

highly speeded tests is the test-retest method.

A participant may be slow in the speed test and cannot

finish all the questions on time. Some test items may be
poorly constructed, but not responded by some
participants.

05/10/24 17
Selecting a Reliability
Coefficient
If a test is to be administered multiple times:
Test-Retest Reliability
Tests to be administered one time:
Homogeneous content – coefficient alpha
Heterogeneous content – split-half coefficient

05/10/24 18
How Reliable Should Tests Be?

A lower level of reliability are acceptable when tests

are used for preliminary rather than final decisions.

05/10/24 19
Reliability of Composite and
Difference Scores
Composite scores
When scores are combined
to form a composite
For example, IQs are
typically composite scores
The reliability of composite
scores is typically better
than the individual scores
in composite

05/10/24 20
Reliability of Composite and
Difference Scores
Difference scores
Involves calculating the difference between two
scores
The reliability of difference scores is typically
lower than the individual scores

05/10/24 21

1968 Fisher Body Service Manual
100% (2)
1968 Fisher Body Service Manual
575 pages
The Official Girlfriend Application: Basic Information
No ratings yet
The Official Girlfriend Application: Basic Information
3 pages
Psychological Testing and Assessment 9th Edition Cohen Solutions Manual 1
100% (76)
Psychological Testing and Assessment 9th Edition Cohen Solutions Manual 1
9 pages
Patrick Meyer Reliability Understanding Statistics 2010
100% (2)
Patrick Meyer Reliability Understanding Statistics 2010
160 pages
CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
From Everand
CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
Hemang Doshi
3/5 (4)
Jodi Picoult - Leaving Time (Extract)
No ratings yet
Jodi Picoult - Leaving Time (Extract)
39 pages
Handbook of Psychological Assessment Fourth Edition
100% (1)
Handbook of Psychological Assessment Fourth Edition
9 pages
Chapter 4: Reliability
No ratings yet
Chapter 4: Reliability
40 pages
Chapter 3: Understanding Test Quality-Concepts of Reliability and Validity
No ratings yet
Chapter 3: Understanding Test Quality-Concepts of Reliability and Validity
10 pages
Reability Test Table Interpretation
No ratings yet
Reability Test Table Interpretation
6 pages
Nature of Reliability and Other Desired Characteristics: Report By: Marrione Eubert M. Estepa
100% (1)
Nature of Reliability and Other Desired Characteristics: Report By: Marrione Eubert M. Estepa
14 pages
Readings Psy211
No ratings yet
Readings Psy211
23 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
Lecture 7 Wednesday 19 Feb Measurement Psychometric properties (1)
No ratings yet
Lecture 7 Wednesday 19 Feb Measurement Psychometric properties (1)
31 pages
Week 4.2 - The Importance of Reliability
No ratings yet
Week 4.2 - The Importance of Reliability
33 pages
CC04 PA Reliability
No ratings yet
CC04 PA Reliability
10 pages
RELIABILITY Show - PPSX
No ratings yet
RELIABILITY Show - PPSX
33 pages
reliability
No ratings yet
reliability
2 pages
TYPESOFRELIABILITY
No ratings yet
TYPESOFRELIABILITY
5 pages
Reliability & Validity
No ratings yet
Reliability & Validity
6 pages
Psychometrics KS
No ratings yet
Psychometrics KS
33 pages
PSY211_READINGS
No ratings yet
PSY211_READINGS
12 pages
test constrcution
No ratings yet
test constrcution
39 pages
Measurement Concepts & Interpretation
No ratings yet
Measurement Concepts & Interpretation
21 pages
Concept of Reliability, Validity and Norms (AutoRecovered)
No ratings yet
Concept of Reliability, Validity and Norms (AutoRecovered)
10 pages
Validity and Reliability
100% (1)
Validity and Reliability
22 pages
Reliability
No ratings yet
Reliability
113 pages
Essentials of A Good Psychological Test
No ratings yet
Essentials of A Good Psychological Test
6 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
33 pages
U5_Measurement, Reliability n Validity
No ratings yet
U5_Measurement, Reliability n Validity
9 pages
Psych-Testing-Reviewer-Midterm
No ratings yet
Psych-Testing-Reviewer-Midterm
9 pages
Reviewer Test Measurement Midterms
No ratings yet
Reviewer Test Measurement Midterms
6 pages
Group 4 (Reliability)
No ratings yet
Group 4 (Reliability)
78 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
Module 2 Week2
No ratings yet
Module 2 Week2
60 pages
Psycass Reviewer
No ratings yet
Psycass Reviewer
19 pages
Week4 1 Testing
No ratings yet
Week4 1 Testing
28 pages
RELIABILITY 2024
No ratings yet
RELIABILITY 2024
30 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Httpseclass.yorku.capluginfile.php5802518mod Resourcecontent7Week202 Reliability20and20Validity Student.pptx
No ratings yet
Httpseclass.yorku.capluginfile.php5802518mod Resourcecontent7Week202 Reliability20and20Validity Student.pptx
1 page
Psyc 385 Exam 2 Study Guide
No ratings yet
Psyc 385 Exam 2 Study Guide
17 pages
PSYCH ASSESSMENT - MIDTERMS
No ratings yet
PSYCH ASSESSMENT - MIDTERMS
16 pages
3 - Reliability
No ratings yet
3 - Reliability
38 pages
Paprint
No ratings yet
Paprint
3 pages
Week 7reliability
No ratings yet
Week 7reliability
25 pages
Essentials of A Good Test
No ratings yet
Essentials of A Good Test
6 pages
Characteristics of Effective Selection Techniques
No ratings yet
Characteristics of Effective Selection Techniques
17 pages
Lesson 9A_Reliability
No ratings yet
Lesson 9A_Reliability
9 pages
PSYCH STATS SEMI
No ratings yet
PSYCH STATS SEMI
11 pages
Reliability PPT Presentation
100% (1)
Reliability PPT Presentation
9 pages
Reliability Psychometrics
No ratings yet
Reliability Psychometrics
7 pages
Testing and Assessment - Reliability and Validity
No ratings yet
Testing and Assessment - Reliability and Validity
5 pages
Strructures
No ratings yet
Strructures
28 pages
QUALITY OF A TEST
No ratings yet
QUALITY OF A TEST
7 pages
What To Look For in A Psychological Test
No ratings yet
What To Look For in A Psychological Test
32 pages
RELIABILITY
No ratings yet
RELIABILITY
4 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Good Psychometric Properties
No ratings yet
Good Psychometric Properties
44 pages
9 Reliability
No ratings yet
9 Reliability
10 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
Evaluating a Psychometric Test as an Aid to Selection
From Everand
Evaluating a Psychometric Test as an Aid to Selection
Zuzana Robertson C.Psychol
5/5 (1)
Identity in Digital Age
No ratings yet
Identity in Digital Age
25 pages
Soci̇al İnfluence
No ratings yet
Soci̇al İnfluence
136 pages
The Intake Report Outline
No ratings yet
The Intake Report Outline
3 pages
Week - Lecture Notes
No ratings yet
Week - Lecture Notes
7 pages
Chapter 3
No ratings yet
Chapter 3
56 pages
Chapter 4
No ratings yet
Chapter 4
47 pages
Chapter 1
No ratings yet
Chapter 1
27 pages
Chapter 2
No ratings yet
Chapter 2
29 pages
Primax Platform-Brochure 2016-02 en PDF
No ratings yet
Primax Platform-Brochure 2016-02 en PDF
19 pages
Comparison of Reclaimer Types - Rev. 0
No ratings yet
Comparison of Reclaimer Types - Rev. 0
5 pages
Class 8 Holidays Homework
No ratings yet
Class 8 Holidays Homework
6 pages
wheat campagain
No ratings yet
wheat campagain
8 pages
From Mending The Life-Richard Rolle
No ratings yet
From Mending The Life-Richard Rolle
5 pages
ECPE SampleTest 1003 Test Booklet
No ratings yet
ECPE SampleTest 1003 Test Booklet
28 pages
stationary_objects_detection
No ratings yet
stationary_objects_detection
40 pages
The Secret of Writing Multiple-Choice Test Items
100% (1)
The Secret of Writing Multiple-Choice Test Items
11 pages
Research Proposal: Effects of Working Capital Management On Sme Profitability
0% (1)
Research Proposal: Effects of Working Capital Management On Sme Profitability
3 pages
Cshperspectmed ADD A039610
No ratings yet
Cshperspectmed ADD A039610
15 pages
Natural Disasters
No ratings yet
Natural Disasters
2 pages
Half Yearly Revision Assignment
No ratings yet
Half Yearly Revision Assignment
28 pages
film-directing-shot-by-shot
No ratings yet
film-directing-shot-by-shot
20 pages
Pembagian Halaqoh Ma 2023-2024.
No ratings yet
Pembagian Halaqoh Ma 2023-2024.
3 pages
ENGLISH IV - STUDENT - S TEXTBOOK Ultima Version
No ratings yet
ENGLISH IV - STUDENT - S TEXTBOOK Ultima Version
84 pages
Test Bank For Pharmacotherapeutics For Advanced Practice Nurse Prescribers 5th Edition Teri Moser Woo Marylou V Robinson
100% (39)
Test Bank For Pharmacotherapeutics For Advanced Practice Nurse Prescribers 5th Edition Teri Moser Woo Marylou V Robinson
8 pages
May 4 Movement, 1919: System, and Foreign Imperialism
No ratings yet
May 4 Movement, 1919: System, and Foreign Imperialism
4 pages
Curriculum Vitae of Md. Enamul: Career Objective
No ratings yet
Curriculum Vitae of Md. Enamul: Career Objective
2 pages
CETM47-Ass1 Tofun
No ratings yet
CETM47-Ass1 Tofun
12 pages
NASSCOM Annual Report 2010-11
0% (1)
NASSCOM Annual Report 2010-11
40 pages
Wa0137.
No ratings yet
Wa0137.
3 pages
Kesepakatan Perjanjian Jual Beli Melalui Mesin Jual Otomatis (Vending Machine) Ditinjau Dari Aspek Hukum Perjanjian
No ratings yet
Kesepakatan Perjanjian Jual Beli Melalui Mesin Jual Otomatis (Vending Machine) Ditinjau Dari Aspek Hukum Perjanjian
16 pages
Journal of Transport Geography: Jiaoe Wang, David Bonilla, David Banister
No ratings yet
Journal of Transport Geography: Jiaoe Wang, David Bonilla, David Banister
12 pages
Passage 1: Set 1 Questions
No ratings yet
Passage 1: Set 1 Questions
3 pages
Finger Math Presentation
No ratings yet
Finger Math Presentation
18 pages
'S Tips On Backing Irish Music: Henrik Norbeck
No ratings yet
'S Tips On Backing Irish Music: Henrik Norbeck
3 pages
Microeconomics 9th Edition Michael Parkin Solutions Manual - Free Access To All Available Content For Download
100% (4)
Microeconomics 9th Edition Michael Parkin Solutions Manual - Free Access To All Available Content For Download
53 pages