0% found this document useful (0 votes)

20 views

Applying Reliability Information

This document discusses applying reliability information from tests, including the standard error of measurement and how it relates to test reliability. It also covers how to use the standard error of measurement to calculate confidence intervals around scores and evaluate differences between scores.

Uploaded by

Gasai Yuno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Applying Reliability Information

Uploaded by

Gasai Yuno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

APPLYING RELIABILITY

INFORMATION
APPLYING RELIABILITY INFORMATION
○ To determine the extent of measurement error in a test
○ The presence of error leads to two conclusions about test
scores
■ Should always be viewed as estimates of an individual’s
knowledge or characteristics
■ Decisions based on test scores should always take into
consideration the possibility of such chance variation
STANDARD OF ERROR
OF MEASUREMENT
(SEm)
STANDARD OF ERROR OF MEASUREMENT
● Standard deviation of errors of measurement that are
associated with test scores
● Allows us to quantify the extent to which a test provides
accurate scores
● For example, a student gets an IQ score of 80.
● How confident are we that the person’s true IQ score is 80?
STANDARD OF ERROR OF MEASUREMENT

SEM = SD x √(1-r)
● Standard deviation of the sample scores multiplied by the
square root of 1 minus the reliability (precision) of the scores
STANDARD OF ERROR OF MEASUREMENT
● Directly related to test reliability
● Uses the reliability coefficient to determine the average
number of points by which test scores and true scores differ
○ The larger the SEM, the lower the test’s reliability.
■ If test reliability = 0, the SEM will equal the standard
deviation of the observed test scores.
■ If test reliability = 1.00, the SEM is zero .
STANDARD OF ERROR OF MEASUREMENT
● Consider an test with an SD of 10
● Let’s quantify the SEM for a score of 100 under four different
conditions
○ Reliability = 0.9
○ Reliability = 0.8
○ Reliability = 0.5
○ Reliability = 0.2
r = 0.9 SD = 10 good measure

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.9)

Calculate the SEM SEM = 3.16

good measure

r = 0.8 SD = 10 good measure

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.8)

Calculate the SEM SEM = 4.47

r = 0.5 SD = 10 poor measure

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.5)

Calculate the SEM SEM = 7.07

r = 0.2 SD = 10 really poor measure

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.2)

Calculate the SEM SEM = 8.94

STANDARD OF ERROR OF MEASUREMENT

SCORE

55 65 75 85 95

10 10
CONFIDENCE INTERVALS
CONFIDENCE INTERVALS
● We use the SEM to calculate confidence intervals around
obtained scores
● Because all test scores include measurement error and only
estimate true scores, it is useful to convert individual test
scores to ranges within which the true score is likely to fall
● We use the person’s test score and SEM to infer what would
happen if the person were tested repeatedly
CONFIDENCE INTERVALS
● Common SEM confidence intervals and their formulas:

68%CI = Score ±SEM

95%CI = Score ±(1.96*SEM)
99%CI = Score ±(2.58*SEM)
CONFIDENCE INTERVALS
SEM = 15 x √(1-.70) 95%CI = 1.96 x 8.22
SEM = 8.22 95%CL = 16.22

100 95%CI = 100 + 16.11 = 116.11

100 95%CI = 100 - 16.11 = 83.89

68%CI = 8.22 99%CI = 2.58 x 8.22

100 68%CI = 100 + 8.22 = 108.22 99%CI = 21.21

100 68%CI = 100 - 8.22 = 91.78 100 68%CI = 100 +21.21 = 121.21

100 68%CI = 100 - 21.21= 78.79

CONFIDENCE INTERVALS
SEM = 15 x √(1-.90) 95%CI = 1.96 x 4.74
SEM = 4.74 95%CL = 9.29

100 95%CI = 100 + 9.29 = 109.29

100 95%CI = 100 -9.29 = 90.71

68%CI = 4.74 99%CI = 2.58 x 4.74

100 68%CI = 100 + 4.74 = 104.74 99%CI = 12.22

100 68%CI = 100 - 4.74 = 95.26 100 68%CI = 100 + 12.22 = 112.22

100 68%CI = 100 - 12.22 = 112.22

CONFIDENCE INTERVALS
SEM = 15 x √(1-.70) 95%CI = 1.96 x 8.22
SEM = 8.22 95%CL = 16.11

100 95%CI = 100 + 16.11 = 116.11

100 95%CI = 100 - 16.11 = 83.89

68%CI = 8.22 99%CI = 2.58 x 8.22

100 68%CI = 100 + 8.22 = 108.22 99%CI = 21.21

100 68%CI = 100 - 8.22 = 91.78 100 68%CI = 100 +21.21 = 121.21

100 68%CI = 100 - 21.21= 78.79

CONFIDENCE INTERVALS
● Just because an instrument produces scores with a high level of
accuracy does not mean those scores are valid
● The size of the confidence interval increases as the extent of
error in test scores increases
EVALUATING THE
DIFFERENCES BETWEEN
TWO SCORES
EVALUATING THE DIFFERENCE BETWEEN TWO SCORES

● Since all test scores reflect some measurement error, we must

be careful when we compare test scores
○ For example, a person took an SAT and received a total
score (verbal + math) of 980. The person then enrolls in a
cram course designed to increase SAT scores, retakes SAT,
and receives a total score of 1100.
○ The test score has improved but why?
SEM OF A DIFFERENCE

SEMdiff = √(SEM test1)2 + (SEM test2)2

● The SEM of a difference is the standard deviation of the set of

possible difference scores that could occur on a set of tests

● Indicates the average amount by which test scores can be

expected to differ on the basis of chance

● How likely it is that a particular difference score will occur by

chance
SEM OF A DIFFERENCE
A student receives a score of 63 on a unit exam and a score of 72
on a retake using an alternative form of the exam

Observed score difference = 72 - 63 = 9

SEM form 1 = 4 points SEM form 2 = 5 points

SEMdiff = √42 + 52 = √16+25 = √41 = 6.4 points

68%CI = Score of a difference of ±6.4

95%CI = Score of a difference ±12.544

SEM OF A DIFFERENCE
● EXAMPLE
○ Mary Jones has T scores of 45 on one test and 60 on
another. The first test A has a reliability of 0.80 and the
second B of 0.90. Was her score on test B statistically
significantly better than her score on test A? The scale of
SD is 10 on both tests because both are T score scales. To
answer this question we must use four steps.
SEM OF A DIFFERENCE
● First step: calculating the SEM

SEM = 15 x √(1-.70)
Test A SEM = 10 x √(1-0.80) = 4.47
Test B SEM = 10 x √(1-0.90) = 3.16
SEM OF A DIFFERENCE
● Second step: Calculate the SEdiff

SEMdiff = √(SEM test1)2 + (SEM test2)2

SEMdiff = √(4.47)2 + (3.16)2
SEMdiff = 5.47
SEM OF A DIFFERENCE
● Third step: we need to evaluate the difference (or distance)
between scores and see if this is greater or less the two
SEdiffs.
● One score is 45 and the other is 60. So the difference between
the two scores is 60 - 45. This is a distance of 15 (SEMdiffAB)
points which is greater than two SEdiffs (twice 5.47) another
method is to divide the distance by the SEMdiffAB to see if
the answer comes out bigger than 2

15 ፥ 5.47 = 2.74 SEdiffs

SEM OF A DIFFERENCE
● Fourth step: we have to decide on the confidence level
● If two scores differ by one SEdiff we can only be 68%
confident that the true scores are different. If they differ by
two or more SEdiffs we can be 95% confident the true scores
really differ. In this case they have a difference of 2.74 Sediffs
which is greater than the 95% confidence interval. So if the
difference is 2.74 SEdiffs we can be sure that the two scores
are statistically significantly different with 95% confidence
Therefore, Mary really is better on test B than on test A
EVALUATING COMPOSITE
OR AVERAGE SCORES
EVALUATING COMPOSITE OR AVERAGE SCORES

● In certain conditions, a person is evaluated not by a single test

score, but on the basis of a series of tests.
● It is a good idea to calculate a confidence interval around a
student’s total or average score before deciding on a final
grade.
EVALUATING COMPOSITE OR AVERAGE SCORES

● SEM of a total or average score

SEMean = SD/√N
EVALUATING COMPOSITE OR AVERAGE SCORES

A student receives the following five grades: 76. 74, 78, 83, 84
X = 79
SD = 6.8987
N=5

SEMean = 3.899/√5 = 1.744

95%CI = 79 ± (1.96)(1.744) = 79 ± 3.418 = 75.582 to 82.418

EVALUATING COMPOSITE OR AVERAGE SCORES

● Based on these test scores, our best estimate (95% confidence

interval) is that this student’s true average lies between 75.58
and 82.42. Now the grade can be compared with other
students who have averages of 79 or 80 to see what grade
seems most appropriate.
ref
http://www.fldoe.org/core/fileparse.php/7567/urlt/y1996-7.pdf

https://home.apu.edu/~bsimmerok/WebTMIPs/Session6/TSes6.html

https://www.statisticssolutions.com/composite-scoring-and-reliability/

https://www.statisticshowto.datasciencecentral.com/standard-error-of-measurement/

Handwriting Without Tears
100% (8)
Handwriting Without Tears
63 pages
Customer Loyalty
No ratings yet
Customer Loyalty
39 pages
Standard Error of Measurement and Confidence Intervals PATOSS Updated June 2020
No ratings yet
Standard Error of Measurement and Confidence Intervals PATOSS Updated June 2020
8 pages
Classroom Assessment For K To 12 Basic Education Program: (Deped Order # 8, S. 2015)
No ratings yet
Classroom Assessment For K To 12 Basic Education Program: (Deped Order # 8, S. 2015)
53 pages
ExtraExercises Week1-2 WithAnswers 2023-2024
No ratings yet
ExtraExercises Week1-2 WithAnswers 2023-2024
3 pages
May 7 2024
No ratings yet
May 7 2024
31 pages
Confidence Intervals, Limits, and Levels? Confidence Intervals, Limits, and Levels?
No ratings yet
Confidence Intervals, Limits, and Levels? Confidence Intervals, Limits, and Levels?
5 pages
Week 4.2 - The Importance of Reliability
No ratings yet
Week 4.2 - The Importance of Reliability
33 pages
TYPESOFRELIABILITY
No ratings yet
TYPESOFRELIABILITY
5 pages
5 Reliability
No ratings yet
5 Reliability
29 pages
Statistical Significance Using Confidence Intervals (2020)
No ratings yet
Statistical Significance Using Confidence Intervals (2020)
49 pages
Week4 2 Testing
No ratings yet
Week4 2 Testing
21 pages
Readings Psy211
No ratings yet
Readings Psy211
23 pages
Psychological Testing 2018 PDF
No ratings yet
Psychological Testing 2018 PDF
74 pages
Handbook -Sampling and Sampling Distributions
No ratings yet
Handbook -Sampling and Sampling Distributions
10 pages
4.1. Estimation
No ratings yet
4.1. Estimation
30 pages
Nature of Reliability and Other Desired Characteristics: Report By: Marrione Eubert M. Estepa
100% (1)
Nature of Reliability and Other Desired Characteristics: Report By: Marrione Eubert M. Estepa
14 pages
APP601S Chapter 4 - Data Handling in Analytical Chem
No ratings yet
APP601S Chapter 4 - Data Handling in Analytical Chem
42 pages
Reliability Note
No ratings yet
Reliability Note
5 pages
Reliability
No ratings yet
Reliability
11 pages
Theory of Estimation- April 2021
No ratings yet
Theory of Estimation- April 2021
21 pages
CHAPTER 4 Norms and Reliability - PPT
No ratings yet
CHAPTER 4 Norms and Reliability - PPT
54 pages
Chapter 4: Reliability
No ratings yet
Chapter 4: Reliability
40 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
Data Anlalysis
No ratings yet
Data Anlalysis
6 pages
Confidence Intervals Concept
No ratings yet
Confidence Intervals Concept
10 pages
Statistics-and-Probability-Q3-SSLM-
No ratings yet
Statistics-and-Probability-Q3-SSLM-
6 pages
PSY211_READINGS
No ratings yet
PSY211_READINGS
12 pages
SEM & Confidence Interval
No ratings yet
SEM & Confidence Interval
39 pages
APP601S Chapter 4- Data Handling in Anal Chem
No ratings yet
APP601S Chapter 4- Data Handling in Anal Chem
42 pages
Statistics 1 Revision Sheet
No ratings yet
Statistics 1 Revision Sheet
9 pages
3NUEPIBIODispersionandProbability2T24-25
No ratings yet
3NUEPIBIODispersionandProbability2T24-25
108 pages
Statistics Notes
No ratings yet
Statistics Notes
23 pages
distribution of data
No ratings yet
distribution of data
32 pages
Lab 4: Alpha and Standard Error of Measurement
No ratings yet
Lab 4: Alpha and Standard Error of Measurement
16 pages
227 - ch7 HW Soln
No ratings yet
227 - ch7 HW Soln
13 pages
One Tailed Test
No ratings yet
One Tailed Test
16 pages
HMEF5053 Topic 8 Reliability Validity
No ratings yet
HMEF5053 Topic 8 Reliability Validity
20 pages
Chapter 8 Confidence Intervals
No ratings yet
Chapter 8 Confidence Intervals
34 pages
Lecture 3 PDF
100% (2)
Lecture 3 PDF
77 pages
Exercises CI and HT
No ratings yet
Exercises CI and HT
5 pages
SSCK 1203 Data Analysis 090214 Students 02
No ratings yet
SSCK 1203 Data Analysis 090214 Students 02
36 pages
Submitted By: Fuenteblanca Gyka J. Lebeco Joanne Submitted To: Sir Ubenia ENG 106.1
No ratings yet
Submitted By: Fuenteblanca Gyka J. Lebeco Joanne Submitted To: Sir Ubenia ENG 106.1
26 pages
L14 Estimation
No ratings yet
L14 Estimation
50 pages
Structural Equation Modelling
No ratings yet
Structural Equation Modelling
36 pages
Formulas
No ratings yet
Formulas
3 pages
10.1 Power Point
No ratings yet
10.1 Power Point
17 pages
New Vison
No ratings yet
New Vison
5 pages
10.1 Power Point Part 2
No ratings yet
10.1 Power Point Part 2
20 pages
1991 HARVILL Standard Error Measurement
No ratings yet
1991 HARVILL Standard Error Measurement
9 pages
Quantitative Methods - I ST1
No ratings yet
Quantitative Methods - I ST1
10 pages
Review of Basic Concepts/Solutions and Their Concentrations
No ratings yet
Review of Basic Concepts/Solutions and Their Concentrations
109 pages
Presentations Tatta of Ee Q Ah
No ratings yet
Presentations Tatta of Ee Q Ah
13 pages
Biostat Lecture seven
No ratings yet
Biostat Lecture seven
59 pages
Biostat 6&7
No ratings yet
Biostat 6&7
6 pages
Psychological Assessmet SEM
No ratings yet
Psychological Assessmet SEM
2 pages
05 Lecture4 - Estimation
No ratings yet
05 Lecture4 - Estimation
37 pages
Stat For Business Spring2017 Final Exam PSUT
No ratings yet
Stat For Business Spring2017 Final Exam PSUT
5 pages
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
From Everand
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
S. Deviant
4.5/5 (6)
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Achievement Tests: Gregory J. Cizek
No ratings yet
Achievement Tests: Gregory J. Cizek
6 pages
10 1108 - BFJ 07 2021 0796
No ratings yet
10 1108 - BFJ 07 2021 0796
28 pages
Suggested Format For Writing A Psychological Testing Report
No ratings yet
Suggested Format For Writing A Psychological Testing Report
26 pages
Customer Orientation
No ratings yet
Customer Orientation
15 pages
Reviewer
No ratings yet
Reviewer
68 pages
Perkins 2008
No ratings yet
Perkins 2008
23 pages
Jurnal - Job Insecurity - Eist, Cuyper, Witte, 2014
No ratings yet
Jurnal - Job Insecurity - Eist, Cuyper, Witte, 2014
19 pages
Lesson 7 The Basics of Experimentation
100% (1)
Lesson 7 The Basics of Experimentation
48 pages
Construction and Validation of A Scale On The Different Sources of Conceptual Understanding of Conic Sections
No ratings yet
Construction and Validation of A Scale On The Different Sources of Conceptual Understanding of Conic Sections
7 pages
Fcps Community Medicine
100% (1)
Fcps Community Medicine
679 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
9 pages
Service Quality Customer Satisfaction 3 PDF
No ratings yet
Service Quality Customer Satisfaction 3 PDF
17 pages
Kondili Et Al (2022) - Predictors of Cultural Humility in Counselors-In-Training
No ratings yet
Kondili Et Al (2022) - Predictors of Cultural Humility in Counselors-In-Training
13 pages
Kyriakides BVQ06
No ratings yet
Kyriakides BVQ06
22 pages
Name: Shifa Ambreen Mughal ID Number: 0000395127 Course: Classroom Assessment (6407) (6407) Level: ADE/B.Ed Semester: Autumn, 2023
No ratings yet
Name: Shifa Ambreen Mughal ID Number: 0000395127 Course: Classroom Assessment (6407) (6407) Level: ADE/B.Ed Semester: Autumn, 2023
18 pages
Nevid CH03 TB
No ratings yet
Nevid CH03 TB
75 pages
Assessing Travel Time Reliability of Public Transport in Kolkata: A Case Study
No ratings yet
Assessing Travel Time Reliability of Public Transport in Kolkata: A Case Study
15 pages
Full CDS
No ratings yet
Full CDS
41 pages
Psychosocial Support Activities on Learners’ Psychosocial Readiness and Well-Being in Elementary Education
100% (1)
Psychosocial Support Activities on Learners’ Psychosocial Readiness and Well-Being in Elementary Education
8 pages
Ejes Pipih
No ratings yet
Ejes Pipih
16 pages
Anderson-Reidy 2012 - FE en Escolares
No ratings yet
Anderson-Reidy 2012 - FE en Escolares
16 pages
CEBQ
No ratings yet
CEBQ
359 pages
Intelligence Cycle: Icitap Philippines
100% (4)
Intelligence Cycle: Icitap Philippines
47 pages
Financial Resource Availability and Implementation of Child Protection and Safeguarding Programs in Kwale County, Kenya
No ratings yet
Financial Resource Availability and Implementation of Child Protection and Safeguarding Programs in Kwale County, Kenya
14 pages
Research Methodology: Research 2 Ma. Daniela Anne B. Samaniego, RPM
No ratings yet
Research Methodology: Research 2 Ma. Daniela Anne B. Samaniego, RPM
43 pages
Relationship Between Attitude Toward Aging, Health Literacy, and Utilization of Healthcare Services Among Older Adults in Suburban Areas in Iligan City
No ratings yet
Relationship Between Attitude Toward Aging, Health Literacy, and Utilization of Healthcare Services Among Older Adults in Suburban Areas in Iligan City
9 pages
Classical Item and Test Analysis Report User Test 1
No ratings yet
Classical Item and Test Analysis Report User Test 1
48 pages
Feduc 08 1097993
No ratings yet
Feduc 08 1097993
12 pages

Applying Reliability Information

Uploaded by

Applying Reliability Information

Uploaded by

APPLYING RELIABILITY

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.9)

Calculate the SEM SEM = 3.16

r = 0.8 SD = 10 good measure

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.8)

Calculate the SEM SEM = 4.47

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.5)

Calculate the SEM SEM = 7.07

Equation SEM = SD x √(1-r)

Substitute SEM = 10 x √(1-0.2)

Calculate the SEM SEM = 8.94

68%CI = Score ±SEM

100 95%CI = 100 + 16.11 = 116.11

100 95%CI = 100 - 16.11 = 83.89

68%CI = 8.22 99%CI = 2.58 x 8.22

100 68%CI = 100 + 8.22 = 108.22 99%CI = 21.21

100 68%CI = 100 - 21.21= 78.79

100 95%CI = 100 + 9.29 = 109.29

100 95%CI = 100 -9.29 = 90.71

68%CI = 4.74 99%CI = 2.58 x 4.74

100 68%CI = 100 + 4.74 = 104.74 99%CI = 12.22

100 68%CI = 100 - 12.22 = 112.22

100 95%CI = 100 + 16.11 = 116.11

100 95%CI = 100 - 16.11 = 83.89

68%CI = 8.22 99%CI = 2.58 x 8.22

100 68%CI = 100 + 8.22 = 108.22 99%CI = 21.21

100 68%CI = 100 - 21.21= 78.79

● Since all test scores reflect some measurement error, we must

SEMdiff = √(SEM test1)2 + (SEM test2)2

possible difference scores that could occur on a set of tests

● Indicates the average amount by which test scores can be

expected to differ on the basis of chance

● How likely it is that a particular difference score will occur by

Observed score difference = 72 - 63 = 9

SEM form 1 = 4 points SEM form 2 = 5 points

SEMdiff = √42 + 52 = √16+25 = √41 = 6.4 points

68%CI = Score of a difference of ±6.4

95%CI = Score of a difference ±12.544

SEMdiff = √(SEM test1)2 + (SEM test2)2

15 ፥ 5.47 = 2.74 SEdiffs

● In certain conditions, a person is evaluated not by a single test

● SEM of a total or average score

SEMean = 3.899/√5 = 1.744

95%CI = 79 ± (1.96)(1.744) = 79 ± 3.418 = 75.582 to 82.418

● Based on these test scores, our best estimate (95% confidence

You might also like