0% found this document useful (0 votes)

10 views

Test Construction Slides

Uploaded by

omaff hurtado

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Test Construction Slides

Uploaded by

omaff hurtado

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Association For Advanced Training

Test Construction
Introduction

Test Construction Questions

• 8 to 12 questions will be from this area

• at least two-thirds of the questions will cover basic or introductory
information
• the remaining questions will address more advanced material

© AATBS. All Rights Reserved.

Association For Advanced Training

Outline of Topics
• Item Analysis
• Reliability
• Methods for Assessing Reliability
• Factors that Affect the Reliability Coefficient
• Standard Error of Measurement

• Validity
• Content Validity
• Construct Validity
• Criterion-Related Validity

• Test Score Interpretation

Study Strategies
• Study Emphasis:
• master basic terms and concepts first
• if you have time, become familiar with some of the more advanced
concepts
• Memorization Strategies:
• use strategies that will ensure that information is adequately encoded
• use multiple modalities and multiple study strategies
• schedule ample time for review
• study only one content domain at a time

© AATBS. All Rights Reserved.

Association For Advanced Training

T t Construction
Test C t ti
Item Analysis

Item Difficulty
• Description:
• refers to the proportion of examinees in the tryout sample who
answered the item correctly
• is of concern for tests designed to measure an examinee’s knowledge
or skill level

• Item Difficulty Index:

• symbolized
b li d with
ith the
th letter
l tt “p”
“ ” and
d calculated
l l t d by
b dividing
di idi ththe number
b off
examinees who answered the item correctly by the total number of
examinees
• ranges in value from 0 to 1, with larger values indicating an easier item

© AATBS. All Rights Reserved.

Association For Advanced Training

Item Difficulty
• Item Difficulty Index (cont.):
• for
f mostt tests,
t t a test
t t developer
d l wants
t items
it with
ith p values
l close
l to
t .50
50
• if the goal of testing is to choose a certain number of top performers, the
optimal p value corresponds to the proportion of examinees to be
chosen
• the optimal value is also affected by the likelihood that examinees can
select
l t th
the correctt answer by
b guessing,
i with
ith the
th preferred
f d difficulty
diffi lt llevell
being halfway between 100% of examinees answering the item correctly
and the probability of answering correctly by guessing

Item Difficulty
• Item Difficulty Index (cont.):
• the
th optimal
ti l p value
l also
l ddepends
d on th
the ttest’s
t’ ceiling
ili and
d flfloor
• a test has adequate ceiling when it can distinguish between examinees
with high levels of the attribute being measured
• ceiling is maximized by including a large proportion of items with a low p
value
• a test has adequate floor when it can distinguish between examinees with
low levels of the attribute being measured
• floor is maximized by including a large proportion of items with a high p
value

© AATBS. All Rights Reserved.

Association For Advanced Training

Item Discrimination
• Description:
• refers to the extent to which an item discriminates between examinees
who obtain low or high scores on the test or an external criterion

• Item Discrimination Index:

• symbolized with the letter “D”
• is calculated by subtracting the percent of examinees in the lower-
scoring group who answered the item correctly from the percent of
examinees in the upper-scoring group who answered the item correctly

Item Discrimination
• Item Discrimination Index (cont.):
• D ranges in value from -1 to +1
• when D equals +1, all examinees in the upper-scoring group answered the item
correctly while all examinees in the lower-scoring group answered the item
incorrectly
• when D equals 0, the same percent of examinees in both groups answered the
item correctly
• when D equals -1, all examinees in the lower-scoring group answered the item
correctly while all examinees in the upper-scoring group answered the item
correctly

© AATBS. All Rights Reserved.

Association For Advanced Training

Test Construction
Test Reliability
(Session #1)

Reliability
• Classical Test Theory:
• variability in test scores reflects a combination of true score variability
and variability due to measurement (random) error

X = T + E

Total Variability True Score Measurement Error

Variability

© AATBS. All Rights Reserved.

Association For Advanced Training

Reliability
• Estimating Reliability:
• reliability
li bilit iis estimated
ti t d bby evaluating
l ti consistency
i t iin scores over titime or
across different forms of the test, different test items, or different raters
• most methods for estimating reliability produce a reliability coefficient
• the reliability coefficient is symbolized as rxx
• reliability coefficients range in value from 0 to +1
• they are always interpreted directly as a measure of true score variability

Reliability
• Test-Retest Reliability:
• provides
id a measure off test
t t score consistency
i t or stability
t bilit over titime
• is calculated by administering the test to the same examinees on two
occasions and correlating the two sets of scores

• is appropriate for tests designed to measure a characteristic that is

stable over time
• is not appropriate for tests that measure characteristics that fluctuate
over time or are likely to be affected in a random way by taking the test
more than once

© AATBS. All Rights Reserved.

Association For Advanced Training

Reliability
• Alternate Forms Reliability:
• provides
id a measure off ttestt score consistency
i t over two
t forms
f off the
th test
t t
• is calculated by correlating the scores obtained by a sample of examinees
on the two forms

• is appropriate for tests that measure a characteristic that is stable over

time
• is not appropriate for tests that measure characteristics that fluctuate
over time or when exposure to one form is likely to affect performance
on the other form in an unsystematic way

Reliability
• Internal Consistency Reliability:
• indicates
i di t th the d
degree off consistency
i t across diff
differentt test
t t items
it
• is appropriate for tests that measure a single content or behavior
domain
• is useful for estimating the reliability of tests that measure
p
characteristics that fluctuate over time or are susceptible to memory
y or
practice effects

© AATBS. All Rights Reserved.

Association For Advanced Training

Reliability
• Internal Consistency Reliability (cont.):
• split-half reliability involves splitting the test in half and correlating examinees’
examinees
scores on the two halves
• tends to underestimate the test’s reliability
• consequently, the split-half reliability coefficient is corrected using the Spearman-
Brown prophecy formula
• the Spearman-Brown
p formula can also be used more g generally y to estimate the
effect of shortening or lengthening a test on its reliability coefficient

• split-half reliability is not appropriate for speeded test

Reliability
• Internal Consistency Reliability (cont.):
• C
Cronbach’s
b h’ coefficient
ffi i t alpha
l h is
i th
the ““mean off allll possible
ibl split-half
lit h lf
correlation coefficients”
• Kuder-Richardson Formula 20 (KR-20) can be used as a substitute for
coefficient alpha when test items are scored dichotomously

© AATBS. All Rights Reserved.

Association For Advanced Training

Reliability
• Inter-Rater Reliability:
• is important for measures that are subjectively scored,
scored such as essay
and projective tests
• can be evaluated using percent agreement, but this tends to
overestimate inter-rater reliability
• alternatively, a special correlation coefficient can be used
• Cohen’s kappa statistic is used to measure agreement between two raters
when scores represent a nominal scale
• Kendall’s coefficient of concordance is used to measure agreement
between three or more raters when scores are reported as ranks

Intentionally Left Blank

© AATBS. All Rights Reserved.

Association For Advanced Training

Test Construction
Test Reliability
(Session #2)

Reliability
• Factors that Affect the Reliability Coefficient:
• longer
l ttests
t are generally
ll more reliable
li bl th
than shorter
h t ttests
t
• a wide range of scores increases the size of the reliability coefficient
• the more homogeneous a test is with regard to content, the higher its
reliability coefficient
• th
the more difficult
diffi lt it is
i tto pick
i k th
the right
i ht answer b
by guessing,
i th
the llarger th
the
reliability coefficient

© AATBS. All Rights Reserved.

Association For Advanced Training

Reliability
• Confidence Intervals:
• because tests are not totally reliable
reliable, an examinee’s
examinee s obtained score may
or may not be his/her true score
• consequently, it’s always best to interpret an examinee’s obtained score
in terms of a confidence interval
• a confidence interval indicates the range within which an examinee’s true
score is likely to fall given his/her obtained score
• it is derived using the standard error of measurement (SEM)

Reliability
• Confidence Intervals (cont.):
• for the 68% confidence interval
interval, one SEM is added to and subtracted
from the obtained score
• for the 95% confidence interval, two SEM’s are added to and subtracted
from the obtained score
• for the 99% confidence interval, three SEM’s are added to and
subtracted from the obtained score

© AATBS. All Rights Reserved.

Association For Advanced Training

Reliability
• Standard Error of Measurement:
SEM = SDx 1 - rxx

Example:

SEM = 10 1 - .91

= 10(.3)

= 3

Intentionally Left Blank

© AATBS. All Rights Reserved.

Association For Advanced Training

Test Construction
Content and Construct Validity
(Session #1)

Validity
• Definition:
• refers to a test
test’ss accuracy in terms of the extent to which the test
measures what it was designed to measure
• Types of Validity:
• content validity is important for tests designed to measure a specific
content or behavior domain
• construct validity is important for tests designed to measure a
hypothetical trait or construct
• criterion-related validity is important for tests that will be used to predict
or estimate an examinee’s status on an external criterion

© AATBS. All Rights Reserved.

Association For Advanced Training

Validity
• Content Validity:
• is of concern when a test is designed to measure a content or behavior
domain
• is built into the test while it’s being constructed
• after a test has been developed, content validity is evaluated by subject
matter experts who determine if test items are an adequate and
representative
p sample
p of the content or behavior domain
• content validity is not the same as face validity
• face validity refers to whether or not test items “look like” they’re
measuring what the test is designed to measure

Validity
• Construct Validity:
• iis iimportant
t t for
f tests
t t designed
d i d to
t measure a hypothetical
h th ti l trait
t it or
construct
• several methods are used to evaluate construct validity
• the multitrait-multimethod matrix is a table of correlation coefficients
that p
provide information about a test’s convergent
g and divergent
g
(discriminant) validity
• factor analysis also provides information about convergent and
divergent validity but is a more complex technique

© AATBS. All Rights Reserved.

Association For Advanced Training

Validity
• Multitrait-Multimethod Matrix:
• itits use requires
i a minimum
i i off ffour measures – the
th measure being
b i
validated; a measure of the same trait using a different method; a
measure of an unrelated trait using the same method; and a measure of
the same unrelated trait using a different method
• the correlation between the test we’re validating and the measure of the
same trait using a different method provides information about the test’s
test s
convergent validity
• the correlations between the test we’re validating and the measures of
unrelated traits provide information about the test’s divergent validity

Validity
• Multitrait-Multimethod Matrix (cont.):
Assertive Aggressive
Assertive Test Aggressive Test
Rating Rating

Assertive Test .93

Aggressive Test .13 .91

Assertive Rating .71 .09 .86

Aggressive Rating .04 .68 .16 .89

© AATBS. All Rights Reserved.

Association For Advanced Training

Test Construction
Content and Construct Validity
(Session #2)

Validity
• Steps in Factor Analysis:
1 Administer tests to a sample of examinees
1.
2. Derive and interpret the correlation matrix
3. Extract the initial factor matrix
4. Rotate the factor matrix
5 Name the factors
5.

© AATBS. All Rights Reserved.

Association For Advanced Training

Validity
• Rotated Factor Matrix:
Factor I Factor II Communality
Interpersonal Assertiveness Test .78 .16 .64
Global Assertiveness Rating .69 .14 .49
Behavioral Assertiveness Scale .59 .12 .36
Aggressiveness Self-Rating .14 .69 .49
Global Aggressiveness Rating .12 .59 .36
Social Aggressiveness Scale .10
10 .49
49 .25
25

Validity
• Rotated Factor Matrix (cont.):
Factor I Factor II Communality
Interpersonal Assertiveness Test .78 .16 ?

Communality = .782 + .162

= .61
61 + .03
03
= .64

© AATBS. All Rights Reserved.

Association For Advanced Training

Validity
• Factor Analysis:
• the
th rotation
t ti off a factor
f t matrix
t i can be
b orthogonal
th l or oblique
bli
• orthogonal means uncorrelated, while oblique means correlated
• a researcher decides which is appropriate based on his/her theory about
the characteristics measured by the tests included in the analysis

Intentionally Left Blank

Association For Advanced Training

Test Construction
Criterion-Related Validity
(Session #1)

Validity
• Criterion-Related Validity:
• is important when test scores will be used to predict or estimate status
on a criterion
• is evaluated by correlating scores on the test (predictor) with scores on
the criterion for a sample of examinees to obtain a criterion-related
validity coefficient
• a concurrent validityy study
y involves obtaining
g scores on the predictor
and criterion at about the same time
• a predictive validity study involves obtaining predictor scores prior to
obtaining criterion scores

Association For Advanced Training

Validity
• Confidence Intervals:
• b
because ththe relationship
l ti hi bbetween
t a predictor
di t and
d criterion
it i iis never
perfect, there’s some degree of error whenever a predictor is used to
predict or estimate status on a criterion
• consequently, the standard error of estimate is used to construct a
confidence interval around a predicted criterion score
• th
the procedure
d ffor constructing
t ti a confidence
fid iinterval
t l around
da
predicted criterion score is the same as the procedure for
constructing a confidence interval around an obtained test score

Validity
• Standard Error of Estimate:

SEest = SDy 1 - rxy2

Example:

SEest = 10 602
1 - .60
= 10(.8)
= 8

Association For Advanced Training

Validity
• Relationship Between Reliability and Validity:
• reliability
li bilit iis a necessary b
butt nott sufficient
ffi i t condition
diti ffor validity
lidit
• as indicated by the following formula, reliability places an upper limit on
validity

rxy < rxx

Example:

rxy < .81 < .90

Intentionally Left Blank

Association For Advanced Training

Test Construction
Criterion-Related Validity
(Session #2)

Validity
• Steps in Validating a Predictor:
1. Conduct a job analysis
2. Select/develop the predictor and criterion
3. Obtain and correlate scores on the predictor and criterion
4. Check for adverse impact
5. Evaluate incremental validity
6. Cross-validate

Association For Advanced Training

Validity
• Incremental Validity:
• refers to the increase in decision-making accuracy that use of a
predictor provides
• even when a predictor has a large validity coefficient, it may not
increase decision-making accuracy beyond the current level
• is evaluated byy comparing
p g the number of correct decisions made with
and without the new predictor

Validity
• Incremental Validity (cont.):

Association For Advanced Training

Validity
• Incremental Validity (cont.):
• calculated by subtracting the base rate from the positive hit rate
Incremental Validity = Positive Hit Rate – Base Rate

• Example:
Positive Hit Rate = 9/10 = 90%
Base Rate = 15/30 = 50%

Incremental Validity = 90% - 50% = 40%

Intentionally Left Blank

Association For Advanced Training

T t Construction
Test C t ti
Test Score Interpretation

Test Score Interpretation

• Introduction:
• an examinee’s
examinee s raw score is often difficult to interpret unless it’s
it s
anchored to the performance of other examinees or a predefined
standard of performance

• Norm-Referenced Interpretation:
• involves comparing an examinee
examinee’s s test score to scores obtained in a
standardization sample or other comparison group
• an examinee’s raw score is converted to a score that indicates his/her
relative standing in the comparison group

Association For Advanced Training

Norm-Referenced Interpretation
• Percentile Ranks:
• range from 1 to 99 and express an examinee
examinee’s
s score in terms of
percentage of examinees who achieved lower scores
• distribution is always flat (rectangular) regardless of the shape of the
raw score distribution
• because the transformation changes the shape of the original raw score
distribution,, it is categorized
g as a nonlinear transformation
• a limitation of percentile ranks is that they indicate an examinee’s
relative position in a distribution but do not provide information about
absolute differences between examinees in terms of their raw scores

Norm-Referenced Interpretation
• Standard Scores:
• iindicate
di t ththe examinee’s
i ’ relative
l ti standing
t di iin th
the comparison
i group iin
terms of standard deviations from the mean
• the z-score distribution has a mean of 0 and standard deviation of 1
• a z-score is calculated by subtracting the mean of the distribution from the
examinee’s score to obtain a deviation score and dividing the deviation
score by
b th
the di
distribution’s
t ib ti ’ standard
t d d deviation
d i ti
• if an examinee obtains a score of 110 on a test that has a mean of 100 and
standard deviation of 10, his/her z-score is +1.0

Association For Advanced Training

Norm-Referenced Interpretation
• Standard Scores (cont.):
• the
th T-score
T di t ib ti h
distribution has a mean off 50 and
d standard
t d dd deviation
i ti off 10
• an examinee whose raw score is one standard deviation above the mean
will have a T-score of 60

• deviation IQ scores have a mean of 100 and standard deviation of 15

• an examinee whose raw score is one standard deviation above the mean
will have a deviation IQ score of 115

Criterion-Referenced Interpretation
• Description:
• involves interpreting an examinee
examinee’s
s score in terms of a predefined
standard
• a percent correct (percentage) score indicates the percent of test
content the examinee answered correctly
• when used as the method of score interpretation, a cutoff score is usually
set
• another method involves interpreting an examinee’s score in terms of
his/her likely status on an external criterion, which might involve using a
regression equation or expectancy table

RBT-StudyGuide Better-3-3 PDF
96% (23)
RBT-StudyGuide Better-3-3 PDF
93 pages
The Ellipsis Manual Training Planner by Chase Hughes
100% (12)
The Ellipsis Manual Training Planner by Chase Hughes
70 pages
Full Life Planner Interactive
100% (76)
Full Life Planner Interactive
265 pages
WALC 11 Language For Home Activities
100% (10)
WALC 11 Language For Home Activities
196 pages
2023 - EMDR Institute Basic Training Worksheets
100% (1)
2023 - EMDR Institute Basic Training Worksheets
108 pages
From Stuck to Unstoppable
100% (2)
From Stuck to Unstoppable
51 pages
Certified Emergency Nurse (CEN) Exam Study Guide
0% (3)
Certified Emergency Nurse (CEN) Exam Study Guide
20 pages
DBT Assignment Workbook F0220
100% (65)
DBT Assignment Workbook F0220
218 pages
Recommendation Letter
90% (10)
Recommendation Letter
2 pages
Mel Robbins Workbook
No ratings yet
Mel Robbins Workbook
22 pages
CLEP Intro Psychology
100% (3)
CLEP Intro Psychology
26 pages
Official Superhuman Walkthrough
No ratings yet
Official Superhuman Walkthrough
111 pages
The Whole-Brain Child Workbook - Daniel J. Siegel, M.D.
100% (5)
The Whole-Brain Child Workbook - Daniel J. Siegel, M.D.
166 pages
EPPP Test Construction
No ratings yet
EPPP Test Construction
14 pages
Goal Bank For Social Skills and Pragmatics
67% (3)
Goal Bank For Social Skills and Pragmatics
3 pages
Overcoming Depression Workbook 2017 Shapiro
100% (10)
Overcoming Depression Workbook 2017 Shapiro
111 pages
EditableSpecialEducationEvaluationReportTemplatesWIATCTOPPGORTWJ 1
100% (5)
EditableSpecialEducationEvaluationReportTemplatesWIATCTOPPGORTWJ 1
49 pages
CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
From Everand
CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
Hemang Doshi
3/5 (4)
How To Draw Faces Course
91% (32)
How To Draw Faces Course
47 pages
12-MONTH: Lsat Study Plan
100% (1)
12-MONTH: Lsat Study Plan
40 pages
Book Summary - Getting Things Done by David Allen, 121231
100% (35)
Book Summary - Getting Things Done by David Allen, 121231
8 pages
77 Life Hacks For Aspiring Entrepreneurs PDF
100% (5)
77 Life Hacks For Aspiring Entrepreneurs PDF
127 pages
How To Win Friends and Influence People
No ratings yet
How To Win Friends and Influence People
8 pages
Good Psychometric Properties
No ratings yet
Good Psychometric Properties
44 pages
Chapter 5 Reliability
No ratings yet
Chapter 5 Reliability
38 pages
RELIABILITY AND VALIDITY
No ratings yet
RELIABILITY AND VALIDITY
47 pages
test constrcution
No ratings yet
test constrcution
39 pages
Psyc 385 Exam 2 Study Guide
No ratings yet
Psyc 385 Exam 2 Study Guide
17 pages
Psyc 85 - Reliability
No ratings yet
Psyc 85 - Reliability
37 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
Reliability Estimates: Source of Error Variance Is Test Administration
No ratings yet
Reliability Estimates: Source of Error Variance Is Test Administration
8 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Paprint
No ratings yet
Paprint
3 pages
RELIABILITY 2024
No ratings yet
RELIABILITY 2024
30 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
9 Reliability
No ratings yet
9 Reliability
10 pages
UNIT-5 psychometry_240505_1652001
No ratings yet
UNIT-5 psychometry_240505_1652001
20 pages
Reliabilty Lecture (5)
No ratings yet
Reliabilty Lecture (5)
16 pages
20201231172157D4978 - Psikometri 6 - 8
No ratings yet
20201231172157D4978 - Psikometri 6 - 8
31 pages
Module 4 Psychometric properties (1)
No ratings yet
Module 4 Psychometric properties (1)
49 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
Reliability Test by Group 2
No ratings yet
Reliability Test by Group 2
28 pages
Characteristics of Effective Selection Techniques
No ratings yet
Characteristics of Effective Selection Techniques
17 pages
Introduction To Reliability: What Is Reliability? Why Is It Important?
No ratings yet
Introduction To Reliability: What Is Reliability? Why Is It Important?
14 pages
Chapter_5_New
No ratings yet
Chapter_5_New
13 pages
Psychometrics
No ratings yet
Psychometrics
102 pages
Chapter 4: Reliability
No ratings yet
Chapter 4: Reliability
40 pages
PSYCH STATS SEMI
No ratings yet
PSYCH STATS SEMI
11 pages
Reliability and its Types
No ratings yet
Reliability and its Types
13 pages
Reliability
No ratings yet
Reliability
37 pages
Notes Ec Psychass
No ratings yet
Notes Ec Psychass
25 pages
4 Reliability Validity
No ratings yet
4 Reliability Validity
47 pages
Reliability
No ratings yet
Reliability
5 pages
6. Establishing Validity and Reliability
No ratings yet
6. Establishing Validity and Reliability
39 pages
reliability
No ratings yet
reliability
2 pages
Lesson 09 - Tagged
No ratings yet
Lesson 09 - Tagged
34 pages
Validity and Reliability: I Qra Development Academy Reporter: Nur - Salam Sultan SEPT. 21, 2019
No ratings yet
Validity and Reliability: I Qra Development Academy Reporter: Nur - Salam Sultan SEPT. 21, 2019
22 pages
CLASS PRESENTATION - Test Reliability
No ratings yet
CLASS PRESENTATION - Test Reliability
7 pages
Assess 1 PED 106 Lesson 6
No ratings yet
Assess 1 PED 106 Lesson 6
75 pages
Reviewer Test Measurement Midterms
No ratings yet
Reviewer Test Measurement Midterms
6 pages
Reliability
No ratings yet
Reliability
15 pages
CHAPTER 6
No ratings yet
CHAPTER 6
8 pages
Item Reliability: Presented By: Jhoanna Rose M. Moreno Group 2 Adv. Psychometrics
No ratings yet
Item Reliability: Presented By: Jhoanna Rose M. Moreno Group 2 Adv. Psychometrics
17 pages
1305 69038KPReliability
No ratings yet
1305 69038KPReliability
21 pages
Reliability PPT Presentation
100% (1)
Reliability PPT Presentation
9 pages
Readings Psy211
No ratings yet
Readings Psy211
23 pages
Reliability
No ratings yet
Reliability
3 pages
Properties of Assessment Method: Validity
No ratings yet
Properties of Assessment Method: Validity
30 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
Introduction-Reliability-in-Language-Testing
No ratings yet
Introduction-Reliability-in-Language-Testing
10 pages
Reliability
No ratings yet
Reliability
9 pages
UNIT 05: Reliability: Module Overview
No ratings yet
UNIT 05: Reliability: Module Overview
9 pages
35 40 Ganesh
No ratings yet
35 40 Ganesh
6 pages
Internal Consistency: From Wikipedia, The Free Encyclopedia
100% (2)
Internal Consistency: From Wikipedia, The Free Encyclopedia
18 pages
Unit 9
No ratings yet
Unit 9
27 pages
Reliability and Its Importance
No ratings yet
Reliability and Its Importance
57 pages
Chapter 3 - Reliability - Hiten
No ratings yet
Chapter 3 - Reliability - Hiten
26 pages
Reliability: Floramae Z. Campos Student/MA-GC
No ratings yet
Reliability: Floramae Z. Campos Student/MA-GC
29 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Measurement - Drill Sheets Gr. 3-5
From Everand
Measurement - Drill Sheets Gr. 3-5
Chris Forest
No ratings yet
CLEP® Analyzing & Interpreting Literature Book + Online
From Everand
CLEP® Analyzing & Interpreting Literature Book + Online
Editors of REA
No ratings yet
Data Analysis & Probability - Task Sheets Gr. 3-5
From Everand
Data Analysis & Probability - Task Sheets Gr. 3-5
Tanya Cook
No ratings yet
EPPP Clinical Psychology
100% (1)
EPPP Clinical Psychology
16 pages
EPPP Abnormal Psychology
100% (1)
EPPP Abnormal Psychology
14 pages
Statistics and Research Design Slides
100% (1)
Statistics and Research Design Slides
31 pages
Abnormal Psychology Handout
100% (1)
Abnormal Psychology Handout
119 pages
Ethics and Professional Issues Slide
100% (1)
Ethics and Professional Issues Slide
39 pages
EPPP Psychological Assessment
100% (1)
EPPP Psychological Assessment
8 pages
ANT Therapy & CBT
100% (1)
ANT Therapy & CBT
7 pages
Law School Handbook
100% (1)
Law School Handbook
64 pages
DND 5E Complete Feat Compedium
100% (13)
DND 5E Complete Feat Compedium
18 pages
The Compound Effect Summary
100% (7)
The Compound Effect Summary
6 pages
Aptitude Personality Practice Tests
No ratings yet
Aptitude Personality Practice Tests
5 pages
Drew McAdam - Making Money From Magic
100% (1)
Drew McAdam - Making Money From Magic
112 pages
201 Relationship Questions The Couples Guide To Building Trust and Emotional Intimacy by Barrie Davenport
100% (4)
201 Relationship Questions The Couples Guide To Building Trust and Emotional Intimacy by Barrie Davenport
115 pages
Big Book Study Workbook Rev 2
100% (1)
Big Book Study Workbook Rev 2
73 pages
Working With Children and Teenagers Using Solution Focused Approaches: Enabling Children To Overcome Challenges and Achieve Their Potential
100% (18)
Working With Children and Teenagers Using Solution Focused Approaches: Enabling Children To Overcome Challenges and Achieve Their Potential
178 pages
Test Taking Strategies
100% (1)
Test Taking Strategies
193 pages
Assessment Rev
No ratings yet
Assessment Rev
4 pages
World War I Intelligence Testing and The Development of Psychology (1977) by Franz Samelson
No ratings yet
World War I Intelligence Testing and The Development of Psychology (1977) by Franz Samelson
5 pages
HIV Stigma Scale Questionnaire
No ratings yet
HIV Stigma Scale Questionnaire
4 pages
Variables Impacting Intercultural Competence A Systematic Literature Review
No ratings yet
Variables Impacting Intercultural Competence A Systematic Literature Review
27 pages
Item Analysis & Reliability
100% (1)
Item Analysis & Reliability
57 pages
Assessing The Validity of Test
No ratings yet
Assessing The Validity of Test
4 pages
Item Analysis and Validation
No ratings yet
Item Analysis and Validation
39 pages
Psychometric Tests
No ratings yet
Psychometric Tests
2 pages
16PF
No ratings yet
16PF
70 pages
Supplementary Multiple Choice Answer Sheet Exam Day Form 2
No ratings yet
Supplementary Multiple Choice Answer Sheet Exam Day Form 2
1 page
06 Back
No ratings yet
06 Back
68 pages
HHGGF
No ratings yet
HHGGF
25 pages
Psychometric Properties of The Theory of Mind Assessment Scale in A Sample of Adolescents and Adults
No ratings yet
Psychometric Properties of The Theory of Mind Assessment Scale in A Sample of Adolescents and Adults
13 pages
2016, Di Nuovo Validity Indices of The Rorschach Test and Personality
No ratings yet
2016, Di Nuovo Validity Indices of The Rorschach Test and Personality
20 pages
Leclerc P 4 POST
No ratings yet
Leclerc P 4 POST
38 pages
Roger E. Millsap - Statistical Approaches To Measurement Invariance-Routledge (2011)
No ratings yet
Roger E. Millsap - Statistical Approaches To Measurement Invariance-Routledge (2011)
359 pages
Psychometric Properties of The Chinese Version of The Psycho-Educational Profile-Revised (CPEP-R)
No ratings yet
Psychometric Properties of The Chinese Version of The Psycho-Educational Profile-Revised (CPEP-R)
9 pages
Bullying Scale Score
No ratings yet
Bullying Scale Score
2 pages
BookOfReports PDF
No ratings yet
BookOfReports PDF
79 pages
In The United States District Court For The District of Columbia
No ratings yet
In The United States District Court For The District of Columbia
28 pages
Lampiran 2. Uji Validitas Dan Reliabilitas: Scale: All Variables Case Processing Summary
No ratings yet
Lampiran 2. Uji Validitas Dan Reliabilitas: Scale: All Variables Case Processing Summary
5 pages
Scale Development - Diabetes Scale
No ratings yet
Scale Development - Diabetes Scale
63 pages

Test Construction Slides

Uploaded by

Test Construction Slides

Uploaded by

Association For Advanced Training

Test Construction Questions

• 8 to 12 questions will be from this area

© AATBS. All Rights Reserved.

• Test Score Interpretation

© AATBS. All Rights Reserved.

• Item Difficulty Index:

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

• Item Discrimination Index:

© AATBS. All Rights Reserved.

Total Variability True Score Measurement Error

© AATBS. All Rights Reserved.

• is appropriate for tests designed to measure a characteristic that is

© AATBS. All Rights Reserved.

• is appropriate for tests that measure a characteristic that is stable over

© AATBS. All Rights Reserved.

• split-half reliability is not appropriate for speeded test

© AATBS. All Rights Reserved.

Intentionally Left Blank

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

Intentionally Left Blank

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

Assertive Test .93

Aggressive Test .13 .91

Assertive Rating .71 .09 .86

Aggressive Rating .04 .68 .16 .89

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

Communality = .782 + .162

© AATBS. All Rights Reserved.

Intentionally Left Blank

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

SEest = SDy 1 - rxy2

© AATBS. All Rights Reserved.

rxy < rxx

rxy < .81 < .90

Intentionally Left Blank

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

Incremental Validity = 90% - 50% = 40%

Intentionally Left Blank

© AATBS. All Rights Reserved.

Test Score Interpretation

© AATBS. All Rights Reserved.

© AATBS. All Rights Reserved.

• deviation IQ scores have a mean of 100 and standard deviation of 15

© AATBS. All Rights Reserved.

You might also like