Workshop: Polytomous IRT Models (# 144, Remo Ostini and Michael L. Nering)
Workshop: Polytomous IRT Models (# 144, Remo Ostini and Michael L. Nering)
Jorge Tendeiro
16 April 2014
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 1/49
Literature
Presentation based on the book:
Ostini, R., & Nering, M. L. (2006). Polytomous item
response theory models. Sage University Paper Series
QASS.
(“Little green book” # 144)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 2/49
Overview
1 Introduction
3 Model selection
4 Software
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 3/49
Introduction
Introduction
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 4/49
Introduction
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 5/49
Introduction
0.8
P(Xi = 1)
0.6
0.5
0.4
0.2
0.0
b
−4 −2 0 2 4
θ (person ability)
• Dichotomous items:
Xi = 0 (incorrect, false) or Xi = 1 (correct, true).
• Most common models (logistic): 1PLM, 2PLM, 3PLM
• These models typically relate θ and Pi1 (θ):
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 7/49
Introduction
1PLM
1
Pi (θ) =
1 + exp[−(θ − bi )]
• bi = difficulty param.
1.00
b1= −1
b2= 0.5
0.75 b3= 1.5
P(Xi = 1)
0.50
0.25
0.00
b1 b2 b3
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 8/49
Introduction
2PLM
1
Pi (θ) =
1 + exp[−ai (θ − bi )]
0.50
0.25
0.00
b1 b2 b3
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 9/49
Introduction
3PLM
1
Pi (θ) = ci + (1 − ci )
1 + exp[−ai (θ − bi )]
0.50
0.25
0.00
b1
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 10/49
Introduction
Now we need to define models which allow estimating each Pic (θ),
c = 0, 1, . . . , m:
Pi0 (θ) = f1 (θ)
··· .
Pim (θ) = fm (θ)
These are the item category response functions (ICRFs).
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 11/49
Introduction
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 12/49
(Some) Polytomous IRT models Nominal response model (NRM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 13/49
(Some) Polytomous IRT models Nominal response model (NRM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 14/49
(Some) Polytomous IRT models Nominal response model (NRM)
exp(λic θ + ζic )
Pic (θ) = Pm .
h=0 exp(λih θ + ζih )
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 15/49
(Some) Polytomous IRT models Nominal response model (NRM)
0.8
Pi (X = c)
0.6
Response A
Response B
0.4 Response C
Response D
0.2
0.0
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 16/49
(Some) Polytomous IRT models Nominal response model (NRM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 17/49
(Some) Polytomous IRT models Partial credit model (PCM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 18/49
(Some) Polytomous IRT models Partial credit model (PCM)
Xi = 0 Xi = 1 Xi = 2 Xi = 3
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 20/49
(Some) Polytomous IRT models Partial credit model (PCM)
• The higher the δij , the more difficult a particular step is.
• The δij ’s aren’t necessarily ordered in the same sequence as the
categories (reversals; such a case indicates that the item is
probably not functioning as intended).
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 21/49
(Some) Polytomous IRT models Partial credit model (PCM)
0.6 Category 5
0.4
0.2
0.0
δi1 δi2 δi3 δi4
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 22/49
(Some) Polytomous IRT models Partial credit model (PCM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 23/49
(Some) Polytomous IRT models Partial credit model (PCM)
1.0
Category 1
0.8 Category 2
Category 3
Category 4
Pi (X = c)
0.6 Category 5
0.4
0.2
0.0
δi1 δi2 δi3 δi4
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 23/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 24/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 25/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
Xi = 0 Xi = 1 Xi = 2 Xi = 3
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 26/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 27/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
Response category
Item Content 0 1 2 3 4
5 Feels tense and jittery 17 111 97 101 24
6 Sometimes feels worthless 72 89 52 94 43
9 Feels discouraged, like giving up 27 128 66 95 34
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 28/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
0.8
P6 (X = c)
Category 0
0.6 Category 1
Category 2
0.4 Category 3
Category 4
0.2
0.0
δ61 δ63 δ62 δ64
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 29/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
0.6 Category 4
0.4
0.2
0.0
δ51 δ52 δ53 δ54
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 30/49
(Some) Polytomous IRT models Generalized partial credit model (GPCM)
0.8
Category 0
P9 (X = c)
0.6 Category 1
Category 2
Category 3
0.4
Category 4
0.2
0.0
δ91 δ93 δ92 δ94
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 31/49
(Some) Polytomous IRT models Rating scale model (RSM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 32/49
(Some) Polytomous IRT models Rating scale model (RSM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 33/49
(Some) Polytomous IRT models Rating scale model (RSM)
δ1 δ1
δ2 δ2
δ3 δ3
λ1 λ2 θ scale
Item 1 (4 cats.) Item 2 (4 cats.)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 34/49
(Some) Polytomous IRT models Rating scale model (RSM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 35/49
(Some) Polytomous IRT models Rating scale model (RSM)
0.8
Category 0
P1 (X = c)
0.6 Category 1
Category 2
Category 3
0.4
Category 4
0.2
0.0
−4 −3 −2 −1 0 1 2 3 4
0.8
P11 (X = c)
0.6
0.4
0.2
0.0
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 36/49
(Some) Polytomous IRT models Graded response model (GRM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 37/49
(Some) Polytomous IRT models Graded response model (GRM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 38/49
(Some) Polytomous IRT models Graded response model (GRM)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 39/49
(Some) Polytomous IRT models Graded response model (GRM)
Xi = c − 1 Xi = c
Category responses
βic
Category thresholds
(θ scale) P(Xi ≤ c) = .50 P(Xi ≥ c) = .50
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 40/49
(Some) Polytomous IRT models Graded response model (GRM)
0.6 Category 4
0.5
0.4
0.2
0.0
β41 β44
−4 −3 −2 −1 0 1 2 3 4
θ (person ability)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 41/49
Model selection
Model selection
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 42/49
Model selection
Model selection
2 Measurement philosophy
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 43/49
Model selection
Model selection
1 Data characteristics
• Dichotomous vs polytomous item scores.
• Nominal vs ordinal categories.
• Number of response categories.
E.g.: The RSM requires the same number across items.
2 Measurement philosophy
• Does the model reflect the the psychological reality that
produced the data?
E.g.: Can one conceptualize the answer to an item as being an
ordered sequence of subtasks for which awarding partial credit
to each is meaningful (i.e., PCM)?
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 44/49
Model selection
Model selection
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 45/49
Model selection
Model selection
Model selection
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 47/49
Model selection
Model selection
Some problems of statistical fit tests:
• The sampling distributions are often unknown.
• Some tests require very large sample sizes (on the hundreds),
specially for χ2 -based tests.
• Unknown influence of using estimated parameters or of mild
model violations on the performance of the tests.
• Too large sample sizes invariably lead to rejections of the null
hypothesis (effect size?).
A final reassurence:
Some comparative studies of polytomous IRT models suggest that
results don’t vary much between models.
(E.g., Dodd, 1984; Maydeu-Olivares et al., 1994; Ostini, 2001; van Engelenburg,
1997; Verhelst et al., 1997.)
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 48/49
Software
Software
• IRTPRO
• R: Several packages worth checking
(see http://cran.r-project.org/web/views/Psychometrics.html)
ltm, eRm, TAM, mcIRT, pcIRT,. . .
Workshop Polytomous IRT models, (# 144, Remo Ostini and Michael L. Nering) 49/49