The Basics of Item Response Theory Using R 1st Edition Frank B. Baker 2024 scribd download
The Basics of Item Response Theory Using R 1st Edition Frank B. Baker 2024 scribd download
com
https://textbookfull.com/product/the-basics-of-item-
response-theory-using-r-1st-edition-frank-b-baker/
OR CLICK BUTTON
DOWNLOAD NOW
https://textbookfull.com/product/handbook-of-item-response-theory-
three-volume-set-first-edition-van-der-linden/
textboxfull.com
https://textbookfull.com/product/handbook-of-item-response-theory-
volume-three-applications-1st-edition-wim-j-van-der-linden-editor/
textboxfull.com
https://textbookfull.com/product/land-subsidence-mitigation-aquifer-
recharge-using-treated-wastewater-injection-1st-edition-frank-r-
spellman/
textboxfull.com
https://textbookfull.com/product/the-legal-environment-of-
business-10th-edition-frank-b-cross/
textboxfull.com
Afropessimism 1st Edition Frank B. Wilderson Iii
https://textbookfull.com/product/afropessimism-1st-edition-frank-b-
wilderson-iii/
textboxfull.com
https://textbookfull.com/product/the-handbook-of-nature-2nd-edition-
frank-r-spellman/
textboxfull.com
https://textbookfull.com/product/the-science-of-water-4th-edition-
frank-r-spellman/
textboxfull.com
https://textbookfull.com/product/the-science-of-environmental-
pollution-third-edition-frank-r-spellman/
textboxfull.com
https://textbookfull.com/product/using-corpora-to-analyze-gender-paul-
baker/
textboxfull.com
Statistics for Social and Behavioral Sciences
Frank B. Baker
Seock-Ho Kim
The Basics of
Item Response
Theory Using R
Statistics for Social and Behavioral Sciences
Series editor
Stephen E. Fienberg
Carnegie Mellon University
Pittsburgh
Pennsylvania
USA
Statistics for Social and Behavioral Sciences (SSBS) includes monographs and
advanced textbooks relating to education, psychology, sociology, political science,
public policy, and law.
123
Frank B. Baker Seock-Ho Kim
Educational Psychology Educational Psychology
University of Wisconsin-Madison University of Georgia
Madison, WI, USA Athens, GA, USA
This book is an update of the original book, The Basics of Item Response Theory, by
the first author. The original book by Frank B. Baker was based on the course given
during his tenure at the University of Wisconsin–Madison. It appeared in 1985.
The second edition of the original book by Frank B. Baker appeared in 2001 with
a publisher’s note by Lawrence A. Rudner. About 15 years have passed since the
last revision. So much has happened in the fields of educational measurement and
psychometrics as well as in the statistical computing technology. In the meantime,
we felt that the usefulness of the book would be increased by some further changes.
The main alterations are due to the use of the computing package R for the
illustration purpose and especially for the computer sessions. The treatment of the
original topics over eight chapters has not been changed.
The original object of the book was to make the book to be a tutorial for item
response theory suited to those who possess only a limited knowledge of educational
measurement and psychometrics. We have never lost sight of such an object. The
amendments in this book are not due to any alteration in the original object but
they are necessitated by the development of the statistical computing technology. In
particular, the book now aims at covering both the basics of item response theory
and the use of R for preparing graphical presentation in the item response theory
related writings.
We will be indebted to any reader who calls our attention to errors or obscurities.
v
Acknowledgments of the Original Book (1985)
Over the past century, many people have contributed to the development of item
response theory. Three persons deserve special recognition. D. N. Lawley of the
University of Edinburgh published a paper in 1943 showing that many of the
constructs of classical test theory could be expressed in terms of parameters of the
item characteristic curve. This paper marks the beginning of item response theory
as a measurement theory. Dr. F. M. Lord of the Educational Testing Service has
been the driving force behind both the development of theory and its application
for the past 35 years. Over this period, he has systematically defined, expanded,
and explored the theory as well as developed the computer programs needed to put
the theory into practice. This effort culminated in his recent book on the practical
applications of item response theory. In the late 1960s, Dr. B. D. Wright of the
University of Chicago recognized the importance of the measurement work by the
Danish mathematician Georg Rasch. Since that time, he has played a key role in
bringing item response theory, the Rasch model in particular, to the attention of
practitioners. Without the work of these three individuals, the level of development
of item response theory would not be where it is today.
I am indebted to Mr. T. Seavey of Heinemann Educational Books for first
suggesting that I do a small book on item response theory. This suggestion allowed
me to fulfill a long-standing desire to develop an instructional software package
dealing with item response theory for a microcomputer. I must also acknowledge
the technical assistance of Mr. W. Vilberg in squeezing the maximum capability out
of the Apple II computer. Without his help, the computer software would be much
less sophisticated. Finally, the manuscript was prepared using the Screenwriter II
word processor program written by Mr. R. Kidwell of Sierra On-Line, Inc. Without
this marvelous package, the present book would never be written.
vii
Acknowledgements of the Second Edition of the
Original Book (2001)
Over the past century, many people have contributed to the development of item
response theory. Three persons deserve special recognition. D. N. Lawley of the
University of Edinburgh published a paper in 1943 showing that many of the
constructs of classical test theory could be expressed in terms of parameters of
the item characteristic curve. This paper marks the beginning of item response
theory as a measurement theory. The work of Dr. F. M. Lord of the Educational
Testing Service has been the driving force behind both the development of theory
and its application for the past 50 years. Dr. Lord systematically defined, expanded,
and explored the theory as well as developed the computer programs needed to
put the theory into practice. This effort culminated in his classic books (with
Dr. Melvin Novick, 1968; 1980) on the practical applications of item response
theory. In the late 1960s, Dr. B. D. Wright of the University of Chicago recognized
the importance of the measurement work by the Danish mathematician Georg
Rasch. Since that time, he has played a key role in bringing item response theory,
the Rasch model in particular, to the attention of practitioners. Without the work of
these three individuals, the level of development of item response theory would not
be where it is today.
I am indebted to Mr. T. Seavey of Heinemann Educational Books for first
suggesting that I do a small book on item response theory, which resulted in the
first edition of this book in 1985. This suggestion allowed me to fulfill a long-
standing desire to develop an instructional software package dealing with item
response theory for the then-state-of-the-art Apple II and IBM PC computers. An
upgraded version of this software has now been made available on the World Wide
Web (http://wricae.net/irt).
ix
Contents
A R Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
C Putting the Three Tests on a Common Ability Scale: Test Equating . . . 167
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
xi
Introduction
When the original book was first published in 1985, the fields of educational
measurement and psychometrics were in a transitional period. The majority of
practice was based upon the classical test theory developed during the 1920s.
However, a new test theory had been developing over the past 40 years that was
conceptually more powerful than classical test theory. Based upon items rather than
test scores, the new approach was known as item response theory. While the basic
concepts of item response theory are straightforward, the underlying mathematics
is somewhat advanced compared to that of classical test theory. As a result, it is
difficult to examine some of these concepts without performing a large number of
calculations to obtain usable information. The original book was designed to provide
the reader access to the basic concepts of item response theory freed of the tedious
underlying calculations through an Apple II computer program. The second edition
of the original published in 2001 used a version of computer program written in
Visual Basic 5.0 that could be obtained at http://ericae.net/irt. Readers accustomed
to sophisticated statistical and graphics packages might have found it utilitarian,
but nevertheless helpful in understanding various facets of the theory. This book
now uses R that is a freely available programming language for applied statistics
and data visualization. The file folder accompanying the book contains a set of R
functions that implement various facets of the theory. These R functions allow the
reader to explore the theory at the conceptual level.
The book is organized in a building block fashion. It proceeds from the simple to
the complex with each new topic building on the preceding topics. Within each
of the eight chapters, a basic concept is presented, the corresponding computer
session is explained, and a set of exploratory exercises are defined. Readers are then
strongly encouraged to use the computer session to explore the concept through a
series of exercises. A final section of each chapter, called “Things to Notice,” lists
some of the characteristics of the concept that you should have noticed and some
of the conclusions that you should have reached. If you do not understand the logic
underlying something in this section, you can return to the computer session and try
new variations and explorations until clarity is achieved.
xiii
xiv Introduction
When finished with the book and the computer sessions, the reader should have
a good working knowledge of the fundamentals of item response theory. This book
emphasizes the basics, minimizes the amount of mathematics, and does not pursue
technical details that are of interest only to the specialist. In some sense, you will
be shown only “what you need to know” rather than all the glorious details of the
theory. Upon completion of this book, the reader should be able to interpret test
results that have been analyzed under item response theory by means of programs
such as WINSTEPS (Linacre 2015), BILOG-MG (Zimowski et al. 2002), and
PCLOGIST (Wingersky et al. 1999). Note that WINSTEPS is a current descendant
of BICAL (Wright and Mead 1976), BILOG-MG is the extended version of BILOG
(Mislevy and Bock 1984), and PCLOGIST is the personal computer version of
LOGIST (Wingersky et al. 1982). In order to employ the theory in a practical setting,
the reader should study more advanced books on the applications of the theory such
as Baker and Kim (2004), de Ayala (2009), Embretson and Reise (2000), Nering and
Ostini (2010), Reckase (2009), Thissen and Wainer (2001), and van der Linden and
Glas (2000) as well as some earlier books including Hambleton and Swaminathan
(1984), Hambleton et al. (1991), Wright and Stone (1979), and Hulin et al. (1983).
Getting Started
1.1 Introduction
ability a given person has, and the ability of several persons could be compared.
While the theoretical range of ability is from negative infinity to positive infinity,
practical considerations usually limit the range of values from, say, 3 to C3.
Consequently, the discussions in the text and the computer sessions will only deal
with ability values within this range. However, you should be aware that values
beyond this range are possible.
The usual approach taken to measure an ability is to develop a test consisting
of a number of items (i.e., questions). Each of these items measures some facet of
the particular ability of interest. From a purely technical point of view such items
should be free response items where the examinee can write any response that seems
appropriate. The person scoring the test then must decide whether the response
is correct or not. When the item response is determined to be correct, the examinee
receives a score of one, an incorrect answer receives a score of zero, that is, the item
is dichotomously scored. Under classical test theory, the examinee’s raw test score
would be the sum of the scores received on the items in the test. Under item response
theory, the primary interest is in whether an examinee got each individual item
correct or not rather than in the raw test score. This is because the basic concepts of
item response theory rest upon the individual items of a test rather than upon some
aggregate of the item responses such as a test score.
From a practical point of view, free response items are difficult to use in a test.
In particular, they are difficult to score in a reliable manner. As a result, most tests
used under item response theory consist of multiple-choice items. These are scored
dichotomously with the correct answer receiving a score of one and each of the
distractors yielding a score of zero. Items scored dichotomously are often referred
to as binary items.
1.0
Probability of Correct Response, P(θ)
0.5
0.0
−3 −2 −1 0 1 2 3
Ability, θ
The item characteristic curve is the basic building block of item response theory
and all the other constructs of the theory depend upon this curve. Because of this,
considerable attention will be devoted to this curve and its role within the theory.
There are two technical properties of an item characteristic curve that are used to
describe it. The first is the difficulty of the item. Under item response theory, the
difficulty of an item describes where the item functions along the ability scale. For
example, an easy item functions among the low-ability examinees while a hard
item would function among the high-ability examinees; thus, item difficulty is a
location index. The second technical property is the discrimination of an item, which
describes how well an item can differentiate between examinees having abilities
below the item location and those having abilities above the item location. This
property essentially reflects the steepness of the item characteristic curve in its
middle section. The steeper the curve the better the item can discriminate. The flatter
the curve the less the item is able to discriminate since the probability of correct
response at low ability levels is nearly the same as it is at high ability levels. Using
these two descriptors, one can describe the general form of the item characteristic
curve. These descriptors are also used to discuss the technical properties of an item.
It should be noted that these two properties say nothing about whether the item really
measures some facet of the underlying ability or not; that is a question of validity.
These two properties simply describe the form of the item characteristic curve.
The idea of item difficulty as a location index will be examined first. In Fig. 1.2,
three item characteristic curves are presented on the same graph. All have the same
level of item discrimination but differ with respect to item difficulty. The left-hand
4 1 The Item Characteristic Curve
1.0
Probability of Correct Response
0.5
0.0
−3 −2 −1 0 1 2 3
Ability
Fig. 1.2 Three item characteristic curves with the same item discrimination but different levels of
item difficulty
curve represents an easy item because the probability of correct response is high
for low-ability examinees and approaches 1 for high-ability examinees. The center
curve represents an item of medium item difficulty because the probability of correct
response is low at the lower ability levels, around 0:5 in the middle of the ability
scale, and near 1 at the highest ability level. The right-hand curve represents a hard
item. The probability of correct response is low for most of the ability scale and
increases only when the higher ability levels are reached. Even at the highest ability
level shown (i.e., D 3) the probability of correct response is only 0:8 for the most
difficult item.
The concept of item discrimination is illustrated in Fig. 1.3. This figure contains
three item characteristic curves having the same item difficulty but differing with
respect to item discrimination. The upper curve on the positive ability range has a
high level of item discrimination since the curve is quite steep in the middle where
the probability of correct response changes very rapidly as ability increases. Just
a short distance to the left of the middle of the curve, the probability of correct
response is much less than 0:5; and a short distance to the right, the probability is
much greater than 0:5. The middle curve represents an item with a moderate level
of item discrimination. The slope of this curve is much less than the previous curve
and the probability of correct response changes less dramatically than the previous
curve as the ability level increases. However, the probability of correct response
is near zero for the lowest-ability examinees and near unity for the highest-ability
examinees. The third curve represents an item with low item discrimination. The
curve has a very small slope and the probability of correct response changes slowly
over the full range of abilities shown. Even at low ability levels, the probability of
correct response is reasonably large and it increases only slightly when high ability
levels are reached. The reader should be warned that although the figures only show
a range of ability from 3 to C3, the theoretical range of ability is from negative
1.3 Item Difficulty and Item Discrimination 5
1.0
Probability of Correct Response
0.5
0.0
−3 −2 −1 0 1 2 3
Ability
Fig. 1.3 Three item characteristic curves with the same item difficulty but different levels of item
discrimination
1.0
Probability of Correct Response
0.5
0.0
−3 −2 −1 0 1 2 3
Ability
infinity to positive infinity. Thus, all item characteristic curves of the type used here
actually become asymptotic to a probability of zero at one tail and to unity at the
other tail. The restricted range employed in the figures is necessary to fit the curves
on the computer screen reasonably and to provide a uniform frame of reference.
One special case is of interest; namely, that of an item with perfect discrimina-
tion. The item characteristic curve of such an item is a vertical line at some point
along the ability scale. Figure 1.4 shows such an item. To the left of the vertical line
at D 1:5, the probability of correct response is zero and to the right of the line
the probability of correct response is unity. Thus, the item discriminates perfectly
6 1 The Item Characteristic Curve
between examinees whose abilities are above and below an ability score of 1:5. Such
items would be ideal for distinguishing between examinees with abilities just above
and below 1:5. However, such an item makes no distinction among those examinees
with abilities above 1:5 nor among those examinees with abilities below 1:5.
At the present point in the presentation of item response theory, the goal is to allow
you to develop an intuitive understanding of the item characteristic curve and its
properties. In keeping with this goal, item difficulty and item discrimination will be
defined in verbal terms. Item difficulty will have the following levels:
very easy
easy
medium
hard
very hard
Item discrimination will have the following levels:
none
low
moderate
high
perfect
These terms will be used in the computer session to specify item characteristic
curves.
The purpose of this session is to enable you to develop a sense of how the shape of
the item characteristic curve is related to item difficulty and item discrimination.
To accomplish this, you will be able to select verbal terms describing the item
difficulty and item discrimination of an item. The computer program R will then
calculate and display the corresponding item characteristic curve on the screen. You
should do examples in this section and exercises in the next section, then try various
combinations of levels of item difficulty and item discrimination and relate these to
the resulting curves. After a bit of such exploratory practice, you should be able to
predict what the item characteristic curve will look like for a given combination of
item difficulty and item discrimination.
1.5 Computer Session 7
When R is ready for input through the R console window, it prints out its prompt
character with an invisible, horizontal space after it:
>
A command line in R will be executed by pressing the enter key:
Enter
-
Such a special character that indicates the end of a command line, usually entered
by pressing the enter or return key will be treated as an invisible character here.
When an incomplete command line (e.g., the end of the expression cannot have
occurred yet) gets the enter key, R prints out the continuation prompt character with
an invisible space after it:
+
To improve readability and because a rather long command line can be typed in
without pressing the enter key in the middle, a long command line will not be
separated by the continuation prompt character but will be continued to the next
line with indentation.
The followings are the simplest command lines, each with the R prompt in front,
to display an item characteristic curve for an item with medium item difficulty and
moderate item discrimination:
> theta <- seq(-3, 3, .1)
> bmedium <- 0
> amoderate <- 1
> P <- 1 / (1 + exp(-amoderate * (theta - bmedium)))
> plot(theta, P, type="l")
By pressing the enter key in the end of each line, five times as a total, the computer
will display an item characteristic curve shown in Fig. 1.5 in the graphics window.
By pressing the enter key in the end of the first line, a sequence of numbers (i.e., a
vector) will be created with 3 as a starting number and 3 as an ending number with
an increment of 0:1. The length or the total number of elements of the sequence is
61. The name of such a sequence is given as theta, and it is to be done by using the
assign function <- for which the less than character < and the hyphen character -
cannot be separated with a space. You can read the first line as “theta gets a sequence
of numbers . . .” or “a sequence of numbers . . . is saved under the name theta.” The
first line is equivalent to:
> assign("theta", seq(-3,3,.1))
or
> seq(-3, 3, .1) -> theta
8 1 The Item Characteristic Curve
0.8
0.6
P
0.4
0.2
−3 −2 −1 0 1 2 3
theta
diff, length, mean, sd, pi, range, rank, time, tree, var, etc.) should
not be used as the names of variables or functions of your own. Note that R is case
sensitive, so the variable P will be different from the variable p unless both are
defined to be equivalent.
The command line
P <- 1 / (1 + exp(-amoderate * (theta - bmedium))))
will create a vector of 61 values of the probabilities of correct response based on the
respective 61 ability points, item difficulty, and item discrimination. We will explore
the exact meaning of this function in the subsequent chapter.
The plot based on the 61 sets of points from the ability variable as the horizontal
axis, abscissa, and the probability of correct response as a vertical axis, ordinate,
can be constructed via:
> plot(theta, P)
With the enter key pressed, the above line can open an R graphics window that
contains the plot of the two variables. The default setting of the function plot will
create a plot with a symbol ı as each point. Because we want to have a plot with
connected lines that ultimately yield a curve, the optional argument type="l"
was added in the earlier command line. Whenever R opens up its graphics window,
it treats the graphics window as a current window. If you want to continue to use
command lines, you may click the R console window (especially the caption on the
top of the R console window or any inside portion of the R console window) to
make it current before you type in a new command line.
You may notice that the number of ticks used in the default setting of the function
plot may not be an appropriate one you want to use in your own figure. The
numbers of ticks in the horizontal and vertical axes can be modified with the use
of the graphical parameters function par and its labels argument lab, for example:
> par(lab=c(7,3,3))
The labels argument was defined by the three parameters in the above line. The set
of the three parameters (i.e., the three numbers separated with two commas in the
combine function c) specifies the number of ticks on the horizontal axes to be 7,
that on the vertical axes to be 3, and the length of characters in the labels to be 3
(but the character length will be most likely ignored in R). If you want to explore
or read the full description of the R function, for example par, you can obtain it by
typing:
> ?par
or
> help(par)
Assuming that your computer is connected to Internet, such a command line will
open up a file in html, the HyperText Markup Language, that explains the function
in a default browser you are using.
10 1 The Item Characteristic Curve
It can be noticed in Fig. 1.5 that the variable names are appeared as the respective
labeling texts along the horizontal and vertical axes. You can change them with the
arguments of, for example, xlab="Ability" and ylab="Probability of
Correct Response" in the function plot. The ranges of the horizontal and
vertical variables can be precisely controlled with the use of the arguments and
parameters of xlim=c(-3,3) and ylim=c(0,1), by specifying the lower limit
number and the upper limit number in the combine or concatenate function c.
The following command lines can be used to obtain Fig. 1.1:
> par(lab=c(7,3,3))
> theta <- seq(-3, 3, .1)
> b <- 0
> a <- 1
> P <- 1 / (1 + exp(-a * (theta - b)))
> plot(theta, P, type="l", xlim=c(-3,3), ylim=c(0,1),
xlab=expression(paste("Ability, ",theta)),
ylab=expression(paste(
"Probability of Correct Response, P(",theta,")")))
As shown in Fig. 1.1, you may use the function expression to add characters in
Greek. We may simply use xlab="Ability" and ylab="Probability of
Correct Response". You can add the main heading on the top of the plot and
the subheading to the bottom of the plot with the arguments in the function plot.
These will be helpful when you are making figures for a presentation purpose. For
example, you may construct a figure (n.b., a figure is not displayed here) that is more
elaborate but nevertheless similar to Fig. 1.5 using the following command lines:
> par(lab=c(7,3,3))
> theta <- seq(-3, 3, .1)
> bmedium <- 0
> amoderate <- 1
> P <- 1 / (1 + exp(-amoderate * (theta - bmedium)))
> plot(theta, P, type="l", xlim=c(-3,3), ylim=c(0,1),
xlab="Ability", ylab="Probability of Correct Response",
main="Figure 1. An Item Characteristic Curve with
Medium Item Difficulty and Moderate Item Discrimination",
sub="See Baker and Kim (2016).")
It is possible to create your own R functions. Each time the plot of the item
characteristic curve is created, you may notice that a nearly identical set of R
command lines are executed. To avoid the repetition of typing in the same command
1.5 Computer Session 11
After defining the function, by specifying the numerical values of item difficulty
and item discrimination, a plot of the item characteristic curve can be constructed
in the R graphics window. For example, you may use the following line to obtain
a plot of item characteristic curve with medium item difficulty and moderate item
discrimination:
> iccplot(0, 1)
The two arguments, b and a, are the named actual arguments in R. When the
numerical values are specified without the arguments, R recognizes the first number
to be the value of item difficulty and the second number to be the value of item
discrimination. The arguments in the function can be specified in arbitrary order by
exactly defining them with names. You can obtain the same plot by typing:
> iccplot(a=1, b=0)
Instead of using the numerical values to define item difficulty and item discrim-
ination, the verbal terms described earlier can be used to plot item characteristic
curves. The numerical definitions of the verbal terms for item difficulty are as
follows:
> bveryeasy <- -2.625
> beasy <- -1.5
> bmedium <- 0
> bhard <- 1.5
> bveryhard <- 2.625
The numerical definitions of the verbal terms for item discrimination are as follows:
> anone <- 0
> alow <- 0.4
> amoderate <- 1
> ahigh <- 2.1
> aperfect <- 999
The following command lines now can display two item characteristic curves
in the R graphics window (see Fig. 1.6). Note that you should click the R console
window after pressing the enter key in the end of the first line, that is, after creating
the R graphics window.
12 1 The Item Characteristic Curve
1.0
Probability of Correct Response
0.5
0.0
−3 −2 −1 0 1 2 3
Ability
clicking the console window after the first line and the third line, respectively:
> iccplot(beasy, amoderate)
> par(new=T)
> iccplot(bmedium, amoderate)
> par(new=T)
> iccplot(bhard, amoderate)
1.6 Exercises
For the exercises, it is assumed that you have defined the function iccplot and
the numerical values of the verbal terms of item difficulty and item discrimination
by typing in the respective command lines.
1. An item with easy item difficulty and high item discrimination is to be plotted.
(a) Use the function iccplot to plot an item characteristic curve of an item
with easy item difficulty and high item discrimination.
(b) From the graph it can be seen that the probability of correct response will be
rather high over most of the ability scale. The item characteristic curve will
be steep in the lower part of the ability scale.
(c) After you have studied the curve, make sure to click the R console window
to make it current and to type in a new command line.
(d) The next graph will be plotted in a new graphics window.
2. An item with hard item difficulty and low item discrimination is to be plotted.
(a) Use the function iccplot to plot an item characteristic curve of an item
with hard item difficulty and low item discrimination.
(b) From the graph it can be seen that the probability of correct response will
have a low general level over most of the ability scale. The item characteristic
curve will not be very steep.
(c) After you have studied the curve, make sure to click the R console window
to make it current and to type in a new command line.
(d) The next graph will be plotted in a new graphics window.
3. An item with medium item difficulty and low item discrimination is to be
plotted.
(a) Use the function iccplot to plot an item characteristic curve of an item
with medium item difficulty and low item discrimination.
(b) From the graph it can be seen that the probability of correct response will be
between 0.2 and 0.8 over the range of ability shown. The item characteristic
curve will be nearly linear over the range of ability employed.
(c) After you have studied the curve, make sure to click the R console window
to make it current and to type in a new command line.
(d) The next graph will be plotted in a new graphics window.
14 1 The Item Characteristic Curve
4. In this exercise, all the items will have the same level of item difficulty but
different levels of item discrimination. The intent is to relate the steepness of
the curve to the level of item discrimination.
(a) Use the function iccplot to plot an item characteristic curve of an item
with medium item difficulty and moderate item discrimination.
(b) From the graph it can be seen that the probability of correct response will
be small at low ability levels and large at high ability levels. The item
characteristic curve will be moderately steep in the middle part of the ability
scale.
(c) After you have studied the curve, make sure to click the R console window
to make it current and to type in a new command line.
(d) The next graph will be plotted on the same graph in the graphics window.
Type in:
> par(new=T)
(e) Now repeat steps a through d several times using medium item difficulty
for each item and item discrimination levels of your choice (e.g., none, low,
high, perfect).
(f) The next graph will be plotted in a new graphics window.
5. In this exercise, all the items will have the same level of item discrimination but
different levels of item difficulty. The intent is to relate the location of the item
on the ability scale to its level of item difficulty.
(a) Use the function iccplot to plot an item characteristic curve of an item
with very easy item difficulty and moderate item discrimination.
(b) From the graph it can be seen that the probability of correct response will be
reasonably large over most of the ability scale. The item characteristic curve
will be moderately steep in the lower part of the ability scale.
(c) After you have studied the curve, make sure to click the R console window
to make it current and to type in a new command line.
(d) The next graph will be plotted on the same graph in the graphics window.
Type in:
> par(new=T)
(e) Now repeat steps a through d several times using items with moderate item
discrimination and item difficulty levels of your choice (e.g., easy, medium,
hard, very hard).
(f) The next graph will be plotted in a new graphics window.
6. Experiment with various combinations of item difficulty and item discrimination
of your own choice until you are confident that you can predict the shape of
the item characteristic curve corresponding to the levels chosen. You may find it
useful to make a rough sketch of what you think the curve will look like before
you have the computer display it on the screen.
1.7 Things to Notice 15
1. When item discrimination is less than moderate, the item characteristic curve is
nearly linear and appears rather flat.
2. When item discrimination is greater than moderate, the item characteristic curve
is S-shaped and rather steep in its middle section.
3. When item difficulty is less than medium, most of the item characteristic curve
has a probability of correct response that is greater than 0:5.
4. When item difficulty is greater than medium, most of the item characteristic
curve has a probability of correct response less than 0:5.
5. Regardless of the level of item discrimination, item difficulty locates the item
along the ability scale. Therefore item difficulty and item discrimination are
independent of each other.
6. When an item has no item discrimination, all choices of item difficulty yield the
same horizontal line at a value of P. / D 0:5. This is because the value of item
difficulty for an item with no item discrimination is undefined.
7. If you have been very observant, you may have noticed the point at which P. / D
0:5 corresponds to item difficulty. When an item is easy, this value occurs at a
low ability level. When an item is hard, this value corresponds to a high ability
level.
Chapter 2
Item Characteristic Curve Models
2.1 Introduction
In the first chapter the properties of the item characteristic curve were defined in
terms of verbal descriptors. While this is useful to obtain an intuitive understanding
of item characteristic curves, it lacks the precision and rigor needed by a theory.
Consequently, in this chapter the reader will be introduced to three mathematical
models for the item characteristic curve. These models provide mathematical
equations for the relation of the probability of correct response to ability. Each
model employs one or more item parameters whose numerical values define a
particular item characteristic curve. Such mathematical models are needed if one
is to develop a measurement theory that can be rigorously defined and is amenable
to further growth. In addition, these models and their parameters provide a vehicle
for communicating information about an item’s technical properties. For each of the
three models, the mathematical equation will be used to compute the probability of
correct response at several ability levels. Then the graph of the corresponding item
characteristic curve will be shown. The goal of the chapter is to have you develop a
sense of how the numerical values of the item parameters for a given model relate
to the shape of the item characteristic curve.
Under item response theory the standard mathematical model for the item char-
acteristic curve is the cumulative form of the logistic function. It defines a family
of curves having the general shape of the item characteristic curves shown in the
first chapter. The logistic function was first derived in 1844 by Pierre François
Verhulst and has been widely used in the biological sciences to model the growth
of plants and animals from birth to maturity. It was first used as a model for the
item characteristic curve in the late 1950s and, because of its simplicity, has become
the preferred model. The equation for the two-parameter logistic model is given in
Eq. (2.1) below:
1 1
P. / D L
D a.b/
; (2.1)
1Ce 1Ce
where
e is the base of the natural logarithm that is a constant 2.718,
b is the item difficulty parameter,
a is the item discrimination parameter,1
L D a. b/ is the logistic deviate (logit), and
is an ability level.
This equation represents a family of curves whose individual members are defined
by specific numerical values of the item parameters b and a; hence, it is called the
two-parameter model. It is the model that was used in Chap. 1.
The item difficulty parameter, denoted by b, is defined as the point on the ability
scale at which the probability of correct response to the item is 0.5. The theoretical
range of the values of this parameter is 1 b C1 (i.e., a set of extended real
numbers). However, typical values have the range of 3 b C3.
Due to the shape of the item characteristic curve, the slope of the curve changes
as a function of the ability level and reaches a maximum value when the ability
level equals the item difficulty parameter. Because of this, the item discrimination
parameter does not represent the general slope of the item characteristic curve as was
indicated in Chap. 1. The technical definition of the item discrimination parameter is
beyond the level of this book. However, a usable definition is that this parameter is
proportional to the slope of the item characteristic curve at D b. The actual slope
at D b is a=4 under the two-parameter model, but considering a to be the slope at b
is an acceptable approximation that makes interpretation of the item discrimination
parameter easier in practice. The theoretical range of the values of this parameter is
1 a C1, but the usual range seen in practice is 2:80 to C2:80.
1
In much of the item response theory literature, the logistic value of the item discrimination
parameter a is divided by 1:702 or 1:7 to obtain the corresponding normal ogive model value.
This is done to make the two-parameter logistic ogive similar to the normal ogive. However, this
was not done in this book as it introduces two frames of reference for interpreting the numerical
values of the item discrimination parameter. All item parameters in this book and the associated
computer programs are interpreted in terms of the logistic function.
2.2 The Two-Parameter Model 19
To illustrate how the two-parameter model is used to compute the points on an item
characteristic curve, consider the following example problem. The values of the item
parameters are:
b D 1:0 is the item difficulty parameter
a D 0:5 is the item discrimination parameter
The illustrative computation is performed at the ability level:
D 3:0
The first term to be computed is the logistic deviate (logit), L, where:
L D a. b/
Substituting the appropriate values yields:
L D 0:5.3:0 1:0/ D 2:0
The next term computed is e raised to the power L. If you have a pocket calculator
that can compute ex or exp.x/ you can verify this calculation. Substituting yields:
eL exp.L/ D exp.2:0/ D 7:389
Now the denominator of Eq. (2.1) can be computed as:
1 C exp.L/ D 1 C 7:389 D 8:389
Finally, the value of P. / is:
P. / D 1=Œ1 C exp.L/ D 1=8:389 D 0:119
Thus, at an ability level of D 3:0, the probability of responding correctly to this
item is 0:119.
From the above, it can be seen that computing the probability of correct response
at a given ability level is very easy using the logistic model. Table 2.1 shows the
calculations for this item at seven ability levels evenly spaced over the range of
ability levels from 3 to C3. You should perform the computations at several of
these ability levels to become familiar with the procedure.
1.0
Probability of Correct Response
0.5
0.0
−3 −2 −1 0 1 2 3
Ability
Fig. 2.1 The item characteristic curve for the two-parameter model with b D 1:0 and a D 0:5
The item characteristic curve for the item of Table 2.1 is shown in Fig. 2.1. The
vertical dotted line corresponds to the value of the item difficulty parameter.
The next model of interest was first published by the Danish mathematician Georg
Rasch in the 1960s. Rasch approached the analysis of test data from a probability
theory point of view. Although he started from a very different frame of reference,
the resultant item characteristic curve model was a logistic model. In Chap. 8,
Rasch’s approach will be explored in greater detail; our present interest is only
in his item characteristic curve model. Under this model, the item discrimination
parameter of the two-parameter model is fixed at a value of a D 1 for all items and
only the item difficulty parameter can take on different values. Because of this, the
Rasch model is often referred to as the one-parameter logistic model.
The equation for the Rasch model is given by the following:
1
P. / D ; (2.2)
1C e1.b/
where
b is the item difficulty parameter and
is the ability level.
It should be noted that the item discrimination parameter was used in Eq. (2.2). But
because it always has a value of 1 it usually is not shown in the formula.
Exploring the Variety of Random
Documents with Different Content
KASTEEN HELMEITÄ
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
textbookfull.com