An Introduction To Psychometrics
An Introduction To Psychometrics
An Introduction to Psychometrics
Sourced from:
Psychometric Theory & Assessment
Professor Jack Demick, Harvard Extension School
The field also has two major research foci, namely: (a) the development and
refinement of theoretical approaches to measurement most generally (psychometric
theory); and (b) the construction of instruments (tasks, tests) and procedures for the
measurement of specific characteristics with the two most widely researched
characteristics being intelligence and personality (partly because both are
multidimensional in nature). From its inception, psychometrics has been controversial
for various reasons, including the notion that the construction of standardized tests
(tests scored in a standard or consistent manner, making it possible to compare scores
of individuals and/or of groups of individuals) has been suggested as a means that
induces bias toward some groups and not others. Indeed, at one point in its history, the
field included proponents of eugenics (who argued that innate human qualities could
be improved through, e.g., limiting childbirth via sterilization among the poor, the
disabled, and others).
Relevant to the first major research focus of the field, psychometric theory refers
to the large body of theory used in the development of psychological tests and in the
analysis of data collected from these tests. It is important to note that the word test has
multiple dictionary meanings but that the term psychological test has a very specific
meaning (i.e., a systematic procedure for obtaining and evaluating samples of
behavior relevant to cognitive, affective, or interpersonal functioning in light of two
standards, namely, uniformity in test administration and comparison of results to
normative or standardization samples). Tests that sample people’s knowledge, skills,
or cognitive functions are often designated as ability tests. Tests that sample
individuals’ attitudes, interests, opinions, emotional makeup, and characteristic
reactions to people, situations, and other stimuli fall under personality tests with self-
report tasks (e.g., inventories, surveys, questionnaires) and observations of behavior
Psychometrics 20-21 2
(e.g., checklists, schedules, projective techniques) included if they adhere to the two
standards.
On the most general level, psychometric theory has been divided into classical
test theory (CTT) and more recently item response theory (IRT). CTT (Gulliksen,
1950) begins with the assumption that every person’s observed or obtained score on a
test is a function of a true score (error-free score) plus an error score (measurement
error from random noise within the individual and/or test situation), the latter of which
is assumed to be of equal magnitude for all test takers. Thus, CTT (often referred to as
true score theory) typically compares the overall test scores (sum of the item scores)
of a group of test takers to those of a normative group randomly selected from the
population toward improving the test’s psychometric properties. Psychometric
properties refer to a test’s reliability (consistency of measurement of overall test
scores) and validity (the ability of a test overall to measure what it is supposed to
measure). Over the years, researchers have identified several different kinds of
reliability (e.g., inter-rater, test-retest, parallel forms, split-half) and of validity (e.g.,
content, face, predictive, concurrent, construct). The establishment of both these
psychometric properties predominantly employs Spearman’s (1904) statistical
technique of correlation (two variables are said to be correlated when variations in the
value of one variable are synchronized with variations in the value of the other).
With respect to the field’s second major research focus, similar reasoning that
was applied to the development of the first psychometric tests of intelligence
(Stanford-Binet, Wechsler scales) has been employed to develop other psychometric
tests and instruments within all subfields of psychology. Most notably, these include
those related to intelligence (e.g., those inherent in aptitude testing, achievement
testing, educational testing, and neuropsychological testing) and personality testing.
More recently, the use of psychological testing (primarily biographical data
instruments, cognitive ability/aptitude tests, and personality tests) has become
increasingly prominent in industrial-organizational psychology toward assessing
aspects of workplace functioning (both pre- and post-employment) that complements
the development of earlier vocational testing. The practical application of
psychometrics has been most consistently evident when a clinical psychologist is
asked to conduct a psychological assessment of which psychological testing is a part.
A psychological assessment takes place when a psychologist is asked to answer a
specific question about a patient’s functioning (e.g., differential diagnosis,
determination of functional vs. organic factors underlying symptoms, identification of
functional issues, recommendations for therapy and/or medication) based on his or her
observations of behavior, review of records, interviews (with the person and/or
Psychometrics 20-21 5