100% found this document useful (1 vote)

882 views

An Introduction To Psychometrics

Psychometrics is the quantitative measurement of psychological characteristics like abilities and personality traits. It involves developing tests and procedures to measure traits like intelligence and personality. Psychometric theory includes classical test theory and item response theory. Classical test theory focuses on overall test scores, while item response theory analyzes individual item performance and uses mathematical models to relate latent traits to item responses. Item response theory allows for computer adaptive testing where subsequent test items are tailored based on a test-taker's estimated ability level.

Uploaded by

Destina Warner

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

882 views

An Introduction To Psychometrics

Uploaded by

Destina Warner

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Psychometrics 20-21 1

An Introduction to Psychometrics

Sourced from:
Psychometric Theory & Assessment
Professor Jack Demick, Harvard Extension School

Psychometrics is a subfield of psychology, which concerns the quantitative

measurement of psychological characteristics (e.g., attitudes, knowledge, abilities,
personality traits). The subfield is primarily concerned with the quantitative
measurement of individual differences (variations along dimensions thought to reside
within individuals and not within situations, for example, age, sex, and racial
differences).

The field also has two major research foci, namely: (a) the development and
refinement of theoretical approaches to measurement most generally (psychometric
theory); and (b) the construction of instruments (tasks, tests) and procedures for the
measurement of specific characteristics with the two most widely researched
characteristics being intelligence and personality (partly because both are
multidimensional in nature). From its inception, psychometrics has been controversial
for various reasons, including the notion that the construction of standardized tests
(tests scored in a standard or consistent manner, making it possible to compare scores
of individuals and/or of groups of individuals) has been suggested as a means that
induces bias toward some groups and not others. Indeed, at one point in its history, the
field included proponents of eugenics (who argued that innate human qualities could
be improved through, e.g., limiting childbirth via sterilization among the poor, the
disabled, and others).

Relevant to the first major research focus of the field, psychometric theory refers
to the large body of theory used in the development of psychological tests and in the
analysis of data collected from these tests. It is important to note that the word test has
multiple dictionary meanings but that the term psychological test has a very specific
meaning (i.e., a systematic procedure for obtaining and evaluating samples of
behavior relevant to cognitive, affective, or interpersonal functioning in light of two
standards, namely, uniformity in test administration and comparison of results to
normative or standardization samples). Tests that sample people’s knowledge, skills,
or cognitive functions are often designated as ability tests. Tests that sample
individuals’ attitudes, interests, opinions, emotional makeup, and characteristic
reactions to people, situations, and other stimuli fall under personality tests with self-
report tasks (e.g., inventories, surveys, questionnaires) and observations of behavior
Psychometrics 20-21 2

(e.g., checklists, schedules, projective techniques) included if they adhere to the two
standards.

On the most general level, psychometric theory has been divided into classical
test theory (CTT) and more recently item response theory (IRT). CTT (Gulliksen,
1950) begins with the assumption that every person’s observed or obtained score on a
test is a function of a true score (error-free score) plus an error score (measurement
error from random noise within the individual and/or test situation), the latter of which
is assumed to be of equal magnitude for all test takers. Thus, CTT (often referred to as
true score theory) typically compares the overall test scores (sum of the item scores)
of a group of test takers to those of a normative group randomly selected from the
population toward improving the test’s psychometric properties. Psychometric
properties refer to a test’s reliability (consistency of measurement of overall test
scores) and validity (the ability of a test overall to measure what it is supposed to
measure). Over the years, researchers have identified several different kinds of
reliability (e.g., inter-rater, test-retest, parallel forms, split-half) and of validity (e.g.,
content, face, predictive, concurrent, construct). The establishment of both these
psychometric properties predominantly employs Spearman’s (1904) statistical
technique of correlation (two variables are said to be correlated when variations in the
value of one variable are synchronized with variations in the value of the other).

In contrast to CTT whose interest centers mainly on overall test scores, an

alternative approach to assessing a test’s reliability is to focus on examinees’
performance on individual items, which may be either qualitative or quantitative in
nature. A qualitative item analysis examines items on the basis of their content
coverage (e.g., number of items falling into different content categories, often found in
test manuals) and content form and relevance (e.g., number of items written according
to effective item-writing guidelines). Quantitative item analysis primarily concerns
statistical assessment of the difficulty and the discriminative value of the items. Item
difficulty is typically defined as the percentage of persons who answer the item
correctly (on ability tests) or in the keyed direction (on personality tests), expressed in
standard scores (with the most suitable items spread over a moderate difficulty range
around the 50% level). Item discrimination refers to the relation between performance
on an item and standing on the trait under consideration (e.g., by comparing those who
pass vs. fail an item on an external criterion or on the total test score through the use
of a biserial correlation with each item). Toward providing comparable scales across
samples (e.g., tested at different times of the year or in different years), some variant
of Thurstone’s absolute scaling (generating common anchor points for different
samples) was characteristically employed.
Psychometrics 20-21 3

However, with the advent of computers, precise mathematical procedures began

to be developed for sample-free measurement scales for use in the construction of
psychological tests. In this context arose a group of procedures initially grouped under
latent trait theory, which models the relations between individuals’ latent
(unobservable) traits (e.g., intelligence) and responses to test items (e.g., whether they
succeed on an item of specified difficulty). These are simply statistical constructs
derived mathematically from empirically observed relations among test responses.
Early proponents of this approach (e.g., Lord, 1980) did not want others to confuse
these mathematical constructs with physical or psychological ones (as possibly
implicated in the term latent trait) so they named the approach item response theory
instead.

IRT is a collection of measurement models that are mathematical equations

describing the association between test takers’ levels on a latent variable and the
probability of a particular response to an item, using nonlinear functions. IRT item
parameters are estimated directly using logistic models instead of proportions (item
difficulty), item-to-scale correlations (item discrimination), or simple independent
probabilities (guessing parameter corresponding to a correct response occurring by
chance). Thus, there are a number of IRT models, which vary in number of parameters
and in whether they handle dichotomous-only or polytomous items more generally.
The item characteristic curve or ICC (in some contexts referred to as a category-
response curve) is the basic unit in IRT and can be understood as the probability of
endorsing an item for individuals with a given level of the attribute. Depending on the
IRT model employed, these curves indicate which items are more difficult, are better
discriminators of the attribute, and/or are likely to have been guesses. In contrast to
the correlation coefficient employed as the predominant technique of CTT, IRT has
proposed more complex statistical methods for working with large matrices of
correlations and co-variances including factor analysis (reducing data to its basic
underlying dimensions), multidimensional scaling (finding a simple representation for
high-dimensional data), data clustering (finding objects that are like each other),
structural equation modeling (analyzing causal relations in non-experimental data),
and path analysis (evaluating the contribution of any path or combination of paths to
an overall model). With such multivariate methods, proponents of IRT attempt to
simplify large amounts of data, which allow statistically sophisticated models to be
fitted to an individual’s data (individual item responses) and tested to determine if
they are adequate fits. Further, whereas CTT relies on the use of representative
samples, IRT employs test data from large samples known to differ on the construct
being examined but they need not be representative of defined populations.
Psychometrics 20-21 4

One of the most important applications of IRT is found in computer adaptive

testing (CAT) in which a test taker does not need to answer every item on a test for
adequate assessment. By presenting the examinee with a few items that cover the
range of difficulty of the test (e.g., 10 items comprising a routing test), it is possible to
identify an individual’s approximate level of ability and then ask only questions that
will further refine his or her position within that ability level. That is, following the
routing test, subsequent items are different based on how well test takers score on the
routing test. New items are calibrated to curves on large-scale data from one or more
IRT models to represent both item characteristics (item difficulty, discrimination) and
test taker characteristics (probability of guessing) In contrast to CTT that views an
individual’s test performance as a function solely of the test, IRT sees it as a joint
function of the person and the environment (more in line with everyday life and
modern conceptions of psychology). Finally, whereas CTT assumes that more
complete (longer) tests strengthens a test’s reliability, IRT and its subsequent use of
CAT assume that shorter tests can be more reliable than longer ones. Although IRT
produces more sophisticated information for test development that CTT, there has
been hesitancy to switch to the former since it is mathematically more complex,
unfamiliar to many psychologists, and without user friendly computer programs to run
the procedures. Please don’t despair: We will demonstrate and make easy both of
these two approaches to test construction.

With respect to the field’s second major research focus, similar reasoning that
was applied to the development of the first psychometric tests of intelligence
(Stanford-Binet, Wechsler scales) has been employed to develop other psychometric
tests and instruments within all subfields of psychology. Most notably, these include
those related to intelligence (e.g., those inherent in aptitude testing, achievement
testing, educational testing, and neuropsychological testing) and personality testing.
More recently, the use of psychological testing (primarily biographical data
instruments, cognitive ability/aptitude tests, and personality tests) has become
increasingly prominent in industrial-organizational psychology toward assessing
aspects of workplace functioning (both pre- and post-employment) that complements
the development of earlier vocational testing. The practical application of
psychometrics has been most consistently evident when a clinical psychologist is
asked to conduct a psychological assessment of which psychological testing is a part.
A psychological assessment takes place when a psychologist is asked to answer a
specific question about a patient’s functioning (e.g., differential diagnosis,
determination of functional vs. organic factors underlying symptoms, identification of
functional issues, recommendations for therapy and/or medication) based on his or her
observations of behavior, review of records, interviews (with the person and/or
Psychometrics 20-21 5

significant others), and administration of standardized rating scales (e.g., Autism

Quotient, Beck Depression Inventory) and standardized psychological tests.

There is reason to believe that training is psychometrics is both important and

professionally relevant. First, the advent of the field of psychometrics was intricately
connected to the birth of the field of psychology itself. James McKeen Cattell (whose
dissertation was entitled Psychometric Investigation) studied at the University of
Leipzig with Wilhelm Wundt who in 1897 established the first psychological
laboratory to become the father of psychology. In 1889, Cattell became the first
professor of psychology in the United States, teaching at the University of
Pennsylvania and helping to establish psychology as a legitimate science by initiating
the first mental testing efforts in the United States. Second, psychometrics has
constituted a significant part of psychology since its inception and continues to do so
to this very day. Representing psychology’s preferred experimental method,
psychometrics continues to generate much interest in newer subfields (e.g., industrial-
organizational psychology) and can be expected to do so in the future. Third, the
ability to conduct psychological testing is unique to psychologists. Members of no
other professional discipline can engage in the practice of psychological testing.

Fourth, the field of psychometrics offers numerous professional opportunities for

differing roles and responsibilities. For example, a psychometrist is one who
administers tests and a psychometrician is one who constructs tests. There is no
designated educational level for the former (although most typically hold bachelors or
masters degrees) while most of the latter are psychologists (one who is trained in a
wide variety of courses on researching, teaching, writing, or practicing clinically,
leading to a doctorate from a university or school of professional psychology). A
practicing clinical psychologist requires post-Ph.D. licensing for the purpose of
protecting the public, which is mandatory for legal practice (leading him or her to be
designated as a licensed psychology provider or LPP). In contrast, a psychometrician
may or may not obtain credentialing as a LPP and may also obtain voluntary
certification as a certified specialist in psychometry (CSP), earned by passing the
minimum competency examination of the foundational knowledge in psychometry.

Health History Assignment
No ratings yet
Health History Assignment
6 pages
Script-Jungle Cruise
No ratings yet
Script-Jungle Cruise
24 pages
Rollo May
No ratings yet
Rollo May
3 pages
Principles of Mirror and Curtain
No ratings yet
Principles of Mirror and Curtain
2 pages
Aryan Marriage Raghunatha Rao R. - Text
No ratings yet
Aryan Marriage Raghunatha Rao R. - Text
310 pages
Interpersonal Relationship
No ratings yet
Interpersonal Relationship
48 pages
Johari Window Model
No ratings yet
Johari Window Model
4 pages
Autonomic Nervous System Activity in Emotion: A Review: Sylvia D. Kreibig
No ratings yet
Autonomic Nervous System Activity in Emotion: A Review: Sylvia D. Kreibig
52 pages
Seminar ON Standardised: Tests
No ratings yet
Seminar ON Standardised: Tests
27 pages
Gordon Allport's Personality Theory - Video & Lesson Transcript
No ratings yet
Gordon Allport's Personality Theory - Video & Lesson Transcript
3 pages
Personality 1
No ratings yet
Personality 1
11 pages
Counselling Services (NEdu)
No ratings yet
Counselling Services (NEdu)
81 pages
Indian Psychology - Sankhya
No ratings yet
Indian Psychology - Sankhya
39 pages
Theories of Nursing Management
No ratings yet
Theories of Nursing Management
23 pages
Organizational Structure
No ratings yet
Organizational Structure
61 pages
Introduction of Human Values
No ratings yet
Introduction of Human Values
4 pages
Transactional Analysis
No ratings yet
Transactional Analysis
4 pages
Motivation: Presented By, Aparna Antony
No ratings yet
Motivation: Presented By, Aparna Antony
8 pages
Notes On Murray
100% (2)
Notes On Murray
8 pages
Altruism
100% (1)
Altruism
18 pages
Developmental Characteristics of Children and Adolescence SARAH&JAYBERT
No ratings yet
Developmental Characteristics of Children and Adolescence SARAH&JAYBERT
7 pages
Non-Parametric Tests
No ratings yet
Non-Parametric Tests
1 page
Personality Development: TOPIC: Biological Determinants of Personality: Heredity, Endocrine Glands
No ratings yet
Personality Development: TOPIC: Biological Determinants of Personality: Heredity, Endocrine Glands
10 pages
Self - Care Deficit Theory (Orem Theory)
No ratings yet
Self - Care Deficit Theory (Orem Theory)
15 pages
Pot On Standardized Test
No ratings yet
Pot On Standardized Test
35 pages
B.P. Koirala Institute of Health Sciences College of Nursing Dharan, Sunsari, Nepal
No ratings yet
B.P. Koirala Institute of Health Sciences College of Nursing Dharan, Sunsari, Nepal
15 pages
Motivational Theory
No ratings yet
Motivational Theory
38 pages
Chapter 6-PERCEPTION AND COMMUNICATION
No ratings yet
Chapter 6-PERCEPTION AND COMMUNICATION
22 pages
4.kurt Lewin Model
0% (1)
4.kurt Lewin Model
4 pages
Anecdotal Records: DEFINITION: An Anecdotal Record Is An Observational Method Used Frequently in Classroom or
No ratings yet
Anecdotal Records: DEFINITION: An Anecdotal Record Is An Observational Method Used Frequently in Classroom or
8 pages
Weidenback Theory
No ratings yet
Weidenback Theory
26 pages
16 Personality Factors Summery
No ratings yet
16 Personality Factors Summery
10 pages
Personality Development
No ratings yet
Personality Development
14 pages
What Are Anxiety Disorders
No ratings yet
What Are Anxiety Disorders
4 pages
Johari Window
No ratings yet
Johari Window
4 pages
Mediation
No ratings yet
Mediation
77 pages
Personality Ppt Riyas Indu-1
No ratings yet
Personality Ppt Riyas Indu-1
79 pages
Counseling at Workplace: Diya Gautam Part II, Sem IV
No ratings yet
Counseling at Workplace: Diya Gautam Part II, Sem IV
16 pages
5 Roles of A Physical Therapist - 2019 S
No ratings yet
5 Roles of A Physical Therapist - 2019 S
19 pages
Reality Therapy: Dr. Arra PSY 201
100% (2)
Reality Therapy: Dr. Arra PSY 201
20 pages
Fields of Psychology
No ratings yet
Fields of Psychology
14 pages
Learning
No ratings yet
Learning
67 pages
EXPERIMENTAL & Non-Experimental Research
No ratings yet
EXPERIMENTAL & Non-Experimental Research
1 page
Life Skill Education
No ratings yet
Life Skill Education
3 pages
Effects of Stress Project
No ratings yet
Effects of Stress Project
33 pages
What Is Watson's Theory of Transpersonal Caring?
No ratings yet
What Is Watson's Theory of Transpersonal Caring?
5 pages
Taste and Smell: - by Group 3B
No ratings yet
Taste and Smell: - by Group 3B
11 pages
Kohlberg's Theory of Moral Development
No ratings yet
Kohlberg's Theory of Moral Development
4 pages
A Comparative Study of Emotional Intelligence Among B.A, B. Com and B. Sc. College Students
No ratings yet
A Comparative Study of Emotional Intelligence Among B.A, B. Com and B. Sc. College Students
10 pages
Psychosocial Aspects of Aging: By: LGS, RN MN
No ratings yet
Psychosocial Aspects of Aging: By: LGS, RN MN
18 pages
Staffing Philosophy Norms
No ratings yet
Staffing Philosophy Norms
26 pages
Mental Set
No ratings yet
Mental Set
2 pages
Mental Hygiene: - Ankita Telwane
No ratings yet
Mental Hygiene: - Ankita Telwane
22 pages
Directing in Management
No ratings yet
Directing in Management
5 pages
Age Related Geriatrics Problem
100% (1)
Age Related Geriatrics Problem
11 pages
Sampling 2
No ratings yet
Sampling 2
9 pages
Dr. Geoffrey Wango Department of Psychology University of Nairobi
100% (1)
Dr. Geoffrey Wango Department of Psychology University of Nairobi
61 pages
Factors Affecting Faculty Staff RS
No ratings yet
Factors Affecting Faculty Staff RS
25 pages
Stress and Coping Notes
100% (1)
Stress and Coping Notes
10 pages
Intelligence Test (By-Shyam)
No ratings yet
Intelligence Test (By-Shyam)
13 pages
Group 8 - PPT Theory of Human Needs
No ratings yet
Group 8 - PPT Theory of Human Needs
40 pages
Work Psychology Draft Final Assessment
No ratings yet
Work Psychology Draft Final Assessment
15 pages
Tmpa291 TMP
No ratings yet
Tmpa291 TMP
11 pages
Who Belies Deen by G A Parwez Published by Idara Tulu-E-Islam
No ratings yet
Who Belies Deen by G A Parwez Published by Idara Tulu-E-Islam
25 pages
Modul 3 Mathematical Modeling of Dynamic Systems
No ratings yet
Modul 3 Mathematical Modeling of Dynamic Systems
12 pages
Heading 1: Not The Nine O'Clock News The Young Ones
No ratings yet
Heading 1: Not The Nine O'Clock News The Young Ones
2 pages
Lbsim PGDM - Fin: Financial Institutions & Markets
No ratings yet
Lbsim PGDM - Fin: Financial Institutions & Markets
110 pages
Metabolic Bone Disease
No ratings yet
Metabolic Bone Disease
37 pages
Strategic Management-Unit: 1 Definition - Strategic Management
No ratings yet
Strategic Management-Unit: 1 Definition - Strategic Management
14 pages
Brosur Dan Etikel RL Dan Salep Mata
No ratings yet
Brosur Dan Etikel RL Dan Salep Mata
1 page
Unit 2: The Consequences of Fraud: 1.what Happened?
No ratings yet
Unit 2: The Consequences of Fraud: 1.what Happened?
3 pages
Lac Reflection Journal
No ratings yet
Lac Reflection Journal
4 pages
Fostoria Intermediate: From The Desk of Mrs. Matz
No ratings yet
Fostoria Intermediate: From The Desk of Mrs. Matz
7 pages
Practical Work Past Tense
No ratings yet
Practical Work Past Tense
1 page
En Barozzi-Veiga Portfolio-2019
No ratings yet
En Barozzi-Veiga Portfolio-2019
13 pages
VP Communications Marketing Executive in Washington DC Resume Woody Goulart
No ratings yet
VP Communications Marketing Executive in Washington DC Resume Woody Goulart
2 pages
The Heroic Cycle Heroic Cycle
No ratings yet
The Heroic Cycle Heroic Cycle
6 pages
I Suggest (Hold) Another Meeting Next Week.: The Gerund
No ratings yet
I Suggest (Hold) Another Meeting Next Week.: The Gerund
4 pages
CRIM Law Quiz
No ratings yet
CRIM Law Quiz
1 page
Tactics Vs Man United
No ratings yet
Tactics Vs Man United
2 pages
Bayes Theorem AI
No ratings yet
Bayes Theorem AI
5 pages
B44 City of Iloilo v. Contreras-Besana
100% (1)
B44 City of Iloilo v. Contreras-Besana
2 pages
Elizabethan & Jacobean Drama
No ratings yet
Elizabethan & Jacobean Drama
3 pages
The Dance Divine
No ratings yet
The Dance Divine
4 pages
Hezekiah and Archaeology. The Answer For Nadav Na'aman
No ratings yet
Hezekiah and Archaeology. The Answer For Nadav Na'aman
16 pages
Active and Passive Voice Present Continuous Grammar
67% (3)
Active and Passive Voice Present Continuous Grammar
4 pages
The Purpose of The Oracle Retail Sales Audit
No ratings yet
The Purpose of The Oracle Retail Sales Audit
22 pages
B.E - Mechanical and Automation Engineering - R2016
0% (1)
B.E - Mechanical and Automation Engineering - R2016
150 pages
Ch16-Equilibria Weak Acids Bases
No ratings yet
Ch16-Equilibria Weak Acids Bases
71 pages
EMS: Network Analysis Functions
No ratings yet
EMS: Network Analysis Functions
10 pages

An Introduction To Psychometrics

Uploaded by

An Introduction To Psychometrics

Uploaded by

Psychometrics 20-21 1

Psychometrics is a subfield of psychology, which concerns the quantitative

In contrast to CTT whose interest centers mainly on overall test scores, an

However, with the advent of computers, precise mathematical procedures began

IRT is a collection of measurement models that are mathematical equations

One of the most important applications of IRT is found in computer adaptive

significant others), and administration of standardized rating scales (e.g., Autism

There is reason to believe that training is psychometrics is both important and

Fourth, the field of psychometrics offers numerous professional opportunities for

You might also like