Prsentaion of Pyschometrics
Prsentaion of Pyschometrics
So, measurement is, much like the rest of research that we’ve
learned about so far, all about representation !!!
• Scale (Test) - all items are “put together” to get a single score
Population Norms
Validity
Reliability
Standardization
Standardization
Administration – test is “given” the same way every time
• who administers the instrument
• specific instructions, order of items, timing, etc.
• Varies greatly - multiple-choice classroom test hand it out)
- WAIS -- 100+ page administration
manual
So, the real question is, “Is this test a valid measure of this
construct for this population in this application?” That
question can be answered!
Face Validity
Does the test “look like” a measure of the construct of interest?
• “looks like” a measure of the desired construct to a member
of the target population
• will someone recognize the type of information they are
responding to?
• Possible advantage of face validity ..
• If the respondent knows what information we are looking for,
they can use that “context” to help interpret the
questions and provide more useful, accurate answers
• Possible limitation of face validity …
• if the respondent knows what information we are looking for,
they might try to “bend & shape” their answers to what
they think we want -- “fake good” or “fake bad”
Content Validity
Does the test contain items from the desired “content domain”?
• Based on assessment by “subject matter experts” (SMEs) in
that content domain
• Is especially important when a test is designed to have low face
validity
• e.g., tests of “honesty” used for hiring decisions
• Is generally simpler for “achievement tests” than for
“psychological constructs” (or other “less concrete” ideas)
• e.g., it is a lot easier for “math experts” to agree whether or
not an item should be on an algebra test than it is for
“psychological experts” to agree whether or not an items
should be on a measure of depression.
• Content validity is not “tested for”. Rather it is “assured” by the
informed item selections made by experts in the domain.
Criterion-related Validity
Do the test scores correlate with criterion behavior scores??
Discriminant Validity
• Does the test show the “right” pattern of interrelationships with
other variables? -- has two parts
• Convergent Validity -- test correlates with other measures of
similar constructs
• Divergent Validity -- test isn’t correlated with measures of
“other, different constructs”
• e.g., a new measure of depression should …
• have “strong” correlations with other measures of “depression”
• have negative correlations with measures of “happiness”
• have “substantial” correlation with measures of “anxiety”
• have “minimal” correlations with tests of “physical health”,
“faking bad”, “self-evaluation”, etc.
Population Norms
In order to interpret a score from an individual or group, you must
know what scores are typical for that population
• Requires a large representative sample of the target population
• preferably random, research-selected & stratified
• Requires solid standardization both administrative & scoring
• Requires great inter-rater reliability if subjective
The Result ??
A scoring distribution of the population.
• lets us identify “normal,” “high” and “low” scores
• lets us identify “cutoff scores” to define important populations
and subpopulations
Desirable Properties of Psychological Measures
Interpretability of Individual and Group Scores
Population Norms
Scoring Distribution & Cutoffs
Validity
Face, Content, Criterioin-Related, Construct
Reliability
Interrater, Internal Consistency, Test-Retest & Alternate Forms
Standardization
Administration & Scoring
Reverse Keying
We want the respondents to carefully read an separately respond
to each item of our scale/test. One thing we do is to write the
items so that some of them are “backwards” or “reversed” …
Consider these items from a depression measure…
1. It is tough to get out of bed some mornings. disagree 1 2 3 4 5
agree
2. I’m generally happy about my life. 1 2 3 4 5
3. I sometimes just want to sit and cry. 1 2 3 4 5
4. Most of the time I have a smile on my face. 1 2 3 4 5