Chapter 05 Validity
Chapter 05 Validity
Class: Date:
Chapter 05: Validity
1. How does the current edition of Standards for Educational and Psychological Testing conceptualize validity?
a. Validity is a non-statistical measure for non-quantitative tests.
b. Validity should only be assessed when reliability is low.
c. Validity is a unitary concept that represents all evidence.
d. Only one type of validity must be high and the others can be low.
ANSWER: c
2. As a result of the Griggs v. Duke Power decision, employers must provide evidence that a test used for employee
selection or promotion has
a. test-retest reliability.
b. inter-rater reliability.
c. specific meaning.
d. internal consistency.
ANSWER: c
3. Evidence for validity should include what information about construct, criterion, and content validation?
a. any form suffices
b. two forms of necessary
c. two forms are necessary plus information about face validity
d. all three need be presented in unified form.
ANSWER: d
4. Why does the current edition of Standards for Educational and Psychological Testing reject face validity?
a. By definition, all tests have face validity.
b. Face validity in not technically a type of validity.
c. No reliable measures of face validity have been found.
d. Face validity is irrelevant.
ANSWER: b
5. Which type of validity is most important for educational tests?
a. face
b. content
c. concurrent
d. predictive
ANSWER: b
Chapter 05: Validity
6. Why is evidence for criterion-related validity gathered?
a. It is legally required.
b. It is critical for reliability.
c. Tests serve as a substitute for the real measure of interest.
d. Criterion validity subsumes all other validity.
ANSWER: c
7. Which of the following should be considered when evaluating evidence for validity?
a. Whether the cause of the relationship between the test and the criterion has changed since the test was
validated.
b. Excessively high validity may indicate a need for improving reliability.
c. Validity is only of concern when legal challenges are expected.
d. Tests that are highly valid for at least two distinct populations are likely to be valid for all populations.
ANSWER: a
8. Which of the following is an important validation consideration in industrial settings?
a. pay grade
b. occupational classification
c. attrition
d. education
ANSWER: c
9. If the results obtained in a validity study are not specific to the validation population, what quality does the validation
have?
a. differential validation
b. convergence
c. population crossover
d. generalizability
ANSWER: d
10. Under what circumstances is construct validation essential?
a. when the test has some type of novel approach
b. when no criterion or universe of content is accepted as adequate to define the quality
c. when insurance reimbursement is going to be sought
d. when a single, large-scale validation study is to be performed
ANSWER: b
Chapter 05: Validity
11. According to Campbell and Fiske, what types of evidence are required for a test to be meaningful?
a. reliability and validity
b. content and construct
c. convergent and divergent
d. quantitative and qualitative
ANSWER: c
12. Which of the following is true of the relationship between reliability and validity?
a. It is possible to create a highly reliable test that lacks validity.
b. If a test lacks reliability, it is important to emphasize its validity.
c. High levels of validity are often associated with low reliability.
d. There is an inverse relationship between reliability and validity.
ANSWER: a
13. The agreement between a test score and the construct it is presumed to measure is referred to as its
a. stability.
b. alpha level.
c. reliability.
d. validity.
ANSWER: d
14. Validity refers to
a. the stability of a score.
b. the alpha level.
c. the agreement between a test score and the quality it is believed to measure.
d. item to total correlation.
ANSWER: c
15. "Face validity" is
a. the correlation between a test score and some poorly established criterion.
b. the best evidence for validity.
c. acceptable for research studies but not for tests used in clinical practice.
d. not really a form of validity because it offers no evidence to support conclusions.
ANSWER: d
Chapter 05: Validity
16. Which of the following was not one of the major categories recognized in the booklet on Standards for Educational
and Psychological Testing?
a. content validity
b. criterion validity
c. convergent validity
d. construct validity
ANSWER: c
17. Concurrent and predictive validity are both subcategories of
a. criterion validity.
b. empirical validity.
c. face validity.
d. convergent validity.
ANSWER: a
18. Which type of validity is unique because it is not statistically-based?
a. predictive
b. concurrent
c. content
d. construct
ANSWER: c
19. Asking patients where their injured back hurts illustrates ____ validity.
a. predictive
b. face
c. construct
d. content
ANSWER: b
20. When Ms. Anderson applied for an administrative assistant job in a factory, she was given a test that examined her
typing ability, political affiliation, and filing ability. This test clearly violates what kind of validity?
a. predictive
b. construct
c. criterion
d. content
ANSWER: d
Chapter 05: Validity
21. Many psychologists no longer believe this to be a separate type of evidence for validity because of the lack of clear-
cut boundaries between it and other types of evidence for validity. What is it?
a. content
b. criterion
c. constructive
d. predictive
ANSWER: a
22. Which type of validity requires that test items provide an adequate representation of the conceptual domain they are
designed to cover?
a. face validity
b. content validity
c. predictive validity
d. construct validity
ANSWER: b
23. If scores on a test of intelligence are influenced by eye color, this is an example of
a. face validity
b. construct underrepresentation.
c. aggregate validity.
d. construct-irrelevant variance.
ANSWER: d
24. Both convergent and discriminant evidence are essential for
a. content validity.
b. predictive validity.
c. concurrent validity.
d. construct validity.
ANSWER: d
25. If the test score forecasts some criterion, the test has
a. face validity.
b. content validity.
c. predictive validity.
d. concurrent validity.
ANSWER: c
Chapter 05: Validity
26. Seaman Sanchez had a hearing impairment, but passed the test to become a radio operator because the test did not
measure hearing acuity. What does this represent?
a. criterion/predictor confusion
b. construct underrepresentation
c. construct-irrelevant variance
d. criterion-related evidence for validity
ANSWER: b
27. A test that has a high correlation with some well-accepted outcome measure provides evidence for
a. face validity.
b. criterion validity.
c. construct validity.
d. content validity.
ANSWER: b
28. The advantage of using a test that is "face valid" is that
a. the appearance of validity can help motivate test-takers because they can see that the test is relevant.
b. it assures that the items are properly sampled from the domain of content.
c. face validity is an important component of criterion validity.
d. it boosts the construct validity coefficient.
ANSWER: a
29. A student confronts a professor and says, "You assigned Chapters 7 through 10 but nearly all of the items came from
Chapter 7. How can you evaluate whether we know anything about the other material we were supposed to read?"
The student is challenging the test on the basis of
a. face validity.
b. content validity.
c. criterion validity.
d. construct validity.
ANSWER: b
30. The evidence is logical rather than statistical for
a. face validity and predictive validity.
b. criterion validity and content validity.
c. content validity and face validity.
d. content validity and concurrent validity.
ANSWER: c
Chapter 05: Validity
31. Job samples exemplify the use of
a. criterion validity.
b. predictive validity.
c. concurrent validity.
d. content validity.
ANSWER: c
32. Which of the following is a legal requirement for the validity of a test that is used to select employees for a job?
a. The test must focus on job-related tasks.
b. The test must include multiple choice, short-answer, and essay questions.
c. The test must include items taken directly from other, related tests.
d. The test must be uncorrelated with similar tests.
ANSWER: a
33. The extent to which a test is valid for making statements about the criterion is represented by its
a. content validity.
b. validity coefficient.
c. squared validity coefficient.
d. construct-irrelevant variance.
ANSWER: b
34. The proportion of variation in the criterion that is knowable in advance from knowledge of the test scores can be
found by
a. construct-irrelevant variance.
b. the validity coefficient.
c. squaring the validity coefficient.
d. the square root of the validity coefficient.
ANSWER: c
35. A validity coefficient is
a. the correlation between a test and the validity criterion.
b. the squared correlation between the test and the validity criterion.
c. the square root of the correlation between the test and the validity criterion.
d. one minus the correlation between a test and the validity criterion.
ANSWER: a
Chapter 05: Validity
36. Hypothetically, 9% of the variation of skill among dentists is accounted for by the test that they took in order to pass
the dental board examination. What is the validity coefficient of the test?
a. 0.03
b. 0.3
c. 0.9
d. 3.0
ANSWER: b
37. Which of the following is one of the eight recommended criteria for evaluating validity coefficients?
a. Determine the statistical sophistication of the researchers who conducted the validation.
b. Do not confuse the criterion with the predictor.
c. Make sure the results obtained from the validity study are situation specific.
d. Perform a small-scale validation study with the population of interest to you.
ANSWER: b
38. Which of the following would be a cause for concern when assessing the validity of an employment test?
a. The validity study was done on a somewhat small sample.
b. The test correlates highly with other tests of known validity.
c. The test was only validated on an all white, male population.
d. There is great variability in both predictor and criterion.
ANSWER: c
39. A careful test constructor obtained the correlation between her test and a well-defined validity criterion. She then
repeated the study to determine whether this same relationship held in another sample. This exercise is known as
a. correction for attenuation.
b. construct validation.
c. cross validation.
d. criterion/predictor confusion.
ANSWER: c
40. Some studies have failed to demonstrate that the Graduate Record Exam (GRE) is a good predictor of grade point
average in graduate school. One of the difficulties with these studies is that
a. the Graduate Record Exam is rarely used as a predictor of graduate school success.
b. the Graduate Record Exam was validated on a very small sample of students.
c. both grade point average and GRE scores have a restricted range for graduate students.
d. the Graduate Record Exam has no reliability.
ANSWER: c
Chapter 05: Validity
41. If a validity coefficient of .3 is found between a written test of baseball rules and actual skills playing baseball, what
percentage of the variation in baseball skills would be accounted for?
a. 3%
b. 6%
c. 9%
d. 30%
ANSWER: c
42. The problem with using the GRE to determine whether students who have already been admitted to graduate school
should be given their degrees is that
a. the criterion validity of the GRE is not well enough established.
b. the sample size is too small.
c. the causes of performance at the school where this procedure is used might be different than the causes of
performance at the schools where the GRE was validated.
d. this policy confuses the predictor and the criterion.
ANSWER: d
43. Construct validity is used when
a. a test can be validated against a single criterion.
b. we cannot clearly define what we want to measure.
c. we want to construct a new test for a specific purpose.
d. we draw samples of items from some clearly defined.
ANSWER: b
44. Suppose you wanted to develop a measure of jealousy, but were unable to define precisely the concept. What type of
validation studies would you most likely use?
a. face validation
b. reliability validation
c. criterion validation
d. construct validation
ANSWER: d
45. In Rubin's study on the Love Scale, the amount of time lovers spent gazing into each other's eyes was used as
evidence for the scale's
a. face validity.
b. criterion validity.
c. convergent validity.
d. content validity.
ANSWER: c
Chapter 05: Validity
46. To develop a test with construct validity, what types of validity evidence are needed?
a. face validity and predictive validity
b. discriminant validity and predictive validity
c. face validity and content validity
d. discriminant validity and convergent validity
ANSWER: d
47. An exclusive preschool program has strict enrollment criteria and only accepts children with very high marks on the
enrollment test. Unfortunately, they have found that the enrollment test does not correlate highly with the grades the
children make. The probable reason is
a. predictor/criterion confusion.
b. an invalid criterion.
c. convergent evidence.
d. range restriction.
ANSWER: d
48. If a variable has a "restricted range", it is difficult to estimate a validity coefficient due to a lack of
a. criterion validity.
b. variability.
c. adequate sample size.
d. cross validation.
ANSWER: b
49. A researcher seeking to develop a measure of depression cites a moderate correlation between her measure and the
Beck Depression Inventory as evidence of the test's validity. What type of validity documentation does this provide?
a. convergent validity
b. discriminant validity
c. content validity
d. face validity
ANSWER: a
50. Discriminant and convergent evidence provide evidence for what type of validity?
a. predictive
b. criterion
c. content
d. construct
ANSWER: d
Chapter 05: Validity
51. Dr. Tong is trying to define 'life satisfaction' and to create a test that will assess the level of a person's satisfaction
with their life. She is planning to find out how well her test correlates with other tests believed to measure life
satisfaction. This is an example of
a. convergent evidence.
b. divergent evidence.
c. discriminant evidence.
d. differential prediction.
ANSWER: a
52. Several measures are already available that assess coping skills. In order to prove that a new measure of coping is
needed, what type of validity evidence should be provided?
a. convergent
b. discriminant
c. discrete
d. predictive
ANSWER: b
53. Cronbach and other authors have argued that all types of validity are really categories of
a. face validity.
b. criterion validity.
c. content validity.
d. construct validity.
ANSWER: d
54. At what point would we assume that the construct validity of a test has been established and no more studies should
be done?
a. when we have shown that it correlates with several other measures designed to tap the same construct
b. when we have shown that it does not correlate with measures designed to tap different constructs
c. never – construct validation is an ongoing process
d. when the validity coefficients are above .8
ANSWER: c
55. In construct validation,
a. a single variable can serve as the criterion.
b. multiple studies are not required.
c. no single variable serves as the criterion.
d. the criterion remains static once defined.
ANSWER: c
Chapter 05: Validity
56. Which of the following illustrates lack of convergent validity?
a. demonstrating that tests A, B, and C all measure the same things for the same purpose
b. demonstrating a particular relationship between a test of job satisfaction and an employee's stated job
satisfaction
c. demonstrating a relationship between the results of a physical exam and a health index score
d. demonstrating that a score on the SAT and ACT have no correlation
ANSWER: d
57. Evidence that indicates a measure does not represent a construct other than the one for which it was devised is ____
validity.
a. convergent
b. discriminant
c. content
d. predictive
ANSWER: b
58. The type of validity that subsumes all other types of validity is
a. construct.
b. discriminant.
c. predictive.
d. content.
ANSWER: a
59. Which of the following statements is true?
a. We can always demonstrate that a reliable test has meaning.
b. It is logically impossible to demonstrate that a totally unreliable test is valid.
c. A test cannot be reliable unless it is also valid.
d. Reliability and validity are basically unrelated concepts.
ANSWER: b
60. A test with low reliability
a. will not have a high validity coefficient.
b. may or may not have a high validity coefficient.
c. is likely to be face valid.
d. can be used in clinical practice but not research.
ANSWER: a
Chapter 05: Validity
61. The validity coefficient of an aggression scale is .5. Using a score on this scale as a predictor of domestic violence,
how much of the variability in domestic violence can be attributed to scores on the aggression scale?
a. 25%
b. 50%
c. none
d. indeterminate
ANSWER: b
62. Criterion-related validity is meaningful when
a. the predictor is stable.
b. the criterion is valid and reliable.
c. the criterion is valid but not necessarily reliable.
d. the predictor is reliable.
ANSWER: b
63. When evidence obtained in one situation is not found in other similar situations, the evidence lacks
a. generalizability.
b. criterion.
c. predictor.
d. inter-situation reliability.
ANSWER: a
64. A professor considers the difference of validity of his mid-term exam between male and female students. This
professor is dealing with
a. range restriction.
b. confusion between predictor and criterion.
c. differential prediction.
d. face validity.
ANSWER: c
65. A student stays up all night studying for a test. She is so exhausted that she does not perform well even though she
has learned the content for the test. This is an example of
a. poor construct validity.
b. predictive validity evidence.
c. construct-irrelevant invariance.
d. construct underrepresentation.
ANSWER: c
Chapter 05: Validity
66. The different categories of validity
a. are important distinctions with little overlap.
b. have little use when determining overall validity.
c. must each be considered in order to determine validity of a scale.
d. are convenient grouping methods, but not distinctly different from each other.
ANSWER: d
67. Most aspects of validity can be seen in terms of these categories.
a. face-related, predictive-related, and construct-related
b. construct-related, criterion-related, and content-related
c. concurrent-related, discriminant-related, and content-related
d. convergent-related, concurrent-related, and discriminant-related
ANSWER: b
68. A professor is developing a final exam for her course. She goes through each chapter of the textbook for the
semester and develops questions from each to ensure they are all sufficiently covered in the exam. Which type of
validity is this professor ensuring for her exam?
a. content validity
b. construct validity
c. predictive validity
d. criterion validity
ANSWER: a
69. Which of the following is a predictor variable for the criterion of college GPA?
a. College test scores
b. High school graduation rates
c. The SAT
d. Graduate school admittance rates
ANSWER: c
70. Which of the following is equivalent to the percentage of variation in the criterion that we can expect to know in
advance because of our knowledge of the test scores?
a. the validity coefficient
b. the validity coefficient squared
c. the correlation coefficient
d. the correlation coefficient squared
ANSWER: b
Chapter 05: Validity
71. Assembling evidence about what a test means is referred to as
a. cross validation
b. prediction
c. range restriction
d. construct validation
ANSWER: d
72. A new measure for anxiety correlates highly with the Beck Anxiety Inventory. This new measure is demonstrating
a. discriminant validity.
b. convergent validity.
c. construct validity.
d. predictive validity.
ANSWER: b
73. According to your book, in some settings–particularly in education–tests have been criticized simply because of
improper interpretation or over-reliance on a single measure. Discuss the topic and suggest ways to improve the
situation.
ANSWER: Answer not provided.
74. It is important to consider several important factors when interpreting the meaning of a validity coefficient. Outline
and discuss what those factors are. Give examples when appropriate.
ANSWER: Answer not provided.
75. When your colleague tries to develop a scale to measure stress, what recommendations should you give the colleague
in terms of validity? Give examples.
ANSWER: Answer not provided.
76. Explain how reliability is related to validity, and/or vice versa.
ANSWER: Answer not provided.
77. Suppose you are consulting for a company that uses a test to predict how successful job applicants might be on the
job. The test is being challenged in the courts. Discuss the types of evidence you would use to defend the test.
ANSWER: Answer not provided.
78. Discuss the factors a university should consider in deciding whether or not to eliminate the SAT or ACT as an
admissions requirement.
ANSWER: Answer not provided.
79. Explain the process of construct validation.
ANSWER: Answer not provided.