0% found this document useful (0 votes)

36 views

Presentation Outline

This document outlines the key aspects of ensuring the validity and reliability of measurement instruments. It discusses three types of validity: content validity, criterion-related validity (which includes concurrent and predictive validity), and construct validity. It also describes several aspects of reliability, including test-retest reliability, parallel-form reliability, internal consistency reliability (measured by Cronbach's alpha and split-half reliability), and inter-rater reliability. Establishing the validity and reliability of measures is important to ensure the instrument is accurately measuring the intended concept.

Uploaded by

MKWD NRWM

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

Presentation Outline

Uploaded by

MKWD NRWM

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

PRESENTATION OUTLINE

I – OVERVIEW: GOODNESS OF MEASURES

II – RELIABILITY
a. Stability of Measures
Test-Retest Reliability
Parallel Form Reliability
b. Internal Consistency of Measures
Interitem Consistency Reliability
Split-Half Reliability

III – VALIDITY
a. Content Validity
Face Validity
b. Criterion-Related Validity
Con-current Validity
Predictive Validity
c. Congruent Validity
Congruent Validity
Discriminant Validity

SOURCE MATERIAL:
“Research Methods for Business: A Skill Building Approach 4th Edition” by Uma Sekaran
GOODNESS OF MEASURES

It is important to make sure that the instrument that we develop to measure a particular concept is indeed
accurately measuring the variable, and that in fact, we are actually measuring the concept that we set out
to measure. This ensures that in operationally defining perceptual and attitudinal variables, we have not
overlooked some important dimensions and elements or included some irrelevant ones. The scales
developed could often be imperfect, and errors are prone to occur in the measurement of attitudinal
variables. The use of better instruments will ensure more accuracy in results, which in turn, will enhance
the scientific quality of the research. Hence, in some way, we need to assess the ―goodness of the
measures developed. That is, we need to be reasonably sure that the instruments we use in our research do
indeed measure the variables they are supposed to, and that they measure them accurately.

Let us now examine how we can ensure that the measures developed are reasonably good. First an item
analysis of the responses to the questions tapping the variable is done, and then the reliability and validity
of the measures are established, as described below

ITEM ANALYSIS is done to see if the items in the instrument belong there or not. Each item is
examined for its ability to discriminate between those subjects whose total scores are high, and those with
low scores. In item analysis, the means between the high-score group and the low-score group are tested
to detect significant differences through the t-values. The items with a high t-value (test which is able to
identify the highly discriminating items in the instrument) are then included in the instrument. Thereafter,
tests for the reliability of the instrument are done and the validity of the measure is established.

Very briefly, reliability tests how consistently a measuring instrument measures whatever concept it is
measuring. Validity tests how well an instrument that is developed measures the particular concept it is
intended to measure. In other words, validity is concerned with whether we measure the right concept,
and reliability with stability and consistency of measurement. Validity and reliability of the measure attest
to the scientific rigor that has gone into the research study. These two criteria will now be discussed.

RELIABILITY

The reliability of a measure indicates the extent to which it is without bias (error free) and hence ensures
consistent measurement across time and across the various items in the instrument. In other words, the
reliability of a measure is an indication of the stability and consistency with which the instrument
measures the concept and helps to assess the ―goodness of a measure.

Stability of Measures
The ability of a measure to remain the same over time—despite uncontrollable testing conditions or the
state of the respondents themselves—is indicative of its stability and low vulnerability to changes in the
situation. This attests to its ―goodness because the concept is stably measured, no matter when it is done.
Two tests of stability are test–retest reliability and parallel-form reliability.

Test–Retest Reliability

The reliability coefficient obtained with a repetition of the same measure on a second occasion is
called test–retest reliability. That is, when a questionnaire containing some items that are
supposed to measure a concept is administered to a set of respondents now, and again to the same
respondents, say several weeks to 6 months later, then the correlation between the scores obtained
at the two different times from one and the same set of respondents is called the test–retest
coefficient. The higher it is, the better the test–retest reliability, and consequently, the stability of
the measure across time.

Parallel-Form Reliability
When responses on two comparable sets of measures tapping the same construct are highly
correlated, we have parallel-form reliability. Both forms have similar items and the same
response format, the only changes being the wordings and the order or sequence of the questions.
What we try to establish here is the error variability resulting from wording and ordering of the
questions. If two such comparable forms are highly correlated (say 8 and above), we may be
fairly certain that the measures are reasonably reliable, with minimal error variance caused by
wording, ordering, or other factors.

Internal Consistency of Measures

The internal consistency of measures is indicative of the homogeneity of the items in the measure that tap
the construct. In other words, the items should ―hang together as a set, and be capable of independently
measuring the same concept so that the respondents attach the same overall meaning to each of the items.
This can be seen by examining if the items and the subsets of items in the measuring instrument are
correlated highly. Consistency can be examined through the inter-item consistency reliability and split-
half reliability tests.

Interitem Consistency Reliability

This is a test of the consistency of respondents‘ answers to all the items in a measure. To the
degree that items are independent measures of the same concept, they will be correlated with one
another. The most popular test of interitem consistency reliability is the Cronbach‘s coefficient
alpha (Cronbach‘s alpha; Cronbach, 1946), which is used for multipoint-scaled items, and the
Kuder–Richardson formulas (Kuder & Richardson, 1937), used for dichotomous items. The
higher the coefficients, the better the measuring instrument

Split-Half Reliability

Split-half reliability reflects the correlations between two halves of an instrument. The estimates
would vary depending on how the items in the measure are split into two halves. Split-half
reliabilities could be higher than Cronbach‘s alpha only in the circumstance of there being more
than one underlying response dimension tapped by the measure and when certain other conditions
are met as well (for complete details, refer to Campbell, 1976). Hence, in almost all cases,
Cronbach‘s alpha can be considered a perfectly adequate index of the interitem consistency
reliability.

It should be noted that the consistency of the judgment of several raters on how they view a
phenomenon or interpret some responses is termed interrater reliability, and should not be
confused with the reliability of a measuring instrument. As we had noted earlier, interrater
reliability is especially relevant when the data are obtained through observations, projective tests,
or unstructured interviews, all of which are liable to be subjectively interpreted.

It is important to note that reliability is a necessary but not sufficient condition of the test of
goodness of a measure. For example, one could very reliably measure a concept establishing high
stability and consistency, but it may not be the concept that one had set out to measure. Validity
ensures the ability of a scale to measure the intended concept. We will now discuss the concept of
validity

VALIDITY

We are now going to examine the validity of the measuring instrument itself. That is, when we ask a set
of questions (i.e., develop a measuring instrument) with the hope that we are tapping the concept, how
can we be reasonably certain that we are indeed measuring the concept we set out to do and not
something else? This can be determined by applying certain validity tests. Several types of validity tests
are used to test the goodness of measures and writers use different terms to denote them. For the sake of
clarity, we may group validity tests under three broad headings: content validity, criterion-related validity,
and construct validity.
Content Validity

Content validity ensures that the measure includes an adequate and representative set of items
that tap the concept. The more the scale items represent the domain or universe of the concept
being measured, the greater the content validity. To put it differently, content validity is a
function of how well the dimensions and elements of a concept have been delineated.

A panel of judges can attest to the content validity of the instrument. Kidder and Judd (1986) cite
the example where a test designed to measure degrees of speech impairment can be considered as
having validity if it is so evaluated by a group of expert judges (i.e., professional speech
therapists).

Face validity is considered by some as a basic and a very minimum index of content validity.
Face validity indicates that the items that are intended to measure a concept, do on the face of it
look like they measure the concept. Some researchers do not see it fit to treat face validity as a
valid component of content validity.

Criterion-Related Validity

Criterion-related validity is established when the measure differentiates individuals on a criterion

it is expected to predict. This can be done by establishing concurrent validity or predictive
validity, as explained below.

Concurrent validity is established when the scale discriminates individuals who are known to be
different; that is, they should score differently on the instrument as in the example that follows.

If a measure of work ethic is developed and administered to a group of welfare recipients, the
scale should differentiate those who are enthusiastic about accepting a job and glad of an
opportunity to be off welfare, from those who would not want to work even when offered a job.
Obviously, those with high work ethic values would not want to be on welfare and would yearn
for employment to be on their own. Those who are low on work ethic values, on the other hand,
might exploit the opportunity to survive on welfare for as long as possible, deeming work to be a
drudgery. If both types of individuals have the same score on the work ethic scale, then the test
would not be a measure of work ethic, but of something else.

Predictive validity indicates the ability of the measuring instrument to differentiate among
individuals with reference to a future criterion. For example, if an aptitude or ability test
administered to employees at the time of recruitment is to differentiate individuals on the basis of
their future job performance, then those who score low on the test should be poor performers and
those with high scores good performers.
Construct Validity

Construct validity testifies to how well the results obtained from the use of the measure fit the
theories around which the test is designed. This is assessed through convergent and discriminant
validity, which are explained below.

Convergent validity is established when the scores obtained with two different instruments
measuring the same concept are highly correlated.

Discriminant validity is established when, based on theory, two variables are predicted to be
uncorrelated, and the scores obtained by measuring them are indeed empirically found to be so.
Validity can thus be established in different ways. Published measures for various concepts
usually report the kinds of validity that have been established for the instrument, so that the user
or reader can judge the ―goodness‖ of the measure. Table 9.1 summarizes the kinds of validity
discussed here.

Some of the ways in which the above forms of validity can be established are through (1)
correlational analysis (as in the case of establishing concurrent and predictive validity or
convergent and discriminant validity), (2) factor analysis, a multivariate technique that would
confirm the dimensions of the concept that have been operationally defined, as well as indicate
which of the items are most appropriate for each dimension (establishing construct validity), and
(3) the multitrait, multimethod matrix of correlations derived from measuring concepts by
different forms and different methods, additionally establishing the robustness of the measure.

In sum, the goodness of measures is established through the different kinds of validity and
reliability depicted in Figure 9.1. The results of any research can only be as good as the measures
that tap the concepts in the theoretical framework. We need to use well-validated and reliable
measures to ensure that our research is scientific. Fortunately, measures have been developed for
many important concepts in organizational research and their psychometric properties (i.e., the
reliability and validity) established by the developers. Thus, researchers can use the instruments
already reputed to be ―good, rather than laboriously develop their own measures. When using
these measures, however, researchers should cite the source (i.e., the author and reference) so that
the reader can seek more information if necessary.

It is not unusual that two or more equally good measures are developed for the same concept. For
example, there are several different instruments for measuring the concept of job satisfaction.
One of the most frequently used scales for the purpose, however, is the Job Descriptive Index
(JDI) developed by Smith, Kendall, and Hulin (1969). When more than one scale exists for any
variable, it is preferable to use the measure that has better reliability and validity and is also more
frequently used.

At times, we may also have to adapt an established measure to suit the setting. For example, a
scale that is used to measure job performance, job characteristics, or job satisfaction in the
manufacturing industry may have to be modified slightly to suit a utility company or a health care
organization. The work environment in each case is different and the wordings in the instrument
may have to be suitably adapted. However, in doing this, we are tampering with an established
scale, and it would be advisable to test it for the adequacy of the validity and reliability afresh.

A sample of a few measures used to tap some frequently researched concepts in the management
and marketing areas is provided in the Appendix to this chapter.
Scenario A.

Product Ratings
A B C
Respondent 1 Fair Excellent Poor
Respondent 2 Fair Fair Poor
Respondent 3 Fair Excellent Poor
Respondent 4 Poor Excellent Poor
Respondent 5 Fair Excellent Poor

Scenario B.

Product Ratings
A B C
Respondent 1 Poor Excellent Excellent
Respondent 2 Excellent Fair Fair
Respondent 3 Fair Poor Fair
Respondent 4 Fair Poor Poor
Respondent 5 Poor Excellent Poor

How To Apply Pixar's Storytelling Rules To Brand Stories
No ratings yet
How To Apply Pixar's Storytelling Rules To Brand Stories
7 pages
Validity and Reliability of Research Instrument
100% (5)
Validity and Reliability of Research Instrument
47 pages
Forrest Gump
No ratings yet
Forrest Gump
3 pages
Lecture-10 (Reliability & Validity)
No ratings yet
Lecture-10 (Reliability & Validity)
17 pages
Reliability and Validity in Research
No ratings yet
Reliability and Validity in Research
5 pages
Assignment: G.C. University, Faisalabad
No ratings yet
Assignment: G.C. University, Faisalabad
5 pages
Multiple Item Scale Development
No ratings yet
Multiple Item Scale Development
6 pages
Unit 3 1
No ratings yet
Unit 3 1
51 pages
Chapter-12---Measurement-scaling--reliability-and-validity-23112022-095259am-05052023-122711pm--4--06122023-112339am (1)
No ratings yet
Chapter-12---Measurement-scaling--reliability-and-validity-23112022-095259am-05052023-122711pm--4--06122023-112339am (1)
17 pages
Chapter 12 Measurement Scaling Reliability and Validity - Extension
No ratings yet
Chapter 12 Measurement Scaling Reliability and Validity - Extension
19 pages
Goodness of Measure
No ratings yet
Goodness of Measure
15 pages
Validity and Reliability
100% (2)
Validity and Reliability
20 pages
Unit 3
No ratings yet
Unit 3
15 pages
Class quiz 4
No ratings yet
Class quiz 4
6 pages
Realiability & Validity
No ratings yet
Realiability & Validity
17 pages
Class 10
No ratings yet
Class 10
54 pages
Measurement in Research
No ratings yet
Measurement in Research
40 pages
Unit IV_Measurement and Scaling_64886f01 4f6a 4f71 Aecd 93bee01c1966
No ratings yet
Unit IV_Measurement and Scaling_64886f01 4f6a 4f71 Aecd 93bee01c1966
71 pages
Unit 2 Reliability and Validity (External and Internal)
No ratings yet
Unit 2 Reliability and Validity (External and Internal)
3 pages
35 40 Ganesh
No ratings yet
35 40 Ganesh
6 pages
AMR97-9-18-19 F
No ratings yet
AMR97-9-18-19 F
2 pages
Measurement: Scaling, Reliability and Validity
100% (1)
Measurement: Scaling, Reliability and Validity
42 pages
Advance Research Methods: Dr. Amin
No ratings yet
Advance Research Methods: Dr. Amin
5 pages
Unit 9
No ratings yet
Unit 9
27 pages
Reliability and Validity of Research Instruments: Correspondence To
No ratings yet
Reliability and Validity of Research Instruments: Correspondence To
19 pages
Experimental Psychology, Week 7, Part 3
No ratings yet
Experimental Psychology, Week 7, Part 3
4 pages
MGN832 Business Research Methods Validity and Reliability: Lecture-5
No ratings yet
MGN832 Business Research Methods Validity and Reliability: Lecture-5
22 pages
Measurement
No ratings yet
Measurement
34 pages
Kyu Edu 2301 WK3
No ratings yet
Kyu Edu 2301 WK3
5 pages
Chapter 12 - Measurement-Scaling, Reliability and Validity
No ratings yet
Chapter 12 - Measurement-Scaling, Reliability and Validity
20 pages
Validity and Reliability
No ratings yet
Validity and Reliability
23 pages
Validity and Reliability
No ratings yet
Validity and Reliability
33 pages
Essentials of A Good Test
No ratings yet
Essentials of A Good Test
6 pages
RMM Lecture 17 Criteria For Good Measurement 2006
No ratings yet
RMM Lecture 17 Criteria For Good Measurement 2006
31 pages
IRMS Lecture w3 BB PDF
No ratings yet
IRMS Lecture w3 BB PDF
30 pages
Validity and Reliability
No ratings yet
Validity and Reliability
5 pages
Chapter 4 Assessment & Evaluation
No ratings yet
Chapter 4 Assessment & Evaluation
10 pages
Business Research Methods
No ratings yet
Business Research Methods
94 pages
Module 2 Part 4 Criteria For Measurement
No ratings yet
Module 2 Part 4 Criteria For Measurement
25 pages
A1181590628 - 23746 - 19 - 2020 - Measurement and Scaling RECAP-3
No ratings yet
A1181590628 - 23746 - 19 - 2020 - Measurement and Scaling RECAP-3
20 pages
A. Sources of Measurement Differences
No ratings yet
A. Sources of Measurement Differences
4 pages
QAB Mod 4 Measurability, Data Collection, Sampling
No ratings yet
QAB Mod 4 Measurability, Data Collection, Sampling
10 pages
Lesson 6 Measurement
No ratings yet
Lesson 6 Measurement
4 pages
Measurement: Measurement Is The Process of Observing and
No ratings yet
Measurement: Measurement Is The Process of Observing and
88 pages
BBA-BI-Class 19 Business Research Notes For BHM
No ratings yet
BBA-BI-Class 19 Business Research Notes For BHM
28 pages
Essentials of A Good Psychological Test
No ratings yet
Essentials of A Good Psychological Test
6 pages
Content Topic_4
No ratings yet
Content Topic_4
8 pages
Validity and Reliability Updated
No ratings yet
Validity and Reliability Updated
9 pages
Validity&Reliability
No ratings yet
Validity&Reliability
16 pages
2.measurement of Validity Reliability
No ratings yet
2.measurement of Validity Reliability
31 pages
Validity Explains How Well The Collected Data Covers The Actual Area of Investigation
No ratings yet
Validity Explains How Well The Collected Data Covers The Actual Area of Investigation
7 pages
In Class Task 4
No ratings yet
In Class Task 4
16 pages
PPT Mod 4 Measurability, Data Collection, Sampling
No ratings yet
PPT Mod 4 Measurability, Data Collection, Sampling
29 pages
Statistical Analysis Internal Consistency Reliability and Construct Validity 1
No ratings yet
Statistical Analysis Internal Consistency Reliability and Construct Validity 1
12 pages
Week 016 Validity of Measurement and Reliability
No ratings yet
Week 016 Validity of Measurement and Reliability
7 pages
Reliability and Validity of Research
100% (1)
Reliability and Validity of Research
7 pages
Reliability & Validity: Dr. Nitu Singh Sisodia
No ratings yet
Reliability & Validity: Dr. Nitu Singh Sisodia
20 pages
BRM Measurement and Scale 2019
No ratings yet
BRM Measurement and Scale 2019
50 pages
Reliability and Validity Mha1
No ratings yet
Reliability and Validity Mha1
13 pages
Validity & Reliability
No ratings yet
Validity & Reliability
4 pages
Week 3 Goodness of Measure
No ratings yet
Week 3 Goodness of Measure
12 pages
Assignment 5 - Text Web and Social Media Analytics
No ratings yet
Assignment 5 - Text Web and Social Media Analytics
2 pages
Synthesis Paper 3
No ratings yet
Synthesis Paper 3
4 pages
AT 2. Case Analysis
No ratings yet
AT 2. Case Analysis
2 pages
Employees Attendance and Absenteeism
No ratings yet
Employees Attendance and Absenteeism
2 pages
Estilos Parentais
No ratings yet
Estilos Parentais
37 pages
Interview Evaluation Form
No ratings yet
Interview Evaluation Form
2 pages
POBkWZ Resilience As Restorative Practice
No ratings yet
POBkWZ Resilience As Restorative Practice
48 pages
Human Resource Management Case Studies With Solutions HRM Case Study
No ratings yet
Human Resource Management Case Studies With Solutions HRM Case Study
2 pages
A clinical study to assess the impact of Gayatri Mantra chanting and Silence Practice on Quality of Life in University Students
No ratings yet
A clinical study to assess the impact of Gayatri Mantra chanting and Silence Practice on Quality of Life in University Students
4 pages
Snow Country
No ratings yet
Snow Country
19 pages
Multiple Intelligences
No ratings yet
Multiple Intelligences
3 pages
Kelsos Choice 2022
No ratings yet
Kelsos Choice 2022
2 pages
Anaimals Conflict MGMT Style
No ratings yet
Anaimals Conflict MGMT Style
4 pages
KCOM 112
No ratings yet
KCOM 112
2 pages
Practical Research1
No ratings yet
Practical Research1
37 pages
STAR
No ratings yet
STAR
1 page
UX Research - Interviews
No ratings yet
UX Research - Interviews
24 pages
History of Case Work
No ratings yet
History of Case Work
1 page
LS 5 Understanding The Self and Society 3
No ratings yet
LS 5 Understanding The Self and Society 3
70 pages
Texas and Non Catergorical Early Childhood
No ratings yet
Texas and Non Catergorical Early Childhood
4 pages
4.3 Components of A Research Proposal
No ratings yet
4.3 Components of A Research Proposal
7 pages
吉他手学习
100% (1)
吉他手学习
95 pages
How Using Various Platforms Shapes Awareness of Algorithms
No ratings yet
How Using Various Platforms Shapes Awareness of Algorithms
13 pages
BRM Lecture 5
No ratings yet
BRM Lecture 5
25 pages
25 Idioms 2
No ratings yet
25 Idioms 2
3 pages
CHUA, BOLAND & NISBETT - 2005 - Cultural Variation in Eye Movemnts During Scene Perception
No ratings yet
CHUA, BOLAND & NISBETT - 2005 - Cultural Variation in Eye Movemnts During Scene Perception
5 pages
Grade 7 Lesson Plan
No ratings yet
Grade 7 Lesson Plan
7 pages
სილამაზე და ჯანმრთელობა
No ratings yet
სილამაზე და ჯანმრთელობა
4 pages
Citation in English For Academic and Professional Purposes
0% (1)
Citation in English For Academic and Professional Purposes
39 pages
CBLM - Teaching (TVET)
No ratings yet
CBLM - Teaching (TVET)
42 pages
Consumer Beliefs & Attitudes
No ratings yet
Consumer Beliefs & Attitudes
9 pages
Jane Eyre Essay 2
No ratings yet
Jane Eyre Essay 2
2 pages

Presentation Outline

Uploaded by

Presentation Outline

Uploaded by

PRESENTATION OUTLINE

I – OVERVIEW: GOODNESS OF MEASURES

Internal Consistency of Measures

Interitem Consistency Reliability

Criterion-related validity is established when the measure differentiates individuals on a criterion

You might also like