0% found this document useful (0 votes)

121 views10 pages

Hammersley Some Notes On Reliability

The document discusses differing definitions of validity and reliability found in methodological literature. It notes a lack of standardized terminology and discrepancies between definitions, with terms sometimes used to refer to different things. The author aims to analyze the conceptual issues and clarify meanings of validity and reliability in relation to the goals of measurement.

Uploaded by

APRIL JOY CABUYAO

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views10 pages

Hammersley Some Notes On Reliability

Uploaded by

APRIL JOY CABUYAO

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

BERA

Some Notes on the Terms 'Validity' and 'Reliability'

Author(s): Martyn Hammersley
Source: British Educational Research Journal, Vol. 13, No. 1 (1987), pp. 73-81
Published by: Wiley on behalf of BERA
Stable URL: http://www.jstor.org/stable/1501231
Accessed: 14-02-2016 22:35 UTC

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://www.jstor.org/page/
info/about/policies/terms.jsp

JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content
in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship.
For more information about JSTOR, please contact [email protected].

Wiley and BERA are collaborating with JSTOR to digitize, preserve and extend access to British Educational Research Journal.

http://www.jstor.org

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
British
Educational
ResearchJournal,
Vol.13,No. 1, 1987 73

Some Noteson the Terms'Validity'

and 'Reliability'[1y

MARTYNHAMMERSLEY, SchoolofEducation,TheOpenUniversity

The problemof measurement is oftenaddressedby meansof the conceptsof

validityand reliablity. Some social scientistsare concernedto showthattheir
measurements are reliableand,less commonly, thattheyare valid.This is more
frequent in 'quantitative'thanin 'qualitative'
research,butthebasicissuesapplyto
both,as is increasingly beingrecognised byqualitativeresearchers(Dobbert,1982;
Evans,1983;Goetz& Le Compte,1984;Kirk& Miller,1985).
Thereis a largeliterature dealingwiththeconceptsof reliability and validity.
However,muchof it concernsthe techniquesby whichtheseproperties can be
measured.Ratherlessattention has beengivento theconceptualissuesinvolved.
And,in fact,whenonelooksat discussions ofreliability
andvalidity onefindsnota
clear set of definitions but a confusing diversityof ideas. Thereare substantial
divergencies amongdifferent authors'definitions,
and thereis evensomeoverlap
betweendefinitions ofthetwoconcepts.Theresultis thatitis oftenunclearwhatis
beingassertedwhenreliability and validityclaimsare made,and therefore it is
difficult
to assesstheirtruth.In thispaperI shallbe concerned almostexclusively
withthemeanings giventothosetermsandwithhowtheserelatetothegoalsofthe
measurement process.

Variations
in Definition
Here is a sampleof the definitionsof the terms'reliability'
and 'validity'to be
foundin themethodologicalliterature[2]:
(1) Reliability
is theagreement betweentwoefforts to measurethesame
traitthrough maximally similarmethods.Validityis represented in the
agreement betweentwo attemptsto measurethe same traitthrough
maximally differentmethods. (Campbell& Fiske,1967:277)
(2) The validityofa measuring instrument is defined
as theproperty ofa
measurethatallowstheresearcher to say thattheinstrument measures
whathe saysit measures.. . The reliabilityof a measuring instrumentis

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
74 M. Hammersley

oftheinstrument
definedas theability tomeasure thepheno-
consistently
menon tomeasure.
itis designed (Black& Champion,1976:222 and 234)
(3) Reliability
refersto thereproducibility of themeasurements. Can we
relyon ourownabilityto obtainverysimilardataagain;thatis,howgood
is our intra-observerreliability?
Otherobserversshouldalso be able to
replicateour measurements whichmeans,in part,thatwe shouldhave
goodinter-observer reliability.
Thisis,ofcourse,oftendifficult, sinceskill
in observationdevelopsthrough practice . . . Hollenbeck (1978)concluded
thatreliabilityconsistsof bothstability and accuracy.However,thisis
trueonlyof interobserver . . an observermaybe reliable
reliability. but
stillhave poor accuracyas long as precison (stability)is maintained.
Therefore, intra-observerreliabilityis solelya measureof stability (or
precision)whereasaccuracyaffectsvalidity... However,accuracywill
almostcertainly affectinter-observerreliability since fewobserversare
likelyto havethesamebiases.Anaccuracy criterion canbe established by
usingan 'expert'observer or theconsensusofseveralobservers. (Lehner,
1979: 130)
(is)theextentto whichrepetition
(4) Reliability ofthestudywouldresult
in thesamedataand conclusions. (Goode & Hatt,1952: 153)
(5) The goal of anyscientificmeasurement operationor procedure is to
arriveat thebestpossibleestimate ofthetruevalueofsomedimensional
qualityof a naturalphenomenon. To theextentthatthisgoalis achieved
it is saidthatthemeasurement is accurateorvalid.Accuracy orvalidity of
theresultstherefore becomestheyardstick forgauging thequalityofany
measurement procedure.For purposesof clarity, accuracy (or validity)
maybe definedas the extentto whichobtainedmeasuresapproximate
valuesof the'true'stateof nature.. . Reliabilityrefersto thecapacityof
theinstrument to yieldthesame measurement valuewhenbrought into
repeatedcontactwiththe same stateof nature.Thus,thismeaningof
reliabilityis concerned withthestability of measuredvaluesundercon-
stantconditions. (Johnston & Pennypacker, 1980: 190 and 191)
is theaccuracy
(6) Reliability orprecisionof a measuring instrument...
The commonest definition ofvalidityis epitomized by thequestion:Are
we measuring whatwe thinkwe are measuring? The emphasisin this
questionis on whatis being measured.For example,a teacherhas
constructed a testto measureunderstanding procedures
of scientific and
has includedon thetestonly factual
itemsaboutscientific procedures.The
testis notvalidbecause,whileit mayreliably measurethepupils'factual
knowledge procedures,
of scientific it does notmeasuretheirunderstand-
ingof suchprocedures. In otherwords,it maymeasurewhatit measures
quite well,but it does not measurewhat the teacherintendedit to
measure.(Kerlinger, 1964:430 and 444-5)
(7) A measureis reliableto theextentthattheaveragedifference between
two measurements independently obtainedin the same classroomis
smallerthantheaveragedifference betweentwomeasurements obtained
classrooms
in different . . . A measureis validto theextent thatdifferences
in scoresyieldedby it reflect actualdifferencesin behaviour notdiffer-

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
'Validity'
and'Reliability' 75
ences in impressions made on different observers.(Medley& Mitzel,
1963: 150)
One sourceof problems is thelackof a standardized terminology, so thatseveral
termsareusedto refer to eachofthedifferent aspectsofthemeasurement process.
Indeed,sometimes different authorsuse thesametermto refer to different things;
and eventhesame authormayuse a termto denotedifferent thingson different
occasions.An exampleis the term'measure'whichcan referto a measuring
instrument or to a particularmeasurement score.Forthepurposesofmyargument
hereI shalluse thefollowing termsand definitions:
measurement: theprocessbywhichan observer appliesan instrument to objects
in orderto gaugethepresence/magnitude ofa property [3];
property: thefeature ofobjectswhichis to be measured;
instrument:a proceduredevelopedto measurethe presence/magnitude of a
property in theobjects;
objects: the phenomena(people,lessons,tasks,etc.)whosepossessionof
theproperty is to be assessed;
scores: theresultsofthemeasurement process;
occasion: thetimeand placewheretheinstrument is appliedto producethe
scores;
observer: thepersonwhocarriesoutthemeasurement.
Usingthisterminology, let us look now at the majordiscrepancies amongthe
definitionsofvalidityand reliability citedabove.
(a) Arereliability and validityconcerned withall aspectsof a studyor do they
relateonlyto theprocessof measurement? Whilemostdefinitions takethelatter
position,someimplytheformer. For example,Goode & Hatt(1952: 153) define
reliability
as "theextentto whichrepetition of thestudywouldresultin thesame
data and conclusions".In otherwordstheyidentify it withreplication, and this
clearlyinvolvesmorethanmeasurement.
In thecase oftheterm'validity', thereis theproblem oftherelationship between
twotypologies: criterion,
predictive, concurrent, content,face,and construct valid-
ityon theone hand;internal, external, populationand ecologicalvalidityon the
other.The formerrefersto measurement, the latterto the whole processof
assessingthetruthof explanatory claims.In addition,thetermvalidityis some-
timesused to referto the assessmentof arguments in termsof whetherthey
conform to legitimatedeductive canons.
(b) Arevalidityand reliability properties ofinstruments, observers, orofparticu-
lar scores?Goode & Hatttreatreliability as a feature ofdataand conclusions. For
themostpart,though, reliability seemsto be viewedas a property ofinstruments
and/orobservers. Validityis sometimes ascribedto instruments (Black& Cham-
pion,Kerlinger, Medley& Mitzel),sometimes to observers (Lehner),sometimes to
scores(Johnston & Pennypacker).
(c) Arevalidityand reliability to be defined in termsoftherelationship between
scoresand variationin theproperty beingmeasured? (Call theserealistdefinitions).
Or are theyto be definedin termsoftherelationships amongscoresproducedby
thesameand/ordifferent instruments? (Call thesenominalist definitions)
[4]. Most
definitionsofvalidityarerealist, claiming, forexample,thatvalidity represents the
extentto whichan instrument measurestheproperty it is intendedto measure.
However, there are exceptions.For instance,"validityis representedin

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
76 M. Hammersley

theagreement betweentwoattempts to measurethesametraitthrough maximally

different methods"(Campbell& Fiske,^ 1967:277).
By contrast, mostdefinitions are nominalist,
of reliability referring to thescores
producedby repeatedefforts to measurethesameproperty by meansofthesame
instrument. Once again, though,thereare apparentexceptions.For example,
Kerlinger (1964: 430) claimsthatreliability is "the accuracyor precisionof a
measuring instrument", and Black& Champion(1976:234) define itas "theability
of theinstrument to measureconsistently thephenomenon it is designed tomea-
sure"(myemphasis).
(d) If we accept a realistdefinition of validity,are we concernedwiththe
relationship betweenthescoresand theproperty or thatbetween thescoresandthe
objects.In shortis it theproperty or theobjectswhichare beingmeasured? Most
definitions referto measurement of the property, but some takethe alternative
view. For instance,Johnston & Pennypacker (1980: 190) define'accuracy(or
validity)'as "the extentto whichobtainedmeasuresapproximate valuesof the
'true'stateofnature."Medley& Mitzel's(1963: 150)definition is another example:
"A measureis valid to the extentthatdifferences in scoresyieldedby it reflect
actualdifferences in behaviour.. ." Thisis probably justa terminological problem,
butit is a potential sourceofconfusion.
(e) If we accepta nominalist definition are we concerned
of reliability, withthe
relationship betweenscoresproducedbythesameinstrument appliedto thesame
objecton thesame occasionby different observers, or thoseproducedby similar
instruments appliedto differentobjectson different occasionsby thesameobser-
ver,or to some otherset of permutations of observer,instrument, objectand
occasion?
Atone extreme is Lehner's(1979: 130)definition:
Reliability Can we rely
ofthemeasurements.
to thereproducibility
refers
on ourownabilityto obtainverysimilardataagain.. .? Otherobservers
ourmeasurements.
shouldalso be ableto replicate
Hereit seemsthatreliability relatesto scoresproducedby anyobserver on any
occasionusinganyinstrument to measuresomeset of objects.Medley& Mitzel
includevariations
(1963: 150)explicitly in themagnitude ofa property in an object
betweenoccasionsas a sourceof unreliability. By contrast, Johnston & Penny-
packer(1980: 191) adopta rathermorerestricted definition:"reliabilityrefers
to
thecapacityoftheinstrument to yieldthesamemeasurement valuewhenbrought
intorepeatedcontactwiththesamestateof nature".Mostdefinitions are unclear
aboutwhatpermutations of instrument, observer,objectsand occasionproduce
scores.And,indeed,Campbell& Fiske(following
reliability Thurstone, 1939)have
arguedthatvariationin thesecomponents producesa continuum fromreliablityto
validity.Even if this is so, however,it is important to get clearwhatis being
measured, howand why.
Thereare someimportant variations betweenauthorsin thewaytheyuse the
and 'validity',
terms'reliability' then.Moreover,sometimes thesameauthorwill
movebetweendifferent definitionsof thesetermswithout warning. For example,
despitethe nominalistic definitionof validityquoted above, laterin the same
articleCampbell& Fiske(1967) implicitly discussing
adopta realistdefinition, the
differentinterpretations thatcan be made of a failureto findany correlation
betweenthe scoresproducedby two methodsintendedto measurethe same

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
'Validity'
and'Reliability' 77
property. If theywereto be consistent withtheirnominalist definition,
no such
problemof interpretation wouldarise,theconclusion wouldbe thatthevalidityof
thescoresis lowor zero.In a similarway,Kerlinger (1964) movesthrough various
definitionsof reliability
withoutaddressing the issueof therelationships among
them.
Anothercommonpracticeis to conflatedefinitions of reliability
in termsof
consistency ofscoreswithdefinitionsin termsofrandomerror:
Reliability
concerns theextentto whichmeasurements areconsistent
and
repeatable.Thus,a highlyreliablemeasureis one thatdoes notfluctuate
greatlybecauseofrandomerror.(Zeller& Carmines,1980: 17)
We havetwodefinitions ofreliability
herewhichdo notmatchone another. While
randomerrorwillproduceinconsistency in scores,so willcertainkindsofsystema-
tic error.For example,wherescoresproducedby two observers are affectedby
biaseswhichoperatein oppositedirections inconsistencies betweenthescoresof
theobservers forthesameobjectswillresult.

AnAttempt
at Clarification
I have triedto showthatthereis some inconsistency in theusageof theterms
'reliability'
and 'validity'.At theriskof addingto theconfusion, I wantto tryto
clarify theconceptsunderlying theseterms.
It is important to beginbymakinga cleardistinction betweengoalsand means,
betweenwhatit is aboutthemeasurement processwe are tryingto assessand the
strategies we use to assessit. Onlywhenwe are clearaboutwhatit is we wantto
assesscan we deviseeffective strategiesforachieving
that.
Ourprimary concernin measurement mustsurelybe whether thesetofscoreswe
haveproducedaccurately reflectsthepresence/magnitude ofthetarget propertyin
theobjectswe havemeasured.Thisis whatmostwriters seemto meanbyvalidity
[5]. Thereare a numberof typesof threatto measurement validity,but we can
distinguish two main sources.If we thinkof measurement as involving, at its
simplest, a relationshipbetweena variablewhichis notdirectly observable andone
thatis, theremay be inaccuraciesin the recording of scoresof the observable
variable(we mightreferto thisas theproblemof 'accuracy')and theremaybe
errors arisingfromimperfect correlation betweentheobservedand theunobserved
variables(thisis oftenreferred to as theproblemof'constructvalidity').
However,validityis not our onlygoal. We are also ofteninterested in the
precision withwhichany particular scorecapturesthe magnitudeof the target
property in an object.Precisionconcernsthedelicacyof themeasurement scale
employed.We can measurethe lengthof a largeobject in termsof metres,
centimetres or evenmillimetres. In thatorderthesescalesrepresent an increasing
degreeof precision. Notethatthisis independent oftheaccuracy of themeasure-
ment.On thisusagea scoremaybe veryprecisebuthighly inaccurate. How precise
we wantour measurement to be willdependupon our purposes,but it willalso
dependupon the level of validitywhichcan be obtainedat different levelsof
precision. Otherthings beingequal,themoreprecisethescale,themoredifficult it
is to achievehighlevelsofvalidity.And,indeed,thereis oftena temptation to be
moreprecisethanthe level of validitywithwhichan objectcan be measured
justifies.

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
78 M. Hammersley

a thirdgoal?It maybe,butonlyifdefined
Is reliability in realistterms.Achieving
consistency ofscoresacrossoccasionsis ofno valuein itself, itonlyhasvalueas an
indicatorof validity.If, on the otherhand,we treatreliability as a property of
instruments notof scores,and defineit as theabilityofan instrument consistently
to producevalidscores,thenassessing ofinstruments
thereliability anddeveloping
reliableinstruments are clearlyimportant goals.On thisdefinition we can havea
scoreofhighvaliditywithout theinstrument whichproduceditbeingreliable, but
we cannothavea reliableinstrument producing invalidscores[6].However, itmay
be difficultto knowthatwe have a scoreofhighvaliditywithout also findingout
whether we have a reliableinstrument sincethesame strategies are involvedin
assessing bothvalidityand reliability,on thedefinitions usedhere.
Validityand appropriate precisionof scores,and reliabilityof instruments, are
ourgoals,then.Butofcoursethecentralproblemin measurement is thatgenerally
we haveno directaccesstotheproperty we aretrying to measure, andthuswehave
no straightforward meansofassessing thevalidity ofanyparticular score.Ifwedid
have directaccess we wouldpresumably have no need of anymeasuring instru-
ment.In assessingthevalidityof scoresand thereliability ofinstruments we have
to relyupon comparisons of the scoresproducedunderdifferent circumstances,
circumstances systematically variedin orderto assesstheeffects ofdifferent types
of threatto measurement validity.To theextentthatscoresare consistent across
thesedifferent circumstances, we can haveincreasedconfidence thattheyarevalid
and thattheinstrument is reliable.

TypesandSourcesofError
random,
between
& Costner's(1977:24-6) distinction
Mueller,Schuessler constant
erroris moreuseful,I believe,thanthe morecommontwofold
and correlated
betweenrandomand systematic
distinction error,sincethetwokindsofsystematic
[7]. Herearetheauthors'definitions:
characteristics
errorhavedifferent

Randomerrors: of
Randomerrorsbehaveas iftheamountand direction
by drawingsignednumbers
errorweredetermined froma hat,withone
andthe
inthehatbeingpositiveandonehalfnegative
halfofthenumbers
beingzero.(p. 24)
averageofthenumbers

Constanterrors: It is as iftheerrorweredeterminedbydrawingnumbers
froma hat, but the averageof the numbersin the hat is not zero;
consequentlyeach scoreis inflated bythesameamounton
(or deflated)
theaverage.(p. 25)

errors:
Correlated Correlatederrorbehavesas iftheerrorweredetermined
by drawingnumbersfromhats,but a different different
hat(containing
numbers)was used formalesand females,or forrichand poor,or for
(p. 25)
groupings.
otherdifferentiated

typesoferror,
sourcesof errorare likelyto lead to different
Different thoughas
yetweknowtoolittletobe abletotieparticular typeswithany
sourcestoparticular
We can onlymakesuggestions
certainty. as to likelylinks.For example.

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
'Validity'
and'Reliability' 79

Sourcesoferror Probabletypesoferror
Observer observation
and coding random,constant
or
inaccuracies correlated
calculation
mistakes randomor constant
interpretational
bias constant
or correlated
Instrument contaminationofscoresby
factors
otherthantheproperty
beingmeasured random,constant
or
correlated
Comparisonsof scoresproducedunderdifferentcircumstances
mayallowus to
assesstheeffects
ofdifferentsourcesand typesoferror:

Observer By comparing scoresforthesame objectsproducedby thesame

observer usingthesameinstrument on differentoccasionswe can
make some estimateof the level of errorderivingfromintra-
observervariation, so longas we can assumethattheproperty is
stable in the object across occasions(or at least betweenthe
occasionson whichmeasurement occurred) andthattheobserver's
secondscoreis independent oftheearlierone.
By comparing scoresforthesame objectsproducedby different
observers usingthesameinstrument on thesameoccasionwe can
assess the level of inter-observer variation,so long as we can
assumethatall observers used theinstrument in an appropriate
way,and thattheirscoreswereproducedindependently.
Instrument By comparing thescoresforthesameobjectsproducedbytwoor
moreinstruments intended tomeasure thesameproperty usedbythe
same observeron different occasions,we can estimatetheerror
arisingfromthe instrument, so longas we can assumethatthe
property is stableacrossoccasionsand thatintra-observer erroris
low.
By comparing thescoresforthesameobjectsproducedbytwoor
moreinstruments intendedto measurethesameproperty usedby
differentobservers on thesameoccasion,wecanestimate theerror
arisingfromtheinstrument so longas we can assumethatinter-
observer erroris low[8].
I shallnot discussthestatistical techniquesavailableto estimatethedifferent
typesoferroron thebasisofscoresfromcomparisons oftheseand otherkinds.A
numberof different approachesare available,thoughnone is unproblematic
(Tryon,1957;Lord & Novick,1968;Heise & Bohrnstedt, 1970;Cronbachet al.,
1972;Zeller& Carmines,1980).And thereare otherkindsofcomparison which
can provideevidenceaboutthevalidityof a set of scoresor thereliability ofan
instrument. For example,we mightinvestigate the extentto whichthe scores
correlatewith scoresproducedby anotherinstrument designedto measurea
variablewhichis knownto correlatestrongly withthevariablewe are trying to
measure(Cronbach& Meehl,1955).Again,ifwehavegoodreasontobelievethata
particularset of objectshas a substantially higherlevel of the property we are

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
80 M. Hammersley

by
seekingto measurethananothersetofobjects,thenwe can testourinstrument
measuringthesetwo sets of objectsto discoverwhetheror not the expected
is to be found.
difference

Conclusion
My concernin thispaperhas beenwiththeconceptual issuesinvolvedin defining
I have proposeddefinitions
validityand reliability. of validity,precisionand
as goalsofthemeasurement
reliability process.Theseareto be distinguished from
the strategieswhichwe use to achievethem,whichinvolvethe comparisonof
scoresproducedunderdifferent circumstances.These comparisons allow us to
assesstheeffects typesand sourcesoferror,
ofdifferent and theyprovideus witha
Considerable
basisforassessingbothvalidityand reliability. workis stillrequired
in developing and applying However,a prerequisite
thesestrategies. foreffective
workin thisarea,it seemsto me,is to be clearaboutwhatit is we are aimingto
achieve.I havetriedto showthatat presentourusageofconceptslikevalidity and
and thispaperhas been directedtowardsa
is vagueand inconsistent,
reliability
ofthesemeasurement
clarification goals,and theirrelation designed
to strategies to
assessthem.

Correspondence:
M. Hammersley, School of Education,The Open University,
WaltonHall,MiltonKeynes,BucksMK7 6AA,England.

NOTES
[1] I am obliged to John Scarth,Donald MacKinnon,BarryCooper and JohnBynnerfor
comments on earlierdraftsofthisarticle.The errorsare ofcoursemine.
[2] This is a haphazardsample,butit does illustrate therangeofvariationin usage.
[3] I put on one side the questionof whetherit is legitimateto talk of classification as
measurement.
[4] The terms'realist'and 'nominalist'are used in a varietyof waysby philosophers. I use the
termsheresimplyas shorthand.
[S] We probablyneed to use some adjectivelike 'measurement' validityhereto
or 'descriptive'
distinguish whatwe are referring to fromlogicalvalidityand frominternalvalidity.
[6] Incorrect use ofa reliableinstrument wouldproduceinvalidscores,butitis betterto treatthis
as use ofa different instrument.
[7] It is also important to recognise,as Cronbachet al., 1972 emphasise,thatwhatis systematic
errorgivenone focusmay be variationin the targetproperty fromanotherpointof view.
Identification ofsystematic erroris relativeto thepropertybeingmeasured.
[8] Thesevariouscomparisons are ofcourseproducedsimplybycombining conventionalreliabil-
ity and validitychecks.There are additionalpossibilitiesin researchemployingtestsor
inventories, suchas theuse ofthesplithalftechniqueor Cronbach'scoefficient alpha.

REFERENCES
BLACK,J.A. & CHAMPION, D. J.(1976) Methodsand Issuesin Social Research(New York,Wiley).
CAMPBELL,D. T. & FISKE,D. W. (1967) Convergent and discriminant validationbythemultitrait-
multimethod matrix,in: W. A. MEHRENS & R. L. EBEL(Eds) Principlesof Educationaland
Psychological Measurement (Chicago,Rand McNally).
CRONBACH, L. J. & MEEHL,P. E. ( 1955) Constructvalidityin psychological tests,Psychological
Bulletin,52, pp. 281-302.
CRONBACH, ofBehavioural
L. J.et al. (1972) TheDependability Measurements (New York,Wiley).

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions
'Validity'
and'Reliability' 81

DOBBERT, M. (1982) Ethnograpic Research:theory and application formodernschoolsand society

(New York,Praeger).
EVANS, J.(1983) Criteriaofvalidityin socialresearch: exploringtherelationship betweenethnogra-
phic and quantitativeapproaches,in: M. HAMMERSLEY (Ed.) The Ethnography of Schooling
(Driffield,Nafferton).
GOETZ,J. & LECOMPTE, M. (1984) Ethnography and QualitativeDesignin EducationalResearch
(New York,AcademicPress).
GOODE,W. & HATT,P. K. (1952) Methodsin Social Research(New York,McGrawHill).
HEISE,D. & BOHRNSTEDT, G. (1970) Validity,invalidityand reliability, in: E. BORGATTA & G.
BOHRNSTEDT (Eds) SociologicalMethodology (San Francisco,Jossey-Bass).
HOLLENBECK, A. R. (1978) Problemsofreliabilityin observational research,in: G. P. SACKETT (Ed.)
Observing Behaviour,Vol.2, Data Collectionand AnalysisMethods(Baltimore, University Park
Press).
JOHNSTON, J. M. & PENNYPACKER, H. S. (1980) Strategiesand Tacticsof Human Behavioural
Research(Hillsdale,New Jersey, Erlbaum).
KERLINGER, F. (1964) Foundations ofBehaviouralResearch(NewYork,Holt,Rinehart & Winston).
KIRK,J. & MILLER,M. L. (1985) Reliability and Validityin QualitativeResearch(BeverleyHills,
Sage).
LEHNER, P. N. (1979) HandbookofEthological Methods(New York,GarlandSTPM Press).
LORD,F. M. & NOVICK, M. R. (1968) StatisticalTheoriesofMentalTestScores(Reading,Mass.,
Addison-Wesley).
MEDLEY,D. M. & MITZEL,H. E. (1963) Measuring classroombehaviourbysystematic observation,
in: N. L. GAGE(Ed.) HandbookofResearchon Teaching(Chicago,Rand McNally).
MUELLER, J.H., SCHUESSLER, K. F. & COSTNER, H. L. (1977) Statistical
ReasoninginSociology, 3rd
edn (Boston,HoughtonMiflin).
THURSTONE, L. L. (1937) TheReliabilityand Validity of Tests(AnnArbor,Edwards).
TRYON,R. C. (1957) Reliabilityand behaviourdomainvalidity,Psychological Bulletin,54, pp.
229-49.
ZELLER, R. A. & CARMINES, E. G. (1980) Measurement in theSocial Sciences(London,Cambridge
University Press).

This content downloaded from 137.108.145.45 on Sun, 14 Feb 2016 22:35:59 UTC
All use subject to JSTOR Terms and Conditions

Introduction To Validity and Reliability
No ratings yet
Introduction To Validity and Reliability
6 pages
Rasch Measurement Theory: A Complete Course
From Everand
Rasch Measurement Theory: A Complete Course
Tejas Thakur
No ratings yet
Allama Iqbal Open University Islamabad: USER ID:0000901747 COURSE CODE: 8602-2 Semester: Autumn 2024
No ratings yet
Allama Iqbal Open University Islamabad: USER ID:0000901747 COURSE CODE: 8602-2 Semester: Autumn 2024
56 pages
COM351 Week 7 Establishing The Validity and Reliability of A Research Instrument
No ratings yet
COM351 Week 7 Establishing The Validity and Reliability of A Research Instrument
14 pages
Reliability & Validity
No ratings yet
Reliability & Validity
18 pages
The Effects of Classroom Environment in Engagement and Academic Performance of Grade 12 Abm Students at Notre Dame of Trece MARTIREZ S.Y. 2023-2024
No ratings yet
The Effects of Classroom Environment in Engagement and Academic Performance of Grade 12 Abm Students at Notre Dame of Trece MARTIREZ S.Y. 2023-2024
25 pages
Food Traceability PPT R-1.
No ratings yet
Food Traceability PPT R-1.
26 pages
4 Measurement Validity
No ratings yet
4 Measurement Validity
17 pages
VALIDITY and Reliability
No ratings yet
VALIDITY and Reliability
23 pages
Contoh Soal Narrative Text Kelas 9 Beserta Jawabannya, Pilihan Ganda Dan Essay
0% (1)
Contoh Soal Narrative Text Kelas 9 Beserta Jawabannya, Pilihan Ganda Dan Essay
36 pages
8602 (2nd Assignment) PDF
No ratings yet
8602 (2nd Assignment) PDF
24 pages
Tour Commentary 1
100% (2)
Tour Commentary 1
22 pages
50 Studies Every Plastic Surgeon Should Know Full Download
No ratings yet
50 Studies Every Plastic Surgeon Should Know Full Download
411 pages
Reliability & Validity
No ratings yet
Reliability & Validity
12 pages
Qualitative and Quantitative Measurement - Part 3 1 17062024 025643am
No ratings yet
Qualitative and Quantitative Measurement - Part 3 1 17062024 025643am
29 pages
L9 Qualities of A Good Measuring Instrument
No ratings yet
L9 Qualities of A Good Measuring Instrument
22 pages
Research Chapter 05 HMM
No ratings yet
Research Chapter 05 HMM
32 pages
Chapter 5
No ratings yet
Chapter 5
20 pages
Adobe Scan 26-Jun-2021
No ratings yet
Adobe Scan 26-Jun-2021
18 pages
Unit 2
No ratings yet
Unit 2
26 pages
Chapter 3 Answer Cost Accounting
100% (1)
Chapter 3 Answer Cost Accounting
17 pages
A Process of Conducting Research
From Everand
A Process of Conducting Research
Prof Gideon C. Mwanza
No ratings yet
Science Week 5
No ratings yet
Science Week 5
6 pages
Reliability and Validity Mha1
No ratings yet
Reliability and Validity Mha1
13 pages
Validity and Reliability
No ratings yet
Validity and Reliability
29 pages
Meai 21
No ratings yet
Meai 21
11 pages
Relibility and Validity of Instruments 31102024 085817am
No ratings yet
Relibility and Validity of Instruments 31102024 085817am
14 pages
Validity and Reliability Lesson 3.
No ratings yet
Validity and Reliability Lesson 3.
48 pages
Concept of Reliability and Validity
100% (1)
Concept of Reliability and Validity
6 pages
Pilot Study Validity Ans Reliability
No ratings yet
Pilot Study Validity Ans Reliability
23 pages
Qualities of Good Measuring Instruments
56% (9)
Qualities of Good Measuring Instruments
4 pages
Validity and Reliability
100% (1)
Validity and Reliability
6 pages
66cee8ee676c720018ba7acb - ## - Research Aptitude 02 - Daily Classnotes
No ratings yet
66cee8ee676c720018ba7acb - ## - Research Aptitude 02 - Daily Classnotes
13 pages
RM Lesson 7
No ratings yet
RM Lesson 7
13 pages
Validity Reliability
No ratings yet
Validity Reliability
25 pages
Chapter 4 Assessment & Evaluation
No ratings yet
Chapter 4 Assessment & Evaluation
10 pages
Characteristics Good Measuring Tool
No ratings yet
Characteristics Good Measuring Tool
4 pages
Chp. 4 - Hazards Identification: Faculty of Chemical Engineering Universiti Teknologi MARA
No ratings yet
Chp. 4 - Hazards Identification: Faculty of Chemical Engineering Universiti Teknologi MARA
39 pages
Validity and Reliability
No ratings yet
Validity and Reliability
6 pages
Reliability & Validity: Dr. Nitu Singh Sisodia
No ratings yet
Reliability & Validity: Dr. Nitu Singh Sisodia
20 pages
Topic 3 Characteristics and Principles of Assessment
100% (1)
Topic 3 Characteristics and Principles of Assessment
45 pages
Judgnomics: Applying Measurements to Certainty
From Everand
Judgnomics: Applying Measurements to Certainty
Robert Burbank
No ratings yet
Validity and Reliability in Education
No ratings yet
Validity and Reliability in Education
5 pages
An Application-Based Discussion of Construct Validity and Internal Consistency Reliability
No ratings yet
An Application-Based Discussion of Construct Validity and Internal Consistency Reliability
25 pages
Unit 2 Reliability and Validity (External and Internal) : Structure
No ratings yet
Unit 2 Reliability and Validity (External and Internal) : Structure
19 pages
Validity and Reliability
No ratings yet
Validity and Reliability
3 pages
Reliability and Validity in Research
No ratings yet
Reliability and Validity in Research
5 pages
Cri 228 Environmental Law
No ratings yet
Cri 228 Environmental Law
54 pages
8602 Assignment 2
No ratings yet
8602 Assignment 2
21 pages
Claremont COURIER 6-16-17
No ratings yet
Claremont COURIER 6-16-17
27 pages
234 Solution
No ratings yet
234 Solution
89 pages
Validity Transferability Quanti: Quali
No ratings yet
Validity Transferability Quanti: Quali
15 pages
Determinationof Reliabilityand Validitymeasuresofaquestionnaire
No ratings yet
Determinationof Reliabilityand Validitymeasuresofaquestionnaire
11 pages
Reliability and Validity of Research Instruments: Correspondence To
No ratings yet
Reliability and Validity of Research Instruments: Correspondence To
19 pages
Validity and Reliability
No ratings yet
Validity and Reliability
6 pages
Syllabus - Technology For Teaching and Learning 1
No ratings yet
Syllabus - Technology For Teaching and Learning 1
9 pages
RMM Lecture 17 Criteria For Good Measurement 2006
No ratings yet
RMM Lecture 17 Criteria For Good Measurement 2006
31 pages
What Is Questionnaire?
No ratings yet
What Is Questionnaire?
4 pages
Appsc
No ratings yet
Appsc
19 pages
Valadity and Reliability
100% (1)
Valadity and Reliability
12 pages
Crimpro Syllabus
No ratings yet
Crimpro Syllabus
10 pages
123 Easy Essay
100% (2)
123 Easy Essay
4 pages
Reliability & Validity
No ratings yet
Reliability & Validity
9 pages
Reliability and Validity
100% (1)
Reliability and Validity
9 pages
Reliability and Validity of Research
100% (1)
Reliability and Validity of Research
7 pages
05 - Designing Valid Communication Research PDF
No ratings yet
05 - Designing Valid Communication Research PDF
9 pages
Reliability and Validity in Research
100% (2)
Reliability and Validity in Research
2 pages
QRM 19th Topic
No ratings yet
QRM 19th Topic
8 pages
@CryptoCred - Trade Management
No ratings yet
@CryptoCred - Trade Management
15 pages
Adapting Hermetically Sealed Compressor Technology To Deal With Sour and Corrosive Gases Marcel Buse Mark Van Aarsen
No ratings yet
Adapting Hermetically Sealed Compressor Technology To Deal With Sour and Corrosive Gases Marcel Buse Mark Van Aarsen
23 pages
Hedging in Academic Writing
No ratings yet
Hedging in Academic Writing
6 pages
International Handbook of Inquiry and Learning Ravit Golan Duncan and Clark A Chinn PDF Download
No ratings yet
International Handbook of Inquiry and Learning Ravit Golan Duncan and Clark A Chinn PDF Download
81 pages
Validity
No ratings yet
Validity
13 pages
PD 9175 Chain Saw Act of 2002 Report
No ratings yet
PD 9175 Chain Saw Act of 2002 Report
14 pages
Validity & Reliability
No ratings yet
Validity & Reliability
27 pages
Validity, Reability, and Qualitative Research
No ratings yet
Validity, Reability, and Qualitative Research
6 pages
Is Reliability Validity
No ratings yet
Is Reliability Validity
17 pages
601 4 Research Reliability & Validity
No ratings yet
601 4 Research Reliability & Validity
13 pages
1098 Programming Manual Pulsar Altair PDF
No ratings yet
1098 Programming Manual Pulsar Altair PDF
136 pages
CRI 300 Research
No ratings yet
CRI 300 Research
105 pages
My Brothers Famous Bottom Takes Off Strong Jeremy Instant Download
No ratings yet
My Brothers Famous Bottom Takes Off Strong Jeremy Instant Download
26 pages
Lesson 8
No ratings yet
Lesson 8
1 page
Validity
No ratings yet
Validity
4 pages
References
No ratings yet
References
13 pages
Slope Stability Assessment of The Letlhakane (DK1) Mine
No ratings yet
Slope Stability Assessment of The Letlhakane (DK1) Mine
10 pages
Detailed Lesson Plan 5
No ratings yet
Detailed Lesson Plan 5
7 pages
Concrete Slump Test or Slump Cone Test
No ratings yet
Concrete Slump Test or Slump Cone Test
4 pages
AV LÍNGUA INGLESA ASPECTOS MORFOSSINTÁTICOS - Prova 2 - AGOSTO
No ratings yet
AV LÍNGUA INGLESA ASPECTOS MORFOSSINTÁTICOS - Prova 2 - AGOSTO
5 pages
Cri417-Lesson 24
No ratings yet
Cri417-Lesson 24
7 pages
Baraza Mwingi Road Land
No ratings yet
Baraza Mwingi Road Land
3 pages
Cri417-Lesson 23
No ratings yet
Cri417-Lesson 23
4 pages
Golden Rules
No ratings yet
Golden Rules
1 page
Quadratic Oscillations
No ratings yet
Quadratic Oscillations
10 pages
2017-18 Microwave Engineering 6th Sem
No ratings yet
2017-18 Microwave Engineering 6th Sem
2 pages
Project Probability
No ratings yet
Project Probability
6 pages
Animasjn
No ratings yet
Animasjn
7 pages
Four Quadrant Speed Control of DC Motor
No ratings yet
Four Quadrant Speed Control of DC Motor
7 pages
Online Assessment Sample Maria
No ratings yet
Online Assessment Sample Maria
3 pages
2
No ratings yet
2
1 page
Best Teacher Award 1
No ratings yet
Best Teacher Award 1
1 page

Hammersley Some Notes On Reliability

Uploaded by

Hammersley Some Notes On Reliability

Uploaded by

BERA

Some Notes on the Terms 'Validity' and 'Reliability'

Some Noteson the Terms'Validity'

The problemof measurement is oftenaddressedby meansof the conceptsof

theagreement betweentwoattempts to measurethesametraitthrough maximally

Observer By comparing scoresforthesame objectsproducedby thesame

DOBBERT, M. (1982) Ethnograpic Research:theory and application formodernschoolsand society

You might also like