W2 - Reliability in ESL Research
W2 - Reliability in ESL Research
Research:
and
proposal
trustworthy
and replicable
Reliability
RELIABILITY
IN ESL
RESEARCH
I n s t r u c t o r : D r. N g u y e n H u u
Cuong
Presenter: Le Do Ngoc Hang
content 1
.
Definition of Reliability
2. "Reliability in ESL research is the extent to which the data collection methods and
instruments used produce consistent, dependable, and replicable results."
- Citation: Dörnyei, Z. (2007). Research methods in applied linguistics: Quantitative,
qualitative, and mixed methodologies. Oxford University Press.
3. "Reliability in ESL research entails the extent to which assessments, tests, or research
procedures yield accurate and consistent results, unaffected by measurement error or
external factors."
- Citation: Brown, J. D. (2004). Research methods in applied linguistics: A practical
resource. Cambridge University Press.
DEFINITION OF
RELIABILITY
4. "Reliability in ESL research refers to the stability and consistency of measurement or
assessment outcomes, ensuring that the results are not influenced by random errors but
reflect true performance or characteristics."
- Citation: Shavelson, R. J., & Webb, N. M. (1991). Generalizability theory: A primer. Sage
Publications.
CONSISTEN
CY
TRUE SCORE THEORY -
Psychometrics
Systematic Error is
caused by any factors that
systematically affect
measurement of the
variable across the sample.
Random error
• Variability in Participant Responses
• Measurement Instrument Fluctuations
• Sampling Variability
Systematic error
• Bias in Test Items
• Rater Bias
• Measurement Instrument Biases
Reliability Coefficient
The reliability of a test or research instrument is commonly
expressed as a value between 0 and 1.
How is it assessed? By checking the consistency of results By checking how well the results
across time, across different observers, correspond to established
and across parts of the test itself. theories and other measures of
the same concept.
How do they relate? A reliable measurement is not always A valid measurement is generally
valid: the results might be reproducible, reliable: if a test produces accurate
but they’re not necessarily correct. results, they should be
reproducible.
RELIABILITY VS VALIDITY
https://conjointly.com/kb/reliability-and-validity/
Types of What does it assess? Example
reliability
Test-retest reli A group of participants complete a
recap
ability
and final
The consistency of a advice questionnaire designed to measure
personality traits. If they repeat the
measure across time questionnaire days, weeks or months
apart and give the same answers,
this indicates high test-retest
reliability.
Interrater relia Based on an assessment criteria checklist,
bility five examiners submit substantially
The consistency of a different results for the same student
measure across raters project. This indicates that the assessment
or observers checklist has low inter-rater reliability (for
example, because the criteria are too
subjective).
Internal consis You design a questionnaire to measure self-
tency
esteem. If you randomly split the results
The consistency of the into two halves, there should be a
measurement itself strong correlation between the two sets of
results. If the two results are very different,
this indicates low internal consistency.
Types of What does it assess? Example
reliability
Test-retest reli The consistency of a A group of participants complete a
recap
ability
and
measure final
across time:
do you get the same
advice questionnaire designed to measure
personality traits. If they repeat the
questionnaire days, weeks or months apart
results when you repeat
and give the same answers, this indicates
the measurement? high test-retest reliability.
Conclusion If reliability and validity were a big problem for your findings , it
might be helpful to mention this here.
An example
Adams et al. (2011) discussed their scoring and
coding procedure as follows: ‘The oral tests were
scored by two of the researchers; the few
discrepancies were discussed until 100 percent
agreement was reached. The written tests were
scored by an independent rater and then the scores
were reviewed by two of the researchers. Interrater
reliability was calculated to be 98 percent.’
Challenges in esl research
Language Variability: Diverse linguistic
backgrounds among participants.
Cultural Diff erences: Varied cultural
interpretations of language constructs.
Contextual Factors: Influence of contextual
factors on language usage and understanding.
Reliability
100%