0% found this document useful (0 votes)

9 views

W2 - Reliability in ESL Research

Uploaded by

Ngoại Ngữ Âu Lạc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

W2 - Reliability in ESL Research

Uploaded by

Ngoại Ngữ Âu Lạc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Validity

Research:
and
proposal
trustworthy
and replicable
Reliability
RELIABILITY
IN ESL
RESEARCH
I n s t r u c t o r : D r. N g u y e n H u u
Cuong
Presenter: Le Do Ngoc Hang
content 1
.
Definition of Reliability

2 True Score Theory

.
3 Measurement Error
.
4 Reliability vs
Validity
.
5 Types of reliability
.
6 Ensuring reliability
.
DEFINITION OF
RELIABILITY
1. "Reliability refers to the consistency or stability of measurement results or research
findings over time and across different conditions or settings."
- Citation: Fraenkel, J. R., Wallen, N. E., & Hyun, H. H. (2012). How to design and evaluate
research in education (8th ed.). McGraw-Hill.

2. "Reliability in ESL research is the extent to which the data collection methods and
instruments used produce consistent, dependable, and replicable results."
- Citation: Dörnyei, Z. (2007). Research methods in applied linguistics: Quantitative,
qualitative, and mixed methodologies. Oxford University Press.

3. "Reliability in ESL research entails the extent to which assessments, tests, or research
procedures yield accurate and consistent results, unaffected by measurement error or
external factors."
- Citation: Brown, J. D. (2004). Research methods in applied linguistics: A practical
resource. Cambridge University Press.
DEFINITION OF
RELIABILITY
4. "Reliability in ESL research refers to the stability and consistency of measurement or
assessment outcomes, ensuring that the results are not influenced by random errors but
reflect true performance or characteristics."
- Citation: Shavelson, R. J., & Webb, N. M. (1991). Generalizability theory: A primer. Sage
Publications.

5. "Reliability in ESL research is the extent to which a measurement or assessment tool

provides consistent results when applied to different groups or individuals under similar
conditions."
- Citation: Dörnyei, Z. (2007). Research methods in applied linguistics: Quantitative,
qualitative, and mixed methodologies. Oxford University Press.

6. "Reliability in ESL research refers to the degree of consistency and dependability of

research procedures and findings, minimizing the influence of measurement errors or
fluctuations in data collection."
- Citation: Mackey, A., & Gass, S. M. (2012). Research methods in second language
acquisition: A practical guide. Wiley-Blackwell.
DEFINITION OF
RELIABILITY
Generally, reliability can be understood as the
consistency or repeatability of observations of
behaviours, performance and/or psychological
attributes.

CONSISTEN
CY
TRUE SCORE THEORY -
Psychometrics

True score theory is a theory

about measurement.
(Classical Test Theory)

1. Every measurement has

an error component.
2. True score theory is the
foundation of reliability.
Measurement error
Random Error is caused
by any factors that
randomly affect
measurement of the
variable across the sample.

Systematic Error is
caused by any factors that
systematically affect
measurement of the
variable across the sample.
Random error
• Variability in Participant Responses
• Measurement Instrument Fluctuations
• Sampling Variability
Systematic error
• Bias in Test Items
• Rater Bias
• Measurement Instrument Biases
Reliability Coefficient
The reliability of a test or research instrument is commonly
expressed as a value between 0 and 1.

A reliability coeffi cient of 0 indicates that the test or

instrument does not measure the target construct
consistently (i.e., it is 0% reliable).

A reliability coeffi cient of 1 means that the test or research

instrument is perfectly precise with no measurement error
(i.e., it is 100% reliable or consistent).
RELIABILITY VS VALIDITY
Reliability Consistency Validity Accuracy
What does it tell The extent to which the results can be The extent to which the results
you? reproduced when the research is really measure what they are
repeated under the same conditions. supposed to measure.

How is it assessed? By checking the consistency of results By checking how well the results
across time, across different observers, correspond to established
and across parts of the test itself. theories and other measures of
the same concept.

How do they relate? A reliable measurement is not always A valid measurement is generally
valid: the results might be reproducible, reliable: if a test produces accurate
but they’re not necessarily correct. results, they should be
reproducible.
RELIABILITY VS VALIDITY

https://conjointly.com/kb/reliability-and-validity/
Types of What does it assess? Example
reliability
Test-retest reli A group of participants complete a
recap
ability
and final
The consistency of a advice questionnaire designed to measure
personality traits. If they repeat the
measure across time questionnaire days, weeks or months
apart and give the same answers,
this indicates high test-retest
reliability.
Interrater relia Based on an assessment criteria checklist,
bility five examiners submit substantially
The consistency of a different results for the same student
measure across raters project. This indicates that the assessment
or observers checklist has low inter-rater reliability (for
example, because the criteria are too
subjective).
Internal consis You design a questionnaire to measure self-
tency
esteem. If you randomly split the results
The consistency of the into two halves, there should be a
measurement itself strong correlation between the two sets of
results. If the two results are very different,
this indicates low internal consistency.
Types of What does it assess? Example
reliability
Test-retest reli The consistency of a A group of participants complete a
recap
ability
and
measure final
across time:
do you get the same
advice questionnaire designed to measure
personality traits. If they repeat the
questionnaire days, weeks or months apart
results when you repeat
and give the same answers, this indicates
the measurement? high test-retest reliability.

Interrater relia The consistency of a Based on an assessment criteria checklist,

bility measure across raters five examiners submit substantially different
or observers: do you get results for the same student project. This
the same results when indicates that the assessment checklist has
different people conduct low inter-rater reliability (for example,
the same because the criteria are too subjective).
measurement?
Internal consis The consistency of the You design a questionnaire to measure self-
tency measurement itself: do esteem. If you randomly split the results into
you get the same two halves, there should be a
results from different strong correlation between the two sets of
parts of a test that are results. If the two results are very different,
designed to measure this indicates low internal consistency.
the same thing?
Types of What does it assess? Statistics
reliability
Test-retest reli The consistency of a measure Researchers administer the same instrument to the

recap and final advice

ability across time same group of participants on two separate
occasions. The scores or measurements obtained
from both administrations are then compared using
a correlation coefficient (e.g., Pearson's correlation)
to determine the stability or reliability of the
instrument. A higher correlation indicates greater
test-retest reliability.
Interrater relia The consistency of a measure Researchers typically provide a set of guidelines or
bility across raters or observers criteria to the raters to ensure uniformity. The
agreement between raters is calculated using
statistical measures such as Cohen's kappa or
intraclass correlation coefficient (ICC). A higher
value indicates greater inter-rater reliability.
Internal consis The consistency of the Researchers administer a single instrument to a
tency measurement itself group of participants and analyze the responses to
calculate measures such as Cronbach's alpha or
split-half reliability. These measures assess the
degree of correlation or agreement among the
individual items within the instrument. A higher
Type of What does it assess? Statistics
reliability
Test-retest reli The consistency of a measure Quantitative measure:
recap and final advice
ability across time Intraclass Correlation Coefficient (ICC)
Interrater relia The consistency of a measure Quantitative measure:
bility across raters or observers Intraclass correlation coefficients (ICC)
Bland and Altman method (fidelity between
two raters)
Spearman-Brown prophecy
Qualitative measure: (Inter-coder)
Cohen's kappa or percentage agreement

Internal consi The consistency of the Quantitative measure:

stency measurement itself Cronbach's Alpha
Spearman-Brown prophecy
Participants Qualitative measure:
Member Checking
Triangulation
Peer Debriefing
Ensuring
reliability
Improving test-retest
reliability
0 02 0
Minimize Control Address
1Practice Environment 3
Participant
Effects: al Variables: Variability:
Randomize Ensure Consider
test orders to consistent individual
reduce testing differences that
familiarity conditions may impact
bias. across performance.
sessions.
Enhancing inter-rater
reliability
0 02 0
Standardize Pilot Calibration
1
d Training: Testing: 3
Sessions:
Ensure all Test Regular
raters instruments meetings to
understand with a sample discuss
scoring to identify and discrepancies
criteria and address and refine
procedures. ambiguities. scoring.
Ensuring internal
consistency reliability
0 02 0
Use Reliable Conduct Consider Item
1
Instruments: Factor 3
Analysis:
Select Analysis: Evaluate
validated Assess the individual items
measures with underlying for consistency
established structure of and coherence.
reliability measurement
coefficients. tools.
Proposal - dissertation
Section Discuss
Literature review What have other researchers done to devise and improve
methods that are reliable and valid?
Methodology How did you plan your research to ensure reliability and validity
of the measures used? This includes the chosen sample set and
size, sample preparation, external conditions and measuring
techniques.

Results If you calculate reliability and validity, state these values

alongside your main results.
Discussion This is the moment to talk about how reliable and valid your
results actually were. Were they consistent, and did they reflect
true values? If not, why not?

Conclusion If reliability and validity were a big problem for your findings , it
might be helpful to mention this here.
An example
Adams et al. (2011) discussed their scoring and
coding procedure as follows: ‘The oral tests were
scored by two of the researchers; the few
discrepancies were discussed until 100 percent
agreement was reached. The written tests were
scored by an independent rater and then the scores
were reviewed by two of the researchers. Interrater
reliability was calculated to be 98 percent.’
Challenges in esl research
 Language Variability: Diverse linguistic
backgrounds among participants.
 Cultural Diff erences: Varied cultural
interpretations of language constructs.
 Contextual Factors: Influence of contextual
factors on language usage and understanding.
Reliability
100%

Validity and Reliability
No ratings yet
Validity and Reliability
6 pages
Melissachemlab#1
100% (2)
Melissachemlab#1
4 pages
Mathematical Statistics and Data Analysis 3rd Edition - Chapter6 Solutions PDF
86% (7)
Mathematical Statistics and Data Analysis 3rd Edition - Chapter6 Solutions PDF
17 pages
FC301 - ANSWERS Worksheet 1 - Time Series
No ratings yet
FC301 - ANSWERS Worksheet 1 - Time Series
6 pages
Reliability
No ratings yet
Reliability
3 pages
Reliability & Validity: Dr. Nitu Singh Sisodia
No ratings yet
Reliability & Validity: Dr. Nitu Singh Sisodia
20 pages
Reliability
No ratings yet
Reliability
10 pages
Class 10
No ratings yet
Class 10
54 pages
Educational Research
No ratings yet
Educational Research
32 pages
Reliability and its Types
No ratings yet
Reliability and its Types
13 pages
Reliability
No ratings yet
Reliability
2 pages
Types of Reliability and How To Measure Them
No ratings yet
Types of Reliability and How To Measure Them
18 pages
LU 4 Methods of Reliability Testing Concepts
No ratings yet
LU 4 Methods of Reliability Testing Concepts
23 pages
Questionnaire Reliability Validity
No ratings yet
Questionnaire Reliability Validity
29 pages
What Is Reliability
No ratings yet
What Is Reliability
3 pages
Reliability and Validity
No ratings yet
Reliability and Validity
5 pages
Unit 9
No ratings yet
Unit 9
27 pages
Understanding Reliability in Research
No ratings yet
Understanding Reliability in Research
10 pages
Topic 1 THE CORNERSTONES OF GOOD RESEARCH
No ratings yet
Topic 1 THE CORNERSTONES OF GOOD RESEARCH
8 pages
Reliability and Its Types...
No ratings yet
Reliability and Its Types...
53 pages
Reliability
No ratings yet
Reliability
9 pages
What Is Questionnaire?
No ratings yet
What Is Questionnaire?
4 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
Reliability vs. Validity in Research - Difference, Types and Examples
No ratings yet
Reliability vs. Validity in Research - Difference, Types and Examples
7 pages
CHAPTER 6
No ratings yet
CHAPTER 6
8 pages
Reliabilty Lecture (5)
No ratings yet
Reliabilty Lecture (5)
16 pages
Validity&Reliability
No ratings yet
Validity&Reliability
16 pages
Validity and Reliability
No ratings yet
Validity and Reliability
3 pages
Validity and Reliability Updated
No ratings yet
Validity and Reliability Updated
9 pages
Characteristics of Research Tools
No ratings yet
Characteristics of Research Tools
3 pages
Presentation Guide
No ratings yet
Presentation Guide
2 pages
Strructures
No ratings yet
Strructures
28 pages
mpc validity and reliability-1
No ratings yet
mpc validity and reliability-1
22 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Evidence of Reliability
No ratings yet
Evidence of Reliability
4 pages
Chapter 4 Assessment & Evaluation
No ratings yet
Chapter 4 Assessment & Evaluation
10 pages
2.4 -Reliability and Validity
No ratings yet
2.4 -Reliability and Validity
6 pages
Reliability
No ratings yet
Reliability
11 pages
PSYCH STATS SEMI
No ratings yet
PSYCH STATS SEMI
11 pages
Business Research Methods For Managers: Dr. Adel Sakr
No ratings yet
Business Research Methods For Managers: Dr. Adel Sakr
5 pages
RELIABILITY
No ratings yet
RELIABILITY
5 pages
reliability
No ratings yet
reliability
27 pages
Validity and Reliability of Research Methods and Evaluating Sources
No ratings yet
Validity and Reliability of Research Methods and Evaluating Sources
32 pages
BBA-BI-Class 19 Business Research Notes For BHM
No ratings yet
BBA-BI-Class 19 Business Research Notes For BHM
28 pages
Lesson 6.2 Item Analysis and Validation
No ratings yet
Lesson 6.2 Item Analysis and Validation
24 pages
Psych Testing Assignment 2.
No ratings yet
Psych Testing Assignment 2.
5 pages
Validity and Reliability Lesson 3.
No ratings yet
Validity and Reliability Lesson 3.
48 pages
Reliability Vs Validity in Research
100% (1)
Reliability Vs Validity in Research
12 pages
Deepika RM Seminar
No ratings yet
Deepika RM Seminar
23 pages
Royal University of Phnom Penh Faculty of Education Master of Education Program
No ratings yet
Royal University of Phnom Penh Faculty of Education Master of Education Program
41 pages
QUALITY OF A TEST
No ratings yet
QUALITY OF A TEST
7 pages
Reliability and Validity of Measurement
No ratings yet
Reliability and Validity of Measurement
8 pages
RELIABILITY Show - PPSX
No ratings yet
RELIABILITY Show - PPSX
33 pages
RM Lesson 7
No ratings yet
RM Lesson 7
13 pages
PT Presentaion
No ratings yet
PT Presentaion
25 pages
What Is Validit1
No ratings yet
What Is Validit1
5 pages
Reliability Test by Group 2
No ratings yet
Reliability Test by Group 2
28 pages
3 - Types of Reliability
No ratings yet
3 - Types of Reliability
36 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Reliability
No ratings yet
Reliability
13 pages
Research in Psychology
From Everand
Research in Psychology
Connor Whiteley
No ratings yet
Testing Impact Review
From Everand
Testing Impact Review
Mason Ross
No ratings yet
Pengembangan Soal-Soal Open-Ended Pada Pokok Bahasan Segitiga Dan Segiempat Di SMP
No ratings yet
Pengembangan Soal-Soal Open-Ended Pada Pokok Bahasan Segitiga Dan Segiempat Di SMP
10 pages
Fundamentals of Psychology - Lecture Notes
100% (1)
Fundamentals of Psychology - Lecture Notes
53 pages
Business Development Strategies Through Artificial Intelligence Technology
No ratings yet
Business Development Strategies Through Artificial Intelligence Technology
6 pages
Research Methodology
No ratings yet
Research Methodology
2 pages
Excel Activity 1 PDF
No ratings yet
Excel Activity 1 PDF
3 pages
Probability and Statistics in Data Science
No ratings yet
Probability and Statistics in Data Science
2 pages
Teknik Pengumpulan Data CAR
No ratings yet
Teknik Pengumpulan Data CAR
26 pages
Test Paper (Template-SDO QC)
No ratings yet
Test Paper (Template-SDO QC)
6 pages
Wombat Statistical Analysis U3227719
No ratings yet
Wombat Statistical Analysis U3227719
5 pages
Excel 2007 - 10 Forecasting and Data Analysis Course Manual1
No ratings yet
Excel 2007 - 10 Forecasting and Data Analysis Course Manual1
160 pages
Williams - 2020 - Stakeholder Views On Publication Bias in Health Services Research
No ratings yet
Williams - 2020 - Stakeholder Views On Publication Bias in Health Services Research
10 pages
WEEK 5 Conducting A Test of Hypothesis On Population Proportion
No ratings yet
WEEK 5 Conducting A Test of Hypothesis On Population Proportion
20 pages
BÁO CÁO BÀI 6đã qua chỉnh sửa lần 1
No ratings yet
BÁO CÁO BÀI 6đã qua chỉnh sửa lần 1
3 pages
7 Quality Control Tools (Basic) : Srinivas R Khode
80% (5)
7 Quality Control Tools (Basic) : Srinivas R Khode
22 pages
Nice Example of Dissertation in Management-SME
88% (24)
Nice Example of Dissertation in Management-SME
60 pages
(Ebook) The Science Book: Big Ideas Simply Explained (DK Big Ideas) by DK ISBN 9781465419651, 1465419659, 14654196591 - Download the ebook with all fully detailed chapters
No ratings yet
(Ebook) The Science Book: Big Ideas Simply Explained (DK Big Ideas) by DK ISBN 9781465419651, 1465419659, 14654196591 - Download the ebook with all fully detailed chapters
50 pages
STM3105 Simple Ranking Test
No ratings yet
STM3105 Simple Ranking Test
6 pages
Time Series Analysis For Psychological Research: Examining and Forecasting Change
No ratings yet
Time Series Analysis For Psychological Research: Examining and Forecasting Change
24 pages
Overview of UPLC and UHPLC - Application in Pharmaceutical Industry
100% (1)
Overview of UPLC and UHPLC - Application in Pharmaceutical Industry
22 pages
Dataware dp2
No ratings yet
Dataware dp2
1 page
BA Module 02 - 2.4 - Confidence Interval
No ratings yet
BA Module 02 - 2.4 - Confidence Interval
41 pages
AA 2015 ONPS2186 BEB Course Guidelines - Science Project (SP)
No ratings yet
AA 2015 ONPS2186 BEB Course Guidelines - Science Project (SP)
15 pages
1974 - Lieblein - Efficient Methods of Extreme-Value Methodology
No ratings yet
1974 - Lieblein - Efficient Methods of Extreme-Value Methodology
36 pages
Scientific Endeavour: Chapter 1 - Ms. Olin
100% (4)
Scientific Endeavour: Chapter 1 - Ms. Olin
50 pages
Statistics Full Notes
No ratings yet
Statistics Full Notes
14 pages
Sbup Pet Syllabus
No ratings yet
Sbup Pet Syllabus
2 pages
TITLE Tjs Analytcial Chemistry 1st Lab Report.
No ratings yet
TITLE Tjs Analytcial Chemistry 1st Lab Report.
8 pages

W2 - Reliability in ESL Research

Uploaded by

W2 - Reliability in ESL Research

Uploaded by

Validity

2 True Score Theory

5. "Reliability in ESL research is the extent to which a measurement or assessment tool

6. "Reliability in ESL research refers to the degree of consistency and dependability of

True score theory is a theory

1. Every measurement has

A reliability coeffi cient of 0 indicates that the test or

A reliability coeffi cient of 1 means that the test or research

Interrater relia The consistency of a Based on an assessment criteria checklist,

recap and final advice

Internal consi The consistency of the Quantitative measure:

Results If you calculate reliability and validity, state these values

You might also like