0% found this document useful (0 votes)

7 views

LU 4 Methods of Reliability Testing Concepts

The document discusses the importance of reliability in psychological research and explores various methods used for testing reliability, including test-retest reliability, parallel forms reliability, internal consistency reliability, and inter-rater reliability. Examples are provided to illustrate each reliability testing technique.

Uploaded by

Kristhel Jane Roxas Nicdao

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

LU 4 Methods of Reliability Testing Concepts

Uploaded by

Kristhel Jane Roxas Nicdao

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Nueva Ecija University of Science and Technology

College of Arts and Sciences

MATHEMATICS AND SCIENCES DEPARTMENT

Psy 102: Psychological Statistics

Lecture prepared by: JAYNELLE G. DOMINGO, MSc. MathEd

LESSON OBJECTIVES:

At the end of the lesson, the students are expected to:

ü Understand the concept of reliability and its importance in

psychological research.
ü Explore various methods used for testing reliability.
ü Analyze the strengths and limitations of different reliability
testing techniques.
ü Apply appropriate reliability testing methods to ensure the
reliability of research instruments.
Introduction

In research, ensuring the reliability of instruments is crucial for

obtaining accurate and trustworthy results.
Psychological research often deals with complex constructs such
as personality traits, cognitive abilities, and mental health variables.
These constructs are typically not directly observable and can be
influenced by various factors, including measurement error.
Measurement error refers to any discrepancy between the true score of
a participant on a given construct and the score obtained from a
measurement instrument. Reliability testing helps to quantify and
minimize measurement error, thereby increasing the accuracy and
precision of the measurements.
Introduction

Reliability is especially crucial in fields like psychology where

researchers aim to make generalizations about human behavior or mental
processes based on empirical evidence. Without reliable measurement
instruments, researchers cannot confidently draw conclusions or make
meaningful comparisons across different studies or populations. Furthermore,
unreliable instruments can lead to wasted time, resources, and effort. Imagine
a scenario where researchers invest significant resources in conducting a large-
scale study, only to discover later that the measurement tool they used was not
reliable. Not only does this undermine the validity of the study's findings, but it
also squanders valuable resources that could have been allocated to more
fruitful research endeavors.
Introduction

Moreover, in applied settings such as clinical psychology or

educational assessment, the consequences of using unreliable instruments can
be particularly severe. For instance, inaccurate assessments of psychological
disorders or academic abilities can result in misdiagnosis, inappropriate
interventions, or ineffective educational programs, ultimately impacting the
well-being and success of individuals.
The reliability of instruments forms the foundation of sound research
or data gathering. By rigorously testing and ensuring the reliability of
measurement instruments, researchers can enhance the validity, credibility,
and impact of their research findings, ultimately advancing our understanding
of human behavior and mental processes.
Understanding Reliability

Reliability refers to the consistency, stability, and

dependability of measurements obtained from a research
instrument.

• Reliable instrument produces consistent results when

administered repeatedly under similar conditions.
• Ensuring reliability is essential because it provides researchers
with confidence that the observed scores accurately reflect the
underlying construct of interest rather than random
measurement error or fluctuations.
Types of Reliability

1. Test-Retest Reliability
This type of reliability assesses the consistency of scores
obtained from the same participants when they are tested on two
separate occasions with the same instrument. For example, if a test
yields similar scores for the same individuals when administered at two
different time points, it indicates good test-retest reliability. However,
factors such as practice effects or changes in participants' conditions
between the two administrations can affect the reliability of this
method.

Test Analysis: Correlation (Pearson r)

Types of Reliability

1. Test-Retest Reliability
Strengths:
Ø Provides a straightforward assessment of stability over time.
Ø Easy to administer and interpret.
Ø Useful for measuring relatively stable constructs.

Weaknesses:
Ø Susceptible to practice effects, where participants may remember previous
responses.
Ø Not suitable for measuring constructs prone to change over short periods.
Ø External factors (e.g., environmental changes) may influence results between
administrations.
Types of Reliability

1. Test-Retest Reliability

Example:
A researcher is studying the effectiveness of a mindfulness-based stress
reduction program. To assess participants' stress levels, the researcher
administers a stress questionnaire twice, with a two-week interval
between administrations. By correlating participants' scores from the
two administrations, the researcher can determine the test-retest
reliability of the questionnaire.
Types of Reliability

2. Parallel Forms Reliability

Also known as alternate forms reliability, this method entails
administering two equivalent forms of the same test to the same group
of participants and then correlating the scores obtained from both
forms. This method helps mitigate the effects of practice or memory on
test-retest reliability.

Test Analysis: Correlation (Pearson r)

Types of Reliability

2. Parallel Forms Reliability

Strengths:
Ø Minimizes the impact of practice effects compared to test-retest reliability.
Ø Useful when repeated administrations of the same instrument are impractical.
Ø Provides a more robust assessment of reliability by using alternate forms.

Weaknesses:
Ø Creating truly equivalent alternate forms can be challenging.
Ø Requires additional time and resources to develop and validate alternate forms.
Ø Differences in content or difficulty between forms may affect reliability estimates.
Types of Reliability

2. Parallel Forms Reliability

Example:
A teacher wants to ensure consistency in grading across multiple
versions of a midterm exam. The teacher creates two equivalent
versions of the exam and administers them to the same group of
students.
Types of Reliability

3. Internal Consistency Reliability

This method assesses the consistency of responses within a
single administration of a test. Common measures of internal
consistency include Cronbach's alpha and split-half reliability.
Cronbach's alpha estimates the average correlation between all possible
combinations of items within a test, with higher values indicating
greater internal consistency. Split-half reliability involves splitting the
test into two halves and correlating the scores obtained from each half.

Test Analysis: Cronbach Alpha (𝛼) / Correlation (Pearson r)

Types of Reliability

3. Internal Consistency Reliability

Strengths:
Ø Provides a measure of the extent to which items within a scale are interrelated.
Ø Allows for the assessment of reliability within a single administration.
Ø Useful for evaluating the homogeneity of a scale or instrument.

Weaknesses:
Ø Assumes that all items are measuring the same underlying construct.
Ø Cronbach's alpha may be influenced by the number of items in the scale.
Ø Doesn't account for systematic errors that may affect individual items.
Types of Reliability

3. Internal Consistency Reliability

Example:
A researcher is developing a questionnaire to assess depression symptoms. The
researcher administers the questionnaire to a sample of participants and calculates
Cronbach's alpha to assess internal consistency.

A psychologist is developing a questionnaire to measure job satisfaction in employees.

The questionnaire consists of 20 items. To assess the split-half reliability of the
questionnaire, the psychologist administers it to sample employees from the same
company. S/He computes the correlation between the total scores obtained from the
odd-numbered items and the total scores obtained from the even-numbered items
across all participants.
Types of Reliability

4. Inter-Rater Reliability
This method assesses the consistency of ratings or judgments
made by different raters or observers. It is particularly relevant in
research involving observational data or subjective judgments.

Test Analysis: Cohen's kappa coefficient for categorical data and

intraclass correlation coefficient (ICC) for continuous data
Types of Reliability

4. Inter-Rater Reliability
Strengths:
Ø Provides a measure of consistency across different observers or raters.
Ø Useful for observational studies or studies involving subjective judgments.
Ø Helps ensure the reliability and validity of data collected through observation.

Weaknesses:
Ø Reliability may be influenced by differences in observer training or judgment criteria.
Ø More time-consuming and resource-intensive compared to other reliability methods.
Ø Requires careful definition and operationalization of behaviors or criteria being
observed.
Types of Reliability

4. Inter-Rater Reliability
Example:
A team of researchers is conducting a study on nonverbal communication in job interviews. They
have developed a coding scheme to analyze specific nonverbal behaviors displayed by job
applicants, such as eye contact, posture, and facial expressions. To assess inter-rater reliability, the
researchers recruit three trained observers who will independently code videos of job interviews
according to the established coding scheme. Each observer watches the same set of video
recordings of job interviews and records the frequency and duration of specific nonverbal
behaviors displayed by the job applicants. They then enter their coding data into a spreadsheet or
coding software. After coding all the videos, the researchers calculate inter-rater reliability using
appropriate statistical measures such as Cohen's kappa coefficient or intraclass correlation
coefficient (ICC). These measures quantify the degree of agreement among the observers in their
coding of nonverbal behaviors.
Types of Reliability

4. Inter-Rater Reliability
Reliability Coefficient

• Correlation1 and Cronbach Alpha2

Coefficient Interpretation
≥ 0.9 excellent reliability
≥ 0.8 < 0.9 good reliability
≥ 0.7 < 0.8 acceptable reliability
≥ 0.6 < 0.7 questionable reliability
≥ 0.5 < 0.6 poor reliability
< 0.5 unacceptable reliability
1
Calmorin, L., & Calmorin, M. (2007). Research Methods and Thesis Writing (2nd ed.). Manila: Rex Bookstore.
2
George, D., & Mallery, P. (2003). SPSS for Windows step by step: A simple guide and reference. 11.0 update (4th ed.). Boston, MA:
Allyn & Bacon.
Reliability Coefficient

• Карра Value/Intraclass Coefficient*

Coefficient Interpretation
0.81-1.00 Near complete agreement
0.61-0.80 Strong agreement
0.41-0.60 Moderate agreement
0.21-0.40 Fair agreement
0.00-0.20 poor agreement

*Landis, J. R., & Koch, G. G. (1977). The measurement of observer agreement for categorical data. biometrics, 159-174.
NOTE

Part 2 of this discussion (illustrating of different types of reliability

in SPSS) will be continued after the Midterm Exam.
References

Calmorin, L., & Calmorin, M. (2007). Research Methods and

Thesis Writing (2nd ed.). Manila: Rex Bookstore.

George, D., & Mallery, P. (2003). SPSS for Windows step by step: A
simple guide and reference. 11.0 update (4th ed.). Boston,
MA: Allyn & Bacon

Landis, J. R., & Koch, G. G. (1977). The measurement of observer

agreement for categorical data. biometrics, 159-174.

Lesson 6 Establishing Test Validity and Reliability: Learning Instructional Modules For CPE 105
100% (2)
Lesson 6 Establishing Test Validity and Reliability: Learning Instructional Modules For CPE 105
17 pages
Validity and Reliability
No ratings yet
Validity and Reliability
6 pages
Xm12-05 CP5
No ratings yet
Xm12-05 CP5
18 pages
04 Chapter 3
0% (1)
04 Chapter 3
25 pages
Quantitative Method Project - Correlation Analysis: Corners Won and Goals Scored
No ratings yet
Quantitative Method Project - Correlation Analysis: Corners Won and Goals Scored
16 pages
Diagnostic Test in Practical Research 1
75% (8)
Diagnostic Test in Practical Research 1
5 pages
Past Papers Reliability of Psychological Tests
No ratings yet
Past Papers Reliability of Psychological Tests
13 pages
Lesson 6 Establishing Test Validity and Reliability
No ratings yet
Lesson 6 Establishing Test Validity and Reliability
19 pages
Reliability
No ratings yet
Reliability
3 pages
Reliability
No ratings yet
Reliability
10 pages
Reliability
No ratings yet
Reliability
13 pages
Chapter 6edited
No ratings yet
Chapter 6edited
15 pages
Lesson in EDUC 4 (Establishing Test Validity and Reliability)
No ratings yet
Lesson in EDUC 4 (Establishing Test Validity and Reliability)
20 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
Strructures
No ratings yet
Strructures
28 pages
Reliability
No ratings yet
Reliability
9 pages
W2 - Reliability in ESL Research
No ratings yet
W2 - Reliability in ESL Research
27 pages
Week 7reliability
No ratings yet
Week 7reliability
25 pages
Lesson-6-1
No ratings yet
Lesson-6-1
16 pages
RMBS M2 Lecture 5a
No ratings yet
RMBS M2 Lecture 5a
42 pages
Reliability
No ratings yet
Reliability
2 pages
3 - Types of Reliability
No ratings yet
3 - Types of Reliability
36 pages
Psych Testing Assignment 2.
No ratings yet
Psych Testing Assignment 2.
5 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
RDQM Ass1
No ratings yet
RDQM Ass1
7 pages
Evidence of Reliability
No ratings yet
Evidence of Reliability
4 pages
PSYCH STATS SEMI
No ratings yet
PSYCH STATS SEMI
11 pages
reliability
No ratings yet
reliability
27 pages
Reliability: Prepared By: Katherine Gayle Arellado - Igcasan, MAT - English
No ratings yet
Reliability: Prepared By: Katherine Gayle Arellado - Igcasan, MAT - English
25 pages
Week 5-Assessment
No ratings yet
Week 5-Assessment
12 pages
Theory of Reliability
No ratings yet
Theory of Reliability
11 pages
Characteristics of Effective Selection Techniques
No ratings yet
Characteristics of Effective Selection Techniques
17 pages
CHAPTER 6
No ratings yet
CHAPTER 6
8 pages
mpc validity and reliability-1
No ratings yet
mpc validity and reliability-1
22 pages
3 - Reliability
No ratings yet
3 - Reliability
38 pages
Reliability and Validity
No ratings yet
Reliability and Validity
5 pages
Validity and Reliability Updated
No ratings yet
Validity and Reliability Updated
9 pages
Notes Ec Psychass
No ratings yet
Notes Ec Psychass
25 pages
Handbook of Psychological Assessment Fourth Edition
100% (1)
Handbook of Psychological Assessment Fourth Edition
9 pages
Lesson 6.2 Item Analysis and Validation
No ratings yet
Lesson 6.2 Item Analysis and Validation
24 pages
Characteristics of Research Tools
No ratings yet
Characteristics of Research Tools
3 pages
Bed
No ratings yet
Bed
16 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Reliability: Floramae Z. Campos Student/MA-GC
No ratings yet
Reliability: Floramae Z. Campos Student/MA-GC
29 pages
PSY 210 L7 Reliability
No ratings yet
PSY 210 L7 Reliability
8 pages
RELIABILITY OF RESEARCH INSTRUMENT
No ratings yet
RELIABILITY OF RESEARCH INSTRUMENT
2 pages
Lesson6 Establishing Test Validity and Reliability
No ratings yet
Lesson6 Establishing Test Validity and Reliability
42 pages
Reliability
No ratings yet
Reliability
9 pages
Assess 1 PED 106 Lesson 6
No ratings yet
Assess 1 PED 106 Lesson 6
75 pages
test constrcution
No ratings yet
test constrcution
39 pages
Reliability and its Types
No ratings yet
Reliability and its Types
13 pages
RELIABILITY
No ratings yet
RELIABILITY
5 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Reliability and Validity of Measurement
No ratings yet
Reliability and Validity of Measurement
8 pages
Reliability and Its Types...
No ratings yet
Reliability and Its Types...
53 pages
Psycass Reviewer
No ratings yet
Psycass Reviewer
19 pages
Reliability Test by Group 2
No ratings yet
Reliability Test by Group 2
28 pages
9 Reliability
No ratings yet
9 Reliability
10 pages
Reliability and Validity Analysis: Dr. Jeevan Jyoti Dept. of Commerce University of Jammu
No ratings yet
Reliability and Validity Analysis: Dr. Jeevan Jyoti Dept. of Commerce University of Jammu
25 pages
Reliability and Validity: Marilyn K Simon
No ratings yet
Reliability and Validity: Marilyn K Simon
20 pages
Reviewer Test Measurement Midterms
No ratings yet
Reviewer Test Measurement Midterms
6 pages
KPD Validity & Realibility
No ratings yet
KPD Validity & Realibility
25 pages
Testing Impact Review
From Everand
Testing Impact Review
Mason Ross
No ratings yet
Research in Psychology
From Everand
Research in Psychology
Connor Whiteley
No ratings yet
Task 7
No ratings yet
Task 7
4 pages
Module 1 - Descriptive Statistics: Objectives
No ratings yet
Module 1 - Descriptive Statistics: Objectives
11 pages
EQ 5D 5LUserguide Chinese 23 07
No ratings yet
EQ 5D 5LUserguide Chinese 23 07
36 pages
TSPSC AEE (GS & GA) Official Paper-I (Held On 08 May 2023) - English - 1684745045
No ratings yet
TSPSC AEE (GS & GA) Official Paper-I (Held On 08 May 2023) - English - 1684745045
192 pages
Sample Paper 2
No ratings yet
Sample Paper 2
6 pages
Item Analysis Q4
No ratings yet
Item Analysis Q4
4 pages
ANA 1003, Module 11
No ratings yet
ANA 1003, Module 11
21 pages
Practical Research 1: Andreve John L. Rebucias
No ratings yet
Practical Research 1: Andreve John L. Rebucias
43 pages
MC 3
No ratings yet
MC 3
5 pages
CMI-CAL-PDT-IT-002 INSTRUCCION DE TRABAJO PARA CARGA DEL PRODUCTO TERMINADO Rev.0
No ratings yet
CMI-CAL-PDT-IT-002 INSTRUCCION DE TRABAJO PARA CARGA DEL PRODUCTO TERMINADO Rev.0
20 pages
Business Communication PDF
0% (1)
Business Communication PDF
6 pages
Table of Contents
No ratings yet
Table of Contents
18 pages
Psych Stat 1 Syllabus Parametric Tests 2023
No ratings yet
Psych Stat 1 Syllabus Parametric Tests 2023
2 pages
Haphazard Block Selection Pana
No ratings yet
Haphazard Block Selection Pana
8 pages
Research Methodology Interview Questions
100% (2)
Research Methodology Interview Questions
5 pages
Unit 3_hypothesis Testing
No ratings yet
Unit 3_hypothesis Testing
15 pages
Midterm Exam For Prac Res 2 PDF
100% (1)
Midterm Exam For Prac Res 2 PDF
5 pages
Checklist Dissertation
No ratings yet
Checklist Dissertation
2 pages
Title of Your Research Paper Typed in Upper Case, 14 Font Size, Bold
No ratings yet
Title of Your Research Paper Typed in Upper Case, 14 Font Size, Bold
7 pages
Ujian Statistik Praktek 2012
No ratings yet
Ujian Statistik Praktek 2012
26 pages
Mcq on Quality Control
No ratings yet
Mcq on Quality Control
6 pages
Research Process Orientation
No ratings yet
Research Process Orientation
44 pages
The Answers Key - Tiếng Anh 8 Right On - Kiểm tra giữa kỳ 2 (N2)
No ratings yet
The Answers Key - Tiếng Anh 8 Right On - Kiểm tra giữa kỳ 2 (N2)
1 page
Jephthah International College: By: Melese Molla PG/327/11
No ratings yet
Jephthah International College: By: Melese Molla PG/327/11
14 pages
Stai PDF
No ratings yet
Stai PDF
1 page
Guidelines For Research Proposal
100% (1)
Guidelines For Research Proposal
6 pages

LU 4 Methods of Reliability Testing Concepts

Uploaded by

LU 4 Methods of Reliability Testing Concepts

Uploaded by

Nueva Ecija University of Science and Technology

College of Arts and Sciences

Psy 102: Psychological Statistics

Lecture prepared by: JAYNELLE G. DOMINGO, MSc. MathEd

At the end of the lesson, the students are expected to:

ü Understand the concept of reliability and its importance in

In research, ensuring the reliability of instruments is crucial for

Reliability is especially crucial in fields like psychology where

Moreover, in applied settings such as clinical psychology or

Reliability refers to the consistency, stability, and

• Reliable instrument produces consistent results when

Test Analysis: Correlation (Pearson r)

2. Parallel Forms Reliability

Test Analysis: Correlation (Pearson r)

2. Parallel Forms Reliability

2. Parallel Forms Reliability

3. Internal Consistency Reliability

Test Analysis: Cronbach Alpha (𝛼) / Correlation (Pearson r)

3. Internal Consistency Reliability

3. Internal Consistency Reliability

A psychologist is developing a questionnaire to measure job satisfaction in employees.

Test Analysis: Cohen's kappa coefficient for categorical data and

• Correlation1 and Cronbach Alpha2

• Карра Value/Intraclass Coefficient*

Part 2 of this discussion (illustrating of different types of reliability

Calmorin, L., & Calmorin, M. (2007). Research Methods and

Landis, J. R., & Koch, G. G. (1977). The measurement of observer

You might also like