0% found this document useful (0 votes)

72 views

Module 3 - Measurement

This document provides an overview of measuring health outcomes in research. It defines different types of outcome measures, including predictive, discriminative, and evaluative measures. It also describes important measurement properties such as validity, reliability, sensitivity to change, and responsiveness that outcome measures should demonstrate. Specifically, it defines types of validity including face, content, criterion, and construct validity. It also describes study designs to evaluate these properties, such as known groups designs for validity and test-retest designs for reliability. Statistics to quantify these properties are discussed, including the intraclass correlation coefficient and standardized response mean.

Uploaded by

Caleb Kim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views

Module 3 - Measurement

Uploaded by

Caleb Kim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Module 3 – Measurement

Learning Objectives:
- Understand/identify different types of outcome measures and understand their limits
- Define different measurement properties
- Describe methods to evaluate different types of validity, reliability, sensitivity to change
and responsiveness
- Design a study to evaluate the measurement properties of an outcome measure

Measuring Health

What is health?
- According to WHO, it is a state of complete physical, mental, and social well-being
- It is a multi-faceted concept influenced by a person’s experiences, beliefs, expectations,
and perceptions
- It means different things to different people

ICF Model of Health

- ICF = International Classification of Functioning, Disability, and Health
- Model meant to standardize communication about health
- Health outcomes are classified according to the effect on body function and structure
(impairment; includes mental health items), limitations in activities (disability), and
participation (handicap)
- Modifiers of these outcomes are: age, coping strategies, social attitudes, education,
experience

Measuring Health in Research

- By using the ICF as a guide, use several outcome measures that can speak to specific
aspect(s) of health being affected by the intervention
- Measure QoL questionnaire, whose items include specific aspects of health that patients
have deemed important and relevant to their disease
o Not always the case that all questions are equally valued; some are more important
than others
- Good outcome measures for a study have good measurement properties AND well-
known/commonly used (ease of interpretation by others)
- Issue: too many independent outcomes = potential for multiple comparisons error

Types of Outcome Measures

Predictive Outcome Measure

- An instrument/device/method that predicts future
o E.g. MCAT predicts who is likely to perform well on licensing exam
o E.g. following an acute injury, predict who is likely to become chronic
- Design a predictive instrument using prognosis design
- Evaluate predictive validity using diagnosis design

Discriminative Outcome Measure

- An instrument/device/method that sorts individuals into groups
o E.g. x-ray (fracture present or absent)
- Evaluate validity using diagnosis design
Evaluative Outcome Measure
- An instrument/device/method that provides data on the quantity/quality of the result of
the experiment
- It is a basis for measuring the effects of the independent variable or change in dependent
variable
o E.g. pain in pre- and post-intervention
- Evaluate using longitudinal construct validity and sensitivity to change

Types of Evaluative Measures

- Surrogate Outcomes
- Patient Important Outcomes

Surrogate Outcomes
- Outcome measures that are not of direct practical importance to patients but are believed
to reflect outcomes that are important.
- Validity depends on magnitude of the association b/w surrogate and the patient important
outcome (i.e. its predictive validity)
o E.g. reduction in cholesterol as surrogate for reduction in mortality
o E.g. increased bone density as a surrogate for reduction in fracture incidence
- We use these outcomes b/c of its efficiency; changes can be measured on all patients over
a shorter time interval

Patient Important Outcomes

- Outcome measures that are of direct importance to patients
o E.g. death/survival, success/failure, patient-reported QoL
- Advantage: validity
- Disadvantage: long time interval needed to measure

Types of QoL

Measurement Properties

Validity and Reliability

- Validity (accuracy) is a measure of how close a measurement comes to the true score for
a variable
- Reliability (precision) is a measure of the extent to which repeated measurements come
up with the same value
- All outcome measures need to demonstrate validity and reliability
- Exception: evaluative measures only need to show responsiveness

Validity vs. Reliability

- Improve precision of an estimate by increasing the number of measurements taken (i.e.
regression to the mean)
o Reduces level of random error and narrows CI about the value being estimated
- Increasing precision when experiment contains systematic errors is not the solution
o Solution: calibration of instrument

Validity: the extent to which an instrument measures what it is intended to measure

Types:
- Face: the extent to which a measurement instrument appears to measure what it is
intended to measure.
- Content: the extent to which a measurement instrument represents all facets of a given
social construct.
- Criterion: examines the extent to which a measure provides results that are consistent
with a gold standard.
o Predictive: compares the measure in question with an outcome assessed at a later
time.
o Concurrent: comparison between the measure in question and an outcome assessed at
the same time.
- Construct: forming theories about the attribute of interest and then assessing the extent to
which the measure under investigation provides results that are consistent with the
theories.
o Convergent: tests the degree to which two measures of constructs that theoretically
should be related, are in fact related
o Divergent: tests whether concepts or measurements that are supposed to be unrelated
are, in fact, unrelated

Study Designs: Validity

- For known groups, one group has disease and the other doesn’t

Reliability: the extent to which an instrument yields the same results in repeated
administrations in a stable population

Study Designs: Reliability

- All require the disease to be in a stable state; measurements are repeated at least twice
- Test re-test: assumes the rater and disease are consistent and evaluates the
reproducibility of the test (patient has to perform the test)
- Inter-rater: The extent to which 2 or more raters are able to consistently differentiate
subjects with higher and lower values on an underlying trait
o assumes the test and disease are consistent and evaluated the reproducibility b/w
different raters (observations of a client or rater has to perform this test)
- Intra-rater: The extent to which a rater is able to consistently differentiate participants
with higher and lower values of an underlying trait on repeated ratings over time
o Assumes the test and disease are consistent and evaluates the reproducibility of one
rater over time (observations of a client or rater has to perform the test)

Statistics to Communicate Reliability

Relative Reliability:
- Reliability = measuring agreement, NOT association
- Cannot use Pearson/Spearman to demonstrate reliability b/c they are measures of
association and do not consider systematic differences b/w measures
- However, both Intra-class Correlation Coefficient (ICC) and Kappa do consider this
(measures of agreement)
- Ideal value = 1
- Measures that are highly associated but are systematically different will have a
correlation coefficient that is larger than the agreement statistics (A)
- Measures that are highly associated without a systematic difference will have similar
values for the correlation coefficient and agreement statistic (B)

ICC:
- Is a measure of reproducibility that compares variance b/w patients to the total variance
(b/w patient and within-patient variance)

Kappa
- Is a measure of the extent to which observers achieve agreement beyond the level
expected to occur by chance alone.
- For a binary outcome variable (0 – no agreement or 1 – perfect agreement)
- The more discordant the raters are, the lower the value of Kappa
- The weighted Kappa is for ordered categories
o Any discordant ratings will largely affect value of Kappa
Absolute Reliability: Precision – Individual Score
- Standard Error of Measurement (SEM) is a statistic for absolute reliability and is
calculated from test-re-test reliability study design
- SEM allows us to determine how certain we can be about a particular individual’s score
at a particular point in time.
- SEM = √within-client variance
- Ideally 0
- Clinician can be x % confident (x defined by confidence level chosen) that the true score
lies within the reported interval

Absolute Reliability: Real Change or Error?

- We can use SEM to determine if there has been a real change in score over time.
- We can be x % confident that a true change has occurred if the change exceeds the
reported interval, known as the Minimal Detectable Change/Difference (MDC/D), as
opposed to possibly due to random error within the measurement.
- MDC(X) = SEM * Z score for X * sqrt(2)

Sensitivity to Change
- Is the ability to detect change that isn’t necessarily meaningful change
- Many stats. for expressing this
- Standardized Response Mean (SRM) is most common
- Study Design: in a population expected to change, administer the new test pre- and post-
change
- SRM = (mean change) / (SD change)
- If SRM > 1, ‘signal/change’ could be detected over the ‘noise/variability’
- Signal = change that occurred from pre- to post-treatment
- Noise = all systematic and random errors

Responsiveness

Responsiveness: is the instruments ability to detect a clinically meaningful change

- Statistic: Minimally Clinically Important Difference (MCID)
- Sensitivity to change is necessary but insufficient condition for responsiveness
- NOTE: using wrong MCID has important implications for sample size
o Within-group: within a treatment group, every patient changes from pre- to post-
treatment
o B/w-group: the difference we want to detect in a study evaluating two different
treatments; the more similar the treatments, the smaller the expected difference b/w the
groups
o B/w-group MCID is approx. 20% of a within-group MCID

Anchor-Based Approach
- way to establish the interpretability of measures of patient-reported outcomes
- All patients are measured at Time 1 and Time 2
- B/w these times, provide an intervention that usually provides some improvement
- At Time 2, the Anchor is included = Global Rating of Change questionnaire
o Patient indicates how much better/worse they feel compared to Time 1
o Calculate average score of all patients who indicated a small but important change on
the GRC (score of 2 or 3); represents the within-group MCID for that instrument.
- If the magnitude of change in ‘better’ and ‘worse’ group are different, then averaging
score is not valid.

Distribution-Based Approach

- Approach 1
o Measure outcome at two time points in individuals not expected to change
o Calculate change scores for every participant and plot them in distribution
o Choose threshold (MCID) for classifying an individual as not having changed by an
important amount

- Approach 2
o Measure outcome at two time points in individuals expected to change by an important
amount
o Calculate change scores for every participant and plot them in distribution
o Choose threshold (MCID) for classifying an individual as having changed by an
important amount

- The score at the cut-off is the within-group MCID for that instrument

Self-Assessment

David Campany - The Cinematic (Documents of Contemporary Art) - The MIT Press (2007) PDF
100% (1)
David Campany - The Cinematic (Documents of Contemporary Art) - The MIT Press (2007) PDF
118 pages
Key Points/: Study Guide
100% (1)
Key Points/: Study Guide
48 pages
Uworld - INTERNAL MEDICINE
No ratings yet
Uworld - INTERNAL MEDICINE
131 pages
NCE Assessment and Testing PDF
100% (2)
NCE Assessment and Testing PDF
7 pages
Measurement and Data Collection Methods
No ratings yet
Measurement and Data Collection Methods
28 pages
As 4049.4-2006 Paints and Related Materials - Pavement Marking Materials High Performance Pavement Marking Sy
No ratings yet
As 4049.4-2006 Paints and Related Materials - Pavement Marking Materials High Performance Pavement Marking Sy
8 pages
601 4 Research Reliability & Validity
No ratings yet
601 4 Research Reliability & Validity
13 pages
EBP Appraising Research Studies of Outcome Measures
No ratings yet
EBP Appraising Research Studies of Outcome Measures
76 pages
The Measurement of Behaviour: Psych 3F40 Psychological Research Mike Maniaci 9 / 2 5 / 2 0 1 3
No ratings yet
The Measurement of Behaviour: Psych 3F40 Psychological Research Mike Maniaci 9 / 2 5 / 2 0 1 3
33 pages
Group 7 Handouts
No ratings yet
Group 7 Handouts
15 pages
Outcome Measures Lecture
No ratings yet
Outcome Measures Lecture
5 pages
@@Epi.docx 111
No ratings yet
@@Epi.docx 111
15 pages
Outcome Measure Unit 3
No ratings yet
Outcome Measure Unit 3
20 pages
Case Control Design
No ratings yet
Case Control Design
4 pages
Evaluation of Physical Fitness
No ratings yet
Evaluation of Physical Fitness
4 pages
Validity and Reliability of Research Instrument, and How To Minimize Bias
No ratings yet
Validity and Reliability of Research Instrument, and How To Minimize Bias
59 pages
Measurement and Data Quality
No ratings yet
Measurement and Data Quality
23 pages
Chapter 5 - Measurement Techniques
No ratings yet
Chapter 5 - Measurement Techniques
46 pages
Lecture 4.bb - Measurement - Part2
No ratings yet
Lecture 4.bb - Measurement - Part2
22 pages
Week 9 Slides
No ratings yet
Week 9 Slides
64 pages
Levels of Measurement
No ratings yet
Levels of Measurement
18 pages
Data Collection, Reliability and Validity Metpen 2017
No ratings yet
Data Collection, Reliability and Validity Metpen 2017
44 pages
WK12 Measurement Reliability and Validity
No ratings yet
WK12 Measurement Reliability and Validity
8 pages
102 Occupational Therapy For Physical Dysfunction
No ratings yet
102 Occupational Therapy For Physical Dysfunction
61 pages
Reviewer in Practical Research 2
No ratings yet
Reviewer in Practical Research 2
12 pages
A1181590628 - 23746 - 19 - 2020 - Measurement and Scaling RECAP-3
No ratings yet
A1181590628 - 23746 - 19 - 2020 - Measurement and Scaling RECAP-3
20 pages
Population Weighting - Stylized Example: Non-Response - 100
No ratings yet
Population Weighting - Stylized Example: Non-Response - 100
31 pages
Lecture 2
No ratings yet
Lecture 2
29 pages
Peerj 07 6918
No ratings yet
Peerj 07 6918
25 pages
RMBS M2 Lecture 5a
No ratings yet
RMBS M2 Lecture 5a
42 pages
Staffileno, Murphy, & Buchholz - Chapter 9
No ratings yet
Staffileno, Murphy, & Buchholz - Chapter 9
39 pages
Validity and Reliability of Measurements
No ratings yet
Validity and Reliability of Measurements
36 pages
Class 10
No ratings yet
Class 10
54 pages
Measurement and Data Collection
No ratings yet
Measurement and Data Collection
82 pages
Midterm Exam Review Session: What You've Learned So Far
No ratings yet
Midterm Exam Review Session: What You've Learned So Far
12 pages
Audit PDF
No ratings yet
Audit PDF
59 pages
Glossary of Common Research Terms
No ratings yet
Glossary of Common Research Terms
5 pages
Chapter 3 notes
No ratings yet
Chapter 3 notes
10 pages
W6 - Measurement in Quantitative Research
No ratings yet
W6 - Measurement in Quantitative Research
5 pages
Content Topic_4
No ratings yet
Content Topic_4
8 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
Lesson 3-1
No ratings yet
Lesson 3-1
98 pages
Lecture 8 Measurement Reliability and Validity
No ratings yet
Lecture 8 Measurement Reliability and Validity
21 pages
Evidence-Based Dentistry
No ratings yet
Evidence-Based Dentistry
7 pages
Psych Ass Ratio March 4
No ratings yet
Psych Ass Ratio March 4
4 pages
The Great Pleasure in Life Is Doing What People Say You Cannot Do
No ratings yet
The Great Pleasure in Life Is Doing What People Say You Cannot Do
49 pages
Estadística
No ratings yet
Estadística
5 pages
Measurement: Measurement Is The Process of Observing and
No ratings yet
Measurement: Measurement Is The Process of Observing and
88 pages
Business Research Methods
No ratings yet
Business Research Methods
94 pages
EBM Cheat Sheet
No ratings yet
EBM Cheat Sheet
7 pages
Chapter 13 Assessing Quality of Measurement Tools 2
No ratings yet
Chapter 13 Assessing Quality of Measurement Tools 2
57 pages
Innovations in PT Service Delivery
No ratings yet
Innovations in PT Service Delivery
3 pages
Validity and reliability
No ratings yet
Validity and reliability
24 pages
Chapter 3 Defining and Measuring Variables
No ratings yet
Chapter 3 Defining and Measuring Variables
32 pages
Apy 3101 m5 Transes
No ratings yet
Apy 3101 m5 Transes
2 pages
Examination and Evaluation in Physical Therapy
No ratings yet
Examination and Evaluation in Physical Therapy
17 pages
Research Question
No ratings yet
Research Question
13 pages
Construct Reability Validity
No ratings yet
Construct Reability Validity
37 pages
Essentials of A Good Test
No ratings yet
Essentials of A Good Test
6 pages
Biostatistics Epidemiology Definitions Chart
No ratings yet
Biostatistics Epidemiology Definitions Chart
10 pages
Reliability and Validity
No ratings yet
Reliability and Validity
27 pages
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Chemistry
No ratings yet
Chemistry
5 pages
Caleb Kim IB Geography September 11, 2014
No ratings yet
Caleb Kim IB Geography September 11, 2014
3 pages
Bill Bates Scholarship Application Letter Febru
No ratings yet
Bill Bates Scholarship Application Letter Febru
1 page
Give Thanks by Don Moen
No ratings yet
Give Thanks by Don Moen
1 page
AJC LinkAJ (October 2012)
No ratings yet
AJC LinkAJ (October 2012)
20 pages
UnME Jeans
No ratings yet
UnME Jeans
5 pages
2.9 Problems Solved
No ratings yet
2.9 Problems Solved
12 pages
9CA - No 3.4.des6
No ratings yet
9CA - No 3.4.des6
15 pages
Machine Learning Notebook
No ratings yet
Machine Learning Notebook
19 pages
Research Methods and Statistics-I
No ratings yet
Research Methods and Statistics-I
2 pages
Lesson 10 Conceptual Framework of The Study
No ratings yet
Lesson 10 Conceptual Framework of The Study
15 pages
A Study of Attitude of Headmasters Towards The Right To Education Act in Thane District
No ratings yet
A Study of Attitude of Headmasters Towards The Right To Education Act in Thane District
6 pages
Strategy: Resistance To Change: Parochial Self Interest
No ratings yet
Strategy: Resistance To Change: Parochial Self Interest
10 pages
Tugas 3 - Bahasa Inggris
100% (2)
Tugas 3 - Bahasa Inggris
3 pages
Pokédex Pokemon - Com 2
No ratings yet
Pokédex Pokemon - Com 2
1 page
Scheduling and Sequencing by Johnson Rule
No ratings yet
Scheduling and Sequencing by Johnson Rule
18 pages
MCA-Vinal Chloride Safety Sheet-1954
No ratings yet
MCA-Vinal Chloride Safety Sheet-1954
17 pages
History: Sanskrit India Guru
No ratings yet
History: Sanskrit India Guru
35 pages
Bio Magnetic Therapy - DrJockers
100% (2)
Bio Magnetic Therapy - DrJockers
20 pages
3ds Max Notes
No ratings yet
3ds Max Notes
2 pages
Properties of Operations of Integers
No ratings yet
Properties of Operations of Integers
25 pages
BSC Mathematics
0% (1)
BSC Mathematics
26 pages
Akhbar-E-Jehan 09-15 December 2019
No ratings yet
Akhbar-E-Jehan 09-15 December 2019
251 pages
WST01 01 Que 20160615
No ratings yet
WST01 01 Que 20160615
28 pages
How To Process The PET/CT Data On Your Own PC: (Quick Manual)
No ratings yet
How To Process The PET/CT Data On Your Own PC: (Quick Manual)
4 pages
Equalisation!: Texto Antigo Da Twiki
No ratings yet
Equalisation!: Texto Antigo Da Twiki
2 pages
Applicant'S Guide To The Mfa Programme 2009: The South African School of Motion Picture Medium and Live Performance
No ratings yet
Applicant'S Guide To The Mfa Programme 2009: The South African School of Motion Picture Medium and Live Performance
22 pages
OPERCOM
100% (2)
OPERCOM
2 pages
Xampp Changes
No ratings yet
Xampp Changes
3 pages
ISW Tact 2 Manual
No ratings yet
ISW Tact 2 Manual
7 pages
CDAInfo
No ratings yet
CDAInfo
1 page
Road Design
100% (1)
Road Design
75 pages

Module 3 - Measurement

Uploaded by

Module 3 - Measurement

Uploaded by

Module 3 – Measurement

ICF Model of Health

Measuring Health in Research

Types of Outcome Measures

Predictive Outcome Measure

Discriminative Outcome Measure

Types of Evaluative Measures

Patient Important Outcomes

Validity and Reliability

Validity vs. Reliability

Validity: the extent to which an instrument measures what it is intended to measure

Study Designs: Validity

Study Designs: Reliability

Statistics to Communicate Reliability

Absolute Reliability: Real Change or Error?

Responsiveness: is the instruments ability to detect a clinically meaningful change

You might also like