0% found this document useful (0 votes)

4 views

Why using the curriculum as your progression model is incompatible with ‘measuring progress’ – David Didau

David Didau argues that using the curriculum as a progression model is incompatible with measuring progress, as assessments should focus on students' mastery of content rather than ranking them. He emphasizes that comparisons should be made laterally among students on the same assessment, rather than longitudinally across different tests, which can lead to misleading conclusions about progress. Ultimately, the document highlights the importance of understanding test scores in the context of curriculum mastery rather than as indicators of overall student ability or progress.

Uploaded by

hmc698nqnp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Why using the curriculum as your progression model is incompatible with ‘measuring progress’ – David Didau

Uploaded by

hmc698nqnp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Why using the curriculum as your

progression model is
incompatible with ‘measuring
progress’
David Didau September 19, 2011

Our capacity to misunderstand complex ideas leads, inexorably, to

the lethal mutation of those ides. In my last post I set out why the
apparently simple and obvious notion of ‘using the curriculum as a
progression model’ often goes wrong but I underplayed some key
points about the use of numbers. Tucked away in that post are two
ideas that need some amplification and explanation.

Firstly, in relation to the way in which summative assessments are

scored:

I should note that the key assumption underpinning this

assessment model is not that tests should discriminate between
students so we can place them in rank orders and assign
summative statements of progression. Instead, in order to ensure
progression through the curriculum, these tests should be
primarily seen as statements of competence, that students have
mastered the content sufficiently well to progress.

If we using the curriculum as our progression model all we need to

know is, how well students have learned this aspect of the
curriculum. Whilst the purpose of a GCSE exam is to say how well
students have performed relative to one another, the purpose of a
test attempting to assess how much of the curriculum has been
learned should not be interested in discriminating between students.
:
Ideally, if the curriculum is perfectly specified and taught, all
students would get close to 100%. Clearly, we’ll never come near this
state of perfection so if we achieve average scores of about 80% we
should be well satisfied that we are specifying and teaching
incredibly well.

Over the long term, if students fail to meet a threshold of confidence

the assumption should be that there is a fault either with the design
of the curriculum or in its teaching. In the short term, if a minority of
students fail to reach the threshold this leaves us with the 2 sigma
problem Benjamin Bloom failed to solve: how can we scale the time
and resources required for all students to be successful? The best
I’m currently able to suggest is that we need to have specified and
sequenced our curriculum to focus heavily on the most crucial
concepts within subject disciplines and to ensure these at least are
mastered. If our guiding assumption was that any test score below
80% highlighted some fault in specification or instruction, this could
transform the educational experiences of our most disadvantaged
students.

We can’t measure progress, only performance

The second point made in the previous post that I want to return to
was on what we should avoid doing with numbers:

What we can’t do is compare the percentage a students gets in

Term 1 with one achieved in Term 6 and attempt to draw a line
between them indicating progress. This would assume that the
second test was objectively more difficult and that if the numbers
go up, then progress is being made. We may believe this to be
true but it’s very rare to find schools that have put the effort into
calculating the difficulty of test questions required to make this a
defensible claim.
:
The key to using numbers sensibly in this model is to only to
compare across space (laterally) and not across time
(longitudinally). What I mean by this is that it makes perfect sense
to compare how different students, classes or cohorts have
performed on the same assessment, but not to compare the
results of different assessments.

In order to explain what I mean, I need to take you back to Becky

Allen’s 2018 blog, What if we cannot measure pupil progress? In it
she said,

When we use … tests to measure relative progress, we often look

to see whether a student has moved up (good) or down (bad) the
bell curve. On the face of it this looks like they’ve made good
progress, and learnt more than similar students over the course
of the year. However [the test] is a noisy measure of what they
knew at the start [and] the end of the year. Neither test is reliable
enough to say if this individual pupil’s progress is actually better
or worse than should be expected, given their starting point.

Tests are very useful for assessing how well students have
learning particular curriculum content but cannot be used to
measure the rate at which students are progressing towards
better future test performance. Just because a student has
learned a lot about, say, the Norman Invasion or Buddhism, we
cannot claim that they will do equally well (or better) on a test of the
Tudors or Hinduism. And if they do less well on a subsequent test we
cannot claim that students are making negative progress. What we
can – and should – perceive is that they don’t know some aspects of
the curriculum as well as others, and intervene accordingly.

This is what I meant about lateral rather than longitudinal

comparison. It’s not only possible but desirable to compare how
:
different students perform on same assessment. If my class
performs much worse than yours on the same assessment could
infer either that students in your class are, on average, clever than
those in mine, or that you have taught the curriculum better. While
the first inference may be true, it is not useful. ‘Top sets’ are routinely
filled with students from more affluent backgrounds who are often
successful despite rather than because of the choices we make.
Rather than worrying about ability, We might do better to track
indices of social disadvantage: if our more disadvantaged students
are doing well we can be reasonably sure this is due to how well
we’ve specified and taught the curriculum. It might be useful to only
use these pupils when analysing test data.

Longitudinal comparison, or attempting to measure progress, is

fraught with error. Even if we don’t make the lamentably common
mistake of assessing students ability to do thing we haven’t actually
taught them, the test students sit will only sample from the domain
of what they were taught. How students perform in that test give us
some sense of how well an individual student has learned the
curriculum relative to their peers but it’s only by establishing a
scalogram of student performance vs item difficulty that we will get a
sense of what individual test score might mean. As I said in this post,

Typically, we just see a test as a mechanism for measuring

students’ ability, or level of development, and fail to understand
that getting 50% in a harder test might actually be better than
getting 70% in an easier test. But we should also understand that
if one student gets 70% and another gets 35% on the same test,
that does not mean the first student has done twice as well as the
second student. It should be obvious that getting less than 35%
is far easier than getting more than 35% and, if a test is well
designed, items will get progressively more difficult so as to
better measure individual students’ performance.
:
Here’s an example of a scalogram that compares students’
performance against item difficulty:

I should explain that item difficulty is established by working out

which questions students answer correctly; the more students
answer an item correctly, the easier that item is, and the fewer
students answer an item, the harder it is. So, we can see in the table
above that although students A, D, and G all achieved the same test
score, Student D was able to answer more difficult questions than
Students A and G.* One explanation could be that Student D had not
been present in class when some of the content most students
found easy to answer was taught. Or, similarly, it could be that
Student G missed some of the later curriculum content. Either way,
the test score tells us relatively little about these students’ ability to
make future progress but quite a lot about what they have learned to
date. It ought to be clear that we should not intervene with these
three students in the same ways. And, I hope, it ought to be equally
obvious why drawing a line between performance in two different
tests is likely to tell us little of any use about students’ progress.
:
*

I hope this additional explanation makes sense and proves useful. If

you have further questions or would like to point out errors in my
thinking I’d be most grateful to hear from you.

* There’s a discussion to be had here about whether the marks

awarded to an item should only be decided after a test has been
taken and item difficulty is established. By marking each question
equally we are likely to obscure students’ actual performance.
:

Achievement Test
95% (20)
Achievement Test
16 pages
Bob Knowlton Case Analysis
0% (1)
Bob Knowlton Case Analysis
4 pages
Qualities of A Good Test
100% (1)
Qualities of A Good Test
24 pages
Using Rubrics for Performance-Based Assessment: A Practical Guide to Evaluating Student Work
From Everand
Using Rubrics for Performance-Based Assessment: A Practical Guide to Evaluating Student Work
Todd Stanley
4.5/5 (2)
Achievement Test
89% (9)
Achievement Test
20 pages
WWII Poster Rubric
No ratings yet
WWII Poster Rubric
1 page
German Universities CS
No ratings yet
German Universities CS
9 pages
The Pros and Cons of Standardized Testing
No ratings yet
The Pros and Cons of Standardized Testing
7 pages
Designing Assessment of Performance in Mathematics: Key Concepts: Key Processes
No ratings yet
Designing Assessment of Performance in Mathematics: Key Concepts: Key Processes
23 pages
SBG Explained
No ratings yet
SBG Explained
3 pages
EP 300 MONDAY & FRIDAY
No ratings yet
EP 300 MONDAY & FRIDAY
66 pages
None of The Above: A New Approach To Testing and Assessment
No ratings yet
None of The Above: A New Approach To Testing and Assessment
40 pages
Types of Tests
No ratings yet
Types of Tests
13 pages
Testing and Evaluation Da The Practice of English Language Teaching by Jeremy Harmer
No ratings yet
Testing and Evaluation Da The Practice of English Language Teaching by Jeremy Harmer
15 pages
Nurtan 13
No ratings yet
Nurtan 13
2 pages
Performance-Based Assessment for 21st-Century Skills
From Everand
Performance-Based Assessment for 21st-Century Skills
Todd Stanley
4.5/5 (14)
B 190313162555
No ratings yet
B 190313162555
29 pages
Chapter1-Teaching and Testing
No ratings yet
Chapter1-Teaching and Testing
34 pages
Teachers Summative Assessment PDF
No ratings yet
Teachers Summative Assessment PDF
17 pages
Harlen 2005
No ratings yet
Harlen 2005
17 pages
A Standardized Fiasco
No ratings yet
A Standardized Fiasco
6 pages
Assessment 1 (Book)
100% (1)
Assessment 1 (Book)
11 pages
Practice English Lenguage Teaching Cap.22
No ratings yet
Practice English Lenguage Teaching Cap.22
16 pages
Evidence To Bew Review
No ratings yet
Evidence To Bew Review
5 pages
Language Testing Summary of Chapters 1 6
No ratings yet
Language Testing Summary of Chapters 1 6
34 pages
Formative and Summative Assessment1
No ratings yet
Formative and Summative Assessment1
22 pages
CAP Critical Thinking Paper Final Draft
No ratings yet
CAP Critical Thinking Paper Final Draft
8 pages
An Argument Against Standardized Testing in Schools-Name
No ratings yet
An Argument Against Standardized Testing in Schools-Name
7 pages
Assessent Analyis and Evaluation
No ratings yet
Assessent Analyis and Evaluation
5 pages
EDUC 107a Module 8 UPLOAD
No ratings yet
EDUC 107a Module 8 UPLOAD
8 pages
Portfolio of Testing and Evaluation: Assigned by Dr. Rafique Memon
No ratings yet
Portfolio of Testing and Evaluation: Assigned by Dr. Rafique Memon
10 pages
Who Packed Your Parachute? Why Multiple Attempts on Assessments Matter: Quick Reads for Busy Educators
From Everand
Who Packed Your Parachute? Why Multiple Attempts on Assessments Matter: Quick Reads for Busy Educators
Cheryl Angst
No ratings yet
Testing in The Classroom and Its Effectiveness in Predicting Student Achievement and Understanding
No ratings yet
Testing in The Classroom and Its Effectiveness in Predicting Student Achievement and Understanding
18 pages
Full Slides For Measurements
No ratings yet
Full Slides For Measurements
195 pages
Desinging Assessment of Performance in Mathematics
No ratings yet
Desinging Assessment of Performance in Mathematics
24 pages
Testing and Evaluation for Workshop
No ratings yet
Testing and Evaluation for Workshop
15 pages
64,709,Chapter 13 The testing of Reading
No ratings yet
64,709,Chapter 13 The testing of Reading
18 pages
Profed4 Reviewer
No ratings yet
Profed4 Reviewer
4 pages
Educational and Psychological Measurement and Evaluation
No ratings yet
Educational and Psychological Measurement and Evaluation
41 pages
38 - Drill 10 - Measurement and Evaluation
No ratings yet
38 - Drill 10 - Measurement and Evaluation
29 pages
Micro Teaching
No ratings yet
Micro Teaching
18 pages
Construction of Questions
No ratings yet
Construction of Questions
38 pages
Module 11 Ass 2
No ratings yet
Module 11 Ass 2
13 pages
Nature of Measurement of Education, Meaning and
100% (1)
Nature of Measurement of Education, Meaning and
13 pages
Evaluatyon of Teacher – Made Tests for the 4th Year of the B
No ratings yet
Evaluatyon of Teacher – Made Tests for the 4th Year of the B
8 pages
CLassroom Test, D of English 3, Group 6, Turma 2
No ratings yet
CLassroom Test, D of English 3, Group 6, Turma 2
17 pages
measurements evaluation pdf
No ratings yet
measurements evaluation pdf
14 pages
Philosophy of Assessment
No ratings yet
Philosophy of Assessment
3 pages
Standardized Testing
No ratings yet
Standardized Testing
2 pages
Assessment of Learning 1
No ratings yet
Assessment of Learning 1
3 pages
High jump vs hurdles- Replacing grades with curriculum related expectations – David Didau
No ratings yet
High jump vs hurdles- Replacing grades with curriculum related expectations – David Didau
3 pages
A Qualitative Approach To Grading Students
No ratings yet
A Qualitative Approach To Grading Students
9 pages
Report For Psych Assessment Script
No ratings yet
Report For Psych Assessment Script
5 pages
Lesson 2
No ratings yet
Lesson 2
25 pages
Inquiry Write Up
No ratings yet
Inquiry Write Up
16 pages
Evaluation by The Utamed
No ratings yet
Evaluation by The Utamed
63 pages
Classroom Assessment WTNTK-pages-78
No ratings yet
Classroom Assessment WTNTK-pages-78
2 pages
Koretz, Daniel. Measuring Up
No ratings yet
Koretz, Daniel. Measuring Up
4 pages
Classroom Testing and Evaluation
No ratings yet
Classroom Testing and Evaluation
80 pages
Educational Measurements and Evaluation Notes
No ratings yet
Educational Measurements and Evaluation Notes
25 pages
Profed4 Reviewer
No ratings yet
Profed4 Reviewer
4 pages
Getting Started with Teacher Clarity: Ready-to-Use Research Based Strategies to Develop Learning Intentions, Foster Student Autonomy, and Engage Students
From Everand
Getting Started with Teacher Clarity: Ready-to-Use Research Based Strategies to Develop Learning Intentions, Foster Student Autonomy, and Engage Students
Marine Freibrun
No ratings yet
Creative Schools Summary Chapters 8, 9 & 10
No ratings yet
Creative Schools Summary Chapters 8, 9 & 10
12 pages
Attention, meaning & consolidation- matching technique to purpose – David Didau
No ratings yet
Attention, meaning & consolidation- matching technique to purpose – David Didau
6 pages
#BackToSchool – free webinars – David Didau
No ratings yet
#BackToSchool – free webinars – David Didau
2 pages
The best 3 sentences in education? – David Didau
No ratings yet
The best 3 sentences in education? – David Didau
3 pages
Come work with me… – David Didau
No ratings yet
Come work with me… – David Didau
2 pages
Book Review - Why Love Hurts - A Sociological Explanation by Eva Illouz - LSE Review of Books
No ratings yet
Book Review - Why Love Hurts - A Sociological Explanation by Eva Illouz - LSE Review of Books
10 pages
In defence of accountability – David Didau
No ratings yet
In defence of accountability – David Didau
6 pages
Messy markbooks- monitoring participation in (and across) lessons – David Didau
No ratings yet
Messy markbooks- monitoring participation in (and across) lessons – David Didau
5 pages
Earned autonomy and shared responsibility – David Didau
No ratings yet
Earned autonomy and shared responsibility – David Didau
6 pages
Boundaries - Redefining Love
No ratings yet
Boundaries - Redefining Love
12 pages
Why Artificial Intelligence Education is Essential in Schools
No ratings yet
Why Artificial Intelligence Education is Essential in Schools
11 pages
9 Best Relationship Blogs To Inspire Your Next Blog
No ratings yet
9 Best Relationship Blogs To Inspire Your Next Blog
18 pages
Love Is Beautiful Yet Hurts!. These Simple Words Make Me Want To Know - by Eunike Ve - Medium
No ratings yet
Love Is Beautiful Yet Hurts!. These Simple Words Make Me Want To Know - by Eunike Ve - Medium
11 pages
Blog - What Is A Blog, Benefits & Tips - Britannica
No ratings yet
Blog - What Is A Blog, Benefits & Tips - Britannica
8 pages
In Love, Pain Is A Blessing - Times of India
No ratings yet
In Love, Pain Is A Blessing - Times of India
18 pages
When Someone You Love Hurts You - The Weekly Sparkle
No ratings yet
When Someone You Love Hurts You - The Weekly Sparkle
8 pages
More To Be - StableMinded With Lisa Pulliam
No ratings yet
More To Be - StableMinded With Lisa Pulliam
7 pages
Unrequited Love - How To Get Over It
100% (1)
Unrequited Love - How To Get Over It
6 pages
14 Reasons Why Love Hurts So Much (& 6 Ways To Heal)
No ratings yet
14 Reasons Why Love Hurts So Much (& 6 Ways To Heal)
13 pages
If It Hurts, It Isn't Love - Robert Holden, Ph.D.
No ratings yet
If It Hurts, It Isn't Love - Robert Holden, Ph.D.
3 pages
Create Your Own Tarot Cards (Adrianne Hawthorne, Theresa Reed) (Z-Library)
100% (7)
Create Your Own Tarot Cards (Adrianne Hawthorne, Theresa Reed) (Z-Library)
218 pages
Which Numeracy Skills Are Covered in Each Abacus Evolve Week?
No ratings yet
Which Numeracy Skills Are Covered in Each Abacus Evolve Week?
11 pages
It As A Adv Pronoun, Dummy or Anticipatory
100% (1)
It As A Adv Pronoun, Dummy or Anticipatory
2 pages
2 - Basic Counselling Skills
No ratings yet
2 - Basic Counselling Skills
14 pages
ENG529
No ratings yet
ENG529
2 pages
Speech Act Theories
No ratings yet
Speech Act Theories
64 pages
Penerapan Kemahiran Berfikir Aras Tinggi (KBAT) Dalam Kurikulum Reka Bentuk Dan Teknologi (RBT) Sekolah Rendah
No ratings yet
Penerapan Kemahiran Berfikir Aras Tinggi (KBAT) Dalam Kurikulum Reka Bentuk Dan Teknologi (RBT) Sekolah Rendah
8 pages
Syllabus TRS601 FALL23.P2
No ratings yet
Syllabus TRS601 FALL23.P2
52 pages
NCP For PTSD
100% (4)
NCP For PTSD
3 pages
Vocabulary For Unit 3
No ratings yet
Vocabulary For Unit 3
3 pages
Cross, Lazy Daisy, Loope Stitches
No ratings yet
Cross, Lazy Daisy, Loope Stitches
3 pages
Lesson Exemplar/Plan/Guide in English 9 - Bias and Prejudices
100% (2)
Lesson Exemplar/Plan/Guide in English 9 - Bias and Prejudices
5 pages
Julia Raymond Lorenz (2000) PDF
No ratings yet
Julia Raymond Lorenz (2000) PDF
79 pages
Educ 103 Module 4 Lesson3
No ratings yet
Educ 103 Module 4 Lesson3
7 pages
Final Thesis Ready To Print Edited
No ratings yet
Final Thesis Ready To Print Edited
58 pages
Action Research Proposal - ZNHS
No ratings yet
Action Research Proposal - ZNHS
15 pages
Analytic Memo 10
No ratings yet
Analytic Memo 10
3 pages
BalloonsAndCharges MS All Pages
No ratings yet
BalloonsAndCharges MS All Pages
10 pages
Lesson Plan: Part A: Preparation and Strategies
No ratings yet
Lesson Plan: Part A: Preparation and Strategies
7 pages
The Correlation Between Students Grammar Knowledge and Their Speaking Ability Journal
100% (1)
The Correlation Between Students Grammar Knowledge and Their Speaking Ability Journal
7 pages
Week 1 Research in Daily Life 2
No ratings yet
Week 1 Research in Daily Life 2
27 pages
Robert J. Sternberg, Don Ambrose - Conceptions of Giftedness and Talent-Palgrave Macmillan (2021)
100% (1)
Robert J. Sternberg, Don Ambrose - Conceptions of Giftedness and Talent-Palgrave Macmillan (2021)
554 pages
Why We Set Unattainable Goals
No ratings yet
Why We Set Unattainable Goals
4 pages
07.11.24-Select-Sample-Assessment-Downloadable
No ratings yet
07.11.24-Select-Sample-Assessment-Downloadable
7 pages
Perizat Nurken: 1. How Would They Affect Validity, Reliability, and Practicality?
No ratings yet
Perizat Nurken: 1. How Would They Affect Validity, Reliability, and Practicality?
3 pages
CHAPTER 1 Module 1
No ratings yet
CHAPTER 1 Module 1
2 pages
BC KMBN 107 Put
No ratings yet
BC KMBN 107 Put
3 pages
Chapter 1 - Parental Involvement and Alphabetical Literacy of The Kindergarten Pupils in Tabunan Integrated School-Group 7
No ratings yet
Chapter 1 - Parental Involvement and Alphabetical Literacy of The Kindergarten Pupils in Tabunan Integrated School-Group 7
38 pages

Why using the curriculum as your progression model is incompatible with ‘measuring progress’ – David Didau

Uploaded by

Why using the curriculum as your progression model is incompatible with ‘measuring progress’ – David Didau

Uploaded by

Why using the curriculum as your

Our capacity to misunderstand complex ideas leads, inexorably, to

Firstly, in relation to the way in which summative assessments are

I should note that the key assumption underpinning this

If we using the curriculum as our progression model all we need to

Over the long term, if students fail to meet a threshold of confidence

We can’t measure progress, only performance

What we can’t do is compare the percentage a students gets in

In order to explain what I mean, I need to take you back to Becky

When we use … tests to measure relative progress, we often look

This is what I meant about lateral rather than longitudinal

Longitudinal comparison, or attempting to measure progress, is

Typically, we just see a test as a mechanism for measuring

I should explain that item difficulty is established by working out

I hope this additional explanation makes sense and proves useful. If

* There’s a discussion to be had here about whether the marks

You might also like