Session 1 Principles and design of assessment
Session 1 Principles and design of assessment
• Please check that you have selected the ‘microphone’ option when 1 2 3
joining the session. If you have joined as ‘listen only’, you will
not be able to unmute yourself and fully participate in the
interactive activities in the workshop.
• If you have joined as ‘listen only’, please click on ‘leave audio’
and then ‘join audio’. You can then select ‘microphone’, and
will be able to mute and unmute yourself as needed.
If you are having problems after trying these options, please let us know in the chat box on the left of your screen and our
moderator will be able to provide additional assistance.
Question writing for effective assessment
• purpose
• validity
• reliability
• manageability
• impact
8
What is validity?
Traditional definition:
‘whether a test really measures what it purports to measure’
(Kelley, 1927)
Contemporary definition:
‘the extent to which certain inferences can be made from an
assessment’s outcomes’
(Isaacs et al, 2013)
10
the students’ minds are doing the things we want them to show us they
can do;
CU (construct under-representation)
or
CIV (construct irrelevant variance)?
CU (construct under-representation)
or
CIV (construct irrelevant variance)?
2. A science exam with two papers of equal length. The first paper
assesses 75% of Section A of the syllabus. Paper 2 assesses
40% of Section B and 25% of Section C.
CU (construct under-representation)
or
CIV (construct irrelevant variance)?
CU (construct under-representation)
or
CIV (construct irrelevant variance)?
Fairness
https://qualificationswalesˌorg/media/avuda1dk/fair-access-by-designˌpdf
20
Use the chat box to identify aspects of these two questions that might make them
unfair:
Discuss.
21
Reliability
• ‘If the same person had taken the test for the first time on another
occasion, then they would have got the same results.’ (NFER, 2001)
Threats to reliability
• the occasion – the weather might be very hot, or the invigilator might give unclear
instructions
• the questions – the choice of topics might be less familiar, the wording of a question
might be ambiguous
• the mark scheme – the mark scheme might be very rigid / vague / not reward atypical but
correct answers
• the marker – is a human being who might be generous / severe / inconsistent / subjective
• the level-setting panel – the decisions might be swayed by one or two people with a
particular view / a different group of people might have made different decisions
• the candidate – is a human being / dynamic / subject to change from day to day
26
Manageability: Impact:
Practicalities of assessment
Details of:
https://www.aqa.org.uk/subjects/history/gcse/history-8145/specification/scheme-of-
assessment
35
Scheme of assessment
Content
AOs Demand Assessed
Total Mark for paper AO1 AO2 AO3 Challenging Regular Basic G10 only
40 40 Specified Marks 30 10 0 8 16 16 40
Any questions?
Use the chat box to ask any questions you have at this point.
Takeaway resources
A reminder:
Title Date
Session 2 The process of question writing 21st November 2024
Session 3 Writing and reviewing a range of question types 28th November 2024
Session 4 Designing reliable mark schemes 5th December 2024
cambridgeassessment.org.uk/the-network © Cambridge University Press & Assessment 2024
Join The Assessment Network
@AssessNetwork
Sign-up for our weekly news round-up and regular email updates about training courses
and events.
cambridgeassessment.org.uk/the-network