How To Design Examiantion Test Item That Are Credible
How To Design Examiantion Test Item That Are Credible
Pre-Amble.
This document has been prepared so that we can share success stories on our learners. In
addition, we will synchronise our working system so that we resonate from the same hymn book
and mould our own culture in setting of examination test items that meet the exact requirements
from the National Examining Body An attempt has been made to reflect on the good and poorly
designed test items. l hope you are going to enjoy reading the work and gain some important
information that will help you to do the right thing.
The most important factor in item writing is critical thinking.
What is critical thinking?
This is the process of analyzing and understanding how and why we reached certain conclusions.
In order to be a critical thinker, One must be able to:
•More Clear
•More Accurate
•More Defensible
Bloom’s taxonomy is key when we set lest items.
Understanding: Constructing meaning from oral, & written, and graphic messages
interpreting, exemplifying, classifying summarizing, inferring, comparing, and explaining
Analyzing Breaking material into constituent parts, determining how the parts relate to
one another and 10 an overall structure or purpose through differentiating, organizing,
and attributing.
Evaluating: Making judgments based on criteria and standards although checking and
critiquing
However, in item design, the syllabus has objectives which have the key words that the learners
should observe so that they will meet the objective set and the teacher can safely monitor
performance lagging gaps and plug loopholes before final examination.
•Remembering: Can the student recall or remember the information? define, duplicate, list,
memorize, recall, repeat, reproduce, state
•Understanding: Can the student explain ideas or concepts? classify, describe, discuss, explain,
identify, locate, recognize, report, select, translate, paraphrase
•Applying: Can the student use the information in a new way? choose, demonstrate, dramatize,
employ, illustrate, interpret, operate, schedule, sketch, solve, use. write.
•Analyzing: Can the student distinguish between the different parts? appraise, compare, contrast,
criticize, differentiate, discriminate, distinguish, examine, experiment, question, test
•Evaluating: Can the student justify a stand or decision? appraise, argue, defend, judge, select,
support, value, evaluate
•Creating: Can the student create new product or point of view? assemble, construct, create,
design, develop, formulate. write
TESTING COMMITTE
School Testing committee to design and revise the school's testing policies
•Examination Development Administration
•Determine acceptable statistical parameters for Test Items and Examinations.
If a test is not reliable and valid, then the student scores are not reliable and valid and not indicative
of the students understanding of the topic
•WRONG—they would have previous experience with the exam and such experience would
influence their scores on the second examination experience.
•Statistical measures that describe the internal consistency (reliability) of examination scores using
the result of only one test
Validity
Does the test measure what it claims to measure?
The presence of even a few poor items on an exam reduces reliability, sometimes to a great extent
Item Analysis
The more items you have, the greater the chance of poor test items.
D Length of examination
50 items
NB. Now that you have read and understood the background behind making or setting the Item
Writing Guidelines
General Guidelines
l) Do Not Use Trick Items
• Items are tricky either intentionally or accidentally.
• Trick items are unfair to examinees and threaten the validity of the test.
• Trick items measure test-taking skills more than the intended construct.
• Trick items heighten test anxiety and cause examinees to mistrust the intent of all other items.
• Causes of unintentional trickiness.
• Trivial content o Discrimination between options is too fine o Overlapping options o Irrelevant
content o Single answer allowed, but multiple correct answers possible o Ambiguity in either
the stem or options
2) Measure a Single Construct
• If an examinee incorrectly answers an item that has multiple constructs, it is impossible to
know which construct is not mastered.
• Items are generally scored dichotomously, so the only inference that can be made is that the
examinee knows the entire item or none of the item.
• Compound items heighten test anxiety and can lower the perceived validity of the exam.
3) Avoid Opinion-Based Items
• Never ask "What would you ... do" use" try", etc. The examinee's answer can never be wrong.
• Use caution when asking for the "best" thing, or the "best" way of doing something, unless it is
clearly the best amongst the options.
• If differences exist between any experts' opinion about what the "best" is, then avoid using it.
• Qualify the standard for "best" (i.e., according to ... )
4) A void Absolute Modifiers such as always, never, only and none.
• The use of absolute modifiers in options makes it easy to eliminate options, increasing the
guessing probability.
5) Avoiding Excessive Verbiage
• "Verbosity is an enemy to clarity. " (Haladyna, 2004)
• Wordy items take longer to read and answer, meaning fewer items can be presented in a fixed
amount of time, reducing reliability.
• Write items as briefly as possible without compromising the construct and cognitive demand
required.
• Get to the point in the stem and present clean, clear options for the examinee to choose.
• Avoid unnecessary background information
6) Avoid over-specific or over-general content
• Over-specific content tends to be trivial
• Over-general content tends to be ambiguous.
7) Use Novel Content
• Do not repeat exact wording from training materials.
• Repeated wording tends to test recall and recognition, rather than learning.
B 50
c. 0,50
NB: This test item has not test a clear objective it intends to test. lt can be interpreted
as five sets of 10 which brings the answer 50, or it can be reduced to lowest terms and
a half is regarded the answer. Moreso, the same half is represented as 0,5.The other
learner may put Roman numeral as he might have grasped concept on Roman
Numerals. ln a nutshell the test item is tricky, or vague. lt is not a good thing to develop
unclear test items.
The above has just been a sample and more examples shall be displayed during
the training development session.
The document has outlined the strength and weakness that we seek to highlight so that
we can do the right thing setting of papers which are fool proof and item to measure an
objective dimension it purports to measure and bring about good results that are
credible, reliable and valid.