Lecture 3 handout Testing - Assessment
Lecture 3 handout Testing - Assessment
Teaching vs. Assessment. Mistakes and Errors in Language Learning. Feedback and
Alternative Assessment. Testing and Evaluation.
Testing:
1. Definition: A test is a formal or informal method of measuring students’ knowledge and/or
abilities in a given area. A test inevitably samples performance but infers certain general
competence on the basis of the sampled performance. (Brown, 2001)
2. Negative connotations associated with tests and testing: Stress, fear of failure, one-off
possibility to show what you know, not designed the way you expected the test to look like,
unpleasant surprises, too difficult questions, seems more like punishment than a fair chance to prove
that you made progress, etc…
3.1. Proficiency tests measure language proficiency regardless of any language courses that
candidates may have followed.
3.2. Achievement tests are intended to establish how successfully individual students, groups of
students, or the courses themselves have been in achieving objectives.
3.3.Diagnostic tests are used to identify students’ strength and weaknesses, to identify what they
know and what they do not know.
3.4. Placement tests are intended to provide information which will help to place students at the
stage or in the part of the teaching programme most appropriate to their abilities.
4. What makes a test/examination standardised, i.e. different from a classroom test prepared by
the teacher?
gives valid and reliable information on students’ knowledge on a larger scale (national,
international)
developed by testing specialists
follows a standardised procedure of development
follows a standardised practice of administration
results are evaluated by statistical methods based on testing theories
4.1.Validity is the extent to which a test measures what it is intended to measure: it is
related to test scores and the ways in which test scores are interpreted, and is therefore
always relative to test purpose.
4.2. Reliability is the extent to which the test scores are consistent: if the candidates took
the same test again tomorrow after taking it today, would they get the same result
(assuming no change in their ability)?
6. Why do we need standardised tests? - The educational context (European tendencies &
Hungary)
shift from centrally controlled input towards a more flexible decentralised system
introduction of the National Core Curriculum (identifying ‘what’ to teach giving freedom
to the ‘how’- content control),
introduction of “Frame curricula” (between NCC and the local curricula)
need to control the output more emphasis on evaluation procedures, standardisation
Sampling: the test constructor must sample what has been taught.
What?
o the course content
o the course objective
o student errors
How?
o principled choice
o mechanical choice
Tests as compromises
They are compromises by sample: we cannot - should not - include everything in
our tests.
They are compromises by method: there might be something in our sampled list
of desirables which we realise we are unable to test.
Direct/task-based tests vs. indirect/discrete point tests
The driving test is based on a task and is direct. The learner driver has to do a
task, which is very similar to the real thing.
A multiple choice grammar test is based on a number of separate discrete points
of grammar, and is indirect because what the student has to do is rather remotely
like the real thing.