0% found this document useful (0 votes)

31 views6 pages

Language Assessment

The document discusses the concepts and issues surrounding language assessment, emphasizing the distinction between assessment and testing, as well as the various types of assessments such as formative, summative, diagnostic, and proficiency tests. It highlights the principles of language assessment, including practicality, reliability, validity, authenticity, and washback, and their implications for classroom testing. The text also addresses the evolution of assessment methods, including traditional and alternative assessments, and the impact of technology on language learning.

Uploaded by

Risa Fairus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views6 pages

Language Assessment

Uploaded by

Risa Fairus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Name : Risa Fairus Zumar Nafisa

NIM : 204200043
Class : TBI B
Language Assessment
Chapter 1
ASSESSMENT CONCEPTS AND ISSUES

Assessment is “appraising or estimating the level or magnitude of some attribute of a person”

(Mousavi, 2009, p. 35). In educational practice, assessment is an Ongoing process that
encompasses a wide range of methodological techniques. Whenever a student responds to a
question, offers a comment, or tries a new word or structure, the teacher subconsciously
appraises the student’s performance. A good teacher never ceases to assess students, whether
those assessments are incidental or intended.

Tests on the other hand, are a subset of assessment, a genre of assessment techniques. They
are prepared administrative procedures that occur at identifiable times in a curriculum when
learners muster all their faculties to offer peak performance, knowing that their responses are
being measured and evaluated. In scientific terms, a test is a method of measuring a person's
ability, knowledge, or performance in a given domain

Test measures an individual's ability, knowledge, or performance. Testers need to

understand who the test-takers are. A test measures performance, but the results imply the test-
taker’s ability or, to use a concept common in the field of linguistics, competence. Most
language tests measure one’s ability to perform language, that is, to speak, write, read, or listen
to a subset of language.

Measurement is the process of quantifying the observed performance of classroom learners.

Bachman (1990) cautioned us to distinguish between quan- titative and qualitative descriptions
of student performance.

Evaluation does not necessarily entail testing; rather, evaluation is involved when the resuits
of a test (or other assessment procedure) are used to make decisions (Bachman, 1990, pp. 22-
23). Evaluation involves the interpretation of information. Simply recording numbers or
making check marks on a chart does not constitute evaluation.

Assessment and learning Although tests can be useful devices, they are only one among many
procedures and tasks that teachers can ultimately use to assess (and measure) students. For
optimal learning to take place, students in the classroom must have the freedom to experiment,
to try out their own hypotheses about language without feeling their overall competence is
judged in terms of those trials and errors. In the same way that tournament tennis players must,
before a tournament, have the freedom to practice their skills with no implications for their
final placement on that day of days.

Informal assessment can take a number of forms, starting with incidental, unplanned
comments and responses, along with coaching and other impromptu feedback to the student.
Examples include putting a smiley face on homework or saying “Nice job!” or “Good work!.
Informal assessment is virtually always nonjudgmental, in that you asa teacher are not making
ultimate decisions about the student’s performance.

Formal assessments are exercises or procedures specifically designed to tap into a storehouse
of skills and knowledge. They are systematic, planned sampling techniques constructed to give
teacher and student an appraisal of student achievement. To extend the tennis analogy, formal
assessments are the tournament games that occur periodically in the course of a regimen of
practice.
Formative, assessment evaluating students in the process of “forming” their competencies
and skills with the goal of helping them to continue that growth process. The key to such
formation is the delivery (by the teacher) and internalization (by the student) of appropriate
feedback on performance, with an eye toward the future continuation (or formation) of
learning.

Summative assessment aims to measure, or summarize, what a student has grasped and
typically occurs at the end of a course or unit of instruction. A summation of what a student
has learned implies looking back and taking stock of how well that student has accomplished
objectives, but it does not necessarily point to future progress. Final exams in a course and
general proficiency exams are examples of summative assessment. Summative assessment
often, but not always, involves evaluation (decision making).

Norm-referenced tests, each test-taker’s score is interpreted in relation to a mean (average

score), median (middle score). standard deviation (extent of variance in scores), and/or
percentile rank. The purpose of such tests is to place test-takers in rank order along a
mathematical continuum.

Criterion-referenced tests, on the other hand, are designed to give test- takers feedback,
usually in the form of grades, on specific course or lesson objectives. Classroom tests involving
students in only one course and connected to a particular curriculum are typical of criterion-
referenced testing.

TYPES AND PURPOSES OF ASSESSMENT

Achievement Tests they are (or should be) limited to particular material addressed in a
curriculum within a specific time frame and are offered after a course has focused on the
objectives in question.

Diagnostic test is to identify aspects of a language that a student needs to develop or that a
course should include. A test of pronunciation, for example, might diagnose the phonological
features of English that are difficult for learners and should therefore become part of a
curriculum.

Place- ment tests, the purpose of which is to place a student into a particular level or section
of a language curriculum or school. A placement test usually, but not always, includes a
sampling of the material to be covered in the various courses in a Curriculum; a student’s
performance on the test should indicate the point at which the student will find material neither
too easy nor too difficult but appropriately challenging.

Proficiency test is not limited to any one course, curriculum, or single skill in the language;
rather, it tests overall ability. Proficiency tests have traditionally consisted of standardized
multiplechoice items on grammar, vocabulary, reading comprehension, and aural
comprehension. Many commercially produced proficiency tests-the TOEFL, for example
include a sample of writing as well as oral production performance.

Aptitude test is designed to measure capacity or general ability to learn a foreign language a
priori (before taking a course) and ultimate predicted success in that undertaking. Language
aptitude tests were ostensibly designed to apply to the classroom learning of any language.

Integrative Approaches The discrete-point approach presupposed a decontextualization that

was proving to be inauthentic. So, as the profession emerged into an era emphasizing
communication, authenticity, and context, new approaches were sought. John Oller (1979).

Communicative Language Testing proposed a model of language competence consisting of

organizational and pragmatic competence, respectively subdivided into grammatical and
textual components and into illocutionary and sociolinguistic components

Traditional and “Alternative” Assessment However, research and practice during the 1990s
provided compelling arguments against the notion that all people and all skills could be
measured by traditional tests. The result was the emergence of what came to be labeled as
alternative assessment.

Performance-Based Assessment A characteristic of many (but not all) performance-based

language assessments is the presence of interactive tasks and hence another term, task based
assessment, for such approaches. J. D. Brown (2005) noted that this is perhaps not so much a
synonym for performance-based assessment as it is a subset thereof, in which the assessment
focuses explicitly on “particular tasks or task types” (p. 24) in a curriculum.

Draws parallels to dynamic assessment (DA), aprolearning form of assessment conceptually

based on Vygotskian approaches to education. DA, as its name suggests, contrasts sharply with
traditional assessment, which is static or stable over time. Instead, in DA, learner abilities are
considered malleable, not fixed.

Tests of pragmatics have primarily been informed by research in interlanguage and cross-
cultural pragmatics (Bardovi-Harlig & Hartford, 2016; BlumKulka, House, & Kasper, 1989;
Kasper & Rose, 2002; Stadler, 2013). Much of pragmatics research has focused on speech acts
(e.g., requests, apologies, refusals, compliments, advice, complaints, agreements, and
disagreements).

Technological innovation and applications of that technology to language learning and

teaching. then, that an overwhelming number of language courses use some form of computer-
assisted language learning (CALL) or mobileassisted language learning (MALL) to achieve
their goals, as recent publications show (H. D. Brown. 2007b: Chapelle, 2005: Chapelle &
Jamieson, 2008: de Szendeffy. 2005).

Chapter 2

PRINCIPLES OF LANGUAGE ASSESSMENT

Practicality refers to the logistical, down-to-earth, administrative issues involved in making,
giving, and scoring an assessment instrument. These include “costs, the amount of time it takes
to construct and to administer, ease of scoring, and ease of interpreting/reporting the results”
(Mousavi, 2009, p. 516).

A reliable test is consistent and dependable. If you give the same test to the same student or
matched students on two different occasions, the test should yield similar results.

Student-Related Reliability. The most common learner-related issue in reliability is caused

by temporary illness, fatigue, a “bad day,” anxiety, and other physical or psychological factors,
which may make an observed score deviate from one’s “true” score. Also included in this
category are such factors as a test-taker’s test-wiseness, or strategies for efficient test-taking
(Mousavi, 2009, p. 804).

Rater Reliability. Human error, subjectivity, and bias may enter into the scoring process.
Interrater reliability occurs when two or more scorers yield consistent scores of the same test.
Failure to achieve inter-rater reliability could stem from lack of adherence to scoring criteria,
inexperience, inattention, or even preconceived biases. Lumley (2002) provided some helpful
hints to ensure inter-rater reliability.

Test Administration Reliability. Unreliability may also result from the conditions in which
the test is administered.

Test Reliability In classroom-based assessment, test unreliability can be caused by many

factors, including rater bias. This typically occurs with subjective tests with open-ended
responses (e.g., essay responses) that require a judgment on the part of the teacher to determine
correct and incorrect answers. Objective tests, in contrast, have predetermined fixed responses,
a format that of course increases their test reliability.

Validity. By far the most complex criterion of an effective test and arguably the most important
principle is validity, “the extent to which inferences made from assessment results are
appropriate, meaningful, and useful in terms of the purpose of the assessment” (Gronlund,
1998, p. 226).

Content-Related Evidence If a test actually samples the subject matter about which
conclusions are to be drawn, and if it requires the test-taker to perform the behavior measured,
it can claim content-related evidence of validity, often popularly referred to as contentrelated
validity (e.g., Hughes, 2003; Mousavi, 2009).

Criterion-Related Evidence. A second form of evidence of the validity of a test may be found
in what is called criterion-related evidence, also referred to as criterion-related validity, or the
extent to which the “criterion” of the test has actually been reached.

Construct-Related Evidence. A third kind of evidence that can support validity, but one that
does not play as large a role for classroom teachers, is construct-related validity, commonly
referred to as construct validity.

Consequential validity encompasses all the consequences of a test, including such

considerations as its accuracy in measuring intended criteria, its effect on the preparation of
test-takers, and the (intended and unintended) social consequences of a test’s interpretation and
use.

Face validity refers to the degree to which a test looks right, and appears to measure the
knowledge or abilities it claims to measure, based on the subjective judgment of the examinees
who take it, the administrative personnel who decide on its use, and other psychometrically
unsophisticated observers” (Mousavi, 2009, p. 247).
Authenticity. A fourth major principle of language testing is authenticity, a concept that is
difficult to define, especially within the art and science of evaluating and designing tests.
Bachman and Palmer (1996) defined authenticity as “the degree of correspondence of the
characteristics of a given language test task to the features of a target language task” (p. 23)
and then suggested an agenda for identifying those target language tasks and for transforming
them into valid test items.

Washback A facet of consequential validity is “the effect of testing on teaching and learning”
(Hughes, 2003, p. 1), otherwise known in the language assessment field as washback. Messick
(1996, p. 241) reminded us that the washback effect may refer to both the promotion and the
inhibition of learning, thus emphasizing what may be referred to as beneficial versus harmful
(or negative) washback. Alderson and Wall (1993) considered washback an important enough
concept to define a washback hypothesis that essentially elaborated on how tests influence both
teaching and learning. Cheng, Watanabe, and Curtis (2004) devoted an entire anthology to the
issue of washback, and Spratt (2005) challenged teachers to become agents of beneficial
washback in their language classrooms. (See Cheng, 2014, for a more recent discussion of this
topic.)

Applying Principles To Classroom Testing. The five principles of practicality, reliability,

validity, authenticity, and washback go a long way toward providing useful guidelines for both
evaluating an existing assessment procedure and designing one on your own. Quizzes, tests,
final exams, and standardized proficiency tests can all be scrutinized through these five lenses.

Maximizing Both Practicality And Washback In many circumstances, assessment

techniques that strive to provide greater washback and, because of their authenticity, usually
carry greater content validity, all require considerable time and effort on the part of the teacher
and the student. But practicality, as seen in earlier sections, may come at the expense of
washback and authenticity. And here we have an age-old challenge to teachers and test
designers: the dilemma of maximizing both practicality and washback. The relationship can be
depicted in a hypothetical graph that shows practicality/reliability on one axis and
washback/authenticity on the other (Figure 2.1). Notice the presumed negative correlation: as
a technique increases in its washback and authenticity, its practicality and reliability tend to
decline. Conversely, the greater the practicality and reliability, the less likely you are to achieve
beneficial washback and authenticity. Three types of assessment are illustrated on the
regression line.

Carney, Ray - Cassavetes On Cassavetes PDF
100% (5)
Carney, Ray - Cassavetes On Cassavetes PDF
544 pages
Educational Assessment Notes EDUC3143
90% (10)
Educational Assessment Notes EDUC3143
13 pages
U.5 Testing and Assessment in ELT by Dave Allan
No ratings yet
U.5 Testing and Assessment in ELT by Dave Allan
3 pages
Chapter 1 - Assessment Concepts and Issues
No ratings yet
Chapter 1 - Assessment Concepts and Issues
21 pages
Language, Assesment and Evaluation
No ratings yet
Language, Assesment and Evaluation
14 pages
Testing Assessing Teaching SUMMARY
No ratings yet
Testing Assessing Teaching SUMMARY
6 pages
Chapter I - Assessment in English Language Teaching
No ratings yet
Chapter I - Assessment in English Language Teaching
2 pages
Assessment Concepts by Brown (2018)
No ratings yet
Assessment Concepts by Brown (2018)
5 pages
Introducing Language Testing Spring 2024
No ratings yet
Introducing Language Testing Spring 2024
32 pages
Maryam Maghrour - Language Testing & Assessment
No ratings yet
Maryam Maghrour - Language Testing & Assessment
19 pages
Language Assessment, Brown & Abeywickrama (2019) - Chapter 1 Summary
No ratings yet
Language Assessment, Brown & Abeywickrama (2019) - Chapter 1 Summary
3 pages
TOPIC 1 Overview of Assessment
No ratings yet
TOPIC 1 Overview of Assessment
9 pages
Concept and Issues, Yana Latifah E1D117102
No ratings yet
Concept and Issues, Yana Latifah E1D117102
3 pages
LTA
No ratings yet
LTA
2 pages
Testing
No ratings yet
Testing
14 pages
Chapter 1 - Assessment Concepts and Issues
No ratings yet
Chapter 1 - Assessment Concepts and Issues
54 pages
Language Assessment 1
No ratings yet
Language Assessment 1
10 pages
Testing and Evaluation
No ratings yet
Testing and Evaluation
12 pages
Testing
No ratings yet
Testing
6 pages
Brown_Language Assessment_23_24
No ratings yet
Brown_Language Assessment_23_24
15 pages
Principles of Teaching D Brown
No ratings yet
Principles of Teaching D Brown
9 pages
Summary of The Course
No ratings yet
Summary of The Course
5 pages
Definition of Testing
No ratings yet
Definition of Testing
5 pages
teaching Material_testing
No ratings yet
teaching Material_testing
29 pages
0000 Summary of D. Brown's Assessment
No ratings yet
0000 Summary of D. Brown's Assessment
36 pages
Materials Assessment and Teaching
No ratings yet
Materials Assessment and Teaching
4 pages
A. Background: Arthur Hughes, Testing Language Teachers (New York: Cambridge University Press) 1989.P.101
No ratings yet
A. Background: Arthur Hughes, Testing Language Teachers (New York: Cambridge University Press) 1989.P.101
18 pages
Chapter 1 Assessment Concepts and Issues
No ratings yet
Chapter 1 Assessment Concepts and Issues
26 pages
Tsl3123 Module PPG
No ratings yet
Tsl3123 Module PPG
120 pages
Language Assessment Module
100% (3)
Language Assessment Module
118 pages
EN Testing and Assessment in English Langua
No ratings yet
EN Testing and Assessment in English Langua
6 pages
CBA-session 1 - Key Concepts - Part 1
No ratings yet
CBA-session 1 - Key Concepts - Part 1
19 pages
Testing, Assessing and Teaching: Lorena Peña Florez
No ratings yet
Testing, Assessing and Teaching: Lorena Peña Florez
30 pages
ELT Handout
No ratings yet
ELT Handout
85 pages
Testingin English Language Teaching
No ratings yet
Testingin English Language Teaching
4 pages
MODULE 6 FOUNDATIONS OF ENG LANG_BAEL
No ratings yet
MODULE 6 FOUNDATIONS OF ENG LANG_BAEL
8 pages
Summary 21-23
No ratings yet
Summary 21-23
8 pages
A Compilation Material of
No ratings yet
A Compilation Material of
5 pages
Lg Testing
No ratings yet
Lg Testing
18 pages
Language Testing and Assessment
No ratings yet
Language Testing and Assessment
30 pages
Relevance of Assessment
No ratings yet
Relevance of Assessment
12 pages
Wa0004.
No ratings yet
Wa0004.
27 pages
11_Testing_and_Assessment
No ratings yet
11_Testing_and_Assessment
2 pages
EFL Testing Techniques Syllabus. (UNG) Doc
No ratings yet
EFL Testing Techniques Syllabus. (UNG) Doc
30 pages
Lesson 02 Introducing Language Assessment Concepts
No ratings yet
Lesson 02 Introducing Language Assessment Concepts
3 pages
Language Tetingquestionsandanswers
No ratings yet
Language Tetingquestionsandanswers
35 pages
PP Kel 1
No ratings yet
PP Kel 1
14 pages
Makalah Language Testing
No ratings yet
Makalah Language Testing
23 pages
Testing Evaluation PP T
No ratings yet
Testing Evaluation PP T
51 pages
Testing and Assessment
50% (2)
Testing and Assessment
23 pages
El 115 FT
No ratings yet
El 115 FT
8 pages
1. Pres. Evaluation 1st Part
No ratings yet
1. Pres. Evaluation 1st Part
15 pages
Testing, assessment, and teaching
No ratings yet
Testing, assessment, and teaching
3 pages
Lqw0y0NGZ9qwXN1YpkHIqNfqZaItTQHbuKFm5vOn (1)
No ratings yet
Lqw0y0NGZ9qwXN1YpkHIqNfqZaItTQHbuKFm5vOn (1)
11 pages
Assessment
No ratings yet
Assessment
123 pages
Handouts On Language & Literature Assessment
No ratings yet
Handouts On Language & Literature Assessment
39 pages
Session 1 - A Course in Assessment in EFL Instruction - MA Students
No ratings yet
Session 1 - A Course in Assessment in EFL Instruction - MA Students
12 pages
sınav hazırlama 1
No ratings yet
sınav hazırlama 1
135 pages
Modul Language Assessment and Testing - MODUL STKIP
No ratings yet
Modul Language Assessment and Testing - MODUL STKIP
35 pages
Language Testing
No ratings yet
Language Testing
36 pages
Learn & Evaluate: Mastering Classroom Assessment
From Everand
Learn & Evaluate: Mastering Classroom Assessment
Pasquale De Marco
No ratings yet
The Importance of English in Todays World
No ratings yet
The Importance of English in Todays World
8 pages
Diplomatic English.1
No ratings yet
Diplomatic English.1
2 pages
Background Bundle Js LICENSE
No ratings yet
Background Bundle Js LICENSE
9 pages
Passive Voice Practice
No ratings yet
Passive Voice Practice
1 page
IMAGERY
No ratings yet
IMAGERY
25 pages
2023 Term 3 Primer
No ratings yet
2023 Term 3 Primer
6 pages
Teaching English in The Elementary Grades - RMDB
100% (1)
Teaching English in The Elementary Grades - RMDB
20 pages
Karin Angeline: Experience
No ratings yet
Karin Angeline: Experience
2 pages
Learn Korean 2
No ratings yet
Learn Korean 2
101 pages
Final Eaap
No ratings yet
Final Eaap
2 pages
Reported Speech - Questions
No ratings yet
Reported Speech - Questions
6 pages
21 Century Chinese Literature
100% (1)
21 Century Chinese Literature
40 pages
(ACV-S03) Week 03 - Pre-Task - Quiz - Weekly Quiz (PA) - INGLES IV (38079)
No ratings yet
(ACV-S03) Week 03 - Pre-Task - Quiz - Weekly Quiz (PA) - INGLES IV (38079)
4 pages
Kuis Acak Kata Bahasa Indonesia
No ratings yet
Kuis Acak Kata Bahasa Indonesia
2 pages
LP Conditionals
No ratings yet
LP Conditionals
3 pages
Answer key New Headway Upper Intermediate unit 1-3
No ratings yet
Answer key New Headway Upper Intermediate unit 1-3
2 pages
PAII datesheet and syllabus.
No ratings yet
PAII datesheet and syllabus.
3 pages
Zamboanga National High School-West
No ratings yet
Zamboanga National High School-West
3 pages
Chapter 4 Deconstruction
No ratings yet
Chapter 4 Deconstruction
10 pages
Noli Me Tangere
No ratings yet
Noli Me Tangere
3 pages
Unit 1 The Simple Past Tense
No ratings yet
Unit 1 The Simple Past Tense
29 pages
Guide 7 - Sales Techniques and Communication Channels
No ratings yet
Guide 7 - Sales Techniques and Communication Channels
9 pages
Syllabus Everyday English Spring 2025
No ratings yet
Syllabus Everyday English Spring 2025
12 pages
EngG8 Q4 Module 2
No ratings yet
EngG8 Q4 Module 2
10 pages
Ban in de 5
No ratings yet
Ban in de 5
5 pages
Natural Language Processing - Session 3 - Regular Expressions
No ratings yet
Natural Language Processing - Session 3 - Regular Expressions
39 pages
BÀI TẬP ÔN TẬP THÌ
No ratings yet
BÀI TẬP ÔN TẬP THÌ
10 pages
English Teaching in Japanese Universitie
No ratings yet
English Teaching in Japanese Universitie
24 pages
2.1 Workshop - What I Like and Dislike - INGLÉS BÁSICO I
No ratings yet
2.1 Workshop - What I Like and Dislike - INGLÉS BÁSICO I
44 pages

Language Assessment

Uploaded by

Language Assessment

Uploaded by

Name : Risa Fairus Zumar Nafisa

Assessment is “appraising or estimating the level or magnitude of some attribute of a person”

Test measures an individual's ability, knowledge, or performance. Testers need to

Measurement is the process of quantifying the observed performance of classroom learners.

Norm-referenced tests, each test-taker’s score is interpreted in relation to a mean (average

TYPES AND PURPOSES OF ASSESSMENT

Integrative Approaches The discrete-point approach presupposed a decontextualization that

Communicative Language Testing proposed a model of language competence consisting of

Performance-Based Assessment A characteristic of many (but not all) performance-based

Draws parallels to dynamic assessment (DA), aprolearning form of assessment conceptually

Technological innovation and applications of that technology to language learning and

PRINCIPLES OF LANGUAGE ASSESSMENT

Student-Related Reliability. The most common learner-related issue in reliability is caused

Test Reliability In classroom-based assessment, test unreliability can be caused by many

Consequential validity encompasses all the consequences of a test, including such

Applying Principles To Classroom Testing. The five principles of practicality, reliability,

Maximizing Both Practicality And Washback In many circumstances, assessment

You might also like