Test Ok
Test Ok
Test
A test is a device or technique used to measure the performance , skill level ,or
knowledge of a learner on a specific subject matter .
0r
Test is a systematic procedure for observing persons and describing them with
either a numerical scale or a category system . thus a test may give either
qualitative or quantity information .
(Anthony j
nitko )
Or
Test commonly refers to a set of items or questions under specific conditions .
Or
A series of questions , problems , or physical responses designed to determine
knowledge , intelligence or ability is called the test .
Purposes of test
tests are conducted to find out whether the set objectives for a particular course ,
lesson or topic has been achieved or not .
test facilitates teacher to determine the progress made by the student in the class.
Test helps teachers to determine what students have learned and not learnt in the
class.
Tests are used to place students /candidates into a particular class , school ,level or
employment . such tests are called placements tests .
Tests can reveal problems or difficult areas of a learner.
Tests can be used to predict outcomes .
Face validity ascertains that the measure appears to be C assessing the intended
construct under study. The expert panelist can easily assess face validity. This is not
a very ‘scientific’ type of validity.
Construct validity is used to ensure that the measure is actually measuring what it is
intended to measure (i.e., The construct), and not other variables. Using a panel of
experts’ familiar with the construct is a way in which this Type of validity can be
assessed. The experts can examine the items and decide what that specific item is
intended to measure. Students can be involved in this process to obtain their
feedback.
• Content validity it ensures that the measure covers the broad range of areas within
the concept under study. Not everything can be covered, so items need to be sampled
from all of the domains. This may need to be completed using a panel of ‘experts’ to ensure
that the content area is adequately sampled. Additionally, a panel can help limit ‘expert’
bias (i.e., a test reflecting what an individual personally feels are the most important or
relevant areas).
Reliability of a Test
Reliability is the degree to which an assessment tool produces stable and consistent results.
Types of reliability:
• Test-retest reliability is a measure of reliability obtained by administering the same test
twice over a period of Time to a group of individuals. The scores from Time1 and Time 2 can
then be correlated in order to evaluate the test for stability over time.
Example: A test designed to assess student learning in psychology could be given to a group
of students twice, with the second administration perhaps coming a week after the first. The
obtained correlation coefficient would indicate the stability of the scores.
• Parallel forms reliability is a measure of reliability obtained by administering different
versions of an assessment tool (both versions must contain items that probe the same
construct, skill, knowledge base, etc.) to the same group of individuals. The scores from the
two versions can then be correlated in order to evaluate the consistency of results across
alternate versions.
Example: If you wanted to evaluate the reliability of a critical thinking assessment, you
might create a large set of items that all pertain to critical thinking and then randomly split
the questions up into two sets, which would represent the parallel forms.
Inter-rater reliability is a measure of reliability used to assess the degree to which different
judges or raters agree with their assessment decisions. Inter-rater reliability is useful
because human observers will not necessarily interpret answers the same way, faters may
disagree as to how well certain responses or material demonstrate knowledge certain
responses or materials demonstrate knowledge of the construction or skill being assessed.
Example: Inter-rater reliability might be employed when different judges are evaluating the
degree to which art portfolios meet certain standards. Inter-rater reliability is especially
useful when judgments can be considered relatively subjective. Thus, the use of this type of
reliability would probably be more likely when evaluating artwork as opposed to math
problems.
Test can be
Standardized or non standardized
Standardized tests are those test stated the uniformity and equality in the scoring and
administrating and interpreting the result .e.g . any examination in which the same test is given in
the same manner to all the students .
Non standardized test is one that allows for an assessment of an individual ,s abilities or
performances ,but doesn’t allow for a fair comparison of one student to another.
Achievement test
Psychological test
Achievement test : a systematic procedure for determining the amount a student has
learned through instructions . (Groundlund)
Achievement test is designed /used as a sampling of skills or abilities on specified area of
knowledge .
Standardized achievement test may assessed any or all of reading ,math,s and written
language as well as subject areas such as science and social studies .
e.g reading tests
mathematics tests
social studies tests
Extended S
Restricted response selective type
response upply type
Completion type
Short answer
Short essay short answer very short answer
True-false Extended
Multiple Matching Assertion - Interpretiv
response Multiple
response type reason e
type choice
item
Psychological test
A psychological test is an instrument designed to describe and measure a sample of certain aspect of
human behavior .
Psychological tests yield objective and standardized description of the behavior , quantified by
numerical scores.
Intelligence test
Intelligence is the global capacity of individual to think rationally ,to act purposefully ,to deal
effectively with the environment .
Intelligence tests are psychological test that are designed to measure functions such as
reasoning ,comprehension ,and judgment .
Aptitude test
According to Warren “aptitude is a set of characteristics symptomatic of an individual’s ability to
acquire with training , some specific field of knowledge ,skill or set of responses .”
Numerical reasoning
Personality tests
Personality is the sum of activities that can be discovered by actual observation over a period of time
to give reliable information .
Example :