Professional Documents
Culture Documents
English 9 Q3 3 - Validity
English 9 Q3 3 - Validity
INTRODUCTION
1. Content validity
2. Criterion-related validity
3. Other forms of evidence for
construct validity
4. Validity in scoring
5. Face validity
6. How to make more valid
tests
Content validity
It refers to how accurately an
assessment or measurement tool
taps into various aspects of the
specific construct in question. In
other words, do the questions
really assess the construct in
question?
A test needs to be related to the
content of the class (relevant
content).
HOW TO JUDGE IT? We need
a specification of the skills or
structures that the test is meant
to cover.
Not all the course content
needs to appear in the test.
The importance of content validity
Language specifications provide the test
constructor with a basis
For making a principled selection of
elements to include in the test.
A comparison between test
specification and test content is the basis
for judgments related to content validity.
The greater a test’s content validity, the
more likely is to be an
Accurate measure of what it is supposed
to measure.
ACTIVITY 1
1. According to the CEFR, which
specification of the language
skills would you need to take into account
to test anA1 student?
PLEASE USE YOUR GADGETS TO HAVE
ACCES TO THE CEFR
LANGUAGE SPECIFICATION.
2. Do you think teachers care about those
specifications while
testing a student?
Criterion related validity
It is the degree to which test results agree
with those provided by some independent
and highly dependable assessment of the
candidate’s ability.
The independent assessment is the
criterion measure against which the test is
validated.
TWO TYPES:
CONCURRENT VALIDITY
PREDICTICE VALIDITY
CONCURRENT VALIDITY
It refers to the extent to which the
results of a particular test, or
measurement correspond to those of a
previously established measurement for
the same construct.
Is it possible to test everything you need
to test in a short time?
This will always depend on how many
functions are tested in the
component, and how representative they
are among the complete
set of functions including in the objectives.
How the level of agreement is
measured?
Using the “correlation coefficient”.
This is a mathematical measure of
similarity.
Perfect agreement= 1
Total lack of agreement= 0
The level of agreement is regarded as
satisfactory, depending on
the purpose of the test and the
decisions that are made based on it.
PREDICITVE VALIDITY
This topic concerns the degree to which a
test can predict a candidate’s future
performance.
How helpful is it to use final outcomes as
the criterion measure when so many factors
other than ability in English (such as subject
knowledge, intelligence, motivation, health
and happiness) will have contributed to
every outcome?
Example: placement tests
Other forms of evidence for construct
validity
We cannot be sure that the items of the
test are measuring what we expect them to
measure.
Construct validity: “construct” refers to
any underlying ability that is hypothesized
in a theory of language ability.
It is important to establish if distinct
abilities exist, if they can be measured and
if they are measured in a test.
Research is needed for evidence.
Another way of obtaining evidence
about the construct validity of a
test is to investigate what test takers
actually do when they respond
to an answer.
TWO PRINCIPAL METHODS:
THINK ALOUD
Test taker voice their thoughts as
they respond to the item.
Problem: The very voicing of thoughts
may interfere with what would be the
natural response of the item
RETROSPECTION
They try to recollect what
their thinking was, as to they
responded.