Professional Documents
Culture Documents
Validity and Reliabillity in Language Assessment and Testing
Validity and Reliabillity in Language Assessment and Testing
Validity and Reliabillity in Language Assessment and Testing
reliability
Dr. D. Spiteri, Faculty of Education,
University of Malta
VALIDITY
CONSTRUCT
VALIDITY CONTENT
FACE
CONTENT VALIDITY
Does it reflect the teaching programme?
Does the test test what it is supposed to test?
Does it reflect the syllabus?
Does it include a representative sample of what has
been learnt?
•The lower the Forms, the
What has younger the learners, the
more careful we have to be
been taught that our test has content
validity.
Test
It is easy to sample haphazardly and end up being
unfair to some learners.
RECOGNISE
15
% OF SYLLABUS
statements, questions,
negatives, short
All persons,
answers
WHAT EXACTLY ARE WE
TEACHING?
Present simple
Grammar:
SYLLABUS ITEM
P
% RE R
NUMBE
OF CO O
SYLLABUS WHAT EXACTLY ARE WE R OF
SYL GN D
ITEM TEACHING? ITEMS
LAB IS U
IN TEST
US E C
E
Reading Scanning / skimming / 25 25/100
skills working out meaning from
context
Speaking Apologizing / giving reasons 25 25/100
skills / offering solutions
A letter of apology / a letter
Writing explaining reasons 25/100
25
skills
Conversation – row,
25 25/100
Listening expressing regret / giving
skills
Construct validity
Does the test test what it is supposed to - and nothing else?
How happy would you be if you found out that the pilot
flying your plane got his license after studying lots of
books?
How valid is a driving test in which the learner did not
drive a car?
How valid is a test of physical stamina if a young person is
asked to walk around the University ring road?
How valid is a speaking test if students only answer questions
but never ask one?
How valid is a speaking test if the questions are on general
knowledge?
How valid is a reading test which requires me to write long
answers?
How valid is a reading test where the teacher removes marks
for my spelling and grammar mistakes?
..... Construct validity
Don’t stick to texts only – film listings,
recipes, timetables, instructions, directions,
are also texts meant for reading;
Use a recording;
VALID
A GOOD
TEST
GOOD
RELIABLE BACKWASH
Reliability
Scorer reliability
Test reliability Intra-scorer reliability
If it was possible to If the same person marked
give the same person the same test twice, would
the same test at the they give the same mark?
same time, would the Inter-scorer reliability
result be the same? If two people marked the
same test, would they give
the same score?
Test reliability - how can we improve it
4. Include an example.
VALID
A GOOD
TEST
GOOD
RELIABLE BACKWASH
Backwash
Backwash is a term that describes the effect that a test has on the
teaching programme that leads to it.
When we design a test we need to consider what effect the test will
have on people.