Professional Documents
Culture Documents
An Introduction Into (English) Language Testing
An Introduction Into (English) Language Testing
What is assessment?
A bit of history
Language testing history [Until about 1980], language was basically seen to be grammar: that eventually came to be regarded as too distant, too abstract.
(Davies 2008)
Language testing history [In the 1980s], language was reckoned to be a set of real life encounters and experiences and tasks, a view which took real life testing so seriously that it lost both objectivity and generality.
(Davies 2008)
Language testing history [From the 1990s] there has been a compromise between these two positions, where language is viewed as being about communication but in order to make contact with that communication it is considered necessary to employ some kind of distancing from the mush of general goings on that make up our daily life in language.
(Davies 2008)
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Evaluation
The judgement made about a test takers ability, knowledge or intelligence, based on his/her test performance.
(Cf. Douglas 2000, Lynch 2003)
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Evaluation
Assessment
Judging the ability of a learner based on a test or otherwise and using this judgement as a constructive element in learning over time.
(Cf. Gipps 1994, Lynch 2005)
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
Make sure you and your students know what you want to test!
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
We dont use if-clauses. If you know something, you write about it. If you dont know for sure, you dont mention it.
History Concepts
Test Evaluation Assessment Reliability Validity Face validity Authenticity
Construction
Criteria Teaching Closing
WHAT HOW
Test specifications:
-
HOW
Test specifications
Task types
discrete point / integrated / non authentic / simulated authentic / genuine authentic / multiple choice / ranking / hotspot / true-false / matching / structuring / fill in the gaps / cloze / C-cloze / semi-open / open answer / diary / portfolio / syllabus task / problem-based task / product assessment / process assessment / oral / written / computer-based / paper-based / self assessment / peer assessment/ co assessment/ tutor assessment / in-class observation / fixed-point testing / norm referencing / criterion referencing /
Teaching
Closing
The CEF
History Concepts Construction Criteria
CEF Rubric
2001 / Council of Europe Goals: - Encouraging reflection - Fuelling discussion - Creating common language
One of the aims of the Framework is to help partners to describe the levels of proficiency required by existing standards, tests and examinations in order to facilitate comparisons between different systems of qualifications.
Teaching
Closing
CEF: system
History Concepts Construction Criteria
CEF Rubric
Skilled User
C2 C1 B2 B1 A2 A1
Teaching
Independent user
Closing
Basic user
CEF: influence
History Concepts Construction Criteria
CEF Rubric
DIALANG
Teaching
Closing
Handboeken
CEF
Didactiek
EUROPASS
Talenscholen
CEF: influence
History Concepts Construction Criteria
CEF Rubric
DIALANG
Teaching
Closing
Handboeken
CEF
Didactiek
EUROPASS
Talenscholen
CEF: influence
History Concepts Construction Criteria
CEF Rubric
DIALANG
Teaching
Closing
Handboeken
CEF
Didactiek
EUROPASS
Talenscholen
CEF: influence
History Concepts Construction Criteria
CEF Rubric
DIALANG
Teaching
Closing
Handboeken
CEF
Didactiek
EUROPASS
Talenscholen
CEF: influence
History Concepts Construction Criteria
CEF Rubric
DIALANG
Teaching
Closing
Handboeken
CEF
Didactiek
EUROPASS
Talenscholen
CEF: influence
History Concepts Construction Criteria
CEF Rubric
DIALANG
Teaching
Closing
Handboeken
CEF
Didactiek
EUROPASS
Talenscholen
Teaching
Closing
CEF: give it a go
History Concepts Construction Criteria
CEF Rubric
www.ceftrain.net
Teaching
Closing
Step 1 Specification
Internal validity
Teaching
Linking benchmarked items to the CEF Linking test answers to the CEF
Implementation
Confirmation
Holistic rubrics
Teaching
Closing
Dichotomous rubrics
Well-paced flow Yes 1 No 0
Teaching
Closing
Message is clear Acceptable pronunciation Effective use of grammar Effective use of vocabulary .
1 1 1 1
0 0 0 0
Band rubrics
Teaching
Closing
Descriptive statistics
How difficult is my test? Average
Descriptive statistics
How difficult is my test? Average Standard Deviation
Descriptive statistics
How difficult is my test? Average Standard Deviation
Voorbeeld: max = 100 | Ave = 50 | StDev = 10 68,3 % = 40 60 95,4 % = 30 70 99, 7% = 20 - 80 68,3% max 1 x SD 95,4% max 2 x SD 99,7% max 3 x SD
Descriptive statistics
Average or median?
Descriptive statistics
How difficult is my test? Average or median?
Descriptive statistics
Descriptive statistics
Descriptive statistics
Descriptive statistics
Correlations Is this test as difficult as last years test? Is group A as proficient as group B?
Correlations
Test 1 1 2 3 4 5 6 7 8 Test 2 1 2
8 12 10
3 4 5 6 7 8
2 6 Series1 Series2
0 0 2 4 6 8 10 12
9
10
9
10
Corr = + 1
Correlations
Test 1 1 Test 2 10
2
3 4 5 6 7 8 9 10
9
8 7 6 5 4 3 2 1
Corr= - 1
Correlations
1,00 2,00 3,00 4,00 5,00 6,00 7,00 8,00 6,00 3,00 5,00 1,00 6,00 8,00 2,00 4,00
Corr= + .05
Correlations
Correlations
Split-half reliability
Is the level of difficulty consistent within the test?
Set 1 #1 #3 #5
Set 2 #2 #4 #6
TOT_100
80 ,0 0
70 ,0 0
60 ,0 0
11 ,0 0
12 ,0 0
13 ,0 0
14 ,0 0
15 ,0 0
16 ,0 0
Corr = + 1
TOT_20
Split-half reliability
Set 1 #1 #3 #5
Set 2 #2 #4 #6
16 ,0 0
SCORE_ADMITTED
14 ,0 0
12 ,0 0
10 ,0 0 2,00 3,00
4,00
5,00
6,00
Corr = + .66
EX_4
Discriminating potential Does this question separate high achievers from weaker students?
Cronbachs Alpha
Cronbachs Alpha
Cronbachs Alpha
Task 2: Test construction For one of your classes, create a test which is motivating, valid and reliable.