Professional Documents
Culture Documents
Policy Brief 7A
Policy Brief 7A
Policy Brief 7A
How the use of item analysis can benefit learners and improve test quality
February 2018
Prepared by NEEDU
1
Teachers consistently assess learners to In April 2017, the Minister of Basic Education commissioned
the National Education Evaluation and Development Unit
establish if the instruction, “the dose,” has (NEEDU) to conduct the Schools that Work II study. This study
sought to examine the characteristics of top-performing
been successful; that is, if all learners have schools in South Africa. The best practices discussed in this
mastered the concepts and skills taught in advocacy brief are based on the findings of that study. The full
report is available on the Department of Basic Education
class. However, unlike in the medical field, website: www.education.gov.za.
NEEDU can be reached at (012) 357 4231
①
1
conduct an item analysis to improve the Table 1: Computing the difficulty of an item
quality of the items in a test or an exam. LEARNER QUESTIONS
NAME 1 2 3 4 5 6 7 8 9 10
TEST ITEM ANALYSIS ZAMI + + + + + + + + + +
Item analysis is described as a process of MAPHORI + x + x x + x + x x
examining learners’ responses on a class-wide ROSE + + x + + + + + x x
performance and the examination of BHEKI + + + + + + + x + +
individual items on a test, rather than the test SAHEDA + + x + x + x + x x
as a whole. Following are three common BONGI + + x + x + + x + x
GUGU + x x + x + x x + x
types of item analysis:
BRUCE + x x x + x x + x x
RONALD + + + + + x + x + +
BARBARA x x x x x + x + x x
Item Item Analysis of
Difficulty Discrimination Response Applying the Item Difficulty Index formula
Index Index Options
described above, the difficulty indices for
Questions 1 and 10 can be calculated as
Described next are procedures that teachers
follows:
in top-performing schools use to calculate the
Difficulty Index, the Discrimination Index and
EXAMPLE 1: HOW AN ITEM DIFFICULTY INDEX IS CALCULATED:
to analyse response options. Ѽ The difficulty of Question No. 1 (i.e. the number of
learners who got this question correctly is 9 out of
10 learners or 9/10 or 90% or 0.90). The p-value
ITEM DIFFICULTY INDEX for Question 1 is therefore 0.90
Ѽ The difficulty of Question No. 10 (i.e. number of
learners who got this question correctly is 3 out of
What is an Item Difficulty Index? 10 learners or 3/10 or 90% or 0.30). The p-value
for Question 10 is therefore 0.30
•It is the proportion of learners taking a test
who answered a test item correctly
•It addresses the question: How are the results of the Difficulty
How difficult is the item? Index analysis interpreted?
•The higher the p-value, the easier the items.
This means, the higher the difficulty index,
How is the Difficulty Index calculated? the easier the item is understood to be
• A rough "rule-of-thumb" is that:
•ITEM DIFFICULTY FORMULA: •① Easy Item = 70% and above or a p-value
•① Count the number of learners who got the of 0.70 and above
correct answer •② Moderate Item = 30-69% or a p-value of
0.30-0.69
•② Divide by the total number of learners who •③ Difficult Item = 29% and below or a p-
took the test
N-07A/2018
①
2
Table 2 displays the same results as in Table 1
ITEM DISCRIMINATION INDEX (DI) but here results are arranged with the top
overall performers at the top of the table
Why calculate a Discrimination Index ? (shaded in yellow) and the lower group at the
bottom of the table (shaded in grey).
•To enable teachers to identify items that are Table 2: Computing the DI of an item
not able to differentiate between learners
who have learned the content and those who LEARNER QUESTIONS TOTAL
have not NAME MARK
•To improve the quality of the items in a test, 1 2 3 4 5 6 7 8 9 10 (%)
thereby improving both reliability and validity 100
of the test
ZAMI + + + + + + + + + +
BHEKI + + + + + + + x + + 90
RONALD + + + + + x + x + + 80
What is an Item Discrimination Index? ROSE + + x + + + + + x x 70
BONGI + + x + x + + x + x 60
•It is a measure of an item's ability to 50
discriminate or differentiates between those
SAHEDA + + x + x + x + x x
who scored high on the total test and those GUGU + x x + x + x x + x 40
who scored low MAPHORI + x + x x + x + x x 40
•It is the point biserial correlation between 30
getting the item right and the total score on all
BRUCE + x x x + x x + x x
other items BARBARA x x x x x + x + x x 20
•It seeks to establish whether getting an item "+" indicates the answer was correct; "x" indicates it was incorrect.
correct or not is due to learners' level of
knowledge or ability and not due to something Applying the DI formula described above, the
else such as chance or a test bias
•It addresses the question:
discrimination indices for Questions 1 and 10
Does the item discriminate between high and in Table 2 can be calculated as follows:
low achievers?
EXAMPLE 3: HOW DISCRIMINATION INDEX (DI) IS CALCULATED:
How is Discrimination Index calculated?
DI CALCULATION STEPS QUESTION 1 QUESTION 10
•ITEM DISCRIMINATION FORMULA: ① Sort by total marks
•After marking the test: from highest to lowest As done in Table 2 As done in Table 2
•① Arrange or sort the test by total marks from
the highest marks to the lowest marks so that ② Create upper and
As done in Table 2 As done in Table 2
the highest total marks appear at the top (see lower groups
Table 2) Upper Lower Upper Lower
Group Group Group Group
•② Create two groupings of tests: (a) the upper
group with high marks and (b) the lower group ③ Calculate difficulty 5/5 or 4/5 or 3/5 or 0/5 or
with low marks (see Table 2) indices for each group 100% or 80% or 60% or 0% or
p-value p-value p-value p-value
•③ Count the number of learners who got each =1 =0.80 =0.60 =0
item correct in each group, i.e. upper and lower
④ Subtract the p-value 1 minus p- p-value 0.60 minus
•④ Calculate the Difficulty Index for each item Difficulty Index for the value 0.80 = 0.20 p-value 0 = 0.60
in the test and get a p-values for each group lower group from the DI/discrimination DI/discrimination
•⑤ Subtract the Difficulty Index for the lower Difficulty Index for the value for Question value for Question
N-07A/2018
group from the Difficulty Index for the upper upper group 1 = 0.20 10 = 0.60
group to get the DI (i.e. Discrimination Index).
•PLEASE NOTE:
•Ѽ Values are calculated using the Pearson
Correlation Coefficient. How are the results of the Discrimination
•Ѽ Values can range as follows: Index analysis interpreted?
Range = (+1) --- 0 --- (-1)
Maximum size Zero Minimum size •A question is a good discriminator when learners
•Ѽ A positive discrimination index (between 0 who answer the question or an assessment item
POLICY BRIEF
and 1) indicates that learners who received a correctly also do well on the test as a whole
high total mark chose the correct answer for a •The higher the DI, the better the test item
specific item more often than the learners who discriminates between the learners with higher
had a lower overall mark. marks and those with lower marks
•Ѽ If, on the other hand, more of the low- •Items with low discrimination indices are often
performing learners got a specific item correct, ambiguously worded and should be examined
then the item has a negative discrimination •The more difficult or easy the item, the lower its
index (between -1 and 0). discriminating power
①
3
How are the results of the Discrimination How is Distractor Analysis conducted?
Index analysis interpreted?
•① Count the proportion of learners choosing
•The greater the positive value, i.e. the closer it each response option in the MCQ test
is to 1.0, the stronger the relationship is •② Divide the number of learners who chose
between overall test performance and each option as an answer by the number of
performance on that item learners taking the test to get the Difficulty
• A rough "rule-of-thumb" is that: Index for each item
•① If DI is 0.40 and above, then the item is
functioning satisfactorily, i.e. it is
differentiating between learners with higher Table 3 shows the number of learners in a
and lower levels of knowledge class of 30 who selected each answer choice
•② If DI is 0.30 to 0.39, then little or no
revision is required. The item is differentiating for Questions 1 and 2 in a multiple choice test
between learners with higher and lower levels with four answer choices (A, B, C and D).
of knowledge
•③ If DI is 0.20 to 0.29, then the item is Table 3: Conducting Distractor Analysis
marginal and needs revision. The item may be
of low quality or marked incorrectly QUESTION A B C D
•④ If DI is 0.19 and below, then the item
should be eliminated or completely revised No. 1 0 8 19* 3
•A Good Achievement Test should have: No. 2 10 6* 8 6
•Ѽ 50% of Items = DI of 0.40 and above * Denotes the correct answer
•Ѽ 40% of Items = DI of 0.39 to 0.20
•Ѽ 10% of Items = DI of 0.19 to 0.00
EXAMPLE 5: HOW THE RESULTS OF DISTRACTOR ANALYSIS FOR
These interpretation guidelines can be applied QUESTIONS 1 AND 2 IN TABLE 3 CAN BE INTERPRETED
to the analysis of results in Table 2 as follows: Ѽ The p-value for Question 1 (Option C) is 0.63, i.e. 19
out of 30 learners chose the correct option. This
indicates this test item was moderately easy
EXAMPLE 4: HOW THE RESULTS OF DI RESULTS FOR QUESTIONS Ѽ The p-value for Question 2 (Option B) is 0.20, i.e. 6
1, 8 AND 10 IN TABLES 1 AND 2 CAN BE INTERPRETED out of 30 learners chose the correct option. This
suggests that this test item was much more difficult
QUESTION 1 QUESTION 8 QUESTION 10
Ѽ No learner chose Option A in Question 1. This means
Difficulty Difficulty Difficulty
Index
DI
Index
DI
Index
DI that Option A does not act as a good distractor
0.90 0.20 0.60 -0.40 0.30 0.60 Ѽ In Question 2, more learners selected an incorrect
The difficulty index The difficulty index The difficulty option (A) than the correct option (B). In addition,
of 0.90 means this of 0.60 means this index of 0.30 learners chose between all four options almost
item was quite item was means this item evenly. This makes guessing correctly more likely,
easy moderately easy was moderately
Low discriminatory Negative DI of difficult
which affects the validity of this item
power (0.20) -0.40 means that DI of 0.60 means
suggests that this the low- that this item is a
item does not performing good discriminator
discriminate well learners were This is the best"
between top and more likely to get overall question USING THE RESULTS OF THE ITEM ANALYSIS
low performers this item correct
The test analysed in Tables 1 and 2 is not a good test in that while 50% Teachers in schools that work use the results
of the items had a DI of 0.40 and above (good), 30% had a DI of <0.19
of item analysis to:
N-07A/2018
Ѽ Adjust the instruction, including modifications based
on learner needs, pace of instruction and coverage
of subject material, e.g. when most learners get an
ANALYSIS OF RESPONSE OPTIONS item wrong as in Question 10 in Tables 1 and 2.
Ѽ Moderate items with zero, low positive and negative
Why conduct Distractor Analysis? discrimination indices, e.g. Question 8 in Table 2 is
problematic in that only two out of five high-
POLICY BRIEF
①
4