Final Language Testing

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 25

INTERPRETING TEST SCORE

A. VALIDITY
In terms of test validity, we can show the tests to the colleagues for face validity, compare
the course objective and the test items for validity, check whether the students respond in the
way they are expected in doing the test for response validity, and calculate the point bi-serial
correlation for item validity using the following formula:

𝑴𝒑 − 𝑴𝒕 𝒑
𝒓𝒑𝒃𝒊 = √
𝑺𝑫 𝒒

rpbi = Point bi-serial Correlation Coefficient, i.e. item validity coefficient.


Mp = Mean score of testees correctly answering the analyzed item.
Mt = Mean score of the total score.
SD = Standard deviation of the total score.
p = Proportion of testees correctly answering the analyzed item.
q = Proportion of testees incorrectly answering the analyzed item.
N Nomor Butir Instrumen
Nama X X2
O 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 10
1 Asril 1 0 0 0 1 1 1 0 1 1 1 1 1 1 0
0 0
1 14
2 Yusuf Topan 1 1 1 0 1 1 1 0 1 1 1 1 1 1 0
2 4
1 10
3 Ahmad Radian 1 1 1 1 1 0 1 0 0 0 1 1 1 1 1
0 0
Novia
4 1 1 1 1 0 1 1 0 1 0 1 0 1 0 0 9 81
Ramadhani
1 10
5 Daffa 1 0 1 1 0 1 1 0 1 1 1 1 1 0 0
0 0
6 Saprianto 1 0 0 0 1 0 1 0 0 0 1 1 0 0 0 5 25
1 16
7 Aldi Muliadi 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1
3 9
1 14
8 Muh. Ashar 1 1 1 0 1 1 1 0 0 1 1 1 1 1 1
2 4
1 19
9 Fadhil 1 1 1 1 0 1 1 1 1 1 1 1 1 1 1
4 6
Yusuf Putra
10 1 0 1 0 1 0 1 0 1 1 1 1 0 0 1 9 81
Baharuddin
1 16
11 William 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1
3 9
12 Harya 1 1 0 0 0 0 1 0 0 0 1 0 0 0 0 4 16
13 Muh. Al Fatir 1 1 0 0 1 1 1 0 0 0 1 0 0 1 1 8 64
14 Rayhan 1 0 1 1 0 1 1 0 1 0 1 0 0 0 0 7 49
1 12
15 Aswandi 1 1 1 0 1 1 1 0 1 1 0 1 1 0 1
1 1
1
N= 15
∑x 15 10 11 5 10 11 15 1 10 9 14 11 10 8 8 4
15 59
7
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
p 1 1
67 73 33 67 73 07 67 6 93 73 67 53 53
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
q 0 0
33 27 67 33 27 93 33 4 07 27 33 47 47
9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9,
Mt
8 8 8 8 8 8 8 8 8 8 8 8 8 8 8
9, 10 10 10 10 9, 0. 10 11 9, 10 11 11 11
Mp 10
8 ,6 ,9 ,3 ,8 8 44 ,8 ,5 7 ,8 ,4 ,5 ,2
2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
SD
81 81 81 81 81 81 81 81 81 81 81 81 81 81 81
0, 1, 0, 0, 1, 12 0, 1, 0, 1, 1, 0, 0,
rpbi 0 0
69 23 06 44 13 ,5 88 17 49 12 41 93 77
The following steps are recomended for calculation, taking item number 1 as the sample of
calculation.
1. Determining the proportion of testees correctly answering the analyzed items:
∑𝑥 15 ∑𝑥 11 ∑𝑥 14
𝑝1 = = =1 𝑝6 = = = 0.73 𝑝11 = = = 0.93
𝑁 15 𝑁 15 𝑁 15
∑𝑥 10 ∑𝑥 15 ∑𝑥 11
𝑝2 = = = 0,67 𝑝7 = = =1 𝑝12 = = = 0.73
𝑁 15 𝑁 15 𝑁 15
∑𝑥 11 ∑𝑥 1 ∑𝑥 10
𝑝3 = 𝑁
= 15
= 0.73 𝑝8 = 𝑁
= 15
= 0.07 𝑝13 = 𝑁
= 15
= 0.67
∑𝑥 5 ∑𝑥 10 ∑𝑥 8
𝑝4 = 𝑁
= 15
= 0.33 𝑝9 = 𝑁
= 15
= 0.67 𝑝14 = 𝑁
= 15
= 0,53
∑𝑥 10 ∑𝑥 9 ∑𝑥 8
𝑝5 = 𝑁
= 15
= 0.67 𝑝10 = 𝑁
= 15
= 0.6 𝑝15 = 𝑁
= 15
= 0.53

2. Determining the proportion of testees incorrectly answer the analyzed items:


q1 = 1 – p1 = 1 – 1 = 0 q6 = 1 – p1 = 1 – 0.73 = 0,27 q11 = 1 – p1 = 1 – 0.93 = 0.07
q2 = 1 – p2 = 1 – 0.67 = 0.33 q7 = 1 – p1 = 1 – 1 = 0 q12 = 1 – p1 = 1 – 0.73 = 0.27
q3 = 1 – p3 = 1 – 0.73 = 0.27 q8 = 1 – p1 = 1 – 0.07 = 0.93 q13 = 1 – p1 = 1 – 0.67 = 0.33
q4 = 1 – p4 = 1 – 0.33 = 0.67 q9 = 1 – p1 = 1 – 0.67 = 0.33 q14 = 1 – p1 = 1 – 0.53 = 0.47
q5 = 1 – p5 = 1 – 0.67 = 0.33 q10 = 1 – p1 = 1 – 0.6 = 0.4 q15 = 1 – p1 = 1 – 0.53 = 0.47

3. Calculating the mean score of the total scores:


∑𝑥 147
𝑀𝑡 = = = 9.8
𝑁 15

4. Calculating the mean score of testees correctly answering the analyzed items.
147 119 136
𝑀𝑝1 = 15
= 9.8 𝑀𝑝6 = 11
= 10.8 𝑀𝑝11 = 14
= 9.71
106 147 119
𝑀𝑝2 = = 10.6 𝑀𝑝7 = = 9.8 𝑀𝑝12 = = 10.8
10 15 11
120 14 114
𝑀𝑝3 = = 10.9 𝑀𝑝8 = = 14 𝑀𝑝13 = = 11.4
11 1 10
50 108 92
𝑀𝑝4 = 5
= 10 𝑀𝑝9 = 10
= 10.8 𝑀𝑝14 = 8
= 11.5
103 104 90
𝑀𝑝5 = = 10.3 𝑀𝑝10 = = 11.5 𝑀𝑝15 = = 11.2
10 9 8

5. Calculating the standard deviation of the total score:


2
∑ 𝑥2 ∑𝑥
𝑆𝐷 = √ −( )
𝑁 𝑁

1559 147 2
=√ −( )
15 15

1559 21609
=√ −
15 225
= √103,93 − 96,04

= √7,89
= 2,81

6. Calculating the item validity coefficient:


 Test item 1

𝑀𝑝 − 𝑀𝑡 𝑝 9.8 − 9.8 1
𝑟𝑝𝑏𝑖 = √ = √ =0
𝑆𝐷 𝑞 2.81 0

 Test item 2

𝑀𝑝 −𝑀𝑡 𝑝 10.6−9.8 0.64


𝑟𝑝𝑏𝑖 = √𝑞 = √ = 0.69
𝑆𝐷 2.81 0.33

 Test item 3
𝑀𝑝 −𝑀𝑡 𝑝 10.9−9.8 0.73
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 1.23
𝑆𝐷 2.81 0.27

 Test item 4
𝑀𝑝 −𝑀𝑡 𝑝 10−9.8 0.33
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 0.06
𝑆𝐷 2.81 0.67

 Test item 5

𝑀𝑝 − 𝑀𝑡 𝑝 10.3 − 9.8 0.67


𝑟𝑝𝑏𝑖 = √ = √ = 0.44
𝑆𝐷 𝑞 2.81 0.33

 Test item 6
𝑀𝑝 −𝑀𝑡 𝑝 10.8−9.8 0.73
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 1.13
𝑆𝐷 2.81 0.27

 Test item 7

𝑀𝑝 − 𝑀𝑡 𝑝 9.8 − 9.8 1
𝑟𝑝𝑏𝑖 = √ = √ =0
𝑆𝐷 𝑞 2.81 0

 Test item 8
𝑴𝒑 −𝑴𝒕 𝒑 𝟏𝟒−𝟗.𝟖 𝟎.𝟎𝟕
𝒓𝒑𝒃𝒊 = √𝒒 = √ = 0,42
𝑺𝑫 𝟐.𝟖𝟏 𝟎.𝟗𝟑

 Test item 9
𝑀𝑝 −𝑀𝑡 𝑝 10.8−9.8 0.67
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 0.88
𝑆𝐷 2.81 0.33

 Test item 10

𝑀𝑝 − 𝑀𝑡 𝑝 11,5 − 9.8 0.6


𝑟𝑝𝑏𝑖 = √ = √ = 1.17
𝑆𝐷 𝑞 2.81 0.4

 Test item 11
𝑀𝑝 −𝑀𝑡 𝑝 9.7−9.8 0.93
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 0.49
𝑆𝐷 2.81 0.07

 Test item 12
𝑀𝑝 −𝑀𝑡 𝑝 10.8−9.8 0.73
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 1.12
𝑆𝐷 2.81 0.27

 Test item 13
𝑀𝑝 −𝑀𝑡 𝑝 11.4−9.8 0.67
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 1.41
𝑆𝐷 2.81 0.33

 Test item 14
𝑀𝑝 −𝑀𝑡 𝑝 11.5−9.8 0.53
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 0.93
𝑆𝐷 2.81 0.47

 Test item 15
𝑀𝑝 −𝑀𝑡 𝑝 11.2−9.8 0.53
𝑟𝑝𝑏𝑖 = √𝑞 = √ = 0.77
𝑆𝐷 2.81 0.47

B. RELIABILITY
In terms of test reliability, we can use single-test single trial method with split-half
reliability, applying Pearson product moment correlation and Spearman-Brown odd even modal
correlation this calculation may be processed through SPSS program, based on the level of
significance of 5. The formula of Pearson product moment correlation is as follows:

𝑵 ∑ 𝒙𝒚 (∑ 𝒙)(∑ 𝒚)
𝒓𝒙𝒚 =
√[𝑵𝒙𝟐 − (𝒙)𝟐 ][𝑵𝒚𝟐 − (𝒚)𝟐 ]

rxy = Pearson product moment correlation between variable x and y


N = Number of students taking the test
∑x = sum of variable x
∑y = sum of variable y
∑xy = sum of multiplication of variable x and variable y
∑x2 = sum of square x
∑y2 = sum of square y

- Test Item 1

No. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
𝑟𝑥𝑦 =
1 1 10 1 100 10 √[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10 (15 x 147) − (15 x 147)
4 1 9 1 81 9 =
√[(15 𝑥 15) − (15)2 ][(15 𝑥 1559) − (147)2 ]
5 1 10 1 100 10
6 1 5 1 25 5 (2205) − (2205) 0
7 1 13 1 169 13
= = =0
√[0][1776] 0
8 1 12 1 144 12
9 1 14 1 196 14
10 1 9 1 81 9
11 1 13 1 169 13
12 1 4 1 16 4
13 1 8 1 64 8
14 1 7 1 49 7
15 1 11 1 121 11
15 147 15 1559 147

The result of this calculation is then analyzed using Spearman-Brown odd ven model correlation
to see the realibility of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2𝑥0 0
𝑟11 = = =0
1+0 1

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation (r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is not reliable (r11 = 0)

- Test Item 2
No. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 0 10 0 100 0 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10
(15 x 106) − (10 x 147)
4 1 9 1 81 9 =
5 1 10 1 100 10 √[(15 𝑥 10) − (10)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
(1590)−(1470) 120
7 1 13 1 169 13 = = 297.9 = 0,40
√[50][1776]
8 1 12 1 144 12
9 1 14 1 196 14
10 0 9 0 81 0
11 1 13 1 169 13
12 1 4 1 16 4
13 1 8 1 64 8
14 0 7 0 49 0
15 1 11 1 121 11
10 147 10 1559 106

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 0.40 0.8
𝑟11 = = = 0.57
1 + 0.40 1,4

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is not reliable (r11 = 0.57)

- Test item 3
No. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 0 10 0 100 0 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10
(15 x 120) − (11 x 147)
4 1 9 1 81 9 =
5 1 10 1 100 10 √[(15 𝑥 11) − (11)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
(1800)−(1617) 183
7 1 13 1 169 13 = = 279.5 = 0,65
√[44][1776]
8 1 12 1 144 12
9 1 14 1 196 14
10 1 9 1 81 9
11 1 13 1 169 13
12 0 4 0 16 0
13 0 8 0 64 0
14 1 7 1 49 7
15 1 11 1 121 11
11 147 11 1559 120

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ
rtt = Total test coefficient reliability (tt = total test)
rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 0.65 1.3
𝑟11 = = = 0.78
1 + 0.65 1.65

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 = 0.78)

- Test item 4
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 0 10 0 100 0 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 0 12 0 144 0
3 1 10 1 100 10
(15 x 50) − (5 x 147)
4 1 9 1 81 9 =
5 1 10 1 100 10 √[(15 𝑥 5) − (5)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
(750)−(735) 15
7 0 13 0 169 0 = = 297,9 = 0,05
√[50][1776]
8 0 12 0 144 0
9 1 14 1 196 14
10 0 9 0 81 0
11 0 13 0 169 0
12 0 4 0 16 0
13 0 8 0 64 0
14 1 7 1 49 7
15 0 11 0 121 0
5 147 5 1559 50

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 − 0.05 0.1
𝑟11 = = = −0.09
1 + −0.05 1.05

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is not reliable (r11 = - 0.09)

- Test item 5
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10
(15 x 103) − (10 x 147)
4 0 9 0 81 0 =
5 0 10 0 100 0 √[(15 𝑥 10) − (10)2 ][(15 𝑥 1559) − (147)2 ]
6 1 5 1 25 5
7 1 13 1 169 13 (1545)−(1470) 75
= = 297,9 = 0.25
√[50][1776]
8 1 12 1 144 12
9 0 14 0 196 0
10 1 9 1 81 9
11 1 13 1 169 13
12 0 4 0 16 0
13 1 8 1 64 8
14 0 7 0 49 0
15 1 11 1 121 11
10 147 10 1559 103

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ
rtt = Total test coefficient reliability (tt = total test)
rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 0.25 0.5
𝑟11 = = = 0.4
1 + 0.25 1.25

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is not reliable (r11 = 0.4)

- Test item 6
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 0 10 0 100 0
4 1 9 1 81 9 (15 x 119) − (11 x 147)
=
5 1 10 1 100 10 √[(15 𝑥 11) − (11)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
7 1 13 1 169 13 (1785)−(1617) 168
= = 279.5 = 0.60
8 1 12 1 144 12 √[44][1776]

9 1 14 1 196 14
10 0 9 0 81 0
11 1 13 1 169 13
12 0 4 0 16 0
13 1 8 1 64 8
14 1 7 1 49 7
15 1 11 1 121 11
11 147 11 1559 119

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 0.60 1,2
𝑟11 = = = 0.75
1 + 0.60 1.60

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 = 0.75)

- Test item 7
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10 𝑟𝑥𝑦 =
2 1 12 1 144 12 √[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
3 1 10 1 100 10
4 1 9 1 81 9 (15 x 147) − (15 x 147)
5 1 10 1 100 10 =
√[(15 𝑥 15) − (15)2 ][(15 𝑥 1559) − (147)2 ]
6 1 5 1 25 5
7 1 13 1 169 13
8 1 12 1 144 12 (2205) − (2205) 0
= = =0
9 1 14 1 196 14 √[0][1776] 0
10 1 9 1 81 9
11 1 13 1 169 13
12 1 4 1 16 4
13 1 8 1 64 8
14 1 7 1 49 7
15 1 11 1 121 11
15 147 15 1559 147

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2𝑥0 0
𝑟11 = = =0
1+0 1

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is not reliable (r11 = 0)

- Test item 8
no. X Y X2 Y2 XY
𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 0 10 0 100 0 𝑟𝑥𝑦 =
2 0 12 0 144 0 √[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
3 0 10 0 100 0
4 0 9 0 81 0 (15 x 147) − (1 x 147)
=
5 0 10 0 100 0 √[(15 𝑥 1) − (1)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
7 0 13 0 169 0 (2205) − (147) 2058
8 0 12 0 144 0 = = = 13.1
√[14][1776] 157.6
9 1 14 1 196 14
10 0 9 0 81 0
11 0 13 0 169 0
12 0 4 0 16 0
13 0 8 0 64 0
14 0 7 0 49 0
15 0 11 0 121 0
1 147 1 1559 14

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the realibility of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 13.1 26.2
𝑟11 = = = 1.8
1 + 13.1 14.1

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 = 1.8)

- Test item 9
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 0 10 0 100 0
4 0 9 0 81 0 (15 x 108) − (10 x 147)
=
5 1 10 1 100 10 √[(15 𝑥 10) − (10)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
7 1 13 1 169 13 (1620)−(1470) 150
= = 297,9 = 0.50
8 1 12 1 144 12 √[50][1776]

9 1 14 1 196 14
10 1 9 1 81 9
11 1 13 1 169 13
12 0 4 0 16 0
13 0 8 0 64 0
14 1 7 1 49 7
15 1 11 1 121 11
10 147 10 1559 108

The result of this calculation is then analyzed using Spearman-Brown odd ven model correlation
to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 0.50 1
𝑟11 = = = 0.66
1 + 0.50 1.50

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is not reliable (r11 = 0.66)

- Test item 10
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 0 10 0 100 0
(15 x 104) − (9 x 147)
4 0 9 0 81 0 =
5 1 10 1 100 10 √[(15 𝑥 9) − (9)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
(1560)−(1323) 237
7 1 13 1 169 13 = = 309.6 = 0.76
√[54][1776]
8 1 12 1 144 12
9 1 14 1 196 14
10 1 9 1 81 9
11 1 13 1 169 13
12 0 4 0 16 0
13 0 8 0 64 0
14 0 7 0 49 0
15 1 11 1 121 11
9 147 9 1559 104

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 − 0.76 1.52
𝑟11 = = = 0.86
1 + − 0.76 1.76

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 = 0.86)

- Test item 11
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10
4 1 9 1 81 9 (15 x 136) − (14 x 147)
=
5 1 10 1 100 10 √[(15 𝑥 14) − (14)2 ][(15 𝑥 1559) − (147)2 ]
6 1 5 1 25 5
7 1 13 1 169 13 (2040)−(2058) −18
= = 157.6 = -0.11
√[14][1776]
8 1 12 1 144 12
9 1 14 1 196 14
10 1 9 1 81 9
11 1 13 1 169 13
12 1 4 1 16 4
13 1 8 1 64 8
14 1 7 1 49 7
15 0 11 0 121 0
14 147 14 1559 136

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 − 0.11 −0.22
𝑟11 = = = −0.24
1 + (−0.11) 0.89

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is not reliable (r11 = -0.24)

- Test item 12
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
𝑟𝑥𝑦 =
1 1 10 1 100 10 √[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10
(15 x 119) − (11 x 147)
4 0 9 0 81 0 =
5 1 10 1 100 10 √[(15 𝑥 11) − (11)2 ][(15 𝑥 1559) − (147)2 ]
6 1 5 1 25 5
(1785)−(1617) 168
7 1 13 1 169 13 = = 279.5 = 0.60
√[44][1776]
8 1 12 1 144 12
9 1 14 1 196 14
10 1 9 1 81 9
11 1 13 1 169 13
12 0 4 0 16 0
13 0 8 0 64 0
14 0 7 0 49 0
15 1 11 1 121 11
11 147 11 1559 119

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 − 0.60 1.2
𝑟11 = = = 0.75
1 + − 0.60 1.60

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 =- 0.75)

- Test item 13
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10
(15 x 114) − (10 x 147)
4 1 9 1 81 9 =
5 1 10 1 100 10 √[(15 𝑥 10) − (10)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
7 1 13 1 169 13 (1710)−(1470) 240
= = 297,9 = 0.80
√[50][1776]
8 1 12 1 144 12
9 1 14 1 196 14
10 0 9 0 81 0
11 1 13 1 169 13
12 0 4 0 16 0
13 0 8 0 64 0
14 0 7 0 49 0
15 1 11 1 121 11
10 147 10 1559 114

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 − 0.80 1.60
𝑟11 = = = −0.88
1 + −0.80 1.80

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 = 0.88)

- Test item 14
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 1 10 1 100 10
𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 1 12 1 144 12
3 1 10 1 100 10
(15 x 92) − (8 x 147)
4 0 9 0 81 0 =
5 0 10 0 100 0 √[(15 𝑥 8) − (8)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
(1380)−(1176) 204
7 1 13 1 169 13 = = 315.3 = 0.64
√[56][1776]
8 1 12 1 144 12
9 1 14 1 196 14
10 0 9 0 81 0
11 1 13 1 169 13
12 0 4 0 16 0
13 1 8 1 64 8
14 0 7 0 49 0
15 0 11 0 121 0
8 147 8 1559 92

The result of this calculation is then analyzed using Spearman-Brown odd ven model correlation
to see the realibility of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 0.64 1.28
𝑟11 = = = 0.78
1 + 0.64 1.64

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 = 0.78)

- Test item 15
no. X Y X2 Y2 XY 𝑁 ∑ 𝑥𝑦 (∑ 𝑥)(∑ 𝑦)
1 0 10 0 100 0 𝑟𝑥𝑦 =
√[𝑁𝑥 2 − (𝑥)2 ][𝑁𝑦 2 − (𝑦)2 ]
2 0 12 0 144 0
3 1 10 1 100 10
4 0 9 0 81 0 (15 x 90) − (8 x 147)
=
5 0 10 0 100 0 √[(15 𝑥 8) − (8)2 ][(15 𝑥 1559) − (147)2 ]
6 0 5 0 25 0
7 1 13 1 169 13 (1350)−(1176) 174
= = 315.3 = 0.55
8 1 12 1 144 12 √[56][1776]

9 1 14 1 196 14
10 1 9 1 81 9
11 1 13 1 169 13
12 0 4 0 16 0
13 1 8 1 64 8
14 0 7 0 49 0
15 1 11 1 121 11
8 147 8 1559 90

The result of this calculation is then analyzed using Spearman-Brown odd even model
correlation to see the reliability of the test.
2𝑟𝑡𝑡
𝑟11 =
1 + 𝑟ℎℎ

rtt = Total test coefficient reliability (tt = total test)


rhh = Product moment Correlation Coefficient between the first half and the second
half of the test (hh = half – half)
1&2 = constant numbers
2 𝑥 0.55 1.1
𝑟11 = = = 0.70
1 + 0.55 1.55

To interpret the test reliability (r11) Sudjono (2003.209) provides criteria. If the resulted
calculation ((r11) is the same or greater than 0.70, the evaluated test is highly reliable.
Conversely, if the resulted calculation (r11) is smaller than 0.70, the evaluated test is not highly
reliable. Therefore, the result of calculation is reliable (r11 = 0.70)

Nomor Butir Instrumen


NO Nama x
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 AGUSNADI 1 1 1 1 1 1 1 1 1 1 1 0 1 1 1 14
2 ANDI ANDHIKA 1 1 1 1 1 1 1 1 0 1 1 0 1 1 1 13
3 ARYA AHMAD 1 1 1 0 1 1 0 1 1 1 1 0 1 1 1 12
4 AZZAHRA 0 1 1 1 0 1 0 1 1 1 1 1 1 1 1 12
5 ELSA FITRI RAMADHAN 0 1 1 1 1 1 1 1 1 1 1 1 0 1 0 12
6 FADIL FIRANSYAH
1 1 1 1 1 1 1 1 1 0 0 0 1 1 1 12

7 MUH. FAHRUL 0 1 1 1 0 1 1 1 1 1 1 1 1 1 0 12
8 MUHAMMAD ABID 0 1 1 1 1 1 1 1 0 1 1 0 1 1 0 11
MUHAMMAD ADHAN
11
9 ASWAR 1 1 0 1 0 1 1 1 1 1 0 1 1 1 0
10 MUHAMMAD AL KAHFI 1 1 1 1 0 1 0 1 1 1 0 1 1 1 0 11
MUHAMMAD RAIHAN AZIS 11
11 0 1 1 1 1 1 0 1 1 1 1 0 0 1 1
MUHAMMAD RESKY
11
12 YASIN 0 1 1 0 0 0 1 1 1 1 1 1 1 1 1
13 MUHAMMAD RIZHKY 1 1 0 1 0 1 0 1 1 1 0 1 1 1 0 10 Lower
14 MUMTAZ IBNU 1 1 0 1 0 1 0 1 1 1 0 1 1 1 0 10 Group
15 NUR ANNISA ISLAMIAH 0 1 0 0 0 1 1 1 0 1 1 0 1 1 1 9 (UG)
UG 6 12 11 10 7 11 8 12 10 11 9 6 10 12 7 142
LG 2 3 0 2 0 3 1 3 2 3 1 2 3 3 1 29

C. ITEM DIFFICULTY
𝑼𝑮 + 𝑳𝑮
𝑰𝑭 =
𝑵

IF = Index of facility
UG = the number of correct answers by the upper group
LG = the number of correct answer by the lower group
N = the number students taking the test

𝑈𝐺+𝐿𝐺 6+2 8 𝑈𝐺+𝐿𝐺 10+2 12


𝐼𝐹1 = = = 15 = 0.53 𝐼𝐹9 = = = 15 = 0.8
𝑁 15 𝑁 15

𝑈𝐺+𝐿𝐺 12+3 15 𝑈𝐺+𝐿𝐺 11+3 14


𝐼𝐹2 = = = 15 = 1 𝐼𝐹10 = = = 15 = 0.93
𝑁 15 𝑁 15

𝑈𝐺+𝐿𝐺 11+0 11 𝑈𝐺+𝐿𝐺 9+1 10


𝐼𝐹3 = = = 15 = 0.73 𝐼𝐹11 = = = 15 = 0.66
𝑁 15 𝑁 15

𝑈𝐺+𝐿𝐺 10+2 12 𝑈𝐺+𝐿𝐺 6+2 9


𝐼𝐹4 = = = 15 = 0.8 𝐼𝐹12 = = = 15 = 0.55
𝑁 15 𝑁 15

𝑈𝐺+𝐿𝐺 7+0 7 𝑈𝐺+𝐿𝐺 10+3 13


𝐼𝐹5 = = = 15 = 0.46 𝐼𝐹13 = = = 15 = 0.86
𝑁 15 𝑁 15

𝑈𝐺+𝐿𝐺 11+3 14 𝑈𝐺+𝐿𝐺 12+3 15


𝐼𝐹6 = = = 15 = 0.93 𝐼𝐹14 = = = 15 = 1
𝑁 15 𝑁 15

𝑈𝐺+𝐿𝐺 8+1 9 𝑈𝐺+𝐿𝐺 7+1 8


𝐼𝐹7 = = = 15 = 0.6 𝐼𝐹15 = = = 15 = 0.53
𝑁 15 𝑁 15

𝑈𝐺+𝐿𝐺 12+3 15
𝐼𝐹8 = = = 15 = 1
𝑁 15
The conclusion of items covering a wide range of difficulty levels may promote

motivation. The inclusion of very easy items will encourage and motivate the poor student. On

the other hand, the more difficult items may be necessary in order to motivate the good students.

D. ITEM DISCRIMINATION
𝑼𝑮 − 𝑳𝑮
𝑰𝑫 =
𝒏
ID = index discrimination
N = number of students in one group (1/2N)
UG = frequency of score by upper group (upper half)
LG = frequency of score by lower group (lower half

𝑈𝐺−𝐿𝐺 6−2 4 𝑈𝐺−𝐿𝐺 10−2 8


𝐼𝐷1 = = = 7 = 0.57 𝐼𝐷9 = = = 7 = 1.14
𝑛 7 𝑛 7

𝑈𝐺−𝐿𝐺 12−3 9 𝑈𝐺−𝐿𝐺 11−3 8


𝐼𝐷2 = = = = 1.2 𝐼𝐷10 = = = = 1.14
𝑛 7 7 𝑛 7 7

𝑈𝐺−𝐿𝐺 11−0 11 𝑈𝐺−𝐿𝐺 9−1 8


𝐼𝐷3 = = = = 1.5 𝐼𝐷11 = = = 7 = 1.14
𝑛 7 7 𝑛 7

𝑈𝐺−𝐿𝐺 10−2 8 𝑈𝐺−𝐿𝐺 6−2 4


𝐼𝐷4 = = = 7 = 1.14 𝐼𝐷12 = = = 7 = 0.57
𝑛 7 𝑛 7

𝑈𝐺−𝐿𝐺 7−0 7 𝑈𝐺−𝐿𝐺 10−3 7


𝐼𝐷5 = = =7=1 𝐼𝐷13 = = =7=1
𝑛 7 𝑛 7

𝑈𝐺−𝐿𝐺 11−3 8 𝑈𝐺−𝐿𝐺 12−3 9


𝐼𝐷6 = = = 1.14 𝐼𝐷14 = = = = 1.28
𝑛 7 7 𝑛 7 7

𝑈𝐺−𝐿𝐺 8−1 7 𝑈𝐺−𝐿𝐺 7−1 6


𝐼𝐷7 = = =7=1 𝐼𝐷15 = = = 7 = 0.85
𝑛 7 𝑛 7

𝑈𝐺−𝐿𝐺 12−3 9
𝐼𝐷8 = = = 7 = 1.28
𝑛 7
Since the indices of facility and discrimination can be calculated in similar procedures,
they are usually recorded together in tabular form. The following table shows how these measures
are recorded.
Item UG LG IF ID Remark
1 6 2 0.53 0.57 Improper
2 12 3 1.00 1.2 Proper
3 11 0 0.73 1.5 Proper
4 10 2 0.80 1.14 Proper
5 7 0 0.46 1.00 Improper
6 11 3 0.93 1.14 Proper
7 8 1 0.60 1.00 Proper
8 12 3 1.00 1.28 Proper
9 10 2 0.80 1.14 Proper
10 11 3 0.93 1.14 Proper
11 9 1 0.66 1.14 Proper
12 6 2 0.55 0.57 Improper
13 10 3 0.86 1.00 Proper
14 12 3 1.00 1.28 Proper
15 7 1 0.53 0.85 Improper

On the result listed in table above, only items 2,3,4,6,7,8,9,10,11,13 and 14 could be
safely used in future tests without being rewritten.

You might also like