Professional Documents
Culture Documents
Language Assessment Report
Language Assessment Report
Language Assessment Report
Written by:
Kevin Reynald
41032122151012
ENGLISH DEPARTMENT
2018
CHAPTER I
Introduction
English as lingua franca has been uniting civilization around the world in
one language. It spreads information rapidly, revolutionize technology
sustainably, developing human ability, also what makes it all; the way people
communicate. Every country has uniqueness of English accent and its language
role (first, second, or foreign) from one to another. Nevertheless, English has
actual standards to determine someone’s proficiency in the end, which is
implemented to most of country, in spite of what role it takes on.
1|Language Assessment
In accordance to this subject, assessment is important to show how far a
learning process could achieve. Supported by McNamara and Roever in Hamied
(2010: 99) that “language testing has a real impact on real people’s lives, and we
cannot cease our theoretical analyses at the point where the test score is
computed”. Moreover, Norris in Hamied (2010: 113) notes that language
assessment is way more than simply giving a language test.
2|Language Assessment
CHAPTER II
Test is something that could make students scared, especially at the end of
academic calendar. The reason can be traced from both internal aspect such as
students’ anxiety and proficiency; to external aspect such as classroom condition.
Furthermore, many people still have a stigma that test determines the grades of a
students. In fact, test is the smallest part of learning. To make it comprehensible,
these explanations hopefully help to define what test and assessment is.
Brown (2004: 3) define that the simplest term of test is a way to measure
someone’s ability, knowledge, or performance in particular subject. In education
practice, a test can be conducted usually at middle or end of a semester. Whenever
students answer a test, the teacher subconsciously appraise students’ performance
/ effort.
To put it simply, test is a part of assessment that assessors have to take for
measuring how far learning process could achieve, and test-takers have to take to
measure their achievement or proficiency in time.
3|Language Assessment
even peer-students. Meanwhile, reading and listening needs some special
requirement to judge by the teachers (Brown, 2004: 4).
In this research, the writer was using formative assessment that aims to
measure what students have grasped and typically occurs at the middle course or
unit of instruction, also the writer used the achievement test for observation. The
achievement tests are limited to specific subject addressed in curriculum within a
particular time allocation.
1
https://www.merriam-webster.com/dictionary/pretest.
4|Language Assessment
pre-test indicate how the students learn the subject yet help teacher to evaluate
teaching method that might to change in time.
Multiple – choice items, which may appear to be the Simplest kind of item
to construct, are extremely difficult to design correctly (Brown, 2004: 55). This
method offers time-saving scoring procedures, but designing the items may cost
teachers’ time to tempt the possibility of items difficulty.
1) Multiple – choice items are all receptive, or selective response; items in that
the test – taker chooses from a set of responses rather that creating a response.
For example, true / false questions and matching lists.
2) Every multiple – choice item has a stem and several options or alternatives to
choose from.
3) One of those options is the correct response; whereas the others serve as
distractors.
2.3. Reading
Brown (2004: 186) also mentioned several genres of reading, they are:
5|Language Assessment
In, addition, Brown (2004: 189) categorize several types of reading,
including:
a. Perceptive reading
b. Selective reading
c. Interactive reading
d. Extensive reading
2.4. Writing
Brown (2004: 219), categorize several genres of writing form, they are:
a. Imitative writing
b. Intense writing
b. Responsive writing
c. Extensive writing
This research uses selective reading and responsive writing for test.
Selective reading is category that is largely an artifact of lexical, grammatical, or
2
https://www.merriam-webster.com/dictionary/essay.
6|Language Assessment
discourse features of language. Whereas responsive writing requires learners to
perform connecting sentences into a paragraph, yet creating a logically connected
sequence of two paragraphs may be used.
2.6. Essay
3
https://www.ef.co.id/englishfirst/englishstudy/descriptive-text-dalam-bahasa-inggris.aspx
4
ibid.
5
ibid.
6
https://www.merriam-webster.com/dictionary/essay
7|Language Assessment
2.7. Dialogue
7
https://www.ef.co.id/englishfirst/englishstudy/dialog-bahasa-inggris/jenis-conversation-dalam-
bahasa-inggris.aspx
8
ibid.
8|Language Assessment
CHAPTER III
Method
In order to find out the data of students’ score in English formative test, this
research used qualitative descriptive. It is a one of many methods for conducting
research in many disciplines, psychology, social sciences, and education indeed.
This type of method has begun to popularize in 2nd language teaching and
learning. An interest has motivated on this method due to the complexity of the
recognition of 2nd language teaching and learning. To simplify its complexity,
depth analysis and understanding individuals’ behaviors and experience, instead
only examine learning tasks in general and what factors that affecting (Nassaji:
2015).
Bogdan & Bilken (2007) state that qualitative research is purposed want
to know where, when, how and under what circumstances behavior comes into
someone. In addition, the main objective of descriptive research is to describe a
phenomenon and its characteristics in form of data. To gather the data, the
researcher used a survey in form of question sheet, as supported by Gall, Gall, &
Borg (2007) mention that the most common tools in qualitative research are
observation and survey. Both of tools are able to describe participants’
characteristics (skills, performance, etc.) of individual or group and physical
environment (school) in specific occasion.
3.2. Participants
9|Language Assessment
To collect the research data, these common approaches can be applied to
qualitative research9:
Hall’s study in Khuder & Harwood (2015), divided writing process into
test and non-test type. In this case, researcher chose the test type as an instrument.
The test itself consists of 20 multiple choices and 3 essays explicitly, which the
best score could be up to 100 and the lowest is potentially zero.
The test had given to tenth grade science-one students in Mekar Arum High
School. There are 36 students in total, unfortunately 3 students were absent, so
there were 33 students participated on this study. Each participant had 60 minutes
to answer the test.
Data analysis is the process of evaluating data using analytical and logical
reasoning to examine each component of the data available.
Savenye and Robinson in Begum and Ahmed (2015) state that qualitative
research data analysis includes statistical procedure, analysis often becomes an
ongoing process where the data is collected and analyzed almost synchronously.
Indeed, researcher generally analyze for patterns in observations through the
entire data collection phase.
9
https://www.le.ac.uk/oerresources/lill/fdmvco/module9/page_54.htm
10 | L a n g u a g e A s s e s s m e n t
3.4. Procedure
b. As the internship began in Mekar Arum High School, writer was assigned
to teach tenth grade science-one students by tutor teacher.
c. The following five weeks, writer had taught about introducing yourself,
congratulating others, and expressing intention.
d. Before the sixth meeting, writer asked permission to hold a test to tutor
teacher informally.
e. After the permission granted, the test could finally hold at tenth grade
science-one class.
11 | L a n g u a g e A s s e s s m e n t
Chapter IV
4.1. Discussion
Table 4.1. down below is explaining about the Table of Specfication (T.O.S.)
including: material, cognitive level, and indicator.
Cognitive level
No. Material Total
C1 C2 C3 C4 C5 C6
1. Fact identification 1 1
2. Explaining information 1 1
3. Category of character 1 1
4. Fact identification 1 1
5. Additional fact 1 1
6. Determining message 1 1
7. Word reference 1 1
8. Jumbled sentences 1 1
9. Correct statement 1 1
12 | L a n g u a g e A s s e s s m e n t
18. Statement by data 1
19 Completing sentence 1
Total 5 5 6 3 1 3 23
Indicator:
13 | L a n g u a g e A s s e s s m e n t
22. Students are able to create a simple dialogue about congratulating and
complimenting others
23. Students are able to create a simple dialogue about expressing intention
Khan, Ishrat & Khan (2015: 179) explain that Facility Value (FV) or Index
of Difficulty (ID) and is a measure of how easy or how difficult a question which
is given to students, the higher the FV / DI, the easier is the question is. Arikunto
in Fajar Furqon (2013) classifies a value around 0.500 is measured to be ideal, 0.3
to 0.7 would be considered as acceptable Furthermore, Arikunto in Fajar Furqon
(2013) also use this following formula to calculate the index of difficulty of an
item. In accordance to this study, this calculation measure multiple choice items
only.
To measure essays, writer used this modified method to match the value
above:
Khan, Ishrat & Khan (2015: 179) explain that Item Discrimination (ID) or
Discrimination Index (DI) determines the significance of a question to
discriminate between a higher and a lower ability student.
14 | L a n g u a g e A s s e s s m e n t
This study is able to find out the DI in multiple choice items by conducting
this calculation:
1. Arranging students’ total score and dividing the score into three groups in by
equal size (top thirds, middle thirds, low thirds).
2. Discarding middle thirds to find out the most powerful test items in
discriminating between high and low ability.
3. Counting the number of students in the upper group who answer each item
correctly, and counting the same with lower group.
4. Subtracting the number of correct answers in the upper group with the lower
group.
5. Dividing the total correct answer by half of total participants in upper and
lower group combined.
ℎ𝑖𝑔ℎ𝑔𝑟𝑜𝑢𝑝#𝑐𝑜𝑟𝑟𝑒𝑐𝑡–𝑙𝑜𝑤𝑔𝑟𝑜𝑢𝑝#𝑐𝑜𝑟𝑟𝑒𝑐𝑡
DI =
1/2𝑋𝑡𝑜𝑡𝑎𝑙𝑜𝑓𝑦𝑜𝑢𝑟𝑡𝑤𝑜𝑐𝑜𝑚𝑝𝑎𝑟𝑖𝑠𝑜𝑛𝑔𝑟𝑜𝑢𝑝𝑠
DI =
(ℎ𝑖𝑔ℎ𝑔𝑟𝑜𝑢𝑝#𝑠𝑐𝑜𝑟𝑒÷𝑚𝑎𝑥𝑖𝑚𝑢𝑚𝑠𝑐𝑜𝑟𝑒)–(𝑙𝑜𝑤𝑔𝑟𝑜𝑢𝑝#𝑠𝑐𝑜𝑟𝑒÷𝑚𝑎𝑥𝑖𝑚𝑢𝑚𝑠𝑐𝑜𝑟𝑒)
1/2𝑋𝑡𝑜𝑡𝑎𝑙𝑜𝑓𝑦𝑜𝑢𝑟𝑡𝑤𝑜𝑐𝑜𝑚𝑝𝑎𝑟𝑖𝑠𝑜𝑛𝑔𝑟𝑜𝑢𝑝𝑠
within the minimum criteria of mastery learning for tenth grade is 75.
Table 4.3.1 and table 4.3.2 on next page shows the result of the test given:
15 | L a n g u a g e A s s e s s m e n t
Table 4.3.1 Result of the test (1 – 10)
Question no.
No. Name Ttl
1 2 3 4 5 6 7 8 9 10
1 Adam Pratama 8
2 Adhitya Pratama 6
3 Adriawan Rustama 5
4 Anggi 7
5 Arya Eka Putra 8
6 Desi Cahyati 8
7 Destiana Eka 10
8 Egi Liana Ruslam
9 Enrico Haqqiawan 10
10 Falak Agung
11 Farhan Taufik 8
12 Fariza Agnia 7
13 Faulina 4
14 Feni Fauziah 5
15 Fikri Muhamad
16 Indri Yani 7
17 Jihan Rafifah 7
18 Linda Herawati 4
19 Moch. Tegar 6
20 Mochamad Faiz 8
21 Muhammad Afif 7
22 Muhammad Syihab 7
23 Putri Kensila Alifaf 5
24 Raihan Rafli M. 8
25 Randi Putra 5
26 Risma Laisya 7
27 Rosa Agustina 7
28 Shalum Adhila 7
29 Shelva Noupirra 6
30 Sindi Gandari 6
31 Siska Hadiyanti 7
32 Shifa Dwi Febriani 7
33 Vana Jelita 8
34 Wanda Husna 10
35 Wulan Julfiani 5
36 Zahra Hasna 8
Total correctness 33 28 31 18 25 30 21 33 11 25
16 | L a n g u a g e A s s e s s m e n t
Table 4.3.2 Result of the test (11 – 20)
Question no.
No. Name Ttl
11 12 13 14 15 16 17 18 19 20
1 Adam Pratama 7
2 Adhitya Pratama 5
3 Adriawan Rustama 4
4 Anggi 3
5 Arya Eka Putra 7
6 Desi Cahyati 5
7 Destiana Eka 7
8 Egi Liana Ruslam
9 Enrico Haqqiawan 7
10 Falak Agung
11 Farhan Taufik 4
12 Fariza Agnia 7
13 Faulina 4
14 Feni Fauziah 4
15 Fikri Muhamad
16 Indri Yani 4
17 Jihan Rafifah 3
18 Linda Herawati 2
19 Moch. Tegar 5
20 Mochamad Faiz 7
21 Muhammad Afif 7
22 Muhammad Syihab 5
23 Putri Kensila Alifaf 5
24 Raihan Rafli M. 7
25 Randi Putra 2
26 Risma Laisya 7
27 Rosa Agustina 6
28 Shalum Adhila 7
29 Shelva Noupirra 3
30 Sindi Gandari 7
31 Siska Hadiyanti 3
32 Shifa Dwi Febriani 3
33 Vana Jelita 3
34 Wanda Husna 7
35 Wulan Julfiani 2
36 Zahra Hasna 5
Total correctness 29 11 18 19 20 17 29 13 12 17
17 | L a n g u a g e A s s e s s m e n t
Table 4.3.2 Result of the test (21 – 23)
Question no.
No. Name Ttl
21 22 23
1 Adam Pratama 8 4 12
2 Adhitya Pratama
3 Adriawan Rustama 3 3 6
4 Anggi
5 Arya Eka Putra 9 7 16
6 Desi Cahyati 8 8
7 Destiana Eka 9 9 10 28
8 Egi Liana Ruslam 0
9 Enrico Haqqiawan 5 7 12
10 Falak Agung 0
11 Farhan Taufik 9 9 18
12 Fariza Agnia 9 9
13 Faulina 5 3 6 14
14 Feni Fauziah 8 4 12
15 Fikri Muhamad 0
16 Indri Yani 0
17 Jihan Rafifah 0
18 Linda Herawati 0
19 Moch. Tegar 0
20 Mochamad Faiz 7 4 11
21 Muhammad Afif 7 3 10
22 Muhammad Syihab 3 3 6
23 Putri Kensila Alifaf 0
24 Raihan Rafli M. 5 3 1 9
25 Randi Putra 3 3
26 Risma Laisya 8 3 11
27 Rosa Agustina 8 1 9
28 Shalum Adhila 8 2 10
29 Shelva Noupirra 3 3
30 Sindi Gandari 2 2
31 Siska Hadiyanti 8 2 10
32 Shifa Dwi Febriani 3 3
33 Vana Jelita 10 3 13
34 Wanda Husna 8 9 9 26
35 Wulan Julfiani 0
36 Zahra Hasna 7 7
Total score 163 75 30
18 | L a n g u a g e A s s e s s m e n t
Score
No. Name
MC Essay Accumulation
Inf.
PG x3 Essay x4 /3 Score
1 Adam Pratama 15 12 45.00 16.00 61.00 TL
2 Adhitya Pratama 11 0 33.00 0.00 33.00 TL
3 Adriawan Rustama 9 6 27.00 8.00 35.00 TL
4 Anggi 10 0 30.00 0.00 30.00 TL
5 Arya Eka Putra 15 16 45.00 21.33 66.33 TL
6 Desi Cahyati 13 8 39.00 10.67 49.67 TL
7 Destiana Eka 17 28 51.00 37.33 88.33 L
8 Egi Liana Ruslam 0 0.00 0.00 0.00 TL
9 Enrico Haqqiawan 17 12 51.00 16.00 67.00 TL
10 Falak Agung 0 0.00 0.00 0.00 TL
11 Farhan Taufik 12 18 36.00 24.00 60.00 TL
12 Fariza Agnia 14 9 42.00 12.00 54.00 TL
13 Faulina 7 14 21.00 18.67 39.67 TL
14 Feni Fauziah 9 12 27.00 16.00 43.00 TL
15 Fikri Muhamad 0 0.00 0.00 0.00 TL
16 Indri Yani 11 0 33.00 0.00 33.00 TL
17 Jihan Rafifah 11 0 33.00 0.00 33.00 TL
18 Linda Herawati 6 0 18.00 0.00 18.00 TL
19 Moch. Tegar 11 0 33.00 0.00 33.00 TL
20 Mochamad Faiz 15 11 45.00 14.67 59.67 TL
21 Muhammad Afif 14 10 42.00 13.33 55.33 TL
22 Muhammad Syihab 12 6 36.00 8.00 44.00 TL
23 Putri Kensila Alifaf 10 0 30.00 0.00 30.00 TL
24 Raihan Rafli 15 9 45.00 12.00 57.00 TL
25 Randi Putra Pratama 7 3 21.00 4.00 25.00 TL
26 Risma Laisya 14 11 42.00 14.67 56.67 TL
27 Rosa Agustina 13 9 39.00 12.00 51.00 TL
28 Shalum Adhila 14 10 42.00 13.33 55.33 TL
29 Shelva Noupirra 9 3 27.00 4.00 31.00 TL
30 Sindi Gandari 13 2 39.00 2.67 41.67 TL
31 Siska Hadiyanti 10 10 30.00 13.33 43.33 TL
32 Shifa Dwi Febriani 10 3 30.00 4.00 34.00 TL
33 Vana Jelita 11 13 33.00 17.33 50.33 TL
34 Wanda Husna 17 26 51.00 34.67 85.67 L
35 Wulan Julfiani 7 0 21.00 0.00 21.00 TL
36 Zahra Hasna 13 7 39.00 14.00 53.00 TL
19 | L a n g u a g e A s s e s s m e n t
Table 4.3.2 Classification of the test
20 | L a n g u a g e A s s e s s m e n t
CHAPTER V
Conclusion
Based on the result that the writer found only 2 students passed this test,
while 31 students failed to reach the minimum criteria of mastery learning (75).
There must be any reasons that can be assumed following the result: 1. English
material that they should learn is not very comprehended, basically they have to
learn more to do right so, 2. Time allocation was too short to answer essay due to
lack of the items answered, 3. Logical thinking that may be seemed
underdeveloped because incorrect answer come from rational thinking questions,
4. English reading and writing proficiency that could be still poor due to
incapability to understand questions statement, 5. The questions were too hard to
completed by tenth grade students.
Other than that, there are couple of suggestion that all parties should pay
attention to. The teacher has to increase reading and writing proficiency for
students, while English vocabularies surely will support it. Students must learn
way harder than this and “stay hungry” for knowledge. Otherwise, writer should
consider students’ English proficiency and time allocation in making test in the
future. Hopefully, this research has benefit for all sides.
21 | L a n g u a g e A s s e s s m e n t
REFERENCES
22 | L a n g u a g e A s s e s s m e n t