Professional Documents
Culture Documents
AssessingMCQandSSAQ PDF
AssessingMCQandSSAQ PDF
net/publication/327015373
CITATION READS
1 6,590
3 authors:
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
[1] Malnutrition among under-5 children and associated factors- a community based cross-sectional study in Tanahun District of Nepal View project
All content following this page was uploaded by Govinda Prasad Dhungana on 20 August 2018.
ORIGINAL RESEARCH
ABSTRACT
Objectives: To evaluate multiple-choice questions (MCQs) and structured short-answer questions (SSAQs) items in human
anatomy by determining their discrimination and difficulty index, and performing ROC (receiver operating characteristics)
analysis to signify their ability to predict and discriminate poor and good performance of students. Methods: First and
second-year medical students of 2016 were followed through 1 year to register their internal examination performance (SSAQs
Downloaded From IP - 163.47.151.210 on dated 18-Aug-2018
and MCQs) of various items in anatomy. Responses were analysed and evaluated for performance. Results: A total of 30
SSAQs and 100 MCQs of five items were analysed. The difficulty index of MCQ and SSAQ was 0.36 and 0.38, respectively,
Members Copy, Not for Commercial Sale
which is considered optimal for both. SSAQ showed higher discrimination index (0.46 – considered very good item) than MCQ
www.IndianJournals.com
(0.29 – marginal item and subject to improvement), suggesting SSAQ a better tool to discriminate poor and good students than
MCQ. Conclusion: This study suggests that though difficulty index of SSAQ and MCQ are similar, and in terms of discriminating
good and poor students SSAQ was a better tool, ROC analysis revealed both the tools to be adequate in terms of discriminating
students with a relatively better value to SSAQ. As this study was conducted in a university where SSAQs are the only form
of evaluation of theoretical knowledge in anatomy, conducting similar study where both the tools are followed is highly
recommended.
IndianJournals.com 127
Arun Dhakal, Shekhar Kumar Yadav and Govinda Prasad Dhungana
than choosing a set of preformed answers and yet of students who gets the item right, where the
include sufficient precision for answers to be easily percentage refers to the percentage of students from
marked[8]. Both the aforementioned tools of evaluation the total of the lower and upper groups[10]. In this study,
need to be carefully constructed with provision of model the difficulty index of MCQ was determined by adding
answers and training of the evaluators to avoid inter- the number of students who gave correct answer in the
rater variability[3,9]. Furthermore assessment tools should upper and lower 27 per cent of students and dividing it
be valid, reliable and feasible and be able to measure by total number of upper and lower 27 per cent of
different facets of professional competencies. students. Difficulty index for SSAQs was evaluated
similarly.
The present article aims to evaluate the basis of validity
evidence of MCQs and SSAQs on human anatomy The discrimination index of an item is defined as the
employing difficult index, discrimination index, sensitivity, degree to which it discriminates between students of
specificity and areas under ROC curve. high and low achievement[10]. It can be used for weeding
out both too easy and too difficult items by maintaining
MATERIALS AND METHODS a target range of 20–80% [11]. Index of 0.40 and above
is regarded as very good items, 0.30 to 0.39 is reasonably
This study was done on 1st and 2nd-year medical good, 0.20 to 0.29 needs improvement and 0.19 or below
Downloaded From IP - 163.47.151.210 on dated 18-Aug-2018
study was obtained from institutional review board of gave correct answer in the upper and lower 27 per cent
Chitwan Medical College. of students and dividing it by total number of upper and
A total of 137 medical students of 1st year (n = 57) and lower 27 per cent of students. With similar approach,
2nd year (n = 80) who were interested to volunteer for the discrimination index of SSAQs is evaluated.
the study were enrolled. SSAQs were selected from The area under ROC (receiver operating characteristics)
past years’ final professional exam questions mixed with curve, sensitivity and specificity of SSAQ and MCQ
newly evaluated questions. MCQs (A type) were were achieved by ROC analysis. The curve values, more
designed and discussed in the panel of anatomy faculties than 0.70, were regarded as having a satisfactory
for consistency. discriminative and predictive value, whereas more than
Enrolled students appeared for 20 MCQs immediately 0.8 was considered having very good value[14].
a day after sitting for their SSAQ (6 questions)-type RESULTS
internal exams. As MCQ exhibited negative marking
A total of 57 1st year and 80 2nd year medical students’
for wrong answers, students were advised to respond
SSAQ and MCQ marks were acquired from the
to confirmed known answers only. This avoided students
department of anatomy. Consent from department was
from guessing the answers without being penalised as
taken before collecting the data.
well as helped overcome bias in relation to SSAQ where
there is no liberty for presumptions. The difficulty index of MCQ and SSAQ was 0.36 and
0.38, respectively, which is considered optimal level of
As there are two internal assessments in 1st year and
difficulty for both [15,16] . SSAQ showed higher
three internal assessments in 2nd year, we got 137
discrimination index (0.46 – considered very good item)
students sitting for a total of 100 MCQs and 30 SSAQs
than MCQ (0.29 – marginal item and subject to
containing various systems of human anatomy.
improvement) [10, 13], suggesting SSAQs a better tool to
Statistical analysis was performed using SPSS software discriminate poor and good students in relation to MCQs
version 20. Difficulty index is defined as the percentage (Table 1).
Table 1: The difficulty and discrimination index of SSAQ reliability, feasibility and more [8,15,17–19]. This study
and MCQ in anatomy examination of medical students
examined MCQ and SSAQ for their effectiveness to
Index Assessment Tool
predict students’ performance.
MCQ SSAQ
Difficulty index 0.36 0.38 Our research outcome demonstrates that MCQ and
Discrimination index 0.29 0.46 SSAQ are efficient tools in terms of their difficulty and
ability to discriminate performance of poor and good
Table 2: The values of area under ROC curve, sensitivity and students. However, in relation to MCQs, SSAQs were
specificity of SSAQ and MCQ in anatomy examination of better able to discriminate students’ performance. There
medical students
may be various reasons implied behind the latter finding
Assessment ROC Sensitivity Specificity
Tool (CI 95%) (%) (%) in our case which is discussed. Assessment subjectivity
MCQ 0.85 (0.78–0.91) 76.3 84.2 is shown to influence the rating given by the
SSAQ 0.93 (0.88–0.96) 85.0 88.2 examiners[16,20]. As the issue of subjectivity is more
pertinent to SSAQs in respect to MCQs, to avoid it,
ROC analysis is shown in Table 2 and illustrated in Figure SSAQs were designed by three faculty members for
1. SSAQ with ROC value more than 0.9 and MCQ consistency in terms of rating the assessments. The
Downloaded From IP - 163.47.151.210 on dated 18-Aug-2018
with 0.8 reveals that MCQ is a good tool, whereas SSAQ university under which the institution (place of this
is an excellent tool of discriminating ability, research) functions, written form of assessment in basic
Members Copy, Not for Commercial Sale
respectively [14]. Also represented in Table 2 are the medical science uses structured essay-type questions
www.IndianJournals.com
sensitivity and specificity values of both the tools as the only form of evaluation. Also it must be noted
indicating their good level to predict good and poor that these students of 1st and 2nd year have never faced
students, though value of SSAQ is much better than nor have any idea of MCQs in regards to basic medical
that of MCQ[14]. sciences. Students’ performance with MCQ also
showed low sensitivity. It is possible that university exams
DISCUSSION acted as a motivating factor focusing students’ attention
on essay-type question where sudden exposure to MCQ
There are various studies interpreting tools of could have contributed to our present result though study
assessment, their comparison, their source of validity, suggests that for most students, there is a strong
correlation between MCQ and Structured Essay
Questions (SEQ) indicating student performance
independent of testing format[21,22]. To elaborate the
reason further, locally question bank (SAQ type with
their answers) collected from previous years is rampant
in the market. As it is easier and more effective in terms
of scoring marks, students are drawn to those and are
reluctant to go through their recommended text books.
This approach has barred them from acquiring a broader
perspective of the subject and may also have reflected
in the result of our study.
With all the aforementioned potential explanations of
SSAQ being excellent to MCQ in our case we do not in
Figure 1: Sensitivity and specificity of SSAQ and MCQ to any way undermine MCQ as an effective tool in
predict pass–fail outcomes discriminating performance of students. A well-
constructed MCQ is superior in testing the higher where SSAQs are the only form of evaluation of
cognitive skills of medical students [19]. There are theoretical knowledge in anatomy, conducting similar
evidences to support MCQ’s validity to better study where both the tools are followed is highly
discriminate performance of poor and good students[23]. recommended. Their own performance assessment with
Moreover, when assessment tools are combined, formats any singular evaluation tool is not without any flaw. It is
including MCQs were the best examination tools to indispensable that medical colleges take initiative and
distinguish poor from moderate and excellent students[24]. devote time and attempt to assess their present
Despite their advantages MCQs as assessment tool also evaluation methods and develop guidelines on setting
needs to be chosen particularly as test demands [25]. up questions in alignment to learning needs.
Considering aggregate performance MCQ papers
revealed to be less sensitive predictors than the essay REFERENCES
papers[26]. In addition, when comparable reliabilities of [1] Drew S. Student perceptions of what helps them learn and
a question are to be attained the effort of writing case develop in higher education. Teaching in Higher Education
vignettes with SAQ items is less than writing MCQ 2001;6(3):309–31.
items[27]. [2] Trigwell K, Prosser M. Improving the quality of student
learning: the influence of learning context and student
Questions like SSAQs and modified essay questions
Downloaded From IP - 163.47.151.210 on dated 18-Aug-2018
own answer rather than selecting from a list of answers assessment results and approaches to learning and studying
as that of an MCQ and yet support enough accuracy in year two medical students. Medical Education
for answers to be easily marked[8]. Also study evaluating 2007;41:754–62.
the results of examination using SAQ or MEQ [4] Scouller K. The influence of assessment method on students’
demonstrated significant correlation of both the questions learning approaches: multiple-choice question examination
and concluded to minimise effort to construct MEQs by versus assignment essay. Higher Education 1998;35:453–
introducing written examination consisting of both SAQ 72.
and MEQ[18]. The main drawback of SAQ-type question [5] Farley JK. The multiple choice test: writing the questions.
is that it is not easily computer marked and hence not Nurse Educator 1989;14:10–2.
practical as an instrument for evaluating a large number [6] Case S, Swanson D. Constructing written test questions for
of students[17]. the basic and clinical sciences. 3rd ed. Philadelphia: National
Board of Medical Examiners; 2003.
Few limitations of this study needs to be mentioned for [7] Tarrant M, Ware JA. Framework for improving the quality
proper analysis and research prospect. As the study was of multiple-choice Assessments. Nurse Educator
carried out on one institution, the results may not be 2012;37(3):98–104.
generalised and even seem out of context on other [8] Webber RH. Structured short-answer questions: an
setting. Variables like individual performance of students alternative examination method. Medical Education
and overall health of the students during the period of 1992;26(1):58–62.
taking test were not controlled. [9] Biggs J. Teaching for quality learning at university.
Philadelphia: The Society for Research into Higher
This study suggests that though difficulty index of SSAQ Education & Open University Press; 2003.
and MCQ are similar, and in terms of discriminating
[10] Linn RL, Gronlund, NE. Measurement and assessment in
good and poor students, SSAQ was a better tool, ROC teaching. 7th ed. New Jersey: Prentice-Hall; 1995.
analysis revealed both the tools to be adequate in terms
[11] Dixon RA. Evaluating and improving multiple-choice papers:
of discriminating students with a relatively better value
true-false questions in public health medicine. Medical
to SSAQ. As this study was conducted in a university Education 1994;28(5):400–8.
[12] Rahim AFbA. What those number mean? 1st ed. Kubang [21] Pepple DJ, Laurine YE, Carroll RG. A comparison of student
Kerian: KKMED; 2010. Available from: http:// performance in multiple-choice and long essay questions in
www.medic.usm.my/dme/images/stories/staff/KKMED/ the MBBS stage I physiology examination at the University
2010/item_analysis_guide.pdf. of the West Indies (Mona Campus). Advances in Physiology
Education 2010;34(2):86–9.
[13] Ebel RL, Frisbie DA. Essentials of educational measurement.
5th ed. Englewood Cliffs. New Jersey: Prentice-Hall Inc; [22] Mujeeb AM, Pardeshi ML, Ghongane BB. Comparative
1991. assessment of multiple choice questions versus short essay
questions in pharmacology examinations. Indian Journal of
[14] Thomas GT. Interpreting diagnostic test; 2009. Available
Medical Sciences 2010;64(3):118–24.
from: http://gim.unmc.edu/dxtests/Default.htm
[23] Taib F, Yusoff MSB. Difficulty index, discrimination index,
[15] Edward JP, Peter GD. Assessment of higher order cognitive
sensitivity and specificity of long case and multiple choice
skills in undergraduate education: modified essay or
questions to predict medical students’ examination
multiple choice questions? BMC Medical Education
performance. Journal of Taibah University Medical Sciences
2007;7:49:10.
2014;9(2):110–14.
[16] Albanese MA. Challenges in using rater judgments in medical
[24] Eijsvogels TMH, van den Brand TL, Hopman MTE.
education. Journal of Evaluation in Clinical Practice
Multiple choice questions are superior to extended matching
2000;6(3):305–19.
questions to identify medicine and biomedical sciences
[17] Sam AH, Hameed S, Harris J, Meeranr K. Validity of very students who perform poorly. Perspectives on Medical
Downloaded From IP - 163.47.151.210 on dated 18-Aug-2018
short answer versus single best answer questions for Education 2013;2(Nov(5–6)):252–63.
undergraduate assessment. BMC Medical Education
[25] Dascalua CG, Enachea AM, Mavrua RB, Zegan G.
Members Copy, Not for Commercial Sale
2016;16:266.
www.IndianJournals.com