Professional Documents
Culture Documents
EDUC343 Module+1
EDUC343 Module+1
Module 1: Overview
of Assessment of Learning1: Utilization of
Assessment Data
Time Table: 10 hours
Deepen!
Read and watch the video “Anywhere Math Introduction to Statistics” by Jeffrey Jacobsen on
YouTube.com.”
This module is a review of the important tools needed in describing, analyzing and
interpreting assessment results. The topics discussed in this module are measures of central
tendency, measures of variation, skewness, correlation and different types of converted scores. It
is very important for every education student and teacher to master this part, because it will be very
helpful in describing and analyzing test results accurately, for the teacher to make an appropriate
decision about the performance of the learners.
What is Statistics?
Statistics plays a very important role in assessing the performance of students, most
especially in describing and analyzing their scores through assessment activities. Teachers should
know how to utilize these data, particularly in decision-making.
Definition of Statistics
Statistics is the branch of science that deals with the collection, presentation, analysis and
interpretation of quantitative data.
Branches of Statistics
There are two branches of statistics: descriptive statistics and inferential statistics.
Descriptive Statistics deals with collecting, describing and analyzing a set of data without drawing
conclusions (or inferences) about a large group of data. Inferential Statistics, on the other hand,
is concerned with the analysis of a subset of data leading to predictions or inferences about the
Assessment of Learning 2 | 2
entire set of data, without dealing with each individual in the population. It means that, inferences
can be derived from the population, using only a sample or a part of the population.
Deepen!
Measure of central tendency provides a very convenient way of describing a set of scores
with a single number that describes the performance of a group. It is also defined as a single value
that is used to describe the “center” of the data. It is thought of as a typical value in a given
distribution. There are three commonly used measures of central tendency. These are the mean,
median and mode. In this section, we shall discuss how to compute the value and some of the
properties of the mean, median and mode as applied in the classroom setting.
1. Mean
Mean is the most commonly used measure of the center of data and it is also referred as
the “arithmetic average.”
Ʃfx
2. 𝑥̅ = n
X (scores)
25
20
18
Assessment of Learning 2 | 3
18
17
15
14
13
12
10
Ʃx = 162
n=10
Ʃx 162
x̅ = = = 16.2
n 10
Analysis:
Example 2: Find the Grade Point Average (GPA) of Timmy for the first semester of the
school year 2018-2019. Use the table below:
Subjects Grade (Xi) Units (Wi) (Wi) (Xi)
BM 112 1.25 3 3.75
BM 101 1.00 3 3.00
AC 103N 1.25 6 7.50
BEC 111 1.00 3 3.00
MGE 101 1.50 3 4.50
MKM 101 1.25 3 3.75
FM 111 1.50 3 4.50
PEN 2 1.00 2 2.00
Ʃ(𝐖𝐢 ) = 𝟐𝟔 Ʃ(𝐖𝐢 )(𝐗𝐢 ) = 𝟑𝟐. 𝟎𝟎
Ʃ(Wi )(Xi )
x̅ =
ƩWi
32
x̅ =
26
x̅ = 1.23
The Grade Point Average of Timmy for the first semester SY 2018-2019 is 1.23.
Grouped data are the data or scores that are arranged in a frequency distribution.
Frequency distribution is the arrangement of scores according to category of classes including
the frequency. Frequency is the number of observations falling in a category.
For this particular lesson we shall discuss only one formula in solving the mean for
grouped data which is called midpoint method. The formula is:
Ʃ𝐟𝐗 𝐦
𝐱̅ =
𝐧
where,
1. Find the midpoint or class mark (Xm ) of each class or category using the formula
LL+UL
Xm = .
2
Example 3: Scores of 40 students in a Science class consist of 60 items and they are
tabulated below.
X f 𝐗𝐦 𝐟𝐗 𝐦
10 - 14 5 12 60
15 – 19 2 17 34
20 – 24 3 22 66
25 – 29 5 27 135
30 – 34 2 32 64
35 – 39 9 37 333
40 – 44 6 42 252
45 – 49 3 47 141
50 – 54 5 52 260
n = 40 Ʃ𝐟𝐗 𝐦 = 𝟏 𝟑𝟒𝟓
ƩfXm
x̅ = n
1 345
x̅ = 40
Assessment of Learning 2 | 5
x̅ = 33.63
Analysis:
The mean performance of 40 students in Science quiz is 33.63. Those students who got
scores below 33.63 did not perform well in the said examination while those students who got
scores above 33.63 performed well.
• It measures stability. Mean is the most stable among other measures of central tendency
because every score contributes to the value of the mean.
• The sum of each score’s distance from the mean is zero.
• It is easily affected by the extreme scores.
• It may not be an actual score in the distribution.
• It can be applied to interval level of measurement.
• It is very easy to compute.
2. Median
Median is the second type of measures of central tendency. It refers to the centermost
score when the scores in the distribution are arranged according to magnitude (from highest score
to lowest score or from lowest score to highest score).
x (score)
19
17
16
15
10
5
Assessment of Learning 2 | 6
Analysis:
The median score is 15. Fifty percent (50%) or three of the scores are above 15 (19, 17,
16) and 50% or three scores are below 15 (10, 5, 2).
x (score)
30
19
17
16
15
10
5
2
16 + 15
x̃ =
2
x̃ = 15.5
Analysis:
The median score is 15.5 which means that 50% of the scores are lower than 15.5, those
are 15, 10, 5 and 2; and 50% are greater than 15.5, those are 30, 19, 17, 16 which mean (4)
scores are below 15.5 and four (4) scores are above 15.5.
x̃ = median value
n
MC = median class is a category containing the 2
Example 3: Scores of 40 students in a Science class consist of 60 items and they are
tabulated below. The highest score is 54 and the lowest score is 10.
X f cf <
10 - 14 5 5
15 – 19 2 7
20 – 24 3 10
25 – 29 5 15
30 – 34 2 17 (cfp)
35 – 39 9 (fm) 26
40 – 44 6 32
45 – 49 3 35
50 – 54 5 40
n = 40
Solution:
𝐧 40
= = 20
𝟐 2
𝐧
The category containing 𝟐 is 35 - 39.
MC = 35 – 39
LL of the MC = 35
LB = 34.5
cfp = 17
fm = 9
Assessment of Learning 2 | 8
c.i = 5
n
−cfp
2
x̃ = LB + ( ) c. i
fm
20−17
= 34.5 + ( ) 5
9
3
= 34.5 + (9) 5
15
= 34.5 + ( 9 )
= 34.5 + 1.67
x̃ = 36.17
Analysis:
The median value is 36.17, which means that 50% or 20 scores are less than 36.17.
3. Mode
Mode is the third measure of central tendency. It refers to the score/s that occurs most
frequently in the score distribution.
Types of Mode
24 24 25
20 20 22
20 18 21
20 18 21
16 17 21
12 10 18
10 9 18
7 7 18
Analysis:
The score that appeared most in section A is 20, hence the mode of section A is 20. There
is only one mode, therefore, score distribution is called unimodal. The modes of section B are 18
and 24, since both 18 and 24 appeared twice. There are two modes in section B, hence, the
distribution is a bimodal distribution. The modes for section C are 18, 21 and 25. There are three
modes for section C, therefore, it is called a trimodal or multimodal distribution.
In solving the mode value using grouped data, use the formula:
𝐝𝟏
𝐱̂ = 𝐋𝐁 + (𝐝 ) 𝐜. 𝐢
𝟏+𝐝𝟐
d1 = difference between the frequency of the modal class and the frequency
above it, when the scores are arranged from lowest to highest.
d2 = difference between the frequency of the modal class and the frequency
below it, when the scores are arranged from the lowest to highest.
Example 2: Scores of 40 students in a Science class consist of 60 items and they are
tabulated below.
X f
10 - 14 5
15 – 19 2
20 – 24 3
25 – 29 5
30 – 34 2
35 – 39 9
40 – 44 6
45 – 49 3
Assessment of Learning 2 | 10
50 – 54 5
n = 40
Modal Class = 35 – 39
LL of MC = 35
LB = 34.5
d1 = 9 – 2 = 7
d2 = 9 – 6 = 3
c.i = 5
d1
x̂ = LB + (d ) c. i
1+d2
7
= 34.5 + (7+3) 5
35
= 34.5 + 10
x̂ = 34.5 + 3.5
x̂ = 38
The mode of the score distribution that consists of 40 students is 38, because 38 occurred
several times.
Practice!
You will need paper and pen to complete the following exercises. Finding the Mean,
Median and Mode. Just click the link: https://www.riosalado.edu/web/oer/wrkdev100-
20011_inter_0000_v1/m5/pdf/m5_l1_mean_median_mode_practice_probs.pdf.
Quantiles
Quantile is a score distribution where the scores are divided into different equal parts.
There are three kinds of quantiles. The quartile is a score point that divides the scores in the
distribution into four (4) equal parts. Decile is a score point that divides the scores in the
distribution into ten (10) equal parts. Percentile is a score point that divides the scores in the
distribution into hundred (100) equal parts.
k k nth score
Qk = [4 n + (1 − 4)]
1 1 nth score
Q1 = [ n + (1 − )]
4 4
2 2 nth score
Q2 = [ n + (1 − )]
4 4
3 3 nth score
Q3 = [4 n + (1 − 4)]
where,
Qk = is the indicated quartile
k = 1,2,3
n = number of cases
Assessment of Learning 2 | 12
k k nth score
Dk = [10 n + (1 − )]
10
1 1 nth score
D1 = [10 n + (1 − )]
10
9 9 nth score
D9 = [10 n + (1 − )]
10
where,
k = 1, 2, 3, 4, 5, 6, 7, 8, 9
n = number of cases
k k nth score
Pk = [100 n + (1 − )]
100
1 1 nth score
𝑃1 = [100 n + (1 − )]
100
99 99 nth score
𝑃99 = [100 n + (1 − )]
100
where,
k = 1, 2, 3, 4, 5,………..97, 98, 99
Deepen!
Measure of Variation is a single value that is used to describe the spread of the scores in
a distribution. The term variation is also known as variability or dispersion. There are several ways
of describing the variation of scores: absolute measures of variation and relative measures of
variation.
1. Range
Range (R) is the difference between the highest score and the lowest score in a
distribution. Range is the simplest and the crudest measure of variation, simplest because we shall
only consider the highest score and the lowest score.
R = HS – LS
where,
R = range value
HS = highest score
LS = lowest score
Group A Group B
10 (LS) 15 (LS)
12 16
15 16
17 17
25 17
26 23
28 25
30 26
35 (HS) 30 (HS)
𝐑 𝐀 = 𝐇𝐒 − 𝐋𝐒 𝐑 𝐁 = 𝐇𝐒 − 𝐋𝐒
R A = 35 − 10 R B = 30 − 15
R A = 25 R B = 15
Analysis:
Assessment of Learning 2 | 14
The range of Group A = 25 is greater than the range of Group B = 15. The implication of
this is that the scores in group A are more spread out than the scores in group B or the scores in
group B are less scattered than the scores in group A.
R = 𝐇𝐒𝐔𝐁 − 𝐋𝐒𝐋𝐁
where,
R = range value
LL of the LS = 25
LSLB = 24.5
UL of the HS = 97
HSUB = 97.5
R = HSUB − LSLB
R = 97.5 – 24.5
R = 73
If the range is large, the scores are more dispersed, widespread or heterogeneous. On the
other hand, if the range is small, the scores are less dispersed, less scattered or homogeneous.
Properties of Range:
Inter-quartile range is the difference between the third quartile and the first quartile.
𝐈𝐐𝐑 = 𝐐𝟑 − 𝐐𝟏
Quartile Deviation indicates the distance we need to go above and below the median to
include the middle 50% of the score. It is based on the range of the middle 50% of the scores,
instead of the range of the entire set.
𝐐 𝐐
The formula in computing the value of the quartile deviation is 𝐐𝐃 = 𝟑−𝟐 𝟏 , where 𝐐𝐃 is
the quartile deviation value, 𝐐𝟏 is the value of the first quartile and 𝐐𝟑 is the value of the third
quartile.
Example: Using the given data 6, 8, 10, 12, 12, 14, 15, 16, 20, find the quartile deviation.
x (score)
6
8
10
12
Assessment of Learning 2 | 16
12
14
15
16
20
Solve for 𝐐𝟏
n=9
k k nth score
Qk = [4 n + (1 − 4)]
1 1 nth score
Q1 = [ n + (1 − )]
4 4
1 1 nth score
= [4 (9) + (1 − 4)]
9 3 nth score
= [4 + 4]
12 nth score
= [4 ]
𝑄1 = 3rd score or
𝑸𝟏 = 10
Solve for 𝑸𝟑
3 3 nth score
Q3 = [ n + (1 − )]
4 4
3 3 nth score
Q3 = [ (9) + (1 − )]
4 4
27 1 nth score
= [ 4 + 4]
28 nth score
= [4 ]
𝑄3 = 7th score
𝑸𝟑 = 15
𝐈𝐐𝐑 = 𝐐𝟑 − 𝐐𝟏
= 15 - 10
Assessment of Learning 2 | 17
IQR = 5
𝐐𝟑 − 𝐐𝟏
𝐐𝐃 = .
𝟐
15−10
= 2
5
= 2
QD = 2.5
Analysis:
The larger the value of the IQR or QD, the more dispersed the scores at the middle 50% of
the distribution. On the other hand, if the IQR or QD is small, the scores are less dispersed at the
middle 50% of the distribution. The point of dispersion is the median value.
When the value of IQR and QD is small, the scores are clustered within the middle 50% of
the score distribution. On the other hand, the scores are dispersed in the middle 50% of the
distribution when the value of IQR and QD. To determine which group of distribution is more
clustered or dispersed you should compare it with another group of distribution since there is no
standard value of a small or large value of IQR and QD.
Variance is one of the most important measures of variation. It shows variation about the
mean.
Population Variance
Ʃ(𝐗− µ)𝐧
ơ𝟐 = 𝐍
Sample Variance
Ʃ(𝐱− 𝐱̅)𝐧
𝐬𝟐 = 𝐧−𝟏
Note: If the variance is already solved, take the square root of the variance to get the value of the
standard deviation.
Coefficient of variation shows a variation relative to the mean. It is used to compare two
or more groups of distribution of scores. Usually expressed in percent, the smaller the value of the
coefficient of variation, the more homogeneous the scores are. On the other hand, the higher the
value of the coefficient of variation, the more dispersed the scores are in that particular distribution.
where,
s = standard deviation
x̅ = mean value
Deepen!
Assessment of Learning 2 | 19
Measures of Skewness
Measure of skewness describes the degree of departure of the scores from symmetry.
The skewness coefficient SK can be solved using the formula:
𝟑 (𝐱̅ − 𝐱̃ )
Sk = where x̅ = mean value s = standard deviation
𝐬
x̃ = median value
Positively skewed or skewed to the right is a distribution where the thin end tail of the
graph goes to the right part of the curve. This happens when most of the scores of the students are
below the mean.
Negatively skewed or skewed to the left is a distribution where the thin end tail of the graph
goes to the left part of the curve. This happens when most scores got by the students are above
the mean.
x̂ x̃ x̅
Graphical Representation of Positively Skewed Distribution (Sk > 0)
Positively Skewed Distribution means that the students who took the examination did
very poor. Most of the scores are low; hence most of the students got scores below the mean value.
Mean value is greater than the median and the mode values. Example: Mean = 50, Median = 47,
and Mode = 43.
Assessment of Learning 2 | 20
x̅ x̃ x̂
Negatively Skewed Distribution means that the students who took the examination
performed well. Most of the scores are high; hence, most of the students got scores above the
mean value. Mean value is less than the median and the mode values. Example: Mean = 43,
Median = 47, and Mode =50.
x̅ = x̃ = x̂
Given:
x̅ = 38.50
x̃ = 35.25
s = 2.50
𝟑 (𝐱̅ − 𝐱̃ )
Sk =
𝐬
Assessment of Learning 2 | 21
3 (38.50−35.25
=
2.50
3 (3.25)
= 2.50
9.75
= 2.50
Sk = 3.9
Interpretation:
Sk = 3.90, so the value is positive. The score distribution is positively skewed. Most of the
score are low; thus the students performed poorly in the said examination.
Standard Scores
In this section, we shall discuss the different kinds of converted scores. There are four (4)
types of standard score: z-scores, t-scores, standard nine (stanines) and percentile ranks.
Scores directly obtained from the test are known as actual scores or raw scores. Such
scores cannot be interpreted as whether the score is low, average or high. Scores must be
converted or transformed so that they become meaningful and allow some kind of interpretations
and direct comparisons of two scores.
1. z-scores
The z-score is used to convert a raw score to standard score to determine how far a raw
score lies from the mean in standard deviation units. From this we can also determine whether an
individual student performs well in the examination compared to the performance of the whole
class.
The z-score value indicates the distance between the given raw score and the mean value
in units of the standard deviation. The z-value is positive when the raw score is above the mean
while the z is negative when the raw score is below the mean. The formula of z-score is:
𝐱− 𝛍 𝐱− 𝐱̅
𝐳= or 𝐳 =
𝛔 𝐬
where z = z-value
x = raw score
x̅ = sample mean
Assessment of Learning 2 | 22
μ = population mean
The z-score formula is very essential when we compare the performance of the student in
his subjects or the performance of two students that belongs to different groups. It can determine
the exact location of the scores whether above or below the mean and how many standard
deviation units it is from the mean.
2. T-scores
There are two possible values of z-score, positive z if the raw score is above the mean and
negative z if the raw score is below the mean. To avoid confusion between negative and positive
value, use T-score to covert raw scores. T-score is another type of standard score where the mean
is 50 and the standard deviation is 10. In z-score the mean is 0 and the standard deviation is one
(1). To convert raw score to T-score, find first the z-score equivalent of the raw score and use the
formula T-score = 10z + 50.
3. Standard Nine
The third type of standard score is the Standard Nine point scale which is also known as
stanine, the origin word is sta (ndard) + nine. A stanine is a nine-point grading scale ranging from 1
to 9, 1 being the lowest and 9 the highest. Stanine grading is easier to understand than the other
standard score model. The descriptive interpretation of stanine 1,2,3 is below average, the stanine
4,5,6, is interpreted as average and the descriptive interpretation of stanine 7,8,9 is above average.
Stanine is used to compare two or more distributions of data, particularly test scores.
Estimate or compute probabilities of events involving normal distributions. Facilitate using words
rather than numbers in presenting statistical data.
The given figure below indicates the percentage of scores in each stanine and the
corresponding descriptions.
4. Percentile Rank
Assessment of Learning 2 | 23
Another way of converting a raw score to standard score is the percentile rank. A
percentile rank indicates the percentage of scores that lies below a given score. Example, a test
score which is greater than 95% of the scores of the examinees is said to be 95th percentile. If the
scores are normally distributed, percentile rank can be inferred from the standard score. In solving
percentile rank use the formula:
CFb +0.5Fg
PR = ( ) x 100
n
where,
PR = percentile rank
Solving the percentile rank is tedious or needs a very long process, we can shortcut the
solution using the SPSS program or EXCEL program which is more easier to use and more
cheaper than other software.
DESCRIBING RELATIONSHIPS
Correlation refers to the extent to which the distributions are linearly related or associated
between the two variables. The extent of correlation is indicated numerically by the coefficient of
correlation (rxy ). The correlation coefficient (rxy ) also known as Pearson Product Moment
Correlation Coefficient in honor to Karl Pearson who developed the said formula. The correlation
coefficient ranges from -1 to +1. There are three kinds of correlation based from the correlation
coefficients: (1) positive correlation; (2) negative correlation and (3) zero correlation. There are two
ways of identifying the correlation between the two variables: (1) using the formula and (2) using
scatter point or scattergram.
Kinds of Correlation
Assessment of Learning 2 | 24
1. Positive Correlation
High scores in distribution x are associated with high scores in distribution y. Low scores in
distribution x are associated with low scores in distribution y. This means that as the value of x
increases the value of y increases too or as the value of x decreases, the y values will also
decrease.
2. Negative Correlation
High scores in distribution x are associated with low scores in distribution y. Low scores in
distribution x are associated with high scores in distribution y. This means that as the values of x
increase, the values of y decrease or when the values of x decrease, the values of y increase.
3. Zero Correlation
The formula in computing the correlation coefficient using the Pearson Product Moment
Correlation is:
r𝑥𝑦 =
(𝑛) (Ʃ𝑥𝑦)−(Ʃ𝑥)(Ʃ𝑦)
2 2
√[(𝑛)(Ʃ𝑥2)−(Ʃ𝑥) ][(𝑛)(Ʃ𝑦2)−(Ʃ𝑦) ]
The given figure below indicates the interpretation of the size of a correlation coefficient