Professional Documents
Culture Documents
EDUC 202 PER Module SGS
EDUC 202 PER Module SGS
EDUC 202 PER Module SGS
T= P–Q
N(N – 1)/2
In handling ties, first take the x distribution and for each test set of ties
determine x (x – 1), where x is the number tied for a rank. These are summed and
divided by 2.
2 (2 – 1) + 3 (3 – 1) = 2 + 6 = _8_ = 4
2 2 2
Repeat the process for the y distribution:
2 ( 2 – 1) = 1
2
Next calculate:
N (N – 1)/2 = 10 (9)/2 = 45
45 – 4 = 41
45 – 1 = 44
Multiplying the two terms (41) (44) = 1804
Activity 8:
1. Using Kendall’s Tau coefficient, determine the correlation of Test A
and Test B and interpret the result.
Test A Test B
36 18
34 26
33 24
29 18
29 15
28 12
28 14
28 13
24 12
22 10
19 8
12 9
Suppose the five judges (m) rank the projects of ten individuals (N) in a judging
contest, and we wish to determine the overall relationship among the ratings of the
five judges. The rankings of these judges have been set up in a given table. First
the rankings by the five judges of each of the projects are summed. The sums
appear in column 3. Then column 3 is summed to give the total sum of the ranks.
This can be checked for the total sum of the ranks as follows:
Total sum of ranks = m (N) (N + 1)
2
Individual Judges’ Ranks Sum of Difference of
Project 1 2 3 4 5 Ranks the mean and D2
the ranks (D)
1 2 1 2 3 4 12 15.5 240.25
2 1 3 1 2 2 9 18.5 342.25
3 3 4 4 1 3 15 12.5 156.25
4 5 5 5 5 1 21 6.5 42.25
5 4 2 6 7 6 25 2.5 6.25
6 7 8 3 4 7 29 1.5 2.25
7 6 6 8 6 5 31 3.5 12.25
8 8 7 7 8 9 39 11.5 132.25
9 9 10 10 9 8 46 18.5 342.25
10 10 9 9 10 10 48 20.5 420.25
∑ = 275 ∑D2 =
1696.5
To check:
Total sum of ranks = m (N) (N + 1)
2
Total sum of ranks = 5 (10) (10 + 1) = 275
2
Mean: x = 275 ÷ 10 = 27.5
If there were no relationship among the ranks, we should expect the sum of the
ranks for each row to be equal. For this case the sum of each would be the average
sum of ranks or 275/10, which equals 27.5. Then obtain the difference from this
mean of the sum of the ranks of each row and square these differences. Then these
squares are summed.
To compute W, using the formula:
W= 12 ∑D2 = ___12 (1696.5)___ = 0.82
m2 (N) (N2 – 1) 25 (10) (110 – 1)
Interpretation of W:
The size of this coefficient of concordance indicates that there is high
agreement among these five judges in the ranking of the ten projects. Perfect
agreement is indicated by W = 1 and lack of agreement by W = 0.
Activity 9:
1. Three judges rated eight essays with the following results. Calculate
the coefficient of concordance for these data.
Essay Judges
1 2 3
1 8 7 8
2 6 5 6
3 4 6 5
4 1 2 1
5 3 3 2
6 2 1 3
7 5 4 4
8 7 8 7
Essay Judges
1 2 3 4
1 1 1 1 1
2 2 4 3 2
3 3 3 2 4
4 4 2 4 3
5 5 6 5 5
6 6 5 6 7
7 7 7 8 6
8 8 8 7 8
Students Before the Demo During the Demo After the Demo
Solution:
Ho: The probability of “in favor” responses is the same for all three conditions
(before, during, and after the demonstrations), that there are no differences in the
student’s attitudes towards the banning of pornographic movies and reading
materials before, during, and after the series of student’s demonstrations.
Ha: The probabilities of “in favor” responses differ according to the condition
under which the interview takes place; that is, there are differences in the student’s
attitudes towards the banning proposal before, during, and after the series of
demonstrations.
Level of Significance is 0.05
Table value: refer to x2 table: df = k – 1 = 3 – 1 = 2; t value = 5.991
∑G = 4 + 9 + 7 = 20
Q = (k – 1) [ k ∑G2 – (∑G)2 ]
K ∑T - ∑T2
Interpretation: The tabular value (5.991) is less than the computed value (6.33),
therefore the null hypothesis is rejected and the alternative hypothesis is accepted. It
means that the response of the students differs according to the condition under
which the interviewed takes place.
Activity 10:
1. The data shows the response of the drug experienced individuals before, during and
after imprisoned for at least 1 year. Determine if difference exists on their behavior and
attitudes during these periods were interviewed takes place. Level of significance set at
0.05.
X2r = [ 12 ∑C2j ] – 3N (k + 1)
N k (k + 1)
Example: One may consider the following hypothetical case. Suppose it is desired
to analyze the achievement of male students taught by one of the three methods of
instruction, namely: (1) programmed text, (2) lecture, and (3) lecture with discussion.
Suppose further, 12 sets of k = 3 male students per set are chosen in such a way that they
are matched according to age and IQ and that the three subjects in each set are each
randomly assigned to one of the three methods of instruction. The end results in terms of
scores are presented.
Solution:
Ho: The three methods of instruction do not have different effects
Ha: The three methods of instruction have differential effects.
Level of Significance = 0.05
Refer to X2 table: df = k – 1 = 2
Tabular value = 5.991
Students Rank – Prog. Test Rank - Lecture Rank – Lec with Dis.
1 3 1 2
2 2 1 3
3 3 1 2
4 2 3 1
5 2 1 3
6 3 2 1
7 2 1 3
8 3 1 2
9 1 2 3
10 3 1 2
11 2 1 3
12 3 2 1
C1 = 29 C2 = 17 C3 = 26
X2r = [ 12 ∑C2j ] – 3N (k + 1)
N k (k + 1)
Interpretation: The Computed X2r is greater that the tabular value of 5.991, which
means that the null hypothesis is rejected and the alternative hypothesis is accepted.
This means that the three methods of instruction have differential effects.
Activity 11:
H= 12 [∑Ri2] – 3(N + 1)
N (N + 1) ni
To illustrate the test a given table with the set of data is presented:
Example:
Variable Rank x Variable Rank y Variable Rank z
x y z
12 13.5 13 16 13 16
16 21 18 22 14 19
14 19 14 19 7 8.5
2 1.5 13 16 8 10.5
12 13.5 8 10.5 4 4.5
7 8.5 3 3
6 7 2 1.5
4 4.5 5 6
9 12
Nx = 5 ∑Rx = 68.5 Ny = 8 ∑Ry = 103.5 Nz = 9 ∑Rz = 81
The scores of the three sets are combined and ranked with the lowest score
receiving a rank value of 1. Ties are treated in the usual fashion for ranking data.
Then the ranks of the three columns are summed. If the sample proportions are
similar, the total sum of the ranks would be divided proportionately among the
various samples on the basis of sample size. The sums of the ranks of the various
samples are then tested against these proportions of the total sum of ranks.
H= 12 [∑Ri2] – 3(N + 1)
N (N + 1) ni
H = 2.30
When the samples contain five or more cases, H is interpreted as chi-square with
the number of samples minus 1 degree of freedom. For this problem df is equal to 2
and this is not significant.
Activity 12:
1. The following data from Borislow show the changed response scores
of three groups on the Edwards Personal Preference Schedule. Test the differences
among these three groups at 0.01 and 0.05 level of significance.
A B C
8 2 12
7 5 4
15 8 11
10 17 5
9 10 14
7 9 11
8 15
2 8
When the sample sizes are very small-that is, when both N1 and N2 are made up
of eight or less measures – the reader is referred to Siegel (1956) for a method and
tables that apply to such sample sizes. When the larger of the two samples is 9 or
more, the procedure describe below is used.
Example 1. In the table there are two samples, Nx with seven cases and Ny with
nine cases. In this case we are testing the hypothesis that both samples come from
the same population. First rank all the scores in one composite distribution in an
increasing order size. The score of 3 in column y is the lowest one; hence give it a
rank of 1. The score of 4 in column x is the second lowest, give it a rank of 2.
There is a score of 6 in both x and y distribution, handle the ties by giving them the
average of the next two ranks, which for this distribution is 3.5. Continue the
ranking until all scores receive a rank.
At this point make an arithmetic check on the work. The sum of the two
columns of ranks must be equal to N (N + 1)/2.
Table:
X Y Rank (x) Rank (y)
14 18 11.5 15
12 16 9 14
13 15 10 13
10 14 8 11.5
7 19 5.5 16
6 7 3.5 5.5
4 8 2 7
6 3.5
3 1
Nx = 7 Ny = 9 ∑Rx = 49.5 ∑Ry = 86.5
z= U1 – (N1N2/2)
√[N1N2(N1 + N2 + 1)]/12
To check: U2 = N1N2 – U1
= 7(9) – 41.5
= 21.50
Refer to table:
Level of significance = 0.01
Two tailed test - refer critical values of U:
Ni = 7 and N2 = 9, U must be 9 or less to be significant.
Example 2. (For larger Set of data), Find out if difference exists between the
variable x and y at 0.01 level of significance.
Given:
Variable x Variable y
7 18
10 14
12 15
6 16
8 12
4 10
6 8
14 13
2 13
4 3
6
7
3
5
2
1
13
14
11
10
9
N1 = 21 N2 = 10
Solution:
Ho: There is no significant difference between the scores of variable x and y.
Ha: Significant difference exists between variable x and y.
Level of significance: 0.01
Non-directional: two-tailed test
Table value of z = 2.58
Solve for Z:
z= U1 – (N1N2/2)
√[N1N2(N1 + N2 + 1)]/12
Z= 168.5 – 21(10)/2
√ [21(10) (21 + 10 + 1]/12
Z = 2.68
Interpretation: The computed z (2.68) is greater than tabular value (2.58), it means
that the null hypothesis is rejected and the alternative hypothesis is accepted. This
means that difference exists between the scores of variable x and variable y.
Activity 13:
Normal Psychotics
6 7
6 2
14 12
13 13
15 8
6 6
8 4
7 2
10 2
14 12
10
14
Group I Group II
10 14
18 18
36 22
22 16
28 38
29 26
32 28
15 12
18 11
36 19
21 16
27
33
36
26
24
31
13
19
23
25
27
32
SD = 40 (522,781) – (4,561)2
√ 40 (39)
SD = 8.34
p = n1/n q = n2/n
= 10/40 = 30/40
= 0.25 = 0.75
rb = 0.91
Equation: rpb = y1 – y2 √ pq
SD
Where y1 is the mean of the measurements in the first category, y2 is the mean
of the measurements in the second category, SD is the standard deviation of the
measurements, p is the proportion of measurements in the first category, and q is
the proportion of measurements in the second category.
Example 1. Determine the extent of relationship between expenditures and sex, the
average weekly expenditures of 28 males and 22 female respondents were
computed over a one-month period. The data are shown in the table. Unit is in
thousand pesos.
SD = 50 (1,529.71) – (243.45)2
√ 50 (49)
SD = 2.65
p = n1/n q = n2/n
= 28/50 = 22/50
= 0.56 = 0.44
√ pq = 0.4964
rpb = y1 – y2 √ pq
SD
Activity 14:
You are a principal of a particular High School. You conducted a follow-up study to
see the correlation of external productivity of your school. See if the following variables
are significantly correlated.
A B C D E F
Successful Grade English Mathematic Involvement Exposure to
School in Extra Community
Graduates Grade s Grade
Average Curricular Programs
Activities
+ 85 74 73 + -
- 83 85 88 - -
+ 78 86 87 + +
+ 84 71 86 - -
+ 86 89 85 - +
- 79 74 90 + -
- 85 86 91 + +
- 84 85 93 - +
+ 93 92 89 - +
+ 91 90 87 + -
- 87 91 89 + -
+ 86 97 88 - -
- 85 98 87 - -
+ 94 73 88 + +
+ 95 85 80 + +
+ 98 88 84 - +
- 96 87 86 - -
+ 90 88 85 - -
- 89 91 84 + +
- 87 93 83 - +
+ 89 90 82 + +
+ 83 92 80 + -
- 84 90 79 - -
+ 85 89 88 - +
+ 87 88 85 + +
- 80 87 80 - +
- 80 86 82 + +
With the test-retest method a test is administered and then at a later date the
same test is re-administered to the same individuals. A Pearson product-moment
correlation coefficient is computed between the two sets of scores. The length of
the time between the two administrations of the test is important in determining the
size of the reliability coefficient. In general, the longer the time between the two
administrations of the test, the lower the correlation. If the period between the
administrations is very short, such as the second test immediately following the
first, individuals may remember their answers and put the same responses down the
second time without making a new effort to react to the test item. Such behavior
tends to make reliability coefficients artificially high. When the period between
testing is short, memory may be an important factor affecting the results. As the
period increases in length, learning, maturation, senescence, and many other
variables may enter the situation to lower the correlation coefficient. At the present
time, the test-retest method is frequently used when determining the reliability of
paper-and -pencil tests. Coefficients computed by this method are frequently called
coefficients of stability.
This technique is also referred to as the method of equivalent forms. With this
technique, we administer form A to a group of individuals and follow this
immediately or fairly soon with form B on the same test. These two forms of the
same test are said to be parallel or equivalent, because they are made up of the same
types of item covering the same materials; they have the same means and variances;
and if one form correlates to a certain extent with some other measure, then the
other form correlates to the same degree.
An advantage of this method is that only one test is needed for the computation
of the reliability coefficient. The test papers are scored so that from every single
paper we have two scores. This is usually done by counting the number of odd-
numbered items answered correctly and the number of even-numbered items
answered correctly. Sometimes other splits are made, such as items 1 and 2 go into
the first score, 3 and 4 into the second score and 5 and 6 into the first score and so
on. Almost any split will be acceptable, except one taking the first half of the items
against the second half, because tests are usually made with the easier items first
and because students tend to complete the first half of a test and not the second, a
division of this sort will result in two tests of a different nature.
Each paper now has an even and an odd score on it, or two other types of
scores, depending on the type of split made. Again a Pearson product-moment
correlation coefficient is computed between the two sets of scores. A reliability
coefficient of this type is often called a coefficient of internal consistency. It so
happens that the reliability of a test is directly related to the length of the test.
When we scored our test on an odd-even basis, we actually cut the length of our
original test in half. The reliability coefficient we have computed is then the
equivalent of one for a test of half of the size of our original test. We make a
correction for this effect by using what is known as the Spearman-Brown formula
as follows:
rtt = __2roe___
1 + roe
Where:
rtt = the reliability of the original test
roe = the reliability coefficient obtained by correlating the scores on the
odd items with the scores of the even items
rtt = k [1 - ∑pq ]
k–1 s2
where:
k = number of items on the test
s2= the variance of the test
pq = the quantity obtained from the worksheet
Questions/Activity 15:
1. In your own understanding, discuss the techniques used in the computation
of reliability coefficients.
References:
: Webster, Allen L., (2005); Applied Statistics for Business and Economics; 2nd
Edition; Von Hoffman Press Inc. USA.
: Downie, N.M. and Robert W. Heath; Basic Statistical Methods; Harper and Row
Publishers, New York, USA
: Freund, John E. and Frank J. Williams; Business Statistics: The Modern Approach;
Sixth Edition; Prentice-Hall, Englewood Cliffs, New Jersey, USA
: Walpole, Ronald E. and Raymond H. Myers, (2003); Probability and Statistics; Fifth
Edition; Macmillian Publishing Company; New York