Professional Documents
Culture Documents
Unit 4 Chi Square Test WR
Unit 4 Chi Square Test WR
Unit 4 Chi Square Test WR
Chi-square distribution
Let x1,x2,….xnbe a random sample form a normal polulation with variance 2 set
n
xi x ns 2
the null hypothesis H 0 : 2 02 .Then The Test Statistic is 2 ( )2 where
i 1 0 n2
ns 2
s2 is the variance of the sample .Then 2 = defined above follows a 2 -distribution with
n2
n-1 degree of freedom.
(O E ) 2
2 -test statistic of goodness of fit is defined by 2 =
E
1. The exact shape of the distribution depends upon the no of degree of freedom n. In general
where „n‟ is small the shapes of the curve is skewed to the right and γ gets larger, the
distribution becomes more and more symmetrical.
2. To test the “goodness of the fit”. It is used to determine whether an actual sample
distribution
1
Dr. S. VIMAL KUMAR
Department of Mathematics KGiSL Institute of Technology SFM / NOTES
2. The total individual frequencies (of no of observations in the sample must reasonably
3. No individual frequency should be less than 5,if any frequency is less than 5,then it is
poled
(that it is fairly small area) with the preceding (or) succeeding frequency so that the polled
frequency is more than 5.Finally adjust for the d.f lost in pooling.
Independence of attributes:
Problems
1. Five coins are tossed 256 times. the number of heads observed is given below.examine
if the coins are unbiased by employing 2 goods of fit.
No.of heads: 0 1 2 3 4 5
Frequence : 5 35 75 84 45 12
Sol:
(O E ) 2
2=
E
On the assumption H0,the expected frequencies are given the terms of N(q+p)n=256( )5
2
Dr. S. VIMAL KUMAR
Department of Mathematics KGiSL Institute of Technology SFM / NOTES
256
[5C 0 5C 1 5C 2 5C 3 5C 4 5C 5 ]
32
=8[1+5+10+10+5+1]
2. The demand for a particular spare part in a factory was found to vary from day-to-
day .In a sample study the following information was obtained.
Test the hypothesis that the number of parts demanded doesn’t depend on the day of
the week.
Sol:
H0: the no.of parts demanded doesnot depend on the day of week.
1
Expected no of frequency = [1124 1125 1110 1126 1120 1115] 1120
6
3
Dr. S. VIMAL KUMAR
Department of Mathematics KGiSL Institute of Technology SFM / NOTES
Hence the no. of parts demanded doesnot depend on the day of week.
3. Two sample polls of votes for two candidates A and B for a public office are taken one
from among the residents of rural areas. The results are given in the adjoining table
.examine whether the nature of te area is related to voting preference in this election.
Under the null hypothesis that the nature of the area is independent the votimgpreferenece in
the election.we get the expected frequencies as follows.
1170 1000
E(620) = 585,
2000
830 1000
E(380) = 415
2000
1170 1000
E(550) = 585
2000
830 1000
E(450) = 415
2000
Thus we conclude that the nature of area is related voting preference in the election .
4. Two researches A and B adopted different techniques while rating the student’s level.
Can you say that the techniques adopted by them are significant?
4
Dr. S. VIMAL KUMAR
Department of Mathematics KGiSL Institute of Technology SFM / NOTES
Sol :
H0: There is no. significant difference between te sampling techniques used by the 2
researches for collecting te required data.
H1: There is significant difference between te sampling techniques used by the 2 researches
for collecting the required data.
We have
126 200
E(86)= 84,
300
93 200
E (60) 62
300
69 200
E (44) 46
300
Since we cannot apply the χ2 test straight way here as the last frequency is less than 5.we
should use the technique of pooling in this case as given below.
5
Dr. S. VIMAL KUMAR
Department of Mathematics KGiSL Institute of Technology SFM / NOTES
| 2 | , so we accept H0.
2
Digits: 0 1 2 3 4 5 6 7 8 9 total
Frequency:6 16 15 10 12 12 3 2 9 5 90
2. The following figures show the distribution of digits in numbers chosen at random from a
directory.
Digits 0 1 2 3 4 5 6 7 8 9 Total
Frequency 1026 1107 997 966 1075 933 1107 972 964 853 10000
Test whether the digits may be taken occur equally frequently in the directory.
3. A die was thrown 498 times denoting x to be the number appearing on the top face of it. The
observed frequency of x is given below.
x 1 2 3 4 5 6
F 69 78 85 82 86 98
What opinion you would form for the occuracy of the die.
4. The table below gives the number of aircraft accident that occurred during various days of the
week. Test whether the accidents are uniformly distributed over the week.
Days Mon Tue Wed thurs Fri Sat
No.of 14 18 12 11 15 14
accidents
Difficult problems
1. A survey of 320 family with 5 children each revealed the following information.
6
Dr. S. VIMAL KUMAR
Department of Mathematics KGiSL Institute of Technology SFM / NOTES
No.of boys 5 4 3 2 1 0
No. of 0 1 2 3 4 5
girls
No.of 14 56 110 88 40 12
families
Is this result consistent with the hypothesis that male and female birth are equally probable?
2. Four coins were tossed 160 times and the following results were obtained.
No of heads : 0 1 2 3 4
Observed frequencies: 17 52 54 31 6
Under the assumption on that coins are balanced, find the expected frequencies of getting
0, 1, 2,
3, 4 heads and test the goodness of fit.
6. Evaluation Strategy:
1. Find if there is any association between extravagance in fathers and extravagance in sons
from
the following data.
Extravagant father Miserly father
Extravagant son 327 741
Miserly son 545 234
Determine the coefficient of association also.
2. On the basis of information noted below, find out whether the new treatment is
comparatively
Superior to the conventional one.
Favourable Non favourable Total
Conventional 40 70 90
New 60 30 110
Total 100 100 N=200
3. In an investigation into the health and nutrition of two groups of children of different social
status, the following results are got.
7. Flowchart:
Χ2-test
7
Dr. S. VIMAL KUMAR
Department of Mathematics KGiSL Institute of Technology SFM / NOTES
Find observed
Find observed frequencies
frequencies (Oi) &
(Oi) & expected
expected frequencies
frequencies (Ei)
(Ei)
If the calculated value is less than (<) table value ,accept the null hypothesis
If the calculated value is greater than (>) table value reject the null hypothesis
8
Dr. S. VIMAL KUMAR