Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

ANIL SC

JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

𝐉𝐎𝐈𝐍𝐓 𝐏𝐑𝐎𝐁𝐀𝐁𝐈𝐋𝐈𝐓𝐘 𝐀𝐍𝐃 𝐉𝐎𝐈𝐍𝐓 𝐃𝐈𝐒𝐓𝐑𝐈𝐁𝐔𝐓𝐈𝐎𝐍


If X and Y are two discrte random variables, we define joint probability function of X and Y
by P(X = x, Y = y) = f(x, y) where f(x, y) ≥ 0, ∑ ∑ f(x, y) = 1
x y

Y y1 y2 ................................ yn sum
X
x1 J11 J12 J1n f(x1 )
x2 J21 J22 J2n f(x2 )
.
.
.
xm Jm1 Jm2 Jmn f(xm )
sum g(y1 ) g(y2 ) g(yn ) 1

Two random variables X and Y are said to be independent if f(xi )g(yj ) = Jij

 μx = E(X) = ∑ xf(x)

 μy = E(Y) = ∑ yg(y)

 E(XY) = ∑ xi yj Jij

 Variance σx 2 = E(X 2 ) − μx 2 , σy 2 = E(Y 2 ) − μy 2

 Cov(X, Y) = E(XY) − μx μy

cov(X,Y)
 Correlation ρ(X, Y) =
σx σy
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

1) The joint distribution of two random variables X and Y is as follows

Y −4 2 7
X
1 1/8 1/4 1/8
5 1/4 1/8 1/8

Compute i) E(X), E(Y), E(XY) ii)σx , σy d) Cov(X, Y)

Distribution of X,

xi 1 5
f(xi ) 1/2 1/2

Distribution of Y,

yj -4 2 7
g(yj ) 3/8 3/8 1/4

1 1
E(X) = ∑ xf(x) = 1 (2) + 5 (2) = 3 = μx

3 3 1
E(Y) = ∑ yg(y) = −4 (8) + 2 (8) + 7 (4) = 1 = μy

1 1 1 1 1
 E(XY) = ∑ xi yj Jij = 1(−4) (8) + 1(2) (4) + 1(7) (8) + 5(−4) (4) + 5(2) (8) +
1
5(7) (8) = 3/2

 σx 2 = E(X 2 ) − μx 2
1 1
E(X 2 ) = ∑ x 2 f(x) = 12 ( ) + 52 ( ) = 13
2 2

σx 2 = E(X 2 ) − μx 2 = 13 − 9 = 4
 σx = 2

 σy 2 = E(Y2 ) − μy 2
3 3 1 79
E(Y 2 ) = ∑ y 2 g(y) = (−4)2 ( ) + 22 ( ) + 72 ( ) =
8 8 4 4

79 75
σy 2 = E(Y2 ) − μy 2 = −1=
4 4
σy = 4.33
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA
3 3
 Cov(X, Y) = E(XY) − μx μy = (2) − (3)(1) = − 2
3
cov(X, Y) (− )
ρ(X, Y) = = 2 = −0.1732
σx σ y 2(4.33)

2) The joint probability distribution table for two random variables X and Y is as follows

Y -2 -1 4 5
X
1 0.1 0.2 0 0.3
2 0.2 0.1 0.1 0
Detemine marginal probability distribution of X and Y.

Also compute a) Expectation of X, Y and XY b) SD of X, Y c) cov(x, y), correlation of X, Y


Furthur verify X and Y are dependent random variables. Also find P(X + Y > 0).

Distribution of X,
xi 1 2
f(xi ) 0.6 0.4

Distribution of Y,
yj -2 -1 4 5
g(yj ) 0.3 0.3 0.1 0.3

E(X) = ∑ xf(x) = 1.4 = μx


E(Y) = ∑ yg(y) = 1 = μy

 E(XY) = ∑ xi yj Jij = 0.9

 σx 2 = E(X 2 ) − μx 2 = 0.24
σx = 0.49

 σy 2 = E(Y2 ) − μy 2 = 9.6
σy = 3.1

 Cov(X, Y) = E(XY) − μx μy = −0.5


cov(X, Y)
ρ(X, Y) = = −0.3
σx σy

If X and Y are independent random variables then f(x1 )g(y1 ) = J11


f(x1 )g(y1 ) = 0.6 × 0.3 = 0.18 ≠ (J11 )ie 0.1

 X and Y are dependent random variables.


ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

P(X + Y > 0) = J13 + J14 + J22 + J23 + J24 = 0 + 0.3 + 0.1 + 0.1 + 0 = 0.5

3) Random variables X and Y have the following joint distribution

Y -3 2 4
X
1 0.1 0.2 0.2
2 0.3 0.1 0.1
i) Detemine marginal probability distribution of X and Y ii)Cov(XY )iii) correlation of X, Y
Are the variables X and Y independent random variables?

xi 1 3
f(xi ) 0.5 0.5

yj -3 2 4
g(yj ) 0.4 0.3 0.3

Cov(X, Y) = −1.2 ρ(X, Y) = −0.4, dependent

4) The joint distribution of two random variables X and Y is given by

Y 2 3 4
X
1 0.06 0.15 0.09
2 0.14 0.35 0.21
Determine the marginal distribution of X and Y. Also verify X and Y are independent.

xi 1 3
f(xi ) 0.3 0.7

yj 2 3 4
g(yj ) 0.2 0.5 0.3

If X and Y are independent random variables f(xi )g(yj ) = Jij

(0.3)(0.2) = 0.06 (0.3)(0.5) = 0.15 (0.3)(0.3) = 0.09

(0.7)(0.2) = 0.14 (0.7)(0.5) = 0.35 (0.7)(0.3) = 0.21 X and Y are independent


ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

5) The joint distribution of two random variables X and Y is given by

Y -3 2 4
X
1 0.1 0.2 0.2
3 0.3 0.1 0.1
Find i) marginal distribution of X and Y ii) covariance of X and Y.

xi 1 3
f(xi ) 0.5 0.5

yj -3 2 4
g(yj ) 0.4 0.3 0.3

E(X) = ∑ xi f(xi ) = 2, E(Y) = ∑ yj g(yj ) = 0.6 , E(XY) = ∑ xi yj Jij = 0


Cov(X, Y) = E(XY) − E(X)E(Y) = −1.2

6) The joint probability distribution of two discrete random variables X and Y is given
by the table. Determine the marginal distributions os X and Y. Also find whether X and Y
are independent.
(Ans ∶ independent)
Y 1 3 6
X
1 1/9 1/6 1/18
3 1/6 1/4 1/12
6 1/18 1/12 1/36

7) Suppose X and Y are independent random varibles with the following respective distribution
of X and Y. Also verify that Cov(X, Y) = 0

xi 1 2 yj -2 5 8
f(xi ) 0.7 0.3 g(yj ) 0.3 0.5 0.2

Since X and Y are independent

Y -2 5 8 f(xi )
X
1 J11 J12 J13 0.7
2 J21 J22 J23 0.3
g(yj ) 0.3 0.5 0.2

J11 = f(x1 )g(y1 ) = 0.7 × 0.3 = 0.21

J12 = f(x1 )g(y2 ) = 0.7 × 0.5 = 0.35


ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

J13 = f(x1 )g(y3 ) = 0.7 × 0.2 = 0.14

J21 = f(x2 )g(y1 ) = 0.3 × 0.3 = 0.09

J11 = f(x2 )g(y2 ) = 0.3 × 0.5 = 0.15

J11 = f(x2 )g(y3 ) = 0.3 × 0.2 = 0.06

Y -2 5 8 f(xi )
X
1 0.21 0.35 0.14 0.7
2 0.09 0.15 0.06 0.3
g(yj ) 0.3 0.5 0.2 1

Cov(X, Y) = E(XY) − E(X)E(Y)

E(X) = ∑ xi f(xi ) = 1.3, E(Y) = ∑ yj g(yj ) = 3.5, E(XY) = ∑ xi yj Jij = 4.55

Cov(X, Y) = 0
1 1 1
8) X and Y are independent random variables, X takes values 2,5,7 with probability 2 , 4 , 4
1 1 1
respectively. Y takes values 3,4,5 with probability , ,
3 3 3
a) Find the joint probability distribution of X and Y
b) Show that the covariance of X and Y is equal to zero
c) Find the probability distribution of Z = X + Y
Given that

xi 2 5 7 yj 3 4 5
f(xi ) 1/2 1/4 1/4 g(yj ) 1/3 1/3 1/3

a) The joint distrubution table is


Y 3 4 5 f(xi )
X
2 1/6 1/6 1/6 ½
5 1/12 1/12 1/12 ¼
7 1/12 1/12 1/12 ¼
g(yj ) 1/3 1/3 1/3

b) Cov(X, Y) = E(XY) − E(X)E(Y) = 16 − (4.4) = 0


c) Z = X + Y
Zi = Xi + Yi {Zi } = {5,6,7,8,9,10,11,12}
1 1 1 1 1 1 1 1 1
The corresponding probabilities are , , , , , , , ,
6 6 6 12 12 12 6 12 12
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

Z 5 6 7 8 9 10 11 12
P(z) 1/6 1/6 1/6 1/12 1/12 1/6 1/12 1/12

9) The joint probability distribution of two discrete random variables X and Y is given by
by f(x, y) = k(2x + y)where x and y are such that 0 ≤ x ≤ 2, 0≤y≤3
a) find the value of k
b) find the marginal probability distribution of X and Y
c) Show that the random variables X and Y are dependent.

X = {xi } = {0,1,2}, Y = {yj } = {0,1,2,3}, f(x, y) = k(2x + y)

Y 0 1 2 3 sum
X
0 0 k 2k 3k 6k
1 2k 3k 4k 5k 14k
2 4k 5k 6k 7k 22k
sum 6k 9k 12k 15k 42k

a) 42k = 1 hence k = 1/42


b)
xi 0 1 2 yj 0 1 2 3
f(xi ) 6/42 14/42 22/42 g(yj ) 6/42 9/42 12/42 15/42
c) f(xi )g(yj ) ≠ Jij , the random variables are dependent.

10) Given the following joint distribution of the random variable X and Y, find the corresponding
marginal distribution. Also compute the covariance and the correlation of the random variables
X and Y.

Y 1 3 9
X
2 1/8 1/24 1/12
4 1/4 1/4 0
6 1/8 1/24 1/12

xi 2 4 6 yj 1 3 9
f(xi ) 1/4 1/2 1/4 g(yj ) 1/2 1/3 1/6

E(X) = ∑ xi f(xi ) = 4, E(Y) = ∑ yj g(yj ) = 3 , E(XY) = ∑ xi yj Jij = 12


Cov(X, Y) = E(XY) − E(X)E(Y) = 0

cov(X, Y)
ρ(X, Y) = =0
σx σy
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

𝐒𝐀𝐌𝐏𝐋𝐈𝐍𝐆 𝐓𝐇𝐄𝐎𝐑𝐘
A large collection of individuals or attributes or numerical data is referred as population or
Universe.

A finite subset of the population or universe is called a sample. The number of individuals in
a sample is called sample size. The process of selecting the sample from the population is
called Sampling.

𝐒𝐀𝐌𝐏𝐋𝐈𝐍𝐆 𝐃𝐈𝐒𝐓𝐑𝐈𝐁𝐔𝐓𝐈𝐎𝐍: Suppose we have different samples of size n. Their mean, SD etc
will be different. If these charectersics are grouped according to their frequency , the
frequence distribution so obtained are called Sampling distributions.
The sampling distribution is approximately a normal distribution.

𝐓𝐄𝐒𝐓𝐈𝐍𝐆 𝐇𝐘𝐏𝐎𝐓𝐇𝐄𝐒𝐈𝐒: Hypothesis is a dicision making statement, which is true or false

There are 2 types of hypothesis

1) Null hypothesis 2) Alternate hypothesis

1) The hypothesis formulated for the purpose of regection, under the assumption that it is
true is called null hypothesis. It is denoted by H0

2) Any hypothesis which is complimentary to the null hypothesis is called alternate hypothesis,
denoted by H1

𝐄𝐑𝐑𝐎𝐑𝐒: If a hypothesis is rejected while it shold have been acceted, we say that type I error
has been made.

If a hypothesis is accepted while it should have been rejected, we say that a type II error has
been made.

𝐂𝐑𝐈𝐓𝐈𝐂𝐀𝐋 𝐑𝐄𝐆𝐈𝐎𝐍: A region which amounts to the rejection of null hypothesis is called
critical region or region of rejection.

𝐋𝐄𝐕𝐄𝐋 𝐎𝐅 𝐒𝐈𝐆𝐍𝐈𝐅𝐈𝐂𝐀𝐍𝐂𝐄: The probability level below which we reject the hypothesis is
known as level of significance. The procedure which enables us to decide wheather to accept
or reject the hypothesis is called test of significance.
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

𝐎𝐍𝐄 𝐓𝐀𝐈𝐋𝐄𝐃 𝐀𝐍𝐃 𝐓𝐖𝐎 𝐓𝐀𝐈𝐋𝐄𝐃 𝐓𝐄𝐒𝐓𝐒: If the extreme values to both side of the mean are
considered, the region of significance will be on both sides of the distrubution. Such a test is
called two tailed test.

If the extreme values to only one side of the mean is considered, the region of significance
will be on one side of the distrubution. Such a test is called one tailed test.

Test Critical values of z


5% level 1% level
One tailed test -1.645 and 1.645 −2.33 and 2.33
Two tailed test 1.16 and 1.96 −2.58 and 2.58

x̅ − μ
𝐓𝐄𝐒𝐓𝐈𝐍𝐆 𝐎𝐅 𝐏𝐎𝐏𝐔𝐋𝐀𝐓𝐈𝐎𝐍 𝐌𝐄𝐀𝐍(𝛍): z =
σ/√n

Where x̅ → sample mean, μ → population mean, σ → S. D

1) The mean life time of sample of 100 bulbs produced by a company is computed to be
1570 hours with SD 120 hours. If μ is the life time of all bulbs produced by the company, test
the hypothesis that μ = 1600 hours against the alternate hypothesis μ ≠ 1600 hours
using LOS as 0.01

LOS is 0.01 ie 1% (2.58)

n = 100, x̅ = 1570, π = 1600, σ = 120

x̅ − μ
z= = −2.5
σ/√n

|z| = 2.5 < 2.58, Ho is accepted

2) It has been found from experience that the mean breaking strength of a particular brand
of thread is 275.6 gms wigh SD of 39.7 gms. Recently a sample of 36 pieces of thread showed
a mean breking strength of 253.2 gms. Can one conclude at a significance level of a) 0.05
b) 0.01 that the thread has become inferior.

H0 : μ = 275.6 gms, mean breaking strength

H1 : μ < 275.6 gms, inferior in breaking strength

We choose one tailed test.

n = 36, μ = 275.6, x̅ = 253.2, σ = 39.7,

The difference x̅ − μ = 22.4

x̅ − μ
z= = 3.38
σ/√n
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

The value of z is greater than the critical value of z = 1.645 at 5% level and 2.33 at

1% level of significance.

H0 is rejected, H1 that the thread has become inferior is accepted at both 0.05 and 0.01 levels

in accordence with one tailed test.

3) In an examination given to students at a large number of different schools , the mean


grade was 74.5 and SD grade was 8. At one particular scool where 200 students took the
examination , the mean grade was 75.9. Discuss the significance of this result from the
view point of a) one tailed test b) two tailed test at both 5% and 1% LOS

H0 : μ = 74.5 and there is no change in the mean grade.

H1 : μ ≠ 74.5

We choose one tailed test.

n = 200, μ = 74.5, x̅ = 75.9, σ = 8,

The difference x̅ − μ = 1.4

x̅ − μ
z= = 2.475
σ/√n

Test z0.05 z0.01


one tailed ±1.645 ±2.33
two tailed ±1.96 ±2.58

The value of z is greater than the z0.05 , z0.01 in one tailed test as well as z0.05 in two tailed test.

We conclude that the difference in the mean grade is significant in these tests but the same is

not significant in the two tailed test at 1% LOS


ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

𝐒𝐓𝐔𝐃𝐄𝐍𝐓 ′ 𝐒 𝐭 𝐃𝐈𝐒𝐓𝐑𝐈𝐁𝐔𝐓𝐈𝐎𝐍

Sir William Gosset(Pen name Student)derived a distribution to test the significance of a


sample mean where the small sample is drawn from a normal population

Let xi be a small sample of size n, the static t is defined as follows:

x̅ − μ 1 1
1) t = where x̅ = ∑ xi and s 2 = ∑(xi − x̅)2
s n n − 1
( )
√n

 Number of degrees of freedom is the number of values in a set which may be assigned
arbitrarily. If n is the given sample size , then df = n − 1
 If |t| > t 0.05 , the difference between x̅and μ is said to be significant at 5% LOS
If |t| > t 0.01 , the difference between x̅and μ is said to be significant at 1% LOS
 If |t| is less than the table value at a certain LOS, the data is said to be conformal or
consistent with the hypothesis that μ is the mean proportion

x̅1 − x̅2 n1 s 21 + n2 s 2 2
2) t = where s 2 = or
1 1 n1 + n2 − 2
s√(n + n )
1 2

1 2
s2 = [∑(xi − x̅)2 + (yj − y̅) ]
n1 + n2 − 2

 If |t| > t 0.05 the difference between the sample means is said to be significant
at 5% LOS

 If |t| > t 0.01 the difference between the sample means is said to be significant
at 1% LOS

 Confidence limits for the population mean μ:


s
95% CI for μ: x̅ ± t 0.05
√n

s
99% CI for μ: x̅ ± t 0.01
√n

1) Ten individuals are chosen at random from a population and their heights in inches are
found to be 63,63, 66,67,68,69,70,70,71,71. Test the hypothesis that the mean height of the
universe is 66 inches. (t 0.05 = 2.262 for 9 df)
∑x 678
μ = 66, n = 10, x̅ = = = 67.8
n 10
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA
1 1
s 2 = n−1 ∑(xi − x̅)2 = 9 [(63 − 67.8)2 + ⋯ … (71 − 67.8)2 ] = 9.067, s = 3.011

x̅ − μ x̅ − μ 67.8 − 66
t= = √ n= √10 = 1.89 < 2.262
s s 3.011
( )
√n
Hypothesis is acceted at 5% LOS

2) Ten individuals are chosen at random from a population and their heights in inches are
found to be 63,63,64,65,66,69,69,70,70,71. Discuss the suggestion that the mean height of the
population is 66 inches. (t 0.05 = 2.262 for 9 df)

(Ans: t = 2.02; Hypothesis is accepted at 5% LOS)

3) A certain stimulus administered to each of the 12 patients resulted in the following


change in blood pressure 5,2,8, −1,3,0,6, −2,1,5,0,4. Can it be concluded that the stimulus
will increase the blood pressure? (t 0.05 = 2.201 for 11 df)

Let us suppose that the stimulus administered is not accompanied with increase in blood
pressure, we can take μ = 0

∑ x 31
x̅ = = = 2.5833,
n 12
1 1
s2 = ∑(xi − x̅)2 = [(5 − 2.5833)2 + ⋯ … (4 − 2.5833)2 ] = 9.538, s = 3.088
n−1 11
x̅ − μ x̅ − μ
t= = √n = 2.8979 > 2.201
s s
( )
√n

The hypothesis is rejected at 5%LOS. We conclude with 95% confidence that the stimulus in

general is accompanied with increase in blood pressure.

4) A machine is expected to produce nails of length 3 inches. A random sample of 25 nails


gave an average length of 3.1 inch with standard deviation 0.3. Can it be said that the
machine is producing nails as per specifications? (t 0.05 for 24 df is 2.064)

μ = 3, n = 25, x̅ = 3.1 s = 0.3

x̅ − μ x̅ − μ
t= = √n = 1.67 < 2.064
s s
( )
√n

Hypothesis is acceted at 5% LOS, the machine is producing nails as per specifications


ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

5) A sample of 10 measurements of the diameter of a sphere gave a mean of 12cm and a


SD 0.15 cm. Find the 95% confidence limits for the actual diameter.

n = 10 , x̅ = 12, s = 0.15

t 0.05 for 9 df = 2.262

s 0.15
95% CI for μ: x̅ ± t 0.05 = 12 ± 2.262 ( ) = 11.893 and 12.107
√n √10

Thus 11.893 cm to 12.107 cm is the confidence limits for the actual diameter.

6) A sample of 11 rats from a central population had an average blood viscocity of 3.92 with
a SD of 0.61. On the basis of this sample , establish 95% fiducial limits for μ the mean blood
viscocity of the central population . (t 0.05 for 10 df = 2.228)

n = 11, x̅ = 3.92 , s = 0.61

s 0.61
95% CI for μ: x̅ ± t 0.05 = 3.92 ± 2.228 ( ) = 3.51 and 4.33
√n √11
Thus 95% confidence limits for μ are 3.51 and 4.33

7) Two type of batteries are tested for their length of life and the following results were
obtained.

Battery A: n1 = 10 , x̅1 = 500, σ1 2 = 100

Battery A: n2 = 10 , x̅2 = 440, σ2 2 = 121

Compute student ′ s t and test whether there is a significant difference in the two means

2
n1 s21 + n2 s2 2 (10 × 100) + (10 × 121)
s = = = 122.78, s = 11.0805
n1 + n2 − 2 18

x̅1 − x̅2 60
t= = = 12.1081, greater than table value of t for
1 1 11.0805√(0.1 + 0.1)
s√(n + n )
1 2

18 df at all levels of significance.

The null hypothesis that there is no significant difference in the two means is rejected at all
LOS
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

8)A group of boys and girls were given an intelligent test. The mean score and SD and
numbers in each group are as follows.
Boys Girls
Mean 74 70
SD 8 10
n 12 10

Is the difference between the means of the two groups significant at 5% LOS?
. (t 0.05 for 20 df = 2.086)

n1 = 12 , x̅1 = 74, s1 = 8

n2 = 10 , x̅2 = 70, s2 = 10

n1 s21 + n2 s2 2 (12 × 64) + (10 × 100)


s2 = = = 88.4, s = 9.402 ≈ 9.4
n1 + n2 − 2 20

x̅1 − x̅2 74 − 70
t= = = 0.994 < 2.086
1 1 1 1
s√(n + n ) 9.4√(12 + 10)
1 2

Thus the hypothesis that there is a difference between the means of the two groups is

accepted at 5% LOS

9) A group of 10 boys fed on a diet A and another group of 8 boys fed on a different diet B
for a period of 6 months recorded the folowing increase in weights(lbs)

Diet A: 5 6 8 1 12 4 3 9 6 10

Diet B: 2 3 6 8 10 1 2 8

Test whether diets A and B differ significantly regarding their effect on increase in weight.

∑ x1 64
n1 = 10 , x̅1 = = = 6.4,
n1 10

∑ x2 40
n2 = 8 , x̅2 = = =5
n2 8

1 2
s2 = [∑(xi − x̅)2 + (yj − y̅) ] = 11.525, s = 3.395
n1 + n2 − 2

x̅1 − x̅2 1.4


t= = = 0.86935 ≈ 0.87
1 1 1 1
s√(n + n ) 3.395√(10 + 8)
1 2

But t .05 for 16 d. f = 212 from the tables, t = 0.87 is less than the table value at 5%
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

LOS.

We conclude that the two diets do not differ significantly regarding their effect on increase
in weight.

10) Two horses A and B were tested according to the time (in seconds) to run a particular race
with the following results.

Horse A: 28 30 32 33 33 29 34

Horse B: 29 30 30 24 27 29

Test whether you can discriminate between the two horses.

∑ x1 219
n1 = 7 , x̅1 = = = 31.3,
n1 7

∑ x2 169
n2 = 6 , x̅2 = = = 28.2
n2 6

1 2
s2 = [∑(xi − x̅)2 + (yj − y̅) ] = 5.2973, s = 2.3016
n1 + n2 − 2

x̅1 − x̅2
t= = 2.42
1 1
s√(n + n )
1 2

But t .05 = 2.2 and t .02 = 2.72

> t .05 = 2.2


t = 2.42 {
< t .02 = 2.72

The discrimination between the horses is significant at 5% level but not at 2% LOS.
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

𝐂𝐡𝐢 − 𝐬𝐪𝐮𝐚𝐫𝐞 𝐝𝐢𝐬𝐭𝐫𝐢𝐛𝐮𝐭𝐢𝐨𝐧


It provides a measure of correspondence between the theoretical and observed frequencies.
n
2
(Oi − Ei )2
χ =∑
Ei
i=1

 If the expected frequencies are less than 10, we group them suitably
1) A die is thrown 264 times and the number appearing on the face(x)follows the following
frequency distribution.

x 1 2 3 4 5 6
f 40 32 28 58 54 60
2
Calculate χ

264
The expected number of frequency for each face is = 44
6

Oi 40 32 28 58 54 60
Ei 44 44 44 44 44 44

n
2
(Oi − Ei )2 (40 − 44)2 (32 − 44)2 (60 − 44)2
χ =∑ = + + ⋯… = 22
Ei 44 44 44
i=1

2) Five dice were thrown 96 times and the numbers 1,2 or 3 appearing on the face of the dice
follows the frequence distribution as follows.
No of dice showing 1,2 or 3 5 4 3 2 1 0
frequency 7 19 35 24 8 3
Test the hypothesis that the data follows a binomial distribution.
(χ2 0.05 = 11.07 for 5 d. f)

3
Probability of a single dice throwing 1,2 or 3 = = 0.5, q = 0.5 and n = 5
6

1 x 1 5−x 1 5 5Cx
p(x) = 5Cx ( ) ( ) = 5Cx ( ) = , these 5 dice thrown 96 times
2 2 2 32
96
f(x) = 96p(x) = 5C = 3 × 5Cx
32 x

Oi 7 19 35 24 8 3
Ei 3 15 30 30 15 3
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA
n
2
(Oi − Ei )2
χ =∑ = 11.7 > χ2 0.05 = 11.07
Ei
i=1

Thus the hypothesis that the data follows a binomial distribution is rejected.

3) A sample analysis of examination results of 500 students was made. It was found that 220
students had failed, 170 had secured third class , 90 had secured third class and 20 had
secured first class. Do these figures support the general examination result which is in the
ratio 4: 3: 2: 1 for the respective catagories. (χ2 0.05 = 7.81 for 3 d. f)

Let us take the hypothesis that these figures support to the general result in the ratio

4:3:2:1

 The expected frequencies in the respective catagories are 200, 150, 100, 50
4 3 2 1
( × 500 = 200, × 500 = 150, × 500 = 100, × 500 = 50)
10 10 10 10

Oi 220 170 90 20
Ei 200 150 100 50

n
2
(Oi − Ei )2
χ =∑ = 23.67 > χ2 0.05 = 7.81, the hypothesis is rejected.
Ei
i=1

4) 4 coins are tossed 100 times and the following results were obtained. Fit a binomial

distribution for the data and test the goodness of fit. (χ2 0.05 = 9.49 for 4 d. f)

Number of heads 0 1 2 3 4
frequence 5 29 36 25 5

n = 4 and μ = np for binomial distribution,

∑ fx 0 + 29 + 72 + 75 + 20
μ= = = 1.96, μ = np implies p = 0.49 and hence q = 0.51
∑f 100

p(x) = 4Cx (0.49)x (0.51)4−x , these 4 coins tossed 100 times

F(x) = 100 × 4Cx (0.49)x (0.51)4−x

F(0) = 7, F(1) = 25.999 ≈ 26 F(2) = 37.47 ≈ 37, F(3) = 24, F(4) = 5.765 ≈ 6

Oi 5 29 36 25 5
Ei 7 27 37 24 6
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA
n
2
(Oi − Ei )2
χ =∑ = 1.15 < χ2 0.05 = 9.49 ,
Ei
i=1

The hypothesis that the fitness is good is accepted.

5) Fit a poisson distrubution for the following data and test the goodness of fit given that
χ2 0.05 = 7.815 for 3 d. f

x 0 1 2 3 4
f 122 60 15 2 1

∑ fx 0 + 60 + 30 + 6 + 4
μ= = = 0.5, μ = m
∑f 200

mx e−m
p(x) = , the expt is conducted 200 times
x!
mx e−m
F(x) = 200
x!
F(0) = 121, F(1) = 61, F(2) = 15, F(3) = 3, F(4) = 0

Oi 122 60 15 2+1=3
Ei 121 61 15 3+0=3

n
2
(Oi − Ei )2
χ =∑ = 0.025 < χ2 0.05 = 7.815 .
Ei
i=1

The hypothesis that the fitness is good is accepted.

6)The number of accidents per day (x)as recorded in a textile industry over a period of 400
days is given below. Test the goodness of fit in respect of Poisson distribution of fit to the
given data (χ2 0.05 = 9.49 for 4 d. f)
x 0 1 2 3 4 5
f 173 168 37 18 3 1

∑ fx 0 + 168 + 74 + 54 + 12 + 5
μ= = = 0.7825, μ = m
∑f 400

mx e−m
p(x) = , the data collected over 400 days
x!
mx e−m
F(x) = 400
x!
ANIL SC
JNN COLLEGE OF ENGINEERING, SHIVAMOGGA

F(0) = 183, F(1) = 143, F(2) = 56, F(3) = 15, F(4) = 3, F(5) = 0

Oi 173 168 37 18 3+1=4


Ei 183 143 56 15 3+0=3

n
2
(Oi − Ei )2
χ =∑ = 12.297 > χ2 0.05 = 9.49 .
Ei
i=1

The hypothesis that the fitness is good is rejected.

You might also like