Professional Documents
Culture Documents
Chi-Square Test: Advance Statistics
Chi-Square Test: Advance Statistics
CHI-SQUARE
TEST
N O N PA R A M E T R I C T E S T S
OUT L INE OF TO PICS
Introduction to Chi-Square
Terms
NO N PAR AM E TR IC TE ST
A test in which no constant of a population is used. Data do not
follow any specific distribution and no assumption are made in
these test. E.g. to classify good, better, and best we just allocate
arbitrary numbers or marks to each category.
HY POT HE SIS
DE GR E E OF FR EE D OM
It denotes the extent of independence (freedom) enjoyed by a
given set of observed frequencies. Suppose we are given a set
of n observed frequencies which are subjected to k independent
constraints (restrictions) then,
Terms To
Take Note
C hi -S quar e ( X2) S ta ti st i cs
A chi-square (X2) statistic is a test that measure how a model compares to actual
“Chi-Square Statistics”
observed data.
The data used in calculating statistic must be random, raw, mutually exclusive,
drawn from independent variables, and drawn from a large enough sample. For
Introduction to
example, the result of tossing a fair coin meet these criteria.
Chi-square test are often used in hypothesis testing. This statistics compares the
size of any discrepancies between the expected results and the actual results,
given the size of sample and the number of variables in the relationship.
For these tests, degrees of freedom are utilized to determine if a certain null
hypothesis can be rejected based on the total number of variables and samples
within the experiment. As with any statistic, the larger the sample size, the more
reliable the result.
C hi -S quar e ( X2) S ta ti st i cs
Chi-Square Test
Application of
Two Main Kinds
of
Chi-Square Test
Chi-Square Test
that we are interested in, then we would not want to use this sample
to draw conclusions about the larger population.
Main Kind of
Example
Chi-Square Test
The chi-square test formula for goodness of fit is:
Main Kind of
Chi-Square Test
square test is useful.
Main Kind of
In such a situation, we proceed with the null hypothesis that
t h e t w o a t t r i b u t e s ( Vi z . , n e w m e d i c i n e a n d c o n t r o l o f f e v e r )
are independent which means that new medicine is not
e f f e c t i v e i n c o n t r o l l i n g f e v e r.
Somer’s Delta
Somer’s Delta:
Computation
Based on the computation. There are
Yate’s
Correction of
Continuity:
Computation
Fisher-Ir wi n E xa ct Te st
It is a test for independence in a 2 X 2 table. It
is most useful when the total number size and
the expected value are small. The test holds the
marginal total fixed and computes the
hypergeometric probability that N11 is at least
as large as the observed value
M1 = a + b, M2 = c + d, N1 = a + c, N2 = b +
d, for convenience assume N1 < N2, M1 < M2.
possible value of a are: 0, 1, …min (M1, N1).
Example
N = a + b + c + d = N1 + N2 = M1 + M2
Fisher-Irwin
Exact Test:
Formula
Fisher-Ir wi n E xa ct Te st
Is HIV Infection related to Hx of STDs in Sub
Saharan African Countries? Test at 5% level.
Fisher-Irwin
Probability of observing this specific table
given fixed marginal totals is
Exact Test:
Pr (3, 7, 5, 10) = 10!15!8!17!/[25!3!7!5!10!]
Computation
= 0.3332
Note the above is not the p-value. Why?
Not the accumulative probability, or not the
tail probability.
Tail probability = sum of all value (a = 3, 2, 1,
Exercises
An instructor makes out his final grades for 200 A die is thrown 132 times with following results. Is
students in his subject Introduction to Statistics. the die unbiased?
He is curious to see if his grade distribution
No. Observed Computed (E-O) (E-O) 2 (E-O) 2
resembles the “normal curve” and notes from the Turned Up Frequency Frequency E
3 1
1 3