Professional Documents
Culture Documents
4 CHAPTER F
4 CHAPTER F
CHI-SQUARE
DISTRIBUTIONS
07/13/2024 by zerihun T. 1
Introduction
• Chi-Square analysis is used to testing hypothesis involving nominal data.
• Nominal data are also known by two names-categorical data and attribute data.
07/13/2024 by zerihun T. 2
Cont.….
• As the sample size and therefore the d.f. increases, the 2 distributions
becomes a symmetrical distribution approaching normality.
• The general shape of the 2 distributions for smaller d.f. is given in the below
graph.
07/13/2024 by zerihun T. 3
Cont.….
• Conditions for Using Chi-Square Test
• The sample observations drawn from a population must be independent and
random
• The data must be in frequency (counting) form. If the original data are in
percentages, they must be converted into frequency.
• No frequency in any cell/category must be less than 5. If the frequency is
less than 5 for a category, you have to do some regrouping
07/13/2024 by zerihun T. 4
Areas of application
The major area of application of the chi-square distribution are
the following
• Tests for independence between two variables
• Tests for the equality of several proportions
• Goodness- of fit tests
07/13/2024 by zerihun T. 5
Tests for independence between two variables
• The chi-square independence test can be used to test the independence of two variables.
07/13/2024 by zerihun T. 6
Example
• A researcher wishes to see if there is a relationship between the hospital and
the number of patient infections. A sample of 3 hospitals was selected, and the
number of infections for a specific year has been reported. The data are shown
next.
07/13/2024 by zerihun T. 8
Cont.….
• The completed table is shown.
07/13/2024 by zerihun T. 9
Cont.….
.
07/13/2024 by zerihun T. 10
Cont.….
• Step 4 Make the decision. The decision is to reject the null hypothesis since
30.518 > 9.488. See the next Figure
• Step 5 Summarize the results. There is enough evidence to support the claim
that the number of infections is related to the hospital where they occurred.
07/13/2024 by zerihun T. 11
Tests for the equality of several proportions
• In this situation, samples are selected from several different populations, and
that have a common characteristic are the same for each population.
07/13/2024 by zerihun T. 12
Example;
• Money and Happiness
A psychologist selected 100 people from each of four income groups and asked
them if they were “very happy.” The percent for each group who responded yes
and the number from the survey are shown in the table. At α = 0.05 test the
claim that there is no difference in the proportions.
Household Less than $30,000– $74,999 $75,000– $100,000 or total
income $30,000 (24%) (33%) $99,999 (38%) more (49%)
Yes 24 33 38 49 144
no 76 67 62 51 256
Total 100 100 100 100 400
07/13/2024 by zerihun T. 13
Solution
• Step 1 State the hypotheses and identify the claim.
• H0: p1 = p2 = p3 = p4 (claim)
• H1: At least one proportion differs from the others.
• Step 2 Find the critical value. The formula for the degrees of freedom is the same as before:
(R - 1)(C -1) (2 -1)(4 -1) =1(3) = 3. The critical value is 7.815.
• Step 3 Compute the test value. Since we want to test the claim that the proportions are
equal, we use the expected value as and the formula
07/13/2024 by zerihun T. 14
Cont.….
07/13/2024 by zerihun T. 15
Cont.….
•V
07/13/2024 by zerihun T. 16
Cont.….
• Step 4 Make the decision. Reject the null hypothesis since 14.149 > 7.815.
See the next Figure
• Step 5 Summarize the results. There is enough evidence to reject the claim
that there is no difference in the proportions. Hence the incomes seem to make
a difference in the proportions.
07/13/2024 by zerihun T. 17
Goodness- of fit tests
• When you are testing to see whether a frequency distribution fits a specific
32 28 16 14 10
If there were no preference, you would expect each flavor to be selected with equal
frequency. In this case, the equal frequency is 100/5 = 20. That is, approximately 20
people would select each flavor.
07/13/2024 by zerihun T. 19
Cont.….
Since the frequencies for each flavor were obtained from a sample, these actual frequencies are
called the observed frequencies. The frequencies obtained by calculation (as if there were no
preference) are called the expected frequencies. A completed table for the test is shown.
Observed 32 28 16 14 10
Expected 20 20 20 20 20
Is there enough evidence to reject the claim that there is no preference in the selection of fruit
soda flavors, using the data shown previously? Let α = 0.05.
07/13/2024 by zerihun T. 20
Solution
• Step 2 Find the critical value. The degrees of freedom are 5 -1 = 4, and α = 0.05.
Hence, the critical value from x2- distribution Table is 9.488.
• Step 3 Compute the test value by subtracting the expected value from the
corresponding observed value, squaring the result and dividing by the expected
value, and finding the sum. The expected value for each category is 20, as
shown previously.
07/13/2024 by zerihun T. 21
Cont.….
• Step 4 Make the decision. The decision is to reject the null hypothesis, since
18.0 > 9.488, as shown in net Figure
07/13/2024 by zerihun T. 22
Cont.….
• Step 5 Summarize the results. There is enough evidence to reject the claim
that consumers show no preference for the flavors.
07/13/2024 by zerihun T. 23
THE END
THANK YOU
07/13/2024 by zerihun T. 24