Professional Documents
Culture Documents
SUMSEM2022-23 BMT5113 TH VL2022230700419 2023-06-24 Reference-Material-I
SUMSEM2022-23 BMT5113 TH VL2022230700419 2023-06-24 Reference-Material-I
BMT5113
INSTITUTE-
VIT Business School
Module:4 Test of Hypothesis & Non Faculty Name: Dr. Bijay Kushwaha
Parametric Test Dr. Bijay Kushwaha, VIT BS, Vellore Institute of
Technology, Vellore, India
STUDENT LEARNING OUTCOMES (SLO)
➢ Reach a statistical conclusion in hypothesis testing problems about a
population mean with a known population standard deviation using
the z statistic.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis
• The null hypothesis states that the “null” condition exists; that is,
there is nothing new happening, the old theory is still true, the
old standard is correct, and the system is in control.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis
State the hypotheses and select an α level. The null hypothesis, H0,
always states that the treatment has no effect (no change, no
difference). According to the null hypothesis, the population mean after
treatment is the same is it was before treatment. The α level
establishes a criterion, or "cut-off", for making a decision about the null
hypothesis. The alpha level also determines the risk of a Type I error.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis
Locate the critical region. The critical region consists of outcomes that are
very unlikely to occur if the null hypothesis is true. That is, the critical
region is defined by sample means that are almost impossible to obtain if
the treatment has no effect. The phrase “almost impossible” means that
these samples have a probability (p) that is less than the alpha level.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis
Compute the test statistic. The test statistic forms a ratio comparing the
obtained difference between the sample mean and the hypothesized
population mean versus the amount of difference we would expect without
any treatment effect (the standard error).
A large value for the test statistic shows that the obtained mean difference
is more than would be expected if there is no treatment effect. If it is large
enough to be in the critical region, we conclude that the difference is
significant or that the treatment has a significant effect. In this case we
reject the null hypothesis. If the mean difference is relatively small, then
the test statistic will have a low value. In this case, we conclude that the
evidence from the sample is not sufficient, and the decision is fail to reject
the null hypothesis.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Errors in Hypothesis Tests
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Errors in Hypothesis Tests
• A Type I error occurs when the sample data appear to show a treatment
effect when, in fact, there is none.
• In this case the researcher will reject the null hypothesis and falsely
conclude that the treatment has an effect.
• Type I errors are caused by unusual, unrepresentative samples. Just by
chance the researcher selects an extreme sample with the result that the
sample falls in the critical region even though the treatment has no
effect.
• The hypothesis test is structured so that Type I errors are very unlikely;
specifically, the probability of a Type I error is equal to the alpha level.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Errors in Hypothesis Tests
• A Type II error occurs when the sample does not appear to have been
affected by the treatment when, in fact, the treatment does have an
effect.
• In this case, the researcher will fail to reject the null hypothesis and
falsely conclude that the treatment does not have an effect.
• Type II errors are commonly the result of a very small treatment effect.
Although the treatment does have an effect, it is not large enough to
show up in the research study.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis: z Statistic ( Known)
One of the most basic hypothesis tests is a test about a population mean. A
business researcher might be interested in testing to determine whether an
established or accepted mean value for an industry is still true or in testing a
hypothesized mean value for a new theory or product.
= population mean
= population standard deviation
x̄ = Sample mean
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis: z Statistic ( Known)
A survey of CPAs across the United States found that the average net income for
sole proprietor CPAs is $74,914.* Because this survey is now more than ten
years old, an accounting researcher wants to test this figure by taking a random
sample of 112 sole proprietor accountants in the United States to determine
whether the net income figure changed. Assume the population standard
deviation of net incomes for sole proprietor CPAs is $14,530. Suppose the 112
CPAs who respond produce a sample
mean of $78,695.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis: z Statistic ( Known)
= $74,914, = $14,530, x̄ = $78,695, n= 112, α=0.05
this test statistic, z = 2.75, is greater than the critical value of z in the upper tail of
the distribution, z = +1.96, the statistical conclusion reached is to reject the null
hypothesis. The calculated test statistic is often referred to as the observed value.
Thus, the observed value of z for this problem is 2.75 and the critical value of z for
this problem is 1.96.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis: z Statistic ( Known)
and Finite Population
= $74,914, = $14,530, x̄ = $78,695, n= 112, N= 600, α=0.05
Use of the finite correction factor increased the observed z value from 2.75 to 3.05.
The decision to reject the null hypothesis does not change with this new
information.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Test of Hypothesis: z Statistic ( Known)
In an attempt to determine why customer service is important to managers in
the United Kingdom, researchers surveyed managing directors of
manufacturing plants in Scotland.* One of the reasons proposed was that
customer service is a means of retaining customers. On a scale from 1 to 5,
with 1 being low and 5 being high, the survey respondents rated this reason
more highly than any of the others, with a mean response of 4.30. Suppose
U.S. researchers believe American manufacturing managers would not rate this
reason as highly and conduct a hypothesis test to prove their theory.
Alpha is set at .05. Data are gathered and the following results are obtained.
Use these data and the eight steps of hypothesis testing to determine whether
U.S. managers rate this reason significantly lower than the 4.30 mean
ascertained in the United Kingdom. Assume from previous studies that the
population standard deviation is 0.574.
The alternative hypothesis is that the population mean is lower than 4.30.
The null hypothesis states the equality case.
The critical value of the test statistic is z.05 = -1.645. An observed test statistic must
be less than -1.645 to reject the null hypothesis. Because the observed test statistic
is not less than the critical value and is not in the rejection region, the statistical
conclusion is that the null hypothesis cannot be rejected.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Testing Hypotheses: A Population Mean
Using The T Statistic ( Unknown)
When a business researcher is gathering data to test hypotheses about a
single population mean, the value of the population standard deviation is
unknown and the researcher must use the sample standard deviation as an
estimate of it. In such cases, the z test cannot be used.
df = degree of freedom
= population mean
= population standard deviation
x̄ = Sample mean
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Testing Hypotheses: A Population Mean
Using The T Statistic ( Unknown)
The U.S. Farmers’ Production Company builds large harvesters. For a harvester to be
properly balanced when operating, a 25-pound plate is installed on its side. The
machine that produces these plates is set to yield plates that average 25 pounds. The
distribution of plates produced from the machine is normal. However, the shop
supervisor is worried that the machine is out of adjustment and is producing plates that
do not average 25 pounds. To test this concern, he randomly selects 20 of the plates
produced the day before and weighs them. Table 9.1 shows the weights obtained,
along with the computed sample mean and sample standard deviation.
The test is to determine whether the machine is out of control, and the shop supervisor
has not specified whether he believes the machine is producing plates that are too
heavy or too light. Thus a two-tailed test is appropriate. The following hypotheses are
tested.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Testing Hypotheses: A Population Mean
Using The T Statistic ( Unknown)
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Testing Hypotheses: A Population Mean
Using The T Statistic ( Unknown)
Because the observed t value is +1.04, the null hypothesis is not rejected. Not
enough evidence is found in this sample to reject the hypothesis that the
population mean is 25 pounds.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Testing Hypotheses: A Population Mean
Using The T Statistic ( Unknown)
The researcher’s hypothesis is that the average size of a U.S. farm is more than 471 acres.
Because this theory is unproven, it is the alternate hypothesis. The null hypothesis is that
the mean is still 471 acres.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Testing Hypotheses: A Population Mean
Using The T Statistic ( Unknown)
With 23 data points, df = n - 1 = 23 - 1 = 22. This test is one tailed, and
the critical table t value is t.05,22 = 1.717. The decision rule is to reject the
null hypothesis if the observed test statistic is greater than 1.717.
The sample mean is 498.78 and the sample standard deviation is
46.94. The observed t value is
The observed t value of 2.84 is greater than the table t value of 1.717,
so the business researcher rejects the null hypothesis. She accepts the
alternative hypothesis and concludes that the average size of a U.S.
farm is now more than 471 acres.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Testing Hypotheses: A Population Mean
Using The T Statistic ( Unknown)
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
The null hypothesis states that the population means for all treatment levels
are equal. Because of the way the alternative hypothesis is stated, if even one
of the population means is different from the others, the null hypothesis is
rejected.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
H0: 1 = 2 = 3 = 4
H1: At least one of the means is
different from the others.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
In the one-way ANOVA, the dfC values are the treatment (column) degrees of
freedom, C - 1. The dfE values are the error degrees of freedom, N - C. Table 11.4
contains an abbreviated F distribution table for = .05. For the machine operator
example, dfC = 3 and dfE = 20, F.05,3,20 from Table 11.4 is 3.10. This value is the critical
value of the F test. Analysis of variance tests are always one-tailed tests with the
rejection region in the upper tail. The decision rule is to reject the null hypothesis if
the observed F value is greater than the critical F value (F.05,3,20 = 3.10).
For the machine operator problem, the observed F value of 10.18 is larger than the
table F value of 3.10. The null hypothesis is rejected. Not all means are equal, so
there is a significant difference in the mean valve openings by machine operator.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
For the machine operator problem, the observed F value of 10.18 is larger than the
table F value of 3.10. The null hypothesis is rejected. Not all means are equal, so
there is a significant difference in the mean valve openings by machine operator.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
A company has three manufacturing plants, and company officials want to
determine whether there is a difference in the average age of workers at the three
locations. The following data are the ages of five randomly selected workers at
each plant. Perform a one-way ANOVA to determine whether there is a significant
difference in the mean ages of the workers at the three plants. Use α= .01 and note
that the sample sizes are equal.
H0: 1 = 2 = 3
H1: At least one of the means is different from
the others.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Analysis of variance, or ANOVA
64.87
1.63
The decision is to reject the null hypothesis because the observed F value of
39.80 is greater than the critical table F value of 6.93. There is a significant
difference in the mean ages of workers at the three plants.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
CHI-SQUARE TEST
The chi-square techniques presented here for analyzing categorical data, the
chi-square goodness-of-fit test and the chi-square test of independence, are
an outgrowth of the binomial distribution and the inferential techniques for
analyzing population proportions.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square Goodness-of-fit Test
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square Goodness-of-fit Test
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square Goodness-of-fit Test
One survey of U.S. consumers conducted by The Wall Street Journal and NBC News
asked the question: “In general, how would you rate the level of service that
American businesses provide?” The distribution of responses to this question was as
follows:
Suppose a store manager wants to find out whether the results of this consumer
survey apply to customers of supermarkets in her city. To do so, she interviews 207
randomly selected consumers as they leave supermarkets in various parts of the city.
She asks the customers how they would rate the level of service at the supermarket
from which they had just exited. The response categories are excellent, pretty good,
only fair, and poor. The observed responses from this study are given in Table 16.1.
Now the manager can use a chi-square goodness-of-fit test to determine whether
the observed frequencies of responses from this survey are the same as the
frequencies that would be expected on the basis of the national survey.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square Goodness-of-fit Test
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square Goodness-of-fit Test
n = 207, The expected proportions are given, but the expected frequencies must
be calculated by multiplying the expected proportions by the sample total of the
observed frequencies.
Thus the data gathered in the sample of 207 supermarket shoppers indicate that
the distribution of responses of supermarket shoppers in the manager’s city is
not significantly different from the distribution of responses to the national
survey.
The store manager may conclude that her customers do not appear to have
attitudes different from those people who took the survey. Figure depicts the
chi-square distribution produced by using Minitab for this example, along with
the observed and critical values.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square: Test Of Independence
The chi-square test of independence can be used to analyze any level of data
measurement, but it is particularly useful in analyzing nominal data. Suppose a
business researcher is interested in determining whether geographic region is
independent of type of financial investment.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square: Test Of Independence
The business researcher would tally the frequencies of responses to these two
questions into a two-way table called a contingency table. Because the chi-square
test of independence uses a contingency table, this test is sometimes referred to as
contingency analysis.
The null hypothesis for a chi-square test of independence is that the two variables
are independent. The alternative hypothesis is that the variables are not
independent. This test is one-tailed. The degrees of freedom are (r – 1)(c – 1).
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square: Test Of Independence
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square: Test Of Independence
To determine the observed value of chi-square, the researcher must compute the
expected frequencies. The expected values for this example are calculated as
follows, with the first term in the subscript (and numerator) representing the row
and the second term in the subscript (and numerator) representing the column
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square: Test Of Independence
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
Chi-square: Test Of Independence
The observed value of chi-square, 70.78, is greater than the critical value of chi-
square, 16.8119 obtained from Table A.8. The business researcher’s decision is to
reject the null hypothesis; that is, type of gasoline preferred is not independent of
income.
Having established that conclusion, the business researcher can then examine the
outcome to determine which people, by income brackets, tend to purchase which
type of gasoline and use this information in market decisions.
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India
THANK YOU
Dr. Bijay Kushwaha, VIT BS, Vellore Institute of Technology, Vellore, India