Professional Documents
Culture Documents
Module 5 Experimental Designs and Significance Testing PDF
Module 5 Experimental Designs and Significance Testing PDF
Module 5 Experimental Designs and Significance Testing PDF
Hospitality Management
Module 5:
Experimental Designs
& Significance Testing
2
What is Significance Testing?
3
Substantive and Statistical Significance
Becky’s Bakery Executive Auto Blue Lagoon Burt’s Fitness
• Becky sells cookies • Executive Auto’s
Cafe Centre
• Her research tells her research uncovers a • Blue Lagoon Cafe has • Burt did a survey to
that overall, customers statistically customer service find out if his new
will pay an average of significant difference forms on each table centre layout was
$0.75/cookie, but that based on age of enough of a draw to
• Results have shown charge a higher price
men will pay up to customer that customer
$0.77/cookie. • Policy owners over for membership
dissatisfaction has
• While there is a the age of 55 tend to increased by a small • He was hoping the new
statistically significant buy more expensive percentage overall format would allow him
difference between insurance packages to raise his prices
• The difference is not without losing
men and women and for luxury autos great enough to be
how much they will • Therefore, Executive members, however, the
statistically results were not
pay, she chooses to Auto decides to use significant, so
ignore it, because the age as a market statistically significant.
management chose This means that he
price difference is segmentation tool to ignore it.
minor, and she doesn’t could not expect
• In this case, the • In this case, the
want to segment her customers to see added
results are both results are not
market based on value in the new format
statistically statistically
gender. and it could be risky to
significant and significant, so they
• In this case, the results raise prices.
substantive also lack substantial
are statistically • In this case, the results
significance would have been
significant, but not
substantially substantially significant
significance to the company, but
they lacked statistical
significance.
4
THMT 3312: Research Methods for Tourism &
Hospitality Management
Hypothesis Testing
▪ It is important to state the hypotheses before attempting to test for whether or not there are
statistically significant differences in the data that would support the hypothesis
▪ Analyzing data from the sample enables researchers to calculate statistical measures to either
accept, or reject, the Null hypothesis. In essence, the statistical measure used provides evidence
which may be strong enough to reject the Null (similar to a finding of guilty in a court of law)
▪ If the data is not strong enough to show a statistically significant difference it means that the
differences observed could be due simply to chance, or to sampling error, and we cannot
conclude that a relationship exists (the default position of “innocent”)!
6
Stating Hypotheses
7
Steps in Testing Hypotheses
Always do the following
steps: The norm in business research
ranges from .01 (99% level of
confidence) to .10 (90% level of
1) State the Null and Alternate confidence), with .05 (95% level
hypotheses
of confidence) being a
“happy medium”.
2) Determine the level of
statistical significance desired for Sample: p-value (probability)
the test
9
The Decision to Accept or Reject is Based on
Test Results
In business research
the p-values used to
determine statistical P-Value in SPSS Level of
significance typically (p) Statistical
range between .01 Significance in
and .05, but they can Business
go as low as .10. Research
When analyzing data .01 .99
using statistical
software the .05 .95
significance level will .10 .90
go beyond .01 to .000
and higher levels of
statistical
significance.
10
The Driver’s Ed Example
The insurance agent is asserting that there
is a relationship between gender and Other Factors:
driving ability A=Number of km/year
B=Purchase of high
First, we should consider if this seems to be Driver’s performance sports car
reasonable. What is it about males that
would make them worse drivers that
gender
females? Generally, if the relationship (X)
sounds illogical, it is usually not a valid one.
11
THMT 3312: Research Methods for Tourism &
Hospitality Management
Experimental Design
14
The Advertising Example
Other Factors:
We can map out the relationship A=Competitors’ stock outs
between advertising and sales. B=Rise in competitors’
We should consider all of the prices
other things that could Ads
contribute to sales that have no
bearing on our advertising at all (X)
Are there other things that could
cause our sales to increase that
have nothing at all to do with our
advertising?
These other factors can We can’t be sure that
confound our analysis of the X>Y unless we are Sales
relationship between advertising controlling for other
and sales (Y)
factors that could
How do we know that advertising impact Y, like A & B
caused sales to increase and not
one of these other factors? shown above.
15
Elements of an Experimental Design
Independent Variable (IV or X): The variable whose impact you are
trying to measure (ag: Radio Ad)
An experiment is
defined as
Dependent Variable (DV or Y): The effect that you are trying to
manipulating an
measure (eg: Sales)
independent
variable to see Extraneous Variables: Influences you are trying to “weed out” that
could confound your analysis
how it affects a
dependent Experimental Group: The group that gets the IV (eg: hears the radio)
variable while
also controlling
Control Group: If used, the group that doesn’t get the IV
the effects of
additional (or Pre-Test: If used, a measure of the DV before the IV is administered
extraneous) (eg: pretest for sales)
variables
Post-Test: A measure of the DV after the IV is administered (eg: sales
measure after ad shown).
16
Two Common Experimental Designs
- Technically not a real - Often called the Classic Experimental
After-Only Design
Group
Before-After with Control
experiment because it doesn’t Design.
manipulate anything. Often called - Two interchangeable samples are
a quasi-experimental design. drawn from the population of interest.
- DV is measured after the IV - The DV is measured to get a baseline
variable has been administered reading in both groups (Pre-Test or
- Any difference in the DV is Before measure).
attributed to the IV. Sometimes, - The IV is administered to one group
extraneous factors are controlled only (the Experimental Group). The
using statistical procedures at the remaining group (the Control Group)
point of data analysis. does not experience the IV.
- This is a loose design without - The DV is measured again in both
any controls: no before measure groups (Post-Test or After measure).
(how do we know that the DV
actually changed?) -Any difference between the
Before/After measures in the
- This is what happens when Experimental Group are then adjusted
survey data are used to try to based on the differences observed in
measure causation. It can’t be the Control Group (because those
done reliably. All that you see is could not be due to the IV). Any
association, not causation. remaining difference should be due to
the IV.
17
THMT 3312: Research Methods for Tourism &
Hospitality Management
Face Validity:
Criterion Validity:
it was designed to measure to correlate they measure the
measure with another either underlying construct
The measure/scale must concurrently, or effectively
be sensitive enough to predictively, when
testing a research model A weak way to
accurately measure determine validity
subtle variances Concurrently: Those who
Example: Studies slept got a better mark
showing purchase on the exam than those
intention should be able who didn’t
to discriminate between Predictive: Those who
future purchasers and wrote the admissions
non-purchasers test and did well, also did
well in their degree
programs
19
Reliability in Data Collection
Does it produce consistent results time after time?
The test items/scale items Ensures that results are
Internally Consistent:
Overall stability:
consistently measure concepts consistent across those taking
Can do a split-half test to the survey
determine internal reliability if May require test/retest to see if
you have a battery of questions same individuals give
that you can split up and then consistent results (if possible
check how consistently both and if the first test doesn’t bias
test halves compare the follow-up test)
This is not possible for surveys Equivalent/parallel forms tests
without batteries of scaled are useful if one has two tests
questions that provide similar measures.
Can test with one, then the
next, then see if the results are
consistent across both tests.
20
Validity in Experimental Design
Validity validity
• Eg: Did the sales increase come from the radio ad,
or something else?
21
Things that Impact Validity in Experiments
External influences that happen Internal & external
History between pre/post measures that Affects validity
affect DV
Change in interviewers or
Instrumentation methodology between pre/post Affects Internal validity
DV measures
Differences in the samples selected
Selection for two-group studies (eg: Control Affects Internal validity
vs. Experimental groups)
Changes in research subjects due Internal & external
Mortality to attrition (drop outs) or deaths Affects validity
during the research project
22
Methods to Control for Extraneous Influences
in Experimental Design
Debrief
• A discussion/
Consistency Blind Method process at the
Randomization • Using a double blind end of the
• Applying
research design • Randomizing approach where both research that
and methods treatment groups the researcher and provides full
consistently (counterbalancing) the subject, or disclosure to
across all disguised approach subjects and the
• Employing methods
research (unknown to opportunity to
to prevent order bias
subjects and respondent only) do discuss the
(hard to do)
groups not know what is research
being measured experience
• May be required
for ethical
reasons
23
THMT 3312: Research Methods for Tourism &
Hospitality Management
Bivariate Analysis:
Univariate Analysis:
One Variable
Two Variables
analyzing data one variable at a analyzing two variables (IV and DV)
time at at time to better understand the
- What percent of tourists visited differences between them, or to
Peggy’s Cove? examine association (survey data),
or causation (experimental data)
- What is the average household
expenditure on electricity? - Who prefers a particular brand of
shoes: men or women? (Shoes =
- This kind of analysis involves DV, Gender = IV)
producing counts (frequencies),
percentages, and means for the - Are there significant differences in
questions asked in a survey and expenditure on vitamins for adults
examining the data over 55 and adults aged 55 or
younger? (Vitamin expenditure =
- This kind of analysis also involves DV, Age = IV)
creating confidence intervals
around the sample statistics to - This kind of analysis involves
determine what the population testing hypotheses using test
parameter would be as part of statistics suitable for the level of
statistical inference data measurement
25
Applying the Level of Data Measurement and
Statistical Significance
Significance tests differ depending the level of data measurement. Data
must be classified as categorical or continuous. The classification, which
is based on the four levels of data measurement, appears below.
26
Significance Tests for Doing Bivariate Analysis
What does your
hypothesis measure?
Does one variable influence another? How variables are related, or how they differ, but not
(association/causation) without association/causation?
(IV & DV) (no IV/DV)
27
Module Review:
1. What is the difference between association and
causation?