Professional Documents
Culture Documents
NCU Best Practices Quantitative Research
NCU Best Practices Quantitative Research
First Edition.
Published by the Center for Teaching and Learning, Northcentral University, 2020
Contributors:
John Bennett, Mary Dereshiwsky, Robert Dodd, David Fogarty, John Frame, Raymie
Grundhoefer, Larry Hughes, Sharon Kimmel, Vicki Lindsay, Edward Maggio, Gordon
McClung, NCU Library Team, Susan Petroshius, Lonnie K. Stevans, Gergana Velkova,
Steve Ziemba
In addition to the collaborative process that engendered this guide, it was also informed by
the quantitative methods and statistics courses in the School of Business.
For comments or suggestions for the next edition, please contact John Frame: jframe@ncu.edu
2 Back to Table of Contents
TABLE OF CONTENTS
Foreword
Introduction
Research Ethics and the IRB
Research Questions
Four Main Designs
Population and Sample
Sampling Method, Sample
Design, and Sample Size
Surveys and Questionnaire Design
Pilot Study
Datasets
Analyzing Secondary Data
Observational Research
Multivariate vs. Univariate Analysis
Measurement of Variables
Descriptive Statistics and Exploratory Data Analysis (EDA)
Inferential Statistics
Alpha Level (level of significance, or p-value)
Hypotheses
Hypothesis Diagrams
Hypothesis Testing
T-Test
Analysis of Variance (ANOVA)
ANOVA Examples
Correlation
Regression Analysis
Factor Analysis
Power (Statistical Power)
Power Analysis
Measuring Validity and Reliability
Internal/External Validity
Selection of Parametric vs. Nonparametric Techniques
Presentation of Statistical Results and Explaining Quantitative Findings in a Narrative Report
Back to Table of Contents 3
Foreword
Dear School of Business Community,
Welcome to the Best Practice Guide for Quantitative Research Design and Methods in Dissertations!
With well over 700 doctoral students in the School of Business working on their dissertation this year, this
guide serves as an important resource in helping us shape and implement quality doctoral-level research.
Its primary purpose is to offer direction on quantitative research in School of Business dissertations, serving
students as they craft and implement their research plans, and serving faculty as they mentor students and
evaluate research design and methods in dissertations.
We encourage you to explore this guide. It is filled with details on important topics that will help ensure
quality and consistency in quantitative research in the School of Business.
Thank you to the faculty and staff of the School of Business and wider NCU community that worked to cre-
ate this guide. It is a great contribution to our School, and each of these individuals played an important
role in its development.
The SAGE handbook of social research meth- A sample in a dissertation needs to be a substantial
ods (2008). London, United Kingdom: SAGE number (see “Power Analysis” in this guide), and
Publications, Ltd. https://doi-org.proxy1.ncu. should be determined based on best practices in
edu/10.4135/9781446212165 quantitative research. Students should be aware
that quantitative research demands a suitable
Sage Methods Map Online Tool: http://methods.
amount of data, and that the response rate from
sagepub.com.proxy1.ncu.edu/methods-map
samples (such as the response rate for surveys),
will typically be very low. Thus, a large number of
persons will need to be surveyed in order to obtain
Population and Sample an adequate amount of data.
The population represents the totality of units under
study, or to whom we wish to generalize or project After sampling, it is possible to generalize the
sample results to the population from which it was
Back to Table of Contents 6
selected. Here are some commonly applied ways to fraction. The items in the list should be scrambled
select a sample in quantitative research: in random order before beginning to cycle through
that list.
Simple random sample:
Every element of the target population has an equal Sampling Method, Sample
chance of being selected for the sample. This is es-
Design, and Sample Size
pecially valuable when doing experimental studies.
In most cases, a researcher will not contact every-
Stratified sample: one in the population (a census), but rather take a
subset of the population, a sample. While the selec-
In this sampling method, it is recognized that there
tion of a sample can involve a non-random (non-
is not one overall homogeneous population, but,
probability) procedure, in most quantitative studies
instead, subpopulations where the subgroups differ
researchers strive to use a probability procedure
from one another. For example, a researcher may
in which every unit in the population has a known
want to see if there is a significant difference in an
chance of being included in the sample. There are
average number of units of Product X purchased by
a number of alternative ways to generate a ran-
men and women. The researcher would subdivide,
dom sample that may vary over time, with respect
or ‘stratify,’ the overall population into men vs.
to cost, and the amount of information needed to
women (strata or subgroups), and randomly select
draw the sample. The researcher often needs a list
a sample from each gender to ensure it is ade-
of the population in order to select the sampling
quately represented in the overall sample.
elements (a sampling frame) and has to determine
Cluster sample: the size of the sample as well.
Sample Size
There are four factors involved in calculating sample size:
Statistical test: the sample size is partly a function of the statistical test used. Some
tests (e.g., Chi-squared) require larger samples to detect a difference than others
(e.g., ANCOVA).
Expected/estimated effect size: the effect size is potency of the strength of the rela-
tionship being investigated. In the language of statistics, an effect size is a difference
between the mean scores of two groups divided by the pooled standard deviation.
This is called ‘Cohen’s d’. A researcher will calculate an effect size as part of the
analysis of the data in order to determine that something meaningful has been found
(not merely statistically significant). However, in advance of doing a study, a
researcher must estimate the effect size in the study.
Alpha: the alpha level is the probability of a Type I error—of rejecting the null
α hypothesis when it is true. By convention, this is set at . It is best to use the literature,
as well as good judgment, to justify an alpha level that makes sense for a study. This
justification will involve looking at the danger of a Type I error versus the cost in
resources of avoiding it. Given this, the most common used α levels are .01, .05,
and .10.
Beta: the beta level is the probability of a Type II error of accepting the null
β
hypothesis when it is false. In other words, of failing to detect a difference when
one exists. As with alpha, a researcher sets beta based upon a judgment. The
convention is .2, which yields a power of .8 (1– β) acceptable level (see “Power
Analysis” in this guide).
* The wording of each question should be exclusive Finally, remember that actual behavior cannot be
and exhaustive. This means that if it is a closed-end- measured via a survey; a survey only measures
ed question, all of the possible answers must reports of behavior.
appear in the answers. In addition, the answers
should not overlap each other. For example, if there References and/or Suggested Reading:
is a question that asks someone’s age, and the
Bryman, A. (2006). Integrating quantitative and
X
participants are from 18 through 90, the student
qualitative research: How is it done? Qualitative
should not offer these as answers:
Research, 6(1), 97-113.
• Quandl - “Time-series” numerical only data for Another factor to consider is how the data was
economics, finance, markets & energy; Features categorized or coded in the dataset. This may
13 Back to Table of Contents
influence how data analysis will take place. For 2015). Three types of observational research are
example, the data may have been modified in discussed below.
some manner, or the full range of data needed for
a study may be spread across different categories
of the dataset. Measurement error in the dataset,
whether or not that bias was intentional (i.e., a
dataset put out by a political party) may also be
present, and needs to be considered (see “Data- Naturalistic Observation
sets” in this guide for additional information about This form of research occurs in the everyday setting
datasets and secondary data). of participants (as they live life normally). Thus,
there is no intervention by the researcher to influ-
Descriptive statistics (See “Descriptive Statistics and ence the environment (McLeod, 2015).
Exploratory Data Analysis (EDA)” in this guide), as
the term suggests, will involve the summarization of
data. For example, this may include the range of
data, number of data points, mean, median, and
mode, standard deviation, and 95% confidence
interval. As explained in the “Descriptive Statistics Participant Observation
and Exploratory Data Analysis (EDA)” section of In participant observation, the experimenter in-
this guide, this type of analysis, while important, is volves him/herself in the environment of the partic-
not sufficient for a doctoral-level dissertation. ipant, for example, as a member of a group. The
purpose of this is to focus on observing participant
Inferential statistics involve subjecting the data to behaviors that may not otherwise be discoverable
statistical tests, such as for significance. A common by the researcher. Such participant observations
inferential test involves the detection of a statistical- can either be covert or overt in terms of the knowl-
ly significant correlation between two sets of data edge or awareness made to the other participants.
from a dataset. Other types of statistical techniques The advantage to this form of observational re-
that are often applied to datasets include analysis search is that it results in a greater insight into the
of variance, regression analysis, logistic regression, participants (McLeod, 2015).
etc. The type of statistical test applied depends on
the research question and the nature of the study
(see “Inferential Statistics” in this guide).
References and/or Suggested Reading:
https://www.managementstudyguide.com/second-
ary_data.htm Controlled Observation
This form of observational research is often used by
Observational Research universities or labs and is carried out under spe-
cifically designed conditions. Such conditions are
Observational research is a form of non-experimen-
discussed by the researcher in detail, and designed
tal research in which a researcher observes ongo-
with an attention to detail. Participants experience
ing behavior in their chosen surroundings (Sauro,
the same situation so that their reactions can be
Back to Table of Contents 14
monitored. A key advantage of this form of obser- The following figure depicts this:
vational method is that the study is reproducible
(McLeod, 2015). CEO POPULATION: CEO
Experience ALL FIRMS WITH Gender
NUMBER OF
References and/or Suggested Reading:
EMPLOYEES > 100
CIRT (Center for Innovation in Research and Teach-
ing) Grand Canyon University (n.d.). Observation-
al method. Retrieved from https://cirt.gcu.edu/
research/developmentresources/research_ready/
descriptive/observational
Corporate CEO
Sauro, Jeff (2015). 4 types of observational research. Assets $ Compensation $
MeasuringU. Retrieved from
https://measuringu.com/observation-role/
The variance is not easily interpretable, since its for- this is to plot a histogram and examine its shape.
mula consists of sums of squares. A more interpreta- This link can be helpful for how to create a histo-
ble measure of dispersion is the standard deviation, gram in SPSS: https://www.youtube.com/results?-
which is calculated by taking the squared root of search_query=SPSS+25+descriptive+statistics+his-
the variance. Simply put, the standard deviation togram.
tells one how far, on average, each data value lies
from the average (mean). In Finance and Econom- If the histogram looks “bell-shaped,” then the data
ics, standard deviation is used as a measure of probably came from a Normal Distribution. How-
error or “risk.” A larger standard deviation is as- ever, sometimes looks are deceiving, so it is more
sociated with more error or higher risk (e.g., stock precise to test whether or not the data comes from
market volatility). (or looks like) a Normal Distribution. Different
tutorials consisting of how to test for the presence or
It is also important to know what the overall pop- absence of a Normal Distribution are available at
ulation (or sample) looks like (e.g., is it Normally this link: https://www.youtube.com/results?search_
Distributed or something else?). The best way to see query=SPSS+25+normal+distribution+testing.
Hypothesis Diagrams
References and/or Suggested Reading:
One of the overall goals of quantitative research is
Cramer, D., & Howitt, D. (2004). The SAGE dictio- to seek theories that focus on possible relationships
nary of statistics. London: SAGE Publications, Ltd. among variables. A diagram is an effective method
to demonstrate the hypothetical pathways (relation-
ships) involved in a research project. It can assist
Hypotheses both the author/researcher and reader to follow the
intended suppositions.
Hypotheses are analytical statements (using popula-
tion parameters) about the relationships outlined in Variables can have numerous different types of
the research question. Hypotheses should be close- relationships. For example, variable relationships
ly related to, and well aligned with, the research can be casual, conditional, reciprocal, symmetrical,
questions guiding the study. spurious, or controlling. When presenting a visual
representation of these relationships, pathways are
typically diagrammed, such as moderating, mediat-
Each research question, with the exception of de-
ing, and confounding variable connections.
scriptive research questions, contains a minimum of
two hypotheses: the null hypothesis and the alterna- Example of diagramming a mediating relationship:
tive hypothesis (sometimes referred to as a research
hypothesis).
MEDIATING
The null hypothesis is the hypothesis that the re- VARIABLE
searcher would like to disprove. This is the hypoth-
eses to be rejected, or nullified. Null hypotheses
for comparative research questions typically state INDEPENDENT DEPENDENT
that the population means of two or more groups VARIABLE VARIABLE
are the same. Null hypotheses for correlational
Hypothesis Testing
Hypothesis testing is the primary means for making
T-Test
decisions based on statistical testing. Hypotheses The T-test is used to analyze whether or not there is
are declarative statements that outline relationships a significant difference between the means of two
or comparisons to be tested in a research study. groups. T-tests are hypothesis testing tools which
The null hypothesis is the core idea in hypothesis give the researcher the ability to test an assumption
testing. The null hypothesis is the hypothesis to be about a population, and are used to determine if
rejected, or nullified. The steps in hypothesis testing there is a significant difference between the means
are: of two groups. The process of finding differences
between two sets of data are analyzed through the
1. State the null and alternative hypotheses (also use of the t-statistic, t-distribution, and degrees of
referred to as a research hypothesis), using only freedom.
population parameters.
The goal is to test the null hypothesis that there is
2. Determine if a one-tailed or two-tailed test should
no statistically significant difference between the
be used. Note: if the alternative hypothesis
means of two groups. If the null hypothesis is reject-
contains a greater than (>) or less than (<) sign,
ed, then there is a statistically meaningful difference
then a one-tailed test should be used. Conversely,
between whatever is being compared.
A Two (or More)-Way ANOVA is a more complex design in which there are two or more independent
variables measured over two or more groups. Consider an operations manager of a financial services firm
who is interested in surveying how committed the firm’s workers are to their jobs. The manager would like
to test that men and women have different levels of commitment and also that people over 40 years of age
have a different level of commitment than those under 40. The manager may use a Two-Way ANOVA in
order to compare two independent variables: gender and over-under 40. The result of this analysis will
lend empirical support for or against the manager’s belief.
In the above example, since both gender and over-under 40 contain two (2) groups, there are four (4)
means (2 x 2) to consider and test for differences. However, this testing is confounded when the presence
of “interaction” is considered. To keep it simple, in the two-variable case (Two-Way ANOVA considered
above), interaction would exist if the effect that being male or female had on commitment depended upon
whether a worker was over 40 or not (or vice versa). A full factorial ANOVA includes both main effects
(gender and above-below 40) and interaction effects (the interaction between gender and above-below 40).
23 Back to Table of Contents
It is important to note that in the absence of inter- References and/or Suggested Reading:
action, there are only two mean differences to test
Kerlinger, F., and Lee, H.B. (2000). Foundations of
for the dependent variable commitment—one due
behavioral research (4th ed.). Orlando, Florida:
to gender and the other from being over-under 40. Hartcourt College Publishers.
However, when interaction is present, there are six
(6) such mean differences to consider (in our ex- Saunders, M., Lewis, P., Thornhill, A. (2015). Re-
search methods for business students (7th ed.) Essex,
ample—see if you can work it out). So, interaction,
England: Pearson Education Unlimited.
while being more realistic from a research perspec-
tive--is also more complex. That is why the first step
in a Two (or More)-Way ANOVA is to test the null Correlation
hypothesis of the absence of interaction, which, if
Correlation is a measure of the degree of “linear”
accepted, would make things a lot easier.
relationship between (or amongst) variables. The
most common is known as Pearson Product Mo-
Here is a link to several video tutorials on how to ment Correlation, noted in the literature as “r.”
do One (or More)-Ways ANOVAs in SPSS: Correlation ranges from -1 (perfectly inverse/neg-
https://www.youtube.com/results?search_que- ative correlation) to a +1 (perfectly direct/positive
ry=SPSS+anova correlation). A correlation that hovers around zero
indicates that variables are not linearly related.
Another common instance of ANOVA, known as
Analysis of Covariance (ANCOVA), is to control Correlation is only a measure of the “linear” or
straight-line relationship between variables. Cor-
for, or “hold constant,” interval or ratio scale vari-
relation does not measure the degree of a non-lin-
ables. For example, continuing with the sex/age
ear relationship. Moreover, researchers must be
example, say the manager thinks that the level of
careful when drawing conclusions using correla-
education might matter in this analysis. He/she
tion, as it does not assume causation (correlation
could use ANCOVA techniques to compare women
does not imply causation).
and men and both age groups, controlling for their
level of education. In this way, any statistical effects There are three types of correlation:
of educational level are removed from the analysis;
1. simple correlation (between one “dependent”
and the manager gets a “purer,” (so to speak), indi-
variable, Y, and one “explanatory” variable, X);
cation of the effects of sex and age on commitment
2. multiple correlation (between one “dependent”
to the employer.
variable, Y, and many “explanatory” variables
X1, X2, X3,…);
Here is a link to several video tutorials on how to
do ANCOVA in SPSS: 3. canonical correlation (between many “depen-
dent” variables and many “explanatory” vari-
https://www.youtube.com/results?search_que-
ables).
ry=SPSS+ancova
Simple correlation is calculated between two vari-
ables. Multiple correlation is computed between
It is important to note that correlation tells us not only the strength of a linear relationship
(close to -1 or +1), but also the direction.
In other words, a positive simple correlation indicates that increases in the X variable is associated with
increases in the Y variable (and vice versa). A negative simple correlation tells us that increases in the X
variable is associated with decreases in the Y variable (and vice versa).
This “scatterplots” (below) show, graphically, the different strengths and directions of linear relationships that can
exist between two variables. The straight blue line going through the points depicts the “linear” relationship.
CORRELATION
(INDICATES THE RELATIONSHIP BETWEEN TWO SETS OF DATA)
MANN
2-SAMPLE
WHITNEY
T-TEST TEST
KRUSKAL
ONE-WAY WALLIS,
MOOD’S
ANOVA
MEDIAN
TEST
FACTORIAL FRIEDMAN
DOE TEST
Below are some general guidelines for applying nonparametric statistical tests to data:
If one’s analysis includes two independent samples, If one’s analysis includes three or more independent
and the data are: samples, and the data are:
Nominal: consider Chi-square test or Fisher Nominal: consider Chi-square test.
exact test. Ordinal: consider Kruskal-Wallis one-way
Ordinal: consider Wilcoxon-Mann-Whitney test analysis of variance.
or Kolmogorov-Smirnov two-sample test.
If one’s analysis includes measuring relationships,
If one’s analysis includes matched (or related) sam- and the data are:
ples, and the data are: Nominal: consider a Phi coefficient or kappa
Nominal: consider McNemar change test. coefficient.
Ordinal: consider Wilcoxon signed ranks test. Ordinal: consider a Spearman correlation coef-
ficient or Kendall’s Tau.
References and/or Suggested Reading: For example, a study finding that job performance
Box, G. E. (2013). An accidental statistician: The life and job satisfaction share a positive correlation of
and memories of George E. P. Box. Hoboken: Wiley r = .31 is rational evidence. Such data could be
bolstered, for example, with other data or research
and Sons.
(e.g., a story of a qualitative prediction by a busi-
Lamorte, W., W. (2017). When to use a nonpara- ness guru stating that to impact job performance, a
metric test. Boston University School of Public Health manager might influence job satisfaction).
Best Practice Module, Retrieved from: http://sphweb.
bumc.bu.edu/otlt/mph-modules/bs/bs704_non- The 6th edition of the Publication Manual of the
parametric/BS704_Nonparametric2.html American Psychological Association (APA) devotes
an entire chapter to “Displaying Results” through
Riegelman, R. (2013). Studying a study and testing a
proper design and placement of tables and figures
test. (6 ed.). Baltimore: Lippincott Williams & Wilkins.
to illustrate findings. In other words, a picture truly
Whitley E., & Ball, J. (2002). Statistics review 6: is worth a thousand words if the narrative augments
Nonparametric methods. Critical Care, 6, 509-512. the displayed evidence. This is true both in a man-
Retrieved from: https://doi.org/10.1186/cc1820 uscript and in a presentation. While in a presenta-
tion, voice may be used to help illustrate the impact
of research findings. The APA manual provides
Presentation of Statistical Results important formatting and presentation guidelines
for tables and figures so that they are not cluttered
and Explaining Quantitative
and have the greatest potential for impact. Exam-
Findings in a Narrative Report ples are also provided for enhancing the visual aids
What a researcher discovers is just as important as with a well-composed narrative. An emphasis is
how they communicate it to readers. Communicat- placed on conciseness of the content of each visual,
ing data and statistical findings is an essential skill as well as standard form.
and an important element in presenting research
findings. If miscommunicated, an audience may be Because the APA understands that most of the
lost or, at a minimum, bored. Proper presentation, information stemming from a quantitative result is
done correctly, can have an enduring impact on usually manipulated by a statistical package, the
audiences—both readers and those who attend charts and graphs usually follow a Microsoft Word
presentations or doctoral dissertation defenses. document-based construction. This construction is
based on the normal APA rules of formatting, such
Data and statistics are left-brained material, in as font, size, and spacing. This can be easily found
that they tap into logical and rational information in the APA 6th edition manual, chapter 5. Most of
processing. However, readers and audiences are the statistical packages will be able to convert these
35 Back to Table of Contents
tables or figures to APA. Even though this is noted, both in text and in charts, tables, or graphs (using
make sure that everything within a chart or figure the correct APA recommended spacing, alignment,
matches the requirements of the correct font, size, and punctuation marks for those particular statis-
and color for APA when used within the dissertation tical tests); and 3) final results, including the null
process. hypothesis testing with the probability of occur-
rence (p-value). These explanations are normally
The student must make sure that everything has con- displayed within the results section (Chapter 4) of
verted correctly and all numbers have been copied the dissertation, but they may also cross over into
over properly. Everything should also look concise Chapter 5.
and clear as it is retrieved from one chart and
re-entered into the dissertation. This may include During the explanation of the sample, the pop-
the use of the same number of decimal points and ulation, sample size, type of sampling, power
similar cell formatting within a table. Students must analysis, instrumentation, and variables should be
realize that some statistical software packages may explained using measures of central tendency and
not typically put a demographic in the same sam- tables, as needed (Creswell & Cresswell, 2018).
pling frame as a frequency. Therefore, a student These explanations of the sampling data should
must have enough forethought to work within the include measures of central tendency that are ap-
Microsoft Word program to be able to add columns propriate to the data: the range, means, medians,
and rows within table properties. modes, standard deviations, etc. This can exist in
The most needed explanations for quantitative
methodology consist of: 1) the explanation of the
sample data (descriptive analysis including inde-
pendent and dependent variables); 2) the statistics,