Professional Documents
Culture Documents
Graduate Level Statistics Compilation
Graduate Level Statistics Compilation
Definition 1: Statistical literacy involves understanding and using the basic language
and tools of statistics: knowing what basic statistical terms mean, understanding the use of
simple statistical symbols and recognizing and being able to interpret different
representations of data. They distinguish statistical literacy, statistical reasoning and
statistical thinking by examining the types of words that are useful in assessing the outcomes
for these terms. They use words such as critique, evaluate and generalize for statistical
thinking (highest levels of Bloom’s taxonomy) and terms such as describe, interpret and
read for statistical literacy. However, when elaborating their list, they write that an
assessment in statistical literacy might include such terms as student interpretation and
critique of selected news articles and media graphs as well as items pertaining to basic terms
and vocabulary. (Garfield, delMas, & Zieffler 2010).
Example 1: In the classroom, students should be able to interpret results from studies
and media reports, pose critical and reflective questions about those reports, and
communicate reactions where required. Even if students do not perform a study,
understanding statistics can help them assess the quality of other studies and the validity of
their findings Watson (2006).
Example 2: For example, exercises that ask students to compute the mean of 10
numbers or to construct a dotplot or boxplot of 20 numbers without context are arithmetic
and graphing exercises. They become statistical problems only when a context gives them
meaning and allows for interpretation. While this makes for a text that may appear “wordy”
when compared to traditional mathematics texts, it is a critical and necessary component of a
modern statistics text.
Page | 1
traditional method. At the end of the study, the researchers compared the average test scores
of the two groups.
Own example: For example, I come across a research study that claims a new
medication is effective in reducing the symptoms of a certain medical condition. Statistical
literacy would involve examining the study's methodology, understanding the statistical tests
used, and interpreting the results accurately.
List of websites:
https://doi.org/10.1080/23265507.2017.1354313
https://doi.org/10.54870/1551-3440.1538
DOI:10.2991/coema-18.2018.41
Example 2: The United States Census Bureau uses statistics to collect and analyze
data on population demographics, such as age, gender, and race (U.S. Census Bureau, n.d.).
Example 3: A researcher studying the effects of exercise on mental health might use
statistics to analyze data on participants' reported levels of anxiety and depression before and
after a 12-week exercise program.
Own Example: A business owner might use statistics to analyze sales data and
identify trends in customer behavior, such as which products are selling well and which are
not.
List of websites:
Page | 2
https://www.investopedia.com/terms/s/statistics.asp
https://www.amstat.org/ASA/Your-Career/What-is-Statistics.aspx
Definition 1: Data refers to factual information that can be used for analysis,
reasoning, or calculation (Investopedia). Variables: In the context of research, a variable is
any characteristic, number, or quantity that can be measured or counted (Segal, 2022).
Example 1: In a study on the effects of exercise on weight loss, the amount of weight
lost would be the data, and the type of exercise performed would be a variable.
Own example: In a study on the relationship between study habits and academic
performance, the number of hours spent studying would be a variable, and the grades
achieved by students would be the data.
List of websites:
https://www.investopedia.com/terms/b/big-data.asp
https://www.simplypsychology.org/variables.html
Page | 3
population, drawing conclusions, and making decisions in the presence of uncertainty
(“Basic Inferential Statistics - Purdue OWL® - Purdue University,” n.d.).
Definition 3: Summary statistics are used to summarize the data using a single
number. They include measures of central tendency, such as the mean and median, which
describe where the center of a dataset is located (2022).
Example 3: An example of summary statistics is calculating the mean and median income of
a sample population to understand the average income and the midpoint of the income
distribution (2022).
Own Example: A teacher collects data on the test scores of students in a class and
uses descriptive statistics to calculate the average score and range, and then uses inferential
statistics to test whether there is a significant difference in scores between two groups of
students (“Basic Inferential Statistics - Purdue OWL® - Purdue University,” n.d.).
List of websites
https://www.scribbr.com/statistics/descriptive-statistics/
https://owl.purdue.edu/owl/research_and_citation/using_research/writing_with_statis
tics/basic_inferential_statistics.html
https://www.statology.org/descriptive-inferential-statistics/
Definition 2: Qualitative Research: This research method involves the collection and
analysis of non-numerical data such as text, images, and videos. It is often used to explore
complex phenomena and gain a deeper understanding of people's experiences, attitudes, and
behaviors.
Page | 4
research problem. It involves collecting and analyzing both numerical and non-numerical
data to answer research questions
Example 2: Survey Research: This research method involves collecting data through
questionnaires or interviews to gather information from a sample of individuals. Surveys can
be conducted in person, over the phone, through mail, or online. Survey research is often
used to gather information about people's opinions, attitudes, behaviors, or characteristics.
For example, conducting a survey to understand public opinion on a political issue
Own Example: An overview of research methods could also cover the differences
between primary and secondary research, as well as the various approaches to data analysis
such as content analysis, thematic analysis, and statistical analysis. It could also discuss the
importance of research questions, hypotheses, and the formulation of a research design.
List of websites
https://www.scribbr.com/methodology/quantitative-research/
https://www.scribbr.com/methodology/qualitative-research/
https://www.scribbr.com/methodology/mixed-methods-research/
https://explorable.com/research-methodology
Page | 5
alternative hypothesis, collecting data, and using statistical tests to determine whether the
evidence supports or rejects the null hypothesis (Majaski, 2023).
Definition 3: Confidence intervals provide a range of values within which the true
population parameter is estimated to lie. They are calculated based on sample data and
provide a measure of the uncertainty associated with the estimate. The level of confidence
(e.g., 95%) indicates the likelihood that the interval contains the true parameter value
(Hayes, 2023).
List of websites
https://courses.lumenlearning.com/suny-hccc-research-methods/chapter/chapter-8-
sampling/
https://www.investopedia.com/terms/h/hypothesistesting.asp
https://www.investopedia.com/terms/c/confidenceinterval.asp
Page | 6
research study and providing detailed descriptions of their relevant attributes, such as age,
gender, health status, and other pertinent factors (Casteel & Bridier, 2021).
Own example: A research study aims to investigate the impact of exercise on mental
health outcomes among adolescents. The study population is defined as adolescents between
the ages of 13 and 18 years attending public high schools in a specific region. To
characterize the population, the researchers collect information on exercise habits, such as
the frequency, duration, and type of physical activity engaged in by the participants. They
also gather data on mental health outcomes, including measures of depression, anxiety, and
self-esteem. Additionally, demographic information such as gender and socioeconomic
status may be collected to further characterize the study population (Casteel & Bridier,
2021).
List of websites
https://ijds.org/Volume16/IJDSv16p339-362Casteel7067.pdf
Page | 7
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3105563/
Definition 2: Sampling principles and procedures aim to ensure that the selected
sample is unbiased and accurately reflects the characteristics of the population (Johnson,
2019).
Own example: Suppose you want to estimate the average height of students at a
university. You can use systematic sampling by selecting every tenth student from the
student registration list (Johnson, 2019).
List of websites
https://www.researchgate.net/publication/351905623_Online_1_PROBABILITY_A
ND_NON-PROBABILITY_SAMPLING_-
AN_ENTRY_POINT_FOR_UNDERGRADUATE_RESEARCHERS
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5325924/
Page | 8
Definition 3: The sampling distribution of proportions was used to estimate the
proportion of voters who supported a particular candidate based on multiple random samples
(Brown, 2020).
Example 2: A researcher conducted multiple random samples of the same size from a
population and created a sampling distribution of the sample means to estimate the
population mean (Smith, 2019).
Own Example: Suppose you want to estimate the average weight of students at a
university. You take multiple random samples of the same size and calculate the sample
mean. By examining the sampling distribution of the sample means, you can make
inferences about the population mean weight.
List of websites
https://www.ijsr.net/archive/v8i1/SUB157990.pdf
https://www.indeed.com/career-advice/career-development/what-is-sampling-
distribution#:~:text=Sampling%20distribution%20is%20a%20statistic,of%20a%20comparat
ively%20larger%20population.
Definition 1: Sample size determination formulas are statistical calculations that help
researchers determine the number of participants or observations required to achieve a
desired level of precision and confidence in their study results.
Definition 2: Sample size determination formulas take into account factors such as
the desired level of statistical power, effect size, significance level, and variability in the
population to estimate the sample size needed.
Definition 3: Sample size determination formulas can differ depending on the study
design and the type of statistical analysis planned, such as for estimating means, proportions,
or differences between groups.
Page | 9
Example 2: A survey researcher used a sample size determination formula that
considered the desired level of precision, expected response rate, and acceptable margin of
error to determine the sample size for a population estimation study.
Own example: Suppose you want to conduct a survey to estimate the proportion of
people who support a certain policy. You can use a sample size determination formula that
takes into account the desired confidence level, margin of error, and estimated proportion
from a pilot study to determine the appropriate sample size.
List of websites
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3148614/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4148275/
https://www.ashokcharan.com/Marketing-Analytics/~sm-sample-size-comparitive-
studies.php#gsc.tab=0
Definition 2: Sample size adequacy refers to having a sufficiently large sample size
that allows for reliable and valid statistical analysis and generalization of findings to the
population (Andrade, 2020).
Page | 10
Own example: Suppose you are conducting market research on a new product. To
ensure representativeness, you randomly select participants from different geographical
regions and age groups. Additionally, you aim for a sample size of at least 500 to achieve
adequate statistical power (Cash, Isaksson, Maier, & Summers, 2022).
List of websites
https://www.investopedia.com/terms/r/representative-sample.asp
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6970301/
https://www.sciencedirect.com/science/article/pii/S0142694X21000880
Definition 1: Primary data sources are data that is collected firsthand by the
researcher for a specific purpose. This can include surveys, interviews, observations, and
experiments (Smith, 2018).
Definition 2: Secondary data sources are data that has already been collected by
someone else for a different purpose. This can include government reports, academic
journals, and industry publications (Jones, 2017).
Definition 3: Tertiary data sources are compilations and analyses of primary and
secondary sources. These can include encyclopedias, textbooks, and databases (Brown,
2019).
Example 2: A company analyzes sales data from industry reports to identify trends in
consumer behavior (Jones, 2017).
Own example: A historian collects letters and diaries from a specific time period to
gain insights into the daily lives of people during that time.
List of websites
https://hal.science/hal-03741847/document
https://crk.umn.edu/library/primary-secondary-and-tertiary-sources
Page | 11
Definition 1: Measurement and data collection issues refer to the challenges and
complexities encountered in the process of gathering, recording, and analyzing data for
research purposes (Smith, 2018).
Definition 2: Measurement and data collection issues encompass the various factors
that can impact the accuracy, reliability, and validity of collected data, including instrument
calibration, observer bias, and sampling errors (Jones & Brown, 2019).
Definition 3: Measurement and data collection issues involve the identification and
mitigation of potential sources of error or bias in the measurement and collection of data,
such as data incompleteness, measurement variability, and data misinterpretation (Garcia et
al., 2020).
Example 2: In a clinical study, a measurement and data collection issue could arise
from inconsistent application of measurement procedures by different healthcare providers,
leading to variability in recorded patient data (Chen et al., 2021).
List of Websites
https://pubmed.ncbi.nlm.nih.gov/14674574/
https://www.researchgate.net/publication/8959691_Measurement_Issues_Related_to
_Data_Collection_on_the_World_Wide_Web
Definition 1: Criteria for evaluating data quality are the standards and measures used
to assess the accuracy, completeness, consistency, and reliability of collected data, ensuring
its fitness for the intended purpose (Smith, 2018).
Definition 2: Criteria for evaluating data quality encompass the attributes and
characteristics of data, including validity, timeliness, relevance, and accessibility, that
determine its usefulness for decision-making and analysis (Jones & Brown, 2019).
Page | 12
Definition 3: Criteria for evaluating data quality involve the systematic assessment of
data against predefined standards, such as data provenance, data governance, and data
documentation, to ensure its trustworthiness and usability (Garcia et al., 2020).
Own example: In a financial analysis, a criterion for evaluating data quality may
include assessing the integrity and consistency of financial data across different accounting
periods and financial statements to ensure accurate reporting and analysis.
List of websites
https://millenniumindicators.un.org/unsd/dnss/docs-nqaf/Eurostat-
HANDBOOK%20ON%20DATA%20QUALITY%20ASSESSMENT%20METHODS%20
AND%20TOOLS%20%20I.pdf
https://www.researchgate.net/publication/318432363_Data_Quality_Considerations_
for_Big_Data_and_Machine_Learning_Going_Beyond_Data_Cleaning_and_Transformatio
ns
https://www.sciencedirect.com/science/article/pii/S0190052821000699
15. THE NATURE OF THE RESEARCH PROCESS AND THE ROLE OF STATISTICS
Definition 1: The nature of the research process refers to the systematic and
structured approach used to investigate phenomena, develop knowledge, and test
hypotheses, often involving the formulation of research questions, data collection, analysis,
and interpretation (Sirisilla & Sirisilla, 2023).
Definition 3: Statistics play a crucial role in research by providing tools for data
description, hypothesis testing, and modeling, allowing researchers to quantify uncertainty,
identify patterns, and make evidence-based conclusions (Garcia et al., 2020).
Page | 13
Example 2: In a clinical trial, an example of the role of statistics in research is the
use of statistical tests to compare treatment outcomes between different groups of patients
and assess the significance of treatment effects (Chen et al., 2021).
Own example: In social science research, the nature of the research process includes
conducting surveys, interviews, and observations to collect data, followed by the use of
statistical techniques such as regression analysis to examine relationships between variables
and test hypotheses.
List of websites
https://www.enago.com/academy/statistics-in-research-data-analysis/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5037948/
1. Textual
2. Tabular
3. Graphical
Page | 14
Example 2: Tabular data organization can be used to present data in a structured
table format. For example, a sales report may include a table that displays the sales figures
for different products or regions over a specific time period. This allows for easy
comparison and analysis of the data.
Own example: If you are presenting sales data for different products over time, you
could use a textual format to describe the overall trends, a tabular format to compare the
sales figures for each product, and a graphical format such as a line chart to visualize the
sales performance over time.
List of websites
https://www.geeksforgeeks.org/classification-and-tabulation-of-data/
https://www.vedantu.com/commerce/presentation-of-data
Definition 1: Proportions are ratios that compare a part to a whole, while percentages
are proportions multiplied by 100. They are commonly used to express the relative size or
frequency of a subset of data in comparison to the total dataset (“Probability and Statistics
Topics Index - Statistics How To,” 2023).
Definition 3: Rates are ratios that express the frequency or occurrence of an event
over time. They are often used to compare trends or changes in data over time (“Probability
and Statistics Topics Index - Statistics How To,” 2023).
Definition 4: Indices are numerical values that represent changes or trends in data
over time. They are often used to compare data from different time periods or to track
Page | 15
changes in a specific variable over time (“Probability and Statistics Topics Index - Statistics
How To,” 2023).
Example 1: If a survey of 100 people found that 60% of them preferred coffee over
tea, the proportion of coffee drinkers is 0.6, and the percentage is 60%.
Example 3: If a car travels 200 miles in 4 hours, its average speed is 50 miles per
hour, which is a rate.
Own example: If a class of 30 students has 20 girls and 10 boys, the ratio of girls to
boys is 2:1.
List of websites
https://www.statisticshowto.com/probability-and-statistics/
https://www.investopedia.com/terms/i/index.asp
1. Central Tendency
2. Variability
3. Position
Definition 1: Central tendency refers to the measure that represents the center of a
distribution. Common measures of central tendency include the mean, median, and mode
(Bhandari, 2023).
Definition 3: Position refers to the relative location of a value within a dataset. This
can be described using percentiles, quartiles, and other positional measures (Kurniawan,
2023).
Page | 16
Example 3: An example of position is the 75th percentile, which represents the value
below which 75% of the data fall (Kurniawan, 2023).
Own example: For instance, if we have a dataset of exam scores (75, 80, 85, 90, 95), the
mean would be (75 + 80 + 85 + 90 + 95) / 5 = 85, the standard deviation would measure the
spread of these scores, and the 75th percentile would represent the score below which 75%
of the students fall. (Bhandari, 2023).
List of websites
https://www.scribbr.com/statistics/descriptive-
statistics/?fbclid=IwAR2dbbxtUhur1vnnU5kksJi9V5VpAcmMNMFF6-DjT-
gvl6aaehgFC6Cdp5I#measures-of-central-tendency
Example 2: Another example is interpreting a bar chart that displays the sales
performance of different products. By analyzing the chart, one can identify the products with
the highest sales and compare their performance to others. This information can help
businesses make strategic decisions regarding product inventory and marketing strategies
(Frost, 2021).
Page | 17
Own example: Let's say you are analyzing a scatter plot that shows the relationship
between hours studied and exam scores for a group of students. By interpreting the scatter
plot, you can determine if there is a positive or negative correlation between study hours and
exam scores, which can help identify the importance of studying for academic success
(Frost, 2021).
List of Websites
https://support.minitab.com/en-us/minitab/21/help-and-how-to/statistics/basic-
statistics/how-to/display-descriptive-statistics/interpret-the-results/all-statistics-and-graphs/
https://www.ahajournals.org/doi/full/10.1161/circulationaha.105.584474
https://learningbehavioranalysis.com/section-c-measurement-data-display-and-
interpretation/
https://statisticsbyjim.com/graphs/pie-charts/
Definition 2: Exploratory Data Analysis serves as the initial step in data analysis,
allowing for the detection of mistakes and anomalies as well as the calculation of summary
statistics. Graphical methods summarize the data in visual representations, while non-
graphical methods involve the calculation of summary statistics (“Exploratory Data
Analysis,” n.d.)
Definition 3: Exploratory data analysis is a critical step in the data analysis process.
It involves using various techniques to gain a better understanding of the dataset and its
characteristics. EDA helps in identifying patterns, relationships, and potential outliers in the
data (Mba, 2022).
Page | 18
Example 3: Exploratory data analysis can also be applied to time series data. For
instance, analyzing stock market data using graphical methods like line charts or candlestick
charts can reveal trends, patterns, and potential investment opportunities (Raj, 2022).
Own example: Let’s say you have a dataset containing information about student
performance, including variables such as study hours, test scores, and extracurricular
activities. By performing EDA on this dataset, you can explore the relationship between
study hours and test scores using scatter plots or analyze the distribution of test scores using
histograms (Raj, 2022).
List of websites
https://www.analyticsvidhya.com/blog/2021/02/introduction-to-exploratory-data-
analysis-eda/
https://www.stat.cmu.edu/~hseltman/309/Book/chapter4.pdf
https://towardsdatascience.com/an-extensive-guide-to-exploratory-data-analysis-
ddd99a03199e
https://www.ibm.com/topics/exploratory-data-analysis
https://www.analyticsvidhya.com/blog/2021/02/introduction-to-exploratory-data-
analysis-eda/
Page | 19
Example 2: Another example is conducting a hypothesis test to determine if a new
drug is effective in treating a certain condition. By collecting data from a sample of patients
and comparing it to a control group, statistical inference enables us to draw conclusions
about the drug's effectiveness in a broader population (“The Logic of Statistical Inference:
Making Statements About Populations From Sample Statistics,” 2007).
Example 3: Statistical inference can also be used in quality control processes. For
instance, by sampling a batch of products and analyzing their characteristics, statistical
inference can help determine if the entire batch meets quality standards (Weisburd & Britt,
2013).
Own example: Let's say you are interested in determining the average height of
students in a school. By collecting height measurements from a sample of students and using
statistical inference, you can estimate the average height of all students in the school
(Weisburd & Britt, 2013).
List of websites
https://link.springer.com/chapter/10.1007/978-0-387-34113-2_6
https://link.springer.com/chapter/10.1007/978-1-4614-9170-5_6
https://statisticsbyjim.com/hypothesis-testing/statistical-inference/
Definition 1: Inferential statistics refers to the process of using sample data to make
estimates or draw conclusions about a population. It involves making inferences about
population parameters based on sample statistics (Bhandari, 2023).
Definition 2: Inferential statistics can be categorized into two types: parametric and
nonparametric methods. Parametric methods assume specific distributions for the
population, while nonparametric methods do not make such assumptions. The choice of
method depends on the nature of the data and the research question (Hassan, 2023).
Page | 20
individuals and using inferential statistics, one can make inferences about the proportion of
the entire population that supports the policy (“Inferential Statistics - Definition, Types,
Examples, Formulas,” n.d.).
Example 3: Inferential statistics can also be used in clinical trials to assess the
effectiveness of a new drug. By randomly assigning participants to treatment and control
groups and using statistical tests, researchers can draw conclusions about the effectiveness
of the drug for the larger population (Hassan, 2023).
Own example: Let's say you want to estimate the average height of all students in a
school. You could collect height measurements from a random sample of students and use
inferential statistics to estimate the average height of all students in the school with a certain
level of confidence (Bhandari, 2023).
List of websites
https://www.scribbr.com/statistics/inferential-statistics/
https://researchmethod.net/inferential-statistics/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8941155/
https://www.cuemath.com/data/inferential-statistics/
Page | 21
techniques, such as the method of moments or maximum likelihood estimation, to obtain
estimates of the population parameters (Hassan, 2023).
Own example: Let's say you want to estimate the average income of all employees in
a company. You could collect salary data from a random sample of employees, use
estimation techniques to calculate the sample mean, and construct a confidence interval for
the population mean income (Bhandari, 2023).
List of websites
https://www.stat.purdue.edu/~qfsong/teaching/511/lecture/Notes-06-point-and-
interval-estimation.pdf
https://www.britannica.com/science/interval-estimation
https://www.scribbr.com/statistics/inferential-statistics/
https://researchmethod.net/inferential-statistics/
Page | 22
data, and using statistical tests to evaluate the likelihood of observing the data under the null
hypothesis. The results of the test help determine whether to accept or reject the null
hypothesis (Bevans, 2023).
Example 1: For instance, let's say you want to test whether there is a relationship
between gender and height. Based on your knowledge of human physiology, you formulate
a hypothesis that men are, on average, taller than women. To test this hypothesis, you restate
it as follows: H0: Men are, on average, not taller than women. You would collect height data
from a sample of men and women, perform statistical tests, and examine the results to
determine whether there is evidence to support or reject the hypothesis (Bevans, 2023).
Own example: For example, you want to test whether there is a significant difference
in average sales between two different advertising strategies. You would collect data on
sales from both groups, formulate null and alternative hypotheses, perform statistical tests,
and analyze the results to determine if there is evidence to support your claim (Bevans,
2023).
List of websites
https://www.scribbr.com/statistics/hypothesis-testing/
Page | 23
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7807926/
Page | 24
either Advertisement A or Advertisement B. After the campaign, they measure the number
of purchases made by each group. Null hypothesis (H0): There is no difference in the
number of purchases between customers exposed to Advertisement A and Advertisement B.
Alternative hypothesis (Ha): There is a significant difference in the number of purchases
between customers exposed to Advertisement A and Advertisement B. After analyzing the
data, the company calculates a p-value of 0.02. Since the p-value is below the predetermined
significance level of 0.05, they conclude that there is a statistically significant difference in
the number of purchases between the two advertising strategies. This suggests that one
advertisement is more effective than the other in driving customer purchases (Moran, 2021).
List of websites
https://www.investopedia.com/terms/s/statistical-significance.asp
https://www.statisticssolutions.com/statistical-significance-cutoffs-for-p-values/
Definition 1: Choosing statistical tests involves identifying the type of data collected,
the research question being addressed, and the assumptions underlying different statistical
tests. Researchers need to consider factors such as the measurement scale, sample size, study
design, and the nature of the variables involved to determine the most appropriate test for
their analysis (“SPSS Statistics Tutorials and Statistical Guides | Laerd Statistics,” n.d.).
Definition 3: Selecting statistical tests refers to the procedure of choosing the most
suitable statistical method or test to analyze data in order to answer a research question or
test a hypothesis. It involves considering various factors such as the type of data, study
design, assumptions of the statistical test, and the nature of the research question being
investigated (Parab & Bhalerao, 2010).
Example 1: Suppose a researcher wants to compare the mean scores of two groups
on a continuous outcome variable. They have independent samples from each group. In this
case, a t-test (independent samples t-test) would be an appropriate statistical test to compare
the means and determine if there is a significant difference between the groups (Parab &
Bhalerao, 2010).
Page | 25
this type of data, a chi-square test of independence would be suitable to determine if there is
a significant association between the variables (Parab & Bhalerao, 2010).
List of websites
https://statistics.laerd.com/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996580/
Definition 2: Statistical significance refers to the likelihood that the observed data
would have occurred by random chance if the null hypothesis were true. It is often expressed
as a p-value between 0 and 1 (Mcleod, 2023).
Page | 26
conclude that there is evidence to support the claim that participants can identify bottled
water (“Using P-values to Make Conclusions (Article) | Khan Academy,” n.d.).
Example 2: Let's say you are conducting a study to determine if a new teaching
method improves students' test scores compared to the traditional method. You collect data
from two groups: an experimental group that receives the new teaching method and a
control group that receives the traditional method. After analyzing the data, you calculate a
p-value. If the p-value is below the significance level (e.g., 0.05), you can reject the null
hypothesis and conclude that there is evidence to support the claim that the new teaching
method leads to improved test scores (Bhandari, 2023).
List of websites
https://www.scribbr.com/statistics/p-value/
https://www.simplypsychology.org/p-value.html
https://www.khanacademy.org/math/ap-statistics/xfb5d8e68:inference-categorical-
proportions/idea-significance-tests/a/p-value-conclusions
https://www.scribbr.com/statistics/statistical-significance/
https://www.statology.org/p-values-statistical-significance/
Page | 27
significant differences in means or proportions (“Hypothesis Test: Difference in Means,”
n.d.).
Example 1: A researcher wants to compare the average income levels between two
different professions. They collect data from individuals in both professions and perform a t-
test to analyze the difference in means. If the p-value is below a predetermined significance
level, they can conclude that there is a significant difference in income levels between the
two professions (Trochim, n.d.).
Example 2: In a study examining the relationship between exercise and heart health,
researchers collect data on participants' exercise habits and measure their cardiovascular
health indicators. They use correlation analysis to determine if there is a significant
relationship between exercise frequency and cardiovascular health measures. The correlation
coefficient can indicate the strength and direction of the relationship (MSEd, 2023).
List of websites
https://stattrek.com/hypothesis-test/difference-in-means
https://www.investopedia.com/terms/r/relationship-management.asp
Page | 28
https://statistics.laerd.com/
https://conjointly.com/kb/statistical-student-t-test/
https://www.verywellmind.com/correlational-research-2795774
https://www.datacamp.com/tutorial/multiple-linear-regression-r-tutorial
Definition 2: Analysis of Variance partitions the total variation in a dataset into two
components: variation between groups and variation within groups. By comparing the ratio
of these two components, ANOVA determines if the differences between group means are
statistically significant (Kenton, 2023).
List of websites
https://stattrek.com/tutorials/anova-tutorial
https://www.investopedia.com/terms/a/anova.asp
Page | 29
https://statistics.laerd.com/spss-tutorials/one-way-anova-using-spss-statistics.php
https://www.researchgate.net/publication/243783609_The_Research_Methods_Kno
wledge_Base
https://www.simplypsychology.org/sampling.html
https://www.statology.org/hypothesis-testing-real-life-examples/
Definition 3: Statistical power analysis takes into account factors such as the desired
level of significance, effect size, sample size, and variability in the data. It helps researchers
make informed decisions about study design, sample size determination, and the likelihood
of finding significant results (“Statistical Power Analysis - Statistics Solutions,” 2021).
Page | 30
will have higher mean scores compared to Group B. To conduct a power analysis, the
researcher determines the desired level of significance (e.g., α = 0.05), the effect size they
expect to detect (e.g., Cohen's d = 0.50), and the desired statistical power (e.g., 0.80). Using
a statistical power calculator or software, the researcher performs the power analysis to
determine the required sample size for each group to achieve the desired power level
(“Cohen, J. (1988).
List of websites
https://stattrek.com/hypothesis-test/statistical-power
https://www.investopedia.com/terms/s/statistical-significance.asp
https://www.scirp.org/reference/ReferencesPapers?ReferenceID=2041144
Definition 2: The effect size can be calculated using the Pearson correlation
coefficient, which measures the linear association between two variables. The coefficient
has a value between -1 and 1, where -1 indicates a strong negative relationship, 1 indicates a
strong positive relationship, and 0 indicates no relationship (2021).
Own example: let's consider a study evaluating the effectiveness of a new teaching
method on student achievement in mathematics. The effect size could be calculated to
quantify the difference in math scores between students who received the new teaching
Page | 31
method and those who did not. This effect size would provide an indication of the practical
significance of the intervention and help interpret the findings of the study (Lipsey &
Wilson, 1993).
List of websites
https://www.scribbr.com/statistics/effect-size/
https://www.statology.org/effect-size/
https://statisticsbyjim.com/basics/effect-sizes-statistics/
https://scholar.harvard.edu/files/mkraft/files/kraft_2019_effect_sizes.pdf
Example 1: Let's consider a health sciences research question: "How does phone use
before bedtime affect sleep patterns?" In this case, the structure and planning of the
experiment would involve defining the variables (phone use and sleep patterns), selecting
appropriate measurement methods, and designing a study protocol (Bevans, 2023).
Page | 32
Own example: Suppose you want to investigate how music affects focus. The
experiment would involve selecting a group of participants, exposing them to different types
of music, and measuring their level of focus using appropriate methods (Psychology, 2023).
List of websites
https://conjointly.com/kb/
https://www.scribbr.com/methodology/experimental-design/
https://practicalpie.com/experimental-design-examples/
Page | 33
ANOVA, we can assess if there are significant differences in crop yield based on the type of
fertilizer used, the region where it was applied, or their interaction (Sapkota, 2023).
Own example: Let's say you are studying the relationship between social media use
and sleep patterns. You divide participants into low, medium, and high levels of social
media use and measure their hours of sleep per night. By using a one-way ANOVA, you can
determine if there is a significant difference in sleep duration between the three groups
(Sapkota, 2023).
List of websites
https://www.statology.org/one-way-vs-two-way-anova/
https://theintactone.com/2019/03/06/brm-u5-topic-10-analysis-of-variance-one-way-
and-two-way-classifications/
https://microbenotes.com/anova/
Page | 34
Example 3: Multiple comparison procedures can also be applied in agricultural
research. For instance, researchers may want to compare the yields of different fertilizers
applied to various crops. By using multiple comparison procedures, they can determine if
there are any significant differences in crop yields between the different fertilizer treatments
(2020).
Own example: Let's say you conducted a survey to compare the satisfaction levels of
customers who used three different brands of smartphones. You can utilize multiple
comparison procedures to determine if there are any significant differences in satisfaction
levels between the three brands (Cabral, 2008).
List of websites:
https://online.stat.psu.edu/stat500/lesson/10/10.3
https://www.ahajournals.org/doi/full/10.1161/circulationaha.107.700971
https://stats.libretexts.org/Bookshelves/Computing_and_Modeling/Supplemental_M
odules_(Computing_and_Modeling)/Regression_Analysis/Multiple_Comparison
Page | 35
Example 3: Consider a study examining the effects of different exercise programs on
physical fitness. The same group of participants could be tested for their fitness levels before
starting each program, at regular intervals during the program, and after completing each
program. By using a repeated-measurement design, researchers can evaluate changes in
physical fitness within the same group of participants across different exercise programs
(Rose, n.d.).
Own example: Let's consider a study that aims to evaluate the effectiveness of a new
diet plan for weight loss. The same group of participants could be weighed at regular
intervals before and after starting the diet plan. This repeated-measurement design would
allow researchers to track changes in weight over time within the same group of participants
(“Repeated Measures Designs: Benefits, Challenges, and an ANOVA Example,” n.d.).
List of websites:
https://stattrek.com/anova/repeated-measures/design#google_vignette
https://stattrek.com/anova/repeated-measures/design
https://psycnet.apa.org/record/2007-07583-008
https://study.com/academy/lesson/what-is-repeated-measures-design-definition-
example.html
https://blog.minitab.com/en/adventures-in-statistics-2/repeated-measures-designs-
benefits-challenges-and-an-anova-example
Page | 36
Example 1: An example of multivariate analysis of variance could be a study
investigating the effects of different teaching methods (independent variables) on the
academic performance, motivation, and self-esteem (dependent variables) of students. A
MANOVA would analyze whether there are significant differences in these dependent
variables across the teaching methods (“Reading and Understanding Multivariate Statistics,”
n.d.).
Own example: Let's say a researcher wants to investigate the effects of different
training programs (independent variables) on employee performance measures such as
productivity, job satisfaction, and teamwork skills (dependent variables). The researcher can
use multivariate analysis of variance to examine if there are significant differences in these
dependent variables across the training programs, considering their interrelationships
(Alexopoulos, 2010).
List of websites:
https://www.apa.org/pubs/books/4316510
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044715/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3049417/
Page | 37
Definition 3: Factorial designs are highly efficient experimental designs that allow
researchers to investigate the effects of multiple independent variables simultaneously.
These designs provide insights into the main effects and interaction effects of the variables
on the dependent variable(s) (Baker et al., 2017).
Example 1: Suppose a research study aims to investigate the effects of two factors,
such as temperature and humidity, on plant growth. A factorial design would involve
manipulating these two factors at different levels (e.g., high temperature/low humidity, low
temperature/high humidity) to observe their individual and combined effects on plant growth
(Cherry, 2023).
Example 3: Consider a marketing study investigating the effects of price (low vs.
high) and packaging design (simple vs. elaborate) on consumer preferences. A factorial
design would enable researchers to analyze the main effects of price and packaging design,
as well as any potential interaction between these factors (Cherry, 2023).
Own example: Let's say a researcher wants to investigate the effects of two factors,
such as time of day and type of task, on productivity. The researcher could manipulate the
time of day (morning vs. afternoon) and the type of task (simple vs. complex) and observe
their individual and combined effects on productivity. This factorial design would allow the
researcher to analyze the main effects of time of day and type of task, as well as any
potential interaction between these factors (Cherry, 2023)
List of websites:
https://www.explorepsychology.com/factorial-design-definition-examples/
https://link.springer.com/referenceworkentry/10.1007/978-94-007-0753-5_982
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5458623/
Page | 38
categorical independent variables while statistically adjusting for the influence of continuous
covariates (Philippas, 2014).
Own example: In this study, the independent variable would be the different teaching
methods, such as traditional lecture-based teaching, problem-based learning, or a flipped
classroom approach. The dependent variable would be the students' academic performance,
which could be measured using test scores or grades. The covariate, in this case, would be
the students' prior academic achievement, which could be measured using their previous test
scores or GPA.
List of websites:
https://link.springer.com/referenceworkentry/10.1007/978-94-007-0753-5_82
https://www.sciencedirect.com/topics/medicine-and-dentistry/analysis-of-covariance
https://www.statology.org/ancova/
Page | 39
within a study. These threats include confounding variables, selection bias, and history
effects (2023).
Definition 2: Threats to external validity are factors that limit the generalizability of
study findings beyond the specific context or sample used in the research. These threats
include population validity, ecological validity, and temporal validity (Slocum, Joslyn,
Nichols, & Pinkelman, 2022).
Definition 3: Threats to internal validity are sources of error or bias that can lead to
inaccurate conclusions about causal relationships within a study. These threats can include
maturation effects, testing effects, and regression to the mean.
Example 2: History effects can pose a threat to internal validity. These effects refer to
external events or circumstances that occur during the course of a study and may influence
the outcome variable. For example, if a study is examining the effectiveness of a new
teaching method on student performance, but during the study period there are significant
changes in the curriculum or educational policies, these external factors may confound the
results (2023).
Own example: A potential threat to both internal and external validity could be
experimenter bias. Experimenter bias occurs when the researcher's expectations or beliefs
about the outcome of the study influence their behavior or interpretation of the results. This
bias can introduce errors in data collection, analysis, and interpretation, compromising both
the internal validity by introducing confounding factors and the external validity by
potentially limiting generalizability (2023).
List of websites
https://stats.libretexts.org/Courses/Kansas_State_University/EDCEP_917%3A_Expe
rimental_Design_(Yang)/01%3A_Introduction_to_Research_Designs/1.03%3A_Threats_to
_Internal_Validity
Page | 40
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9458797/
Own example: Let's consider a study that examines the relationship between
employee satisfaction (a dependent variable) and job performance (an independent variable).
The research data could include survey responses from employees about their satisfaction
levels and objective performance metrics. Interpreting the data would involve analyzing the
correlation between employee satisfaction and job performance scores to determine if there
is a significant relationship (Ngulube, 2015).
List of websites:
Page | 41
https://www.questionpro.com/blog/data-interpretation/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5037947/
https://www.researchgate.net/publication/278961843_Qualitative_Data_Analysis_an
d_Interpretation_Systematic_Search_for_Meaning
Definition 2: Writing a conclusion provides closure for the reader by reminding them
of the contents and importance of the research paper. It allows the reader to step back from
the specifics and view the bigger picture of the research, highlighting its relevance and
potential impact (“Walden University Writing Center,” n.d.).
Page | 42
Own example: Let's consider a study on employee satisfaction in a company. The
conclusion may summarize the research findings, such as that high levels of employee
satisfaction are associated with increased productivity and lower turnover rates. The
recommendation could be for the company to prioritize initiatives that enhance employee
satisfaction, such as improving work-life balance or providing professional development
opportunities, to foster a positive work environment (“Walden University Writing Center,”
n.d.).
List of websites:
https://libguides.usc.edu/writingguide/conclusion
https://academicguides.waldenu.edu/writingcenter/writingprocess/conclusions
https://www.sjsu.edu/writingcenter/docs/handouts/Conclusion%20Section%20for%2
0Research%20Papers.pdf
Definition 3: Research quality criteria are a set of standards that researchers must
meet to ensure that their studies produce valid and reliable results. These criteria may
include factors such as the use of appropriate research designs, the selection of
representative study samples, and the use of rigorous statistical techniques (“Help and
Support: Measuring Research Quality and Impact - Research Guide: Citation Metrics,” n.d.).
Example 1: In a quantitative research study, criteria for research quality may include
the use of a randomized controlled trial design, a large and representative sample size, and
appropriate statistical analysis techniques (Lindgreen, Di Benedetto, & Brodie, 2021).
Example 2: In a qualitative research study, criteria for research quality may include
the use of clear and consistent data collection methods, appropriate sampling techniques,
and rigorous data analysis procedures such as thematic analysis (Yadav, 2021).
Page | 43
Example 3: In a mixed-methods research study, criteria for research quality may
include the integration of both quantitative and qualitative data, appropriate sequencing of
data collection methods, and the use of appropriate statistical techniques to analyze the data
(“Help and Support: Measuring Research Quality and Impact - Research Guide: Citation
Metrics,” n.d.).
Own example: In a mixed-methods research study, criteria for research quality may
include the integration of both quantitative and qualitative data, appropriate sequencing of
data collection methods, and the use of appropriate statistical techniques to analyze the data
(“Help and Support: Measuring Research Quality and Impact—Research Guide: Citation
Metrics,” n.d.).
List of websites:
https://www.sciencedirect.com/science/article/abs/pii/S0019850121002121
https://link.springer.com/article/10.1007/s40299-021-00619-0
https://libguides.murdoch.edu.au/measure_research/citation
Definition 2: The research report is a formal document that outlines the research
process, including the research question, literature review, methodology, data analysis, and
conclusions. It provides a detailed account of the research study and its outcomes (Kabir,
2016).
Definition 3: Writing the research report involves organizing and presenting the
research findings in a logical and coherent manner. It includes sections such as an
introduction, literature review, methodology, results, discussion, and conclusion, providing a
complete picture of the research process and its outcomes (Bhat, 2023).
Page | 44
consumer survey data), recommendations (suggestions for marketing strategies based on
findings), and a conclusion (summary of key findings and their implications) (Bhat, 2023).
Own example: let's consider a research report on the impact of social media on
mental health. The report would include sections such as an introduction (background
information and research question), literature review (overview of previous studies on the
topic), methodology (data collection methods and sample selection), results (analysis of
survey data), discussion (interpretation of findings in relation to existing literature), and
conclusion (summary of key findings and recommendations for further study or
interventions) (Kabir, 2016).
List of websites:
https://researchmethod.net/research-report/
https://www.researchgate.net/publication/325546150_WRITING_RESEARCH_REP
ORT
https://www.questionpro.com/blog/research-reports/
Page | 45
Example 1: Researchers often disseminate their work through peer-reviewed journal
articles, which are considered a traditional mode of research dissemination. These articles
undergo rigorous review by experts in the field before publication (Ross-Hellauer et al.,
2020).
Example 3: social media and Online Platforms: With the rise of digital
communication, researchers are increasingly using social media platforms, blogs, and
websites to disseminate their research findings to a wider audience. These platforms allow
for real-time engagement and interaction with both academic and non-academic audiences
(Schillinger, 2010).
List of websites:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7161944/
https://www.ahrq.gov/sites/default/files/wysiwyg/ncepcr/resources/dissemination-
quick-start-guide.pdf
https://accelerate.ucsf.edu/files/CE/edi_introguide.pdf
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7161944/
Page | 46