Professional Documents
Culture Documents
WK1 Topics - Cs
WK1 Topics - Cs
Classifying samples
Stratified Sample: population is divided into groups (called strata), then a simple
random sample is drawn from each stratum(useful when the STRATA DIFFERS from
one another BUT individuals in the strata tend to be ALIKE)
o The population is divided into groups, with each group made up of members that
share some particular characteristic(s), such as an occupation.
Then, a fixed number of members (but not all members) from each
group are randomly selected for the sample.
Cluster Sample: items are drawn from the population in groups OR clusters(useful when
the population is TOO LARGE/SPREAD OUT for simple random sampling to be
feasible)
o The population is divided into groups that tend to exist naturally, such as cities in
a state. Then, all members of some randomly chosen groups (but not all
groups) are selected for the sample.
Voluntary Response Sample: NOT reliable because people who volunteer an opinion
tend to have STRONGER opinions than the general population & people with
NEGATIVE opinions are more likely to volunteer their response(used by the media to
engage audience)
Samples of Convenience: members of the pop. that are easily accessible (no
consideration of randomly at any point during the selection process)
Sampling Bias: chooses a sample from the population in such a way that certain members
are more likely to be included than others
o Using a sample of convenience (a sample chosen because the members in the sample
are easily accessible) typically gives rise to sampling bias.
o To help prevent sampling bias, when possible, a random sample of
the population should be taken.
Voluntary Response Bias: members of the sample of the population voluntarily choose to
participate(e.g. people are asked to call into a radio show to express their opinion, are asked
to answer a poll on social media, are asked to participate in an advertisement and other
similar situations)
o the bias comes from the tendency for volunteers to not be representative of the whole
population.
o This tendency can occur because volunteers have strong opinions on the focus of the
study
Self-Interest Bias: people who have an interest in the outcome of an experiment have an
incentive to use biased methods
Response Bias: When the way the study is conducted causes the members of the sample to
give inaccurate or untruthful responses
o Questions can also be confusingly worded causing misinterpretations.
o Another source of response bias comes from the fact that people are unlikely to give
truthful answers if those answers are socially unacceptable or reflect negatively on
them in some other way.
o Some ways to reduce response bias include allowing respondents to answer
anonymously and wording questions with clear and neutral language.
Social Acceptability Bias: People are reluctant to admit to behavior that may reflect
negatively on them which affects many surveys
Nonresponse Bias: studies that are ethically conducted, people chosen for the sample by the
researcher can choose not to participate or choose not to answer the questions.
o So if a study has many people who do not respond, we say that the study
has nonresponse bias.
o Depending on the focus and design of the study, some ways to reduce nonresponse
bias include offering an incentive to respond or sending reminders to those who
have not yet responded.
NOTE: choosing a large (or small) sample does not prevent or help with preventing bias
in a study.
1. Determine Quantitative/Categorical
if categorical: determine if you can rank/not the items
if quantitative: determine if the DIFF. of 2 #s is GREATLY diff. from a DIFF. of 0(e.g. a
salary of 50k is GREATLY diff. from a diff. of $0 salary)
STRATEGY - ALEKS Qs: REMEMBER to input the correct #s into the calculator & put () as needed
random sample: each member of the population has an equal chance of being chosen
o a random sample best represents the entire population
o therefore, can be used to make predictions about the population
o if a sample is NOT random, we probably should NOT use it for predictions about the
population
Classification of Variables
Differentiating Between Parameters and Statistics
PART A:
PART B:
The data are collected on the experimental units, which are the people, objects, or other
entities being measured.
The researcher compares two or more treatments for those experimental units.
o For example, different treatments might come in the form of different exercise
programs.
o The aspect being measured for each experimental unit is given by a response
variable (also called a dependent variable).
For instance, a response variable may be a person's resting heart rate after
undertaking an exercise program.
o An explanatory variable (also called an independent variable) is the variable that is
manipulated by the researcher.
Commonly, an explanatory variable defines the groups between which the
researcher wants to examine changes in the response variable.
For example, an explanatory variable may be the type of exercise program
undertaken.
PART C:
The type of conclusion that can be made is different between an observational study and
a designed experiment.
o An observational study does not allow the researcher to conclude that there is
a cause-and-effect relationship between the explanatory and response
variables.
o The researcher may, however, conclude that there is an association between the
explanatory and response variables.
o An association means that as the explanatory variable changes, the response
variable changes systematically, as in the ice cream example above.
o Only through a designed experiment may a researcher possibly conclude that
there is a cause-and-effect relationship between the explanatory and response
variables.
(A cause-and-effect relationship means that changes in one variable are
causing the changes in the other variable.)
If the researcher has carefully designed the experiment and controlled
for the effects of all unwanted influences on the result, then she may
have reason to conclude that there is a cause-and-effect relationship.
Observational studies fail to imply a cause-and-effect relationship because
of the possible effect on the outcome from some other variable that is not
being studied (called a confounding variable).
For example, suppose an observational study finds that coffee
drinkers have a higher percentage of lung cancer than non-coffee
drinkers do. We could not (and should not) conclude that drinking
coffee causes lung cancer.
There could be another variable that differs between the two
groups, beside coffee consumption, that influences the likelihood
of getting lung cancer, such as smoking or age. It could be that
coffee drinkers are also more likely to smoke, or that they tend to
be older than non-coffee drinkers. The difference in the percentage
of people with lung cancer could be attributed to smoking and/or
older age.
Observing a strong association does not mean observing a
cause-and-effect relationship. However, through a designed
experiment, a researcher has the possibility to understand
cause-and-effect, because other variables that are not part of
the study can be controlled through a method such as random
assignment (for example, randomly assigning experimental
units to groups, and randomly assigning groups to treatments).