Professional Documents
Culture Documents
QRM Lecture 2 - Canvas
QRM Lecture 2 - Canvas
MEANS
Sharon Morein
Sharon.morein@anglia.ac.uk
Parametric vs non-parametric
• Parametric tests tend to be more powerful
• Use all the information available
• But also have more constraints (‘rules’ or
assumptions)
• Non-parametric:
• Do no rely on parametric assumptions
• More suited for small n’s
• Allow more levels of measurement of variables
(e.g., ordinal)
Comparing means
Conducting, concluding and reporting instances when SD of
the population is unknown (as we’d do a Z test)
Where we have 2 samples [Comparing to a specific value
you could use one sample t-test, though CI simpler]
Between subjects
Parametric (independent Student’s t-test or Welch’s t-test)
Non-parametric (e.g., Mann-Whitney, Wilcoxon rank-sum)
Within subjects
Parametric (repeated=paired=dependent t-test)
Non parametric (e.g., Wilcoxon signed-rank)
The sampling distribution
• Hypothetical scenario
• Samples are randomly and repeatedly
taken
• Unit of measurement is a sample
(group of observations) rather than a
single observation
• SE=SD of sampling distribution
• “Typical” variability between samples
X1 X2 Difference between
t
s p2 s p2 the means
n1 n2 SE of sampling
distribution of
differences
• variance sum law – variance of a difference between two independent
variables is the sum of variances: variance1+variance2
• The equation will take into account unequal n’s – BUT should we even go
there?
Dependent t-test
Xhappy= 51.1; SDhappy=10.1
For each person: di= Xhappy-Xsad nhappy=nsad=60
Xsad=49.7; SDsad=9.7
Xdiff=51.1-49.7=1.4
SDdiff=15.6
D D
t
t= signal
Difference between noise
the means assumed
sD N to be 0
(estimated) SE
of differences
• Example:
• Memory was not significantly different in the sad (M=51.1,
SD=10.1) vs the happy (M=49.67, SD=9.7) condition. The
difference (M=1.43, 95% CI [-2.18, 4.98]) was not statistically
significant, t(59)=1.45, p=.15. The effect size was small (Cohen’s
d=0.14).
Non parametric
• More robust to outliers
• If the distributions are not normal, then with small n’s the
sampling distribution will likely be non-normal
• Mann-Whitney
[& Wilcoxon rank-sum test]
• Wilcoxon signed-rank test
• Assumptions
• Ordinal scale and above
• Each observation (M-W) or pair (Wilcoxon) is chosen randomly and
independently
Mann-Whitney Rationale
• Ranking – we compute the statistic on ranks rather than
on raw values
Sum of
ranks in
group 1
• Tests,
e.g. Shapiro-Wilk, Kolmogorov-Smirnov (H1 the distribution differs
from the normal distribution), but don’t operate well with smaller n’s
AND can be too sensitive with very large n’s
In practice
If have a problem with normality (no 100% solution):
• Not too bad with equal n’s of sufficient size – just go ahead
• Aim for larger samples (central limit theorem) but depends on
distribution (if heavy tailed need N’s larger than 30)
• Can consider cleaning/trimming/transforming
• Non-parametric
• Bootstrapping (& robust tests if R available) – not currently
available
• If have a problem with HOV
• Not so much a problem with equal n’s
• Welch’s t
• Non-parametric
Warning: Leven’s test doesn’t work so well with small n’s and
unequal n’s
So What to do???
• Normality and HOV holds (and equal n’s)
T-test
• Normality holds but HOV violated
Welch’s t-test
• Normality violated
Look at sample size, equal sample size and HOV
If sample sizes are (roughly) equal and reasonably good and
normality not massively violated – ok to stick with Welch’s t-test
Alternatives:
Non-parametrics
Transforming the data and starting again