Professional Documents
Culture Documents
2015-02-10 MBM Tutorial Combined Diversity
2015-02-10 MBM Tutorial Combined Diversity
metrics
a0
the
unknown
number
of
species
present
in
the
community
but
not
observed.
1.
OTU
richness:
R=
S
0 (no
correcGon
for
taxa
not
observed);
2.
Chao-‐1
index:
assumes
that
the
number
of
observaGons
for
a
taxa
has
a
Poisson
distribuGon
and
corrects
for
variance;
0 R=
S
+
a
0
i.e.
3.
ACE
(abundance-‐based
coverage
esGmators):
involves
an
arbitrary
abundance
threshold
to
label
Sabun
as
the
number
of
abundant
taxa,
Srare
as
the
number
of
rare
taxa;
The
expression
basically
inflates
the
number
of
rare
taxa
and
inflates
again
the
number
of
taxa
with
abundance
1.
Marker-‐based
metagenomic
tutorial
4
Richness
(observed
Richness species)
100 150 200 250 300 350 400
RU-BOL1-1
RU-KOR1-1
GB-EL75-69
FI-N-47
FI-Xinb-3
FI-FHS2-11
FI-FUT1-2
SE-G1-9
Daphnia
clones
FI-FSP1-16
BE-OM-2
RU-RM1-9
DE-K35-Mu11
CH-H-149
DE-K35-Iinb
KE-1-1
Species
richness:
example
of
use
RU-YAK1-1
FR-C1-1
28°C
20°C
IT-ISR1-8
IL-M1-8
Daphnia
microbiota:
5
IR-GG1-7
Influence
of
temperature
on
Alpha
Diversity:
within
sample
diversity
R
=
5
R
=
2
R
=
4
R
=
5
Species richness"
Inverse Simpson"
Berger-Parker"
Sequencing depth!
Sample 2
OTU 1 OTU 2
R = 2
Sample 2
OTU 1 OTU 2
R = 2
Sample 2
OTU 1 OTU 2
R = 2
Sample 2
OTU 1 OTU 2
Species richness"
Diversity
Shannon"
Index Value!
Inverse Simpson"
Berger-Parker"
Sequencing depth!
Richness!
FiltraSon
• Remove
taxa
with
0
count
(prune_taxa)
• Remove
OTUs
that
appear
less
than
n
Gmes
or
in
less
than
n
samples
(genefilter_sample,
filterfun_sample)
Marker-‐based
metagenomic
tutorial
24
Ge]ng
controls
CH-Daphnia
FI-Daphnia
CH-Water
Controls
IsolaGon
Clustering Number
threshold of OTUs
99% 58
98% 40
97% 35
96% 31
95% 30
90% 14
Stock
76
sequences
x
max.607
nuc
Mock
76
strains
Marker-‐based
metagenomic
tutorial
26
β-‐diversity
metrics
Sample
1
Sample
2
R
=
7
Sample 3 Sample 4
Sample 1 Sample 2
Mean
diversity
per
treatment
β
=
γ
/
α
Treatment
A
Treatment
B
R
=
5
R
=
6
Sample 3 Sample 4
Presence/Absence Abundance
Without
a
• Bray-‐CurGs
(PCoA)
phylogeneSc
tree
• Jaccard
• Euclidean
(PCA)
With
a
phylogeneSc
tree
Treatment B
Sample 4
Treatment B
Sample 4
Treatment B
Sample 4
Distance between samples is reduced if phylogeneSc approach is used
• With
unweighted
UniFrac,
only
the
presence
or
absence
of
lineages
are
considered
(community
membership).
• With
weighted
UniFrac,
branch
lengths
are
weighted
based
on
the
relaGve
abundances
of
lineages
within
communiGes
(community
structure).
hep://bmf.colorado.edu/unifrac/$
Beta
Diversity:
between
sample
diversity
Presence/Absence Abundance
Without
a
• Bray-‐CurGs
(PCoA)
phylogeneSc
tree
• Jaccard
• Euclidean
(PCA)
Cluster Dendrogram
OTU_table
Sample1
Sample2
Sample3
Sample4
OTU 1 2 0 1 2
0.7
OTU
2
4
1
2
4
s2
2
OTU
3
1
0
0
2
0.6
OTU
4
1
0
3
2
0.5
OTU
5
1
9
0
0
Height
OTU
6
0
0
2
0
0.4
OTU
7
0
0
0
2
s3
3
0.3
Sample1
A
s1
s4
4
Sample2
A
Hierarchical clustering
Sample3
B
tutorial.data.bray
Marker-‐based
metagenomic
tutorial
hclust (*, "average") 38
Sample4
B
Beta
Diversity:
visualizaGon
/
ordinaGon
Treatment
A
Sample
1
Sample
2
0.2
s1
s4
site4
site1
0.1
PCoA 2: 18.9 %
s2
0.0
site2
-0.1
-0.2
-0.3
s3
site3
Dispersion
Ordina4on
plots
Cluster
analysis
sta4s4cs
Community
Differen4ally
Local
specificity
overlap
abundant
taxa
• Distance = frac4on of the total branch length that is unique to any par4cular
environment
Variance
Euclidean
variables
are
expected
to
have
equal
variance
Manhagan
Mean
Bray–Cur4s
between-‐group
differences
in
average
absolute
[log-‐scale]
Variance
Canberra
differenceare
matched
by
propor4onate
changes
in
average
abundance
Mean
[log-‐scale]
Gower
variables
are
physical
or
chemical
data
Dispersion
Ordina4on
plots
Cluster
analysis
sta4s4cs
Community
Differen4ally
Local
specificity
overlap
abundant
taxa
Unconstrained
tb-‐PCA
CA,
DCA
PCoA,
NMDS
Env.
factors
are
used
post
hoc
by
env.
factors
Unimodal
2. Inves4gate
how
well
the
distances
in
the
ordina4on
graph
represent
the
total
distances
(e.g.
Dispersion
Ordina4on
plots
Cluster
analysis
sta4s4cs
Community
Differen4ally
Local
specificity
overlap
abundant
taxa