Professional Documents
Culture Documents
Introduction To Bioinformatics-7
Introduction To Bioinformatics-7
Introduction To Bioinformatics-7
Bioinformatics
1
Introduction to Bioinformatics.
LECTURE 7: Phylogenetic Trees
2
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
* Dr. Carlo Urbani (WHO) came and concluded that this was
a new and unusual pathogen.
4
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
5
Introduction to Bioinformatics
7.1 SARS: the outbreak
8
Introduction to Bioinformatics
7.1 SARS: the outbreak
10
11
Introduction to Bioinformatics
7.1 SARS: the outbreak
12
Introduction to Bioinformatics
7.1 SARS: the outbreak
The SARS corona virus
13
Introduction to Bioinformatics
7.1 SARS: the outbreak
14
Introduction to Bioinformatics
7.1 SARS: the outbreak
15
Introduction to Bioinformatics
7.1 SARS: the outbreak
16
Introduction to Bioinformatics
7.1 SARS: the outbreak
17
Introduction to Bioinformatics
7.1 SARS: the outbreak
* When and where did the virus cross the species border?
20
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
21
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
22
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
Phylogenetics
phylogenetics is the study of evolutionary relatedness
among various groups of organisms (e.g., species,
populations).
23
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
Cladistics
As treelike relationship-diagrams called "cladogram" is drawn up to
show different hypotheses of relationships.
A cladistic analysis is typically based on morphological data.
24
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
Cladistics
25
Cladistics : tree of life
26
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
Phylogenetic Trees
A phylogenetic tree is a tree showing the evolutionary interrelationships
among various species or other entities that are believed to have a
common ancestor. A phylogenetic tree is a form of a cladogram. In a
phylogenetic tree, each node with descendants represents the most
recent common ancestor of the descendants, and edge lengths
correspond to time estimates.
27
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
28
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
Rooted Phylogenetic Tree
29
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
30
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
Unrooted Phylogenetic Tree
31
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
32
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
33
Tree without Branch Length
34
Tree with Branch Length
35
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
* Multifurcating trees
37
Introduction to Bioinformatics
7.2 - ON TREES AND EVOLUTION
root
internal node
branch
external node
38
Introduction to Bioinformatics
7.2 - ON TREES AND EVOLUTION
unrooted tree
39
Introduction to Bioinformatics
7.2 - ON TREES AND EVOLUTION
40
Introduction to Bioinformatics
7.2 - ON TREES AND EVOLUTION
A B C B A C
rotation invariant
41
Introduction to Bioinformatics
7.2 - ON TREES AND EVOLUTION
* n is number of taxa
42
Introduction to Bioinformatics
7.2 - ON TREES AND EVOLUTION
Representing trees
* Various possibilities
* Listing of nodes
1 2 3 4 5 6 7
44
Introduction to Bioinformatics
7.3 INFERRING TREES
* n taxa {t1,,tn}
45
Introduction to Bioinformatics
7.3 INFERRING TREES
Three-point formula:
A
C
Lx
Lx + Ly = dAB
Lx + Lz = dAC Lz
Ly + Lz = dBC centre
Ly
B
Lx = (dAB+dAC-dBC)/2
Ly = (dAB+dBC-dAC)/2
Lz = (dAC+dBC-dAB)/2
46
Introduction to Bioinformatics
7.3 INFERRING TREES
Four-point formula: 1
when (1,2) and (i,j) are i
Lneighbor-couples
Four-point
x
condition !
d(1,2) + d(i,j) < d(i,1) + d(2,j)
Lz
Ri = j d(ti ,tj) centre
Minimize d(i,j) AND total distance in tree
Lq j
M(i,j) = (n-2)d(i,j) Ri Rj Ly
If i and j are neighbours!
2
M(i,j) < M(i,k) for all k not equal to j
47
NJ algorithm:
Step 2: Join the two taxa ti and tj to a new vertex V - use 3-point formula to
calculate the updates distance matrix D where ti and tj are replaced by V .
Step 4: The distance matrix D now contains n 1 taxa. If there are more than
2 taxa left go to step 1. If two taxa are left join them by an branch of length
d(ti,tj).
Step 5: Define the root node as the branch connecting the outgroup to the
rest of the tree. (Alternatively, determine the so-called mid-point) 48
Introduction to Bioinformatics
7.3 INFERRING TREES
If the distance from the root to all leafs is equal the tree is
ultrametric
Ultrametricity must be valid for the real tee, bur due to noise
this condition will in practice generate erroneous trees.
49
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
* Area of Origin
50
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
51
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
52
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
multidimensional scaling
53
Introduction to Bioinformatics
LECTURE 7: PHYLOGENETIC TREES
The genetic distance of samples from the palm civet increases +/- linearly with time
54
Introduction to Bioinformatics
7.5 THE NEWICK FORMAT
1 2 3 4 5 6 7
55
END of LECTURE 7
56