Professional Documents
Culture Documents
01-Intro To Sequence
01-Intro To Sequence
01-Intro To Sequence
Sequence analysis
Representation is key to understanding
In sequence analysis, macromolecules are represented as strings
QTELATKAGVKQQSIQLIEAGVTK TATACAAGAAAGTTTGTACT
Nucleotide sequences
DNA: 4 bases: A, G, C, T
RNA: 4 bases: A, G, C, U
Ambiguity codes:
N = A or G or C or T or U (also = X)
S (Strong) = G or C, W(Weak) = A or T/U
R (puRine) = G or A, Y (pYrimidine) = C or T/U
M (aMino) = A or C, K (Keto) = G or T/U
B = not A, D = not C, H = not G, V = not T/U
Nucleotide sequences
5- GATCCAGA - 3 5- TCTGGATC - 3 Sequence: 5-GATCCAGA-3
Reverse: 3-AGACCTAG-5
Complement: 3-CTAGGTCT-5 Reverse-complement: 5-TCTGGATC-3
Design further experiments lRestriction mapping lPCR planning non-coding Sequence comparison Search for known motifs
coding
Molecular phylogeny