Professional Documents
Culture Documents
G7 Sequence Alignment
G7 Sequence Alignment
Dynamic programming
Sequence alignment
Introduction:
2. Local Alignment: This type finds the most similar regions within the sequences,
which is useful for sequences of different lengths or those containing only similar
subsequences. The Smith-Waterman algorithm is commonly used for local
alignment.
Key Algorithms:
1
1. Needleman-Wunsch(Global Alignments): This algorithm is used for global
alignment, meaning it aligns the entire length of two sequences. It builds a
matrix where each cell represents the best score for aligning subsequences up
to that point, considering matches, mismatches, and gaps.
Example: Suppose we want to align the sequences "GATTACA" and
"GCATGCU"
1. Initialization:
- Create a matrix with dimensions based on the sequence lengths plus one for
initial gaps.
- Initialize the first row and column with gap penalties (assuming a gap penalty
of -1).
2. Matrix Filling: Fill in the matrix using a scoring scheme (e.g., match = +1,
mismatch = -1, gap = -1).
3. Trace back: Start from the bottom-right cell and trace back to the top-left to get
the alignment.
- G A T T A C A
- 0 - - - - - - - Example Matrix Filling:
1 2 3 4 5 6 7
G -
1
C -
2
A -
3
T -
4
G - The matrix is filled based on the
5
C - recurrence relation:
6
U 2
-
7
F(i , j)
GA-TTACA
| | | | |
G-CATGCU
Example:
1. *Initialization*:
- Initialize a matrix similar to Needleman-Wunsch, but set the first row and
column to zero.
2. Matrix Filling:
- The difference here is to not allow negative scores (set negative values to zero).
3. Traceback: Start from the cell with the highest score and trace back to the first
zero encountered.
3
Example Matrix Filling:
- G A T T A C A
- 0 0 0 0 0 0 0 0
G 0
C 0
A 0
T 0
G 0
C 0
U 0
Using similar recurrence relations but ensuring no cell has a negative score.
TTAC
| | | |
T-AC
4
2. Matrix Filling: Populate the matrix using recurrence relations that consider
match, mismatch, and gap scores.
3. Traceback: Follow the path from the optimal score in the matrix to reconstruct
the alignment.
Scoring Systems:
- Gap Penalties: Negative scores for introducing gaps to account for insertions or
deletions.
Applications:
Challenges:
- Scoring Scheme Sensitivity: Results can vary greatly with different scoring
parameters.
5
Conclusion: