L-8 Global Alignment
L-8 Global Alignment
Global Alignment
Closely related sequences which are of same length are very
much appropriate for global alignment.
Here, the alignment is carried out from beginning till end
of the sequence to find out the best possible alignment
How it is done?
Needleman-Wunsch algorithm-A formula or set of steps to
solve a problem
Developed by Saul B. Needleman and Christian D. Wunsch in
1970
Dynamic programming algorithm for sequence alignment
Local Alignment
Sequences which are suspected to have similarity or even
dissimilar sequences can be compared with local alignment
method.
It finds local regions with high level of similarity
How it is done?
Smith-Waterman algorithms
Scoring matrices
Mostly Needleman-Wunsch and Smith-Waterman algorithms
use scoring system
For nucleotide sequence alignment, the scoring matrices
used are relatively simpler since the frequency of mutation for
all the bases are equal.
Positive or higher value is assigned for a match and a
negative or a lower value is assigned for mismatch.
These assumption based scores can be used for scoring the
matrices
Mainly used predefined matrices are PAM and BLOSUM
Gap score or gap penalty: Dynamic programming algorithms
use gap penalties to maximize the biological meaning.
Gap penalty is subtracted for each gap that has been introduced.
There are different gap penalties such as gap open and gap
extension. The gap score defines a penalty given to alignment
when we have insertion or deletion.
During the evolution, there may be a case where we can see
continuous gaps all along the sequence, so the linear gap penalty
would not be appropriate for the alignment.
Thus gap open and gap extension has been introduced when there
are continuous gaps (five or more).
The open penalty is always applied at the start of the gap, and
then the other gaps following it is given with a gap extension
penalty which will be less compared to the open penalty. Typical
values are –12 for gap opening, and –4 for gap extension.
The dynamic programming matrix is defined with three
different steps.
Match =1
Mismatch =-1
Gap -2
Trace Back
A T G A G
0 -2 -4 -6 -8 -10
A -2
T -4
G -6
C -8
A -10
G -12