Multiple Sequence Alignment: Some Slides From Cuong Dang and Others
Multiple Sequence Alignment: Some Slides From Cuong Dang and Others
✓ Motifs/Patterns discovery
Which parts “do the same thing”?
A
V S N —S
A
—S N A —
N ———A S
S
Start V S N S
DP for 3 sequences
There are 7 ways to reach C[i,j,k]
✓Run time: O(n3), i.e. 7n3
C[i,j,k]
C[i-1,j,k-1]
C[i-1,j-1,k-1] C[i-1,j,k-1]
Dynamic programming for three sequences
Alignments 1
Alignments 2
CLUSTALW
CLUSTALW: Step 1 - Pairwise alignment
- 𝑋! = "GGATTGT”
- 𝑋" = "GGAAGG”
- 𝑋# = "AAGGTT”
- 𝑋$ = "AGGT"
Scores:
- 𝐶 𝑥, 𝑥 = 10 (match score).
- 𝐶 𝑥, 𝑦 = −1 mismatch score .
- 𝐶 𝑥, – = 𝐶 – , 𝑥 = −2 (indel score)
Problems with SP
Some pair-wise alignments are more
important than others
✓E.g., more important to have a good
alignment between mouse and human
sequences than mouse and bird