Dot Matrix
Dot Matrix
&
AMINO-ACID SEQUENCES
1
An alignment is an evolutionarily meaningful
comparison of two or more sequences (DNA, RNA, or
proteins).
GCGGCCCATCAGGTAGTTGGTG-G
GCGTTCCATC--CTGGTTGGTGTG
***..***** .*.******* *
3
Positional homology = A pair of nucleotides
from two aligned sequences that have
descended from one nucleotide in the
ancestor of the two sequences.
GCGGCCCATCAGGTAGTTGGTG-G
GCGTTCCATC--CTGGTTGGTGTG
***..***** .*.******* *
4
Definition:
Similarity
resulting from
common 5
Homology: A qualitative statment
6
Homology
By comparing homologous
characters, we can reconstruct
the evolutionary events that
have led to the formation of the
extant sequences from the common
ancestor. 7
Homology
A
ACTGGGCCCAAATC ACTGGGCCCAAATC
1 deletion
G A 1 insertion
1 substitution 1 substitution
ACTGGCCCAGATC ACAGGGCCACAAATC
ACT-GGCC-CAGATC ACTGGCCCAGATC--
ACAGGGCCACAAATC ACAGGGCCACAAATC
**.-****-**.*** **.**.***.*..--
9
unknown
unknown unknown
ACTGGCCCAGATC ACAGGGCCACAAATC
12
- Two DNA sequences: A and B.
- Lengths are m and n, respectively.
- The number of matched pairs is x.
- The number of mismatched pairs is y.
13
An gap indicates that a deletion or an
insertion has occurred in one of the
two lineages.
GCGG-CCATCAGGTAGTTGGTG--
GCGTTCCATC--CTGGTTGGTGTG
14
The alignment is the first step in
many evolutionary and functional
studies.
15
Methods of alignment:
1. Manual
2. Dot matrix
3. Algorithmic (scoring matrices and gap
penalties)
16
Manual alignment.
nment When there
are few gaps and the two
sequences are not too different
from each other, a reasonable
alignment can be obtained by
visual inspection.
GCG-TCCATCAGGTAGTTGGTGTG
GCGTTCCATCAGGTGGTTGGTGTG
*** **********.********* 17
Advantages of manual alignment:
(1) use of a powerful and trainable tool
(the brain, well…, some brains).
(2) ability to integrate additional data,
e.g., domain structure, biological
function (e.g., 3D structure).
18
Disadvantages of manual alignment:
20
The alignment
is defined by a
path from the
upper-left
element to the
lower-right
element.
21
There are 4 possible steps in the path:
(1) a diagonal step
through a dot = match.
(2) a diagonal step
through an empty
element of the matrix =
mismatch.
(3) a horizontal step = a
gap in the sequence on
the top of the matrix.
(4) a vertical step = a gap
in the sequence on the
left of the matrix.
22
forbidden
directions
allowed
directions
23