0% found this document useful (0 votes)

7 views

Early sequence aligment

Uploaded by

carucast

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Early sequence aligment

Uploaded by

carucast

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

HIGH THROUGHPUT SEQUENCING

Early sequence alignment (1 with 1)

1. Types of alignment

• Global alignment:
§ Tries to align the en2re sequence.
§ Aling all le8ers from the query (reference) and target sequence.
§ Suitable for closely related sequences.
§ Needleman-Wunsch method.

• Local alignment:
§ Aling regions having highest similari2es.
§ Aling substring of target with substring of query.
§ Suitable for more divergent sequences.
§ Smith-Waterman method.

2. Global alignment

• It is based on rewards and penal2es:

§ Match à 1.
§ Mismatch à -1.
§ GAP à -2.
• They are suitable in dynamic programming; from a large and complicated problem you break the problem into
smaller pieces, and it is easier to resolve.
• Steps:
1. Ini2aliza2on.
2. Matrix ﬁlling.
3. Traceback.
3. Needleman-Wunsch alignment – Global alignment

1. Generate a matrix where:

a. 1+M columns where M is the length of the largest sequence.
b. 1+N rows where N is the length of the sequence 2.

2. Ini2aliza2on step:
a. From the first row (point 0: upper leU) propor2onate a reward or penalty in each posi2on for this
row. The penal2es or rewards are accumula2ve in the rows.
b. From the first column (point 0: upper leU) propor2onate a reward or penalty in each posi2on for this
row. The penal2es or rewards are accumula2ve in the rows.
3. Matrix filling step:
a. From the first row (first point of overlapping in sequence, no ma8ers if they are equal or not) select
this square.

b. Calculate the leU value: assign the value form the leU neighbor and for every ver2cal or horizontal
movement you must add a GAP penalty to it (-2).

c. Calculate the up value: assign the value form the up neighbor and for every ver2cal or horizontal
movement you must add a GAP penalty to it (-2).
d. Calculate the diagonal value: assign the value form the diagonal neighbor and for every diagonal
movement you must add a match reward or a mismatch penalty to it (-1), depending on if the le8ers
are the same (match reward) or diﬀerent (mismatch penalty).

e. Take the maximum vale between the leU, up and diagonal value and a8ribute it to the cell.

f. Proceed to the next cell.

g. Fix the matrix completely.

4. Traceback:
§ Begin the traceback from the right bo8om cell in the matrix where the maximum score is present
con2nuing up to the upper leU corner. You can trace the arrows back that lead to the star2ng point.
§ There is an easy way to do it:
o If the le8ers are matched, the traceback will go diagonally.
o If the le8ers are not matched, the traceback will go towards the higher neighbor value
(diagonally, horizontally, or ver2cally).

a. Star2ng from the right bo8om corner; since there is a match (T=T) you have to go diagonally.
b. From this posi2on, since there is a match (C=C) you have to go diagonally.

c. From this posi2on, since there is a match (G=G) you have to go diagonally.
d. From this posi2on, since there is a mismatch (A-T) you have to go to the higher value in any direc2on
(in this case leU horizontally).

e. From this posi2on, since there is a match (A-A) you have to go diagonally.
To do an alignment we have to consider the arrows not the values:
• A diagonal arrow from the larger to the smaller value à match.
• A diagonal arrow from the smaller to the largest value or from same value to same valueà mismatch.
• A horizontal or ver2cal arrow no ma8ers the values à gap.
4. Local alignment

• Align regions having highest similari2es between 2 sequences.

• Stretches of sequences with highest density of matches are aligned.
• More suitable for par2ally similar, diﬀerent length and conserved regions containing sequences.
• Align substring of target sequences with substrings of query sequences.
• Suitable for divergent sequences.
• The most used algorithm is the Smith-Waterman.
• They are suitable in dynamic programming; from a large and complicated problem you break the problem into
smaller pieces, and it is easier to resolve. Steps:
1. Ini2aliza2on.
2. Matrix ﬁlling.
3. Traceback.

5. Smith-Waterman alignment – Local alignment

• It’s similar to the Needleman-Wunsch algorithm (global alignment) despite all the nega2ve values we get
during the matrix prepara2on becomes 0.

• The rewards and penal2es for this algorithm are:

§ Match à 1.
§ Mismatch à -1.
§ Gap à -2.
• Steps:

1. Generate a matrix where:

a. 1+M columns where M is the length of the largest sequence.
b. 1+N rows where N is the length of the sequence 2.

2. Ini2aliza2on:
a. From the first row (point 0: upper leU) propor2onate a reward or penalty in each posi2on for this row.
The penal2es or rewards are accumula2ve in the rows.
b. From the first column (point 0: upper leU) propor2onate a reward or penalty in each posi2on for this
row. The penal2es or rewards are accumula2ve in the rows.
c. You have to change the nega2ve values into 0.
3. Matrix filling step:
a. From the first row (first point of overlapping in sequence, no ma8ers if they are equal or not) select
this square.

b. Calculate the leU value: assign the value form the leU neighbor and for every ver2cal or horizontal
movement you must add a GAP penalty to it (-2).

Value from leU à -2

c. Calculate the up value: assign the value form the up neighbor and for every ver2cal or horizontal
movement you must add a GAP penalty to it (-2).

Value from up à -2
d. Calculate the diagonal value: assign the value form the diagonal neighbor and for every diagonal
movement you must add a match reward or a mismatch penalty to it (-1), depending on if the le8ers
are the same (match reward) or diﬀerent (mismatch penalty).

e. Transform the nega2ve values from leU, up, and diagonal into 0.

f. Take the maximum vale between the leU, up and diagonal value and a8ribute it to the cell.
g. Proceed to the next cell and ﬁx the matrix completely.

4. Traceback:
• You just have to start from the lower right cell or the highest number close to this cell and
sequen2ally generate ver2cal, horizontally, or diagonal arrows towards the neighbor cell with the
highest value.
• Once you ﬁnd a 0 you should stop since you have already found the shared mo2f between these two
sequences.
• You can repeat the process to ﬁnd other matches mo2fs aUer a mismatch from a posi2ve number
un2l you found another 0 àmul2ple local alignments.

Pooja Anshul Saxena Engr 692: Special Topics - Computational Biology
No ratings yet
Pooja Anshul Saxena Engr 692: Special Topics - Computational Biology
24 pages
Sequence Comparison: Motivation: Finding Similarity Between Sequences Is Important For Many Biological Questions
No ratings yet
Sequence Comparison: Motivation: Finding Similarity Between Sequences Is Important For Many Biological Questions
47 pages
L-8 Global Alignment
No ratings yet
L-8 Global Alignment
19 pages
Sequence Comparison and Alignment: Bioinformatics #4 IPB University
No ratings yet
Sequence Comparison and Alignment: Bioinformatics #4 IPB University
37 pages
G7 Sequence Alignment
No ratings yet
G7 Sequence Alignment
6 pages
Bioinfo Generic Skill
No ratings yet
Bioinfo Generic Skill
10 pages
Lecture-7-Dynamic Programming Global-Sequence Alignment
No ratings yet
Lecture-7-Dynamic Programming Global-Sequence Alignment
31 pages
Lecture5 Newest
No ratings yet
Lecture5 Newest
124 pages
Global Alignment
100% (1)
Global Alignment
40 pages
Tabby
No ratings yet
Tabby
11 pages
Bio Ass
No ratings yet
Bio Ass
3 pages
Pattern Matching Techniques and Their Applications To Computational Molecular Biology - A Review
No ratings yet
Pattern Matching Techniques and Their Applications To Computational Molecular Biology - A Review
8 pages
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
No ratings yet
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
57 pages
Three Steps in Dynamic Programming
No ratings yet
Three Steps in Dynamic Programming
7 pages
Smithwaterman 130216133804 Phpapp02
No ratings yet
Smithwaterman 130216133804 Phpapp02
15 pages
Needleman-Wunsch and Smith-Waterman Algorithm
67% (9)
Needleman-Wunsch and Smith-Waterman Algorithm
19 pages
Sequence Comparison Part 3
No ratings yet
Sequence Comparison Part 3
22 pages
MIT6 006S20 Lec16
No ratings yet
MIT6 006S20 Lec16
9 pages
Smith Waterman
No ratings yet
Smith Waterman
9 pages
Dynamic Programing-Global Alignment
No ratings yet
Dynamic Programing-Global Alignment
12 pages
lecture2_sequence_alignment
No ratings yet
lecture2_sequence_alignment
26 pages
PCB Lect02 Pairwise Allign
No ratings yet
PCB Lect02 Pairwise Allign
51 pages
Lecture 9 and 10 Pair wise global Alignment.
No ratings yet
Lecture 9 and 10 Pair wise global Alignment.
27 pages
lecture1-2
No ratings yet
lecture1-2
44 pages
Bioinformatics Prof. M. Michael Gromiha Department of Biotechnology Indian Institute of Technology, Madras Lecture - 7b Sequence Alignment II
No ratings yet
Bioinformatics Prof. M. Michael Gromiha Department of Biotechnology Indian Institute of Technology, Madras Lecture - 7b Sequence Alignment II
26 pages
8-5-19-Sequence Alignment in Gpu
No ratings yet
8-5-19-Sequence Alignment in Gpu
26 pages
Sequence Alignment: Lecture 2, Thursday April 3, 2003
No ratings yet
Sequence Alignment: Lecture 2, Thursday April 3, 2003
39 pages
Needlemanwunsch 130216130832 Phpapp01
No ratings yet
Needlemanwunsch 130216130832 Phpapp01
39 pages
Lab5 Ch2 Sequence Similarity PDF
No ratings yet
Lab5 Ch2 Sequence Similarity PDF
95 pages
trees
No ratings yet
trees
6 pages
Notes On Dynamic-Programming Sequence Alignment
No ratings yet
Notes On Dynamic-Programming Sequence Alignment
8 pages
Lecture 4.1 and 4.2 Sequence Alignment (Global and Local)
No ratings yet
Lecture 4.1 and 4.2 Sequence Alignment (Global and Local)
14 pages
DAA Assignment
No ratings yet
DAA Assignment
12 pages
Gap Penalty
No ratings yet
Gap Penalty
5 pages
bioinfo
No ratings yet
bioinfo
6 pages
Daa Assignment 10 Aryan Project
No ratings yet
Daa Assignment 10 Aryan Project
11 pages
Bioinformatics 1: Lecture 3: - Pairwise Alignment - Substitution - Dynamic Programming Algorithm
No ratings yet
Bioinformatics 1: Lecture 3: - Pairwise Alignment - Substitution - Dynamic Programming Algorithm
32 pages
Oxford Mathematics Problem Sheet 1
No ratings yet
Oxford Mathematics Problem Sheet 1
14 pages
Week 9N
No ratings yet
Week 9N
9 pages
Algorithms and Data Structure
No ratings yet
Algorithms and Data Structure
29 pages
Introduction Dynamic Programming
No ratings yet
Introduction Dynamic Programming
52 pages
Dynamic Programming
No ratings yet
Dynamic Programming
28 pages
Dynamic Programming Approach (1)
No ratings yet
Dynamic Programming Approach (1)
32 pages
11 Smith–Waterman Algorithm 06-08-2024
No ratings yet
11 Smith–Waterman Algorithm 06-08-2024
9 pages
Needleman Wunsch PDF
No ratings yet
Needleman Wunsch PDF
3 pages
Lecture # 14 - New
No ratings yet
Lecture # 14 - New
54 pages
Running BLAST Through Perl
No ratings yet
Running BLAST Through Perl
35 pages
CO34563 Assignment 4
No ratings yet
CO34563 Assignment 4
12 pages
Patterns
No ratings yet
Patterns
8 pages
Lecture 1b: The Maximum Contiguous Subarray Problem: Brute Force Reuses Data. Divide-And-Conquer Revisualizing
No ratings yet
Lecture 1b: The Maximum Contiguous Subarray Problem: Brute Force Reuses Data. Divide-And-Conquer Revisualizing
16 pages
COB Sequencealignment
No ratings yet
COB Sequencealignment
49 pages
Lecture 5 Introduction Dynamic Programming
No ratings yet
Lecture 5 Introduction Dynamic Programming
52 pages
SciCom LecNotes
No ratings yet
SciCom LecNotes
28 pages
Lec08 Dynamic Programming2024
No ratings yet
Lec08 Dynamic Programming2024
78 pages
Asst2 Students
No ratings yet
Asst2 Students
4 pages
Eng HuyDQ Chapter-1-Fundementals 2023
No ratings yet
Eng HuyDQ Chapter-1-Fundementals 2023
83 pages
Notebook PDF
No ratings yet
Notebook PDF
26 pages
Week 4
No ratings yet
Week 4
38 pages
Exercises of Function Study
From Everand
Exercises of Function Study
Simone Malacrida
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
EXAMEN INNOVATION I
No ratings yet
EXAMEN INNOVATION I
6 pages
GSEA
No ratings yet
GSEA
7 pages
distribution
No ratings yet
distribution
7 pages
Tema 10 Leukocyte Migration
No ratings yet
Tema 10 Leukocyte Migration
36 pages
Ada Ans
No ratings yet
Ada Ans
42 pages
DSA Chapter 07 (Sorting)
No ratings yet
DSA Chapter 07 (Sorting)
64 pages
PPT Unit 2
No ratings yet
PPT Unit 2
56 pages
Searching and Sorting
No ratings yet
Searching and Sorting
28 pages
Some of The Assumption Taken While Working With Linear Programming Are
No ratings yet
Some of The Assumption Taken While Working With Linear Programming Are
2 pages
Bubble Sort Homework
100% (1)
Bubble Sort Homework
8 pages
L4 Slides - Algorithms - KS4
No ratings yet
L4 Slides - Algorithms - KS4
29 pages
Stacks
No ratings yet
Stacks
3 pages
Data Structures Using C, 2e Reema Thareja
No ratings yet
Data Structures Using C, 2e Reema Thareja
23 pages
Clustering
No ratings yet
Clustering
36 pages
Using Fuzzy Logic Controller in Ant Colony Optimization
No ratings yet
Using Fuzzy Logic Controller in Ant Colony Optimization
2 pages
Question Sheet On Assignment Porblem
No ratings yet
Question Sheet On Assignment Porblem
3 pages
Problem 1.1 (I) Shortest Remaining Time
No ratings yet
Problem 1.1 (I) Shortest Remaining Time
4 pages
Midterm Exam Solution
No ratings yet
Midterm Exam Solution
11 pages
Learning Algorithms 1st Edition George Heineman - The ebook with rich content is ready for you to download
100% (1)
Learning Algorithms 1st Edition George Heineman - The ebook with rich content is ready for you to download
47 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
136 pages
File Structures Jan 2018 (2010 Scheme)
No ratings yet
File Structures Jan 2018 (2010 Scheme)
1 page
Design and Analysis of Algorithms: Unit - I
No ratings yet
Design and Analysis of Algorithms: Unit - I
28 pages
Practical 1 BFS
No ratings yet
Practical 1 BFS
4 pages
CS50 Intro To AI With Python - Notes
No ratings yet
CS50 Intro To AI With Python - Notes
11 pages
Example 1. Use Cramer's Rule To Solve
No ratings yet
Example 1. Use Cramer's Rule To Solve
8 pages
Distance Vector Routing and Link State Routing: Expt No:7 DATE:28/06/2021
No ratings yet
Distance Vector Routing and Link State Routing: Expt No:7 DATE:28/06/2021
15 pages
CS350-CH03 - Algorithm Analysis
No ratings yet
CS350-CH03 - Algorithm Analysis
88 pages
CO322: DS & A (Simple Yet) Efficient Algorithms: Dhammika Elkaduwe
No ratings yet
CO322: DS & A (Simple Yet) Efficient Algorithms: Dhammika Elkaduwe
34 pages
Zco2024 Solutions and Cutoffs
100% (1)
Zco2024 Solutions and Cutoffs
15 pages
ADA Syllabus New
No ratings yet
ADA Syllabus New
6 pages
DSAL Print Format
No ratings yet
DSAL Print Format
6 pages
Design and Analysis of Algorithms Lab
No ratings yet
Design and Analysis of Algorithms Lab
62 pages
Moore Tutorial
No ratings yet
Moore Tutorial
20 pages
C++ Plus Data Structures: Nell Dale
No ratings yet
C++ Plus Data Structures: Nell Dale
39 pages

Early sequence aligment

Uploaded by

Early sequence aligment

Uploaded by

HIGH THROUGHPUT SEQUENCING

Early sequence alignment (1 with 1)

• It is based on rewards and penal2es:

1. Generate a matrix where:

f. Proceed to the next cell.

• Align regions having highest similari2es between 2 sequences.

5. Smith-Waterman alignment – Local alignment

• The rewards and penal2es for this algorithm are:

1. Generate a matrix where:

Value from leU à -2

You might also like