0% found this document useful (0 votes)

48 views

Pairwise Sequ Datab: Appos® ©mimfoimrdaifcltes

Uploaded by

Ved Classes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Pairwise Sequ Datab: Appos® ©mimfoimrdaifcltes

Uploaded by

Ved Classes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

You are on page 1/ 25

Pairwise Sequ Datab

AppOS^ MimfoimrDaifcltes

uence
match mismatch GCG!
TA CC TA CC

GCG A

gap

Why make sequence alignments?

The sequences may share a common origin - a common ancestor sequence. If the similarity is sufficiently

convincing or if we have additional evidence for an evolutionary relationship, then we say that the sequences are homologous.

The sequences may have the same or related structure and function.

The difference in the alignments may be linked to the functional changes/diseases.

Approaches in Pairwise Sequence Alignment

Dot Matrix
Global Alignment

Local Alignment

isualization: Dotmatnx
A G A T T C G C A G T T C C G T A C G

rotmatrix - StophylococcuI^pIBermiclis TCP62A and ATCC12228

ignment --StophylococcusTpJBFrmidisRP62A and ATCC12228

A high-quality alignment? For DNA sequences Long runs of identity Few gaps in the aligned regions An overall high degree of identity (>8o%)

For protein sequences Includes most of each sequence A significant proportion of identities throughout the alignment Multiple examples of conservative substitutions Relatively few gaps 50% is very good

=
{92&f

NBed4etn fn and Wunsc

The alternative pathways that could form the maximum match are illustrated. The maximum match terminates at the largest number in the first row or first column, 8 in this case.

Local Alignment: Smith-Waterman Algorithm (1981)

A A A A U 00 0-0

C 00 0-0 0 0 00 00 10 10 0-0 00 00 0-0 00 10 00 00

A 00 10 10 00

C 0-0 0-0 0*7 0-7 10 0-0 0-0 0-7 17 0-3 1-3 0-0 0-7 10 1-7

C 0 0 0*0 00 0-3 0-3 2 0 10 0-3 03 1-3 00 10 10 03 0-7

U 0-0 00 00 00 00 1-3 30 1-7 1-8 10 10 0-3 20 0-7 0-3

C 00 0*0 00 10 0 0 0-3 1-7 2-7 2-7 2-3 1-0 0-7 0-7 1*7 0-3

G 0*0 0-0 0*0 0-7 10 1-3 13 2-3 2-3 20 0 7 17 0-3 1*3 ()() 0-0 0-0 0-0 1-0 0-3 10 10 1-0 2 0

C 0-0 00 00 0-0 00 20 13 0*7 0-7 0-7 2-0

U 00 0-0 00 10 00 0-7 17 1-0 1-7 1-7 17 1-7 27 2-7 1-3

l! 0-0 0-0 00 1-0 0-7 0-3 0 3 13 20 2-7 13 13 13 2 3 2-3

G 0 0 1 (I I 0 0-0 0-7 0-3 0-0 13 10 1-7 2-3 2-3 10 10 2-0 00 0o 7 07 10 0* 3 00 00 10 27 20 20 20 20

4 c
c A
U U G A C G G

o-o 0-0 00 00 00 00 00 00 0-0 00 00 00 00

o-o

o-o 00
0-7 2-0 0-7 0-3 0 0 10 0-0 0-7 0-0

!li 20 t 17 \oso 2-7 13 \

Match: 1.0 Mismatch: -1/3 Gapwk=1.0+l/3*k

'Smith-Waterman Algorithm vs Needleman-Wunsch Algorithm

Database Searching

Similarity searches in sequence databases have become a mainstay of bioinformatics.

A sequence by itself is not information. Comparison can help find the important biological information, e.g. function of unknown genes, structure of query sequences, duplicated genes.

Similar scores: allowing substitutions or residues with similar characteristics (e.g. BLOSUM62, PAM250)

Two programs, which greatly facilitated the similarity search, were developed: FASTA (Pearson and Lipman 1988) and BLAST (Altschul et al. 1990). Many programs have been further developed from them.

Sequence databases, e.g. NCBI.

Basic Local Alignment Search Tool (BLAST)

Basic Local Alignment Search Tool (BLAST) was developed as a new way to perform sequence similarity search. It is a string pattern search.

SA/hat BLAST Tells You

BLAST reports surprising alignments Different than chance

Assumptions Random sequences Constant composition

Conclusions Surprising similarities imply evolutionary homology

Basic Local Alignment Search Tool (BLAST)

Widely used similarity search tool Heuristic approach based on Smith Waterman algorithm Finds best local alignments Provides statistical significance All combinations (DNA/Protein) query and database . DNA vs DNA (BLASTN) DNA translation vs Protein (BLASTX) Protein vs Protein (BLASTP) Protein vs DNA translation (TBLASTN) DNA translation vs DNA translation (TBLASTX) www, standalone, and network clients

Word Size = 11
GTACTGGACAT = 28

Minimum word size = 7

megablast default

blastn default = 11 Make a lookup TACTGGACATG

table of words
ACTGGACATGG CTGGACATGGA TGGACATGGAC

GGACATGGACC GACATGGACCC ACATGGACCCT

Online BLASJ S e a r c h ^

http://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastDocs BLAST Home Recent Results Saved Strategies Help a a Apis mellifera Gallus NCBI/ BLAST Home gatlus o Pan BLAST finds regions of similarity between biological troglodyte sequences, more.., ^^2 Aligning Multiple Protein Sequences? Try the COBALT Multiple Alignment Tool. GoJ BLAST Assembled RefSeq Genomes n Oryza sativa D Bos taurus n Human a Mouse a Rat a Arabidopsis thaliana Basic BLAST Choose a BLAST program to run. nucleoti de blast protei n blast Search a nucleotide database using a nucleotide query Algorithms: blastn, megablast. discontiguous megablast Search protein database using a protein query Algorithms: blastp. psi-blast. phi-blast Choose a species genome to search, or list all genomic BLAST databases

II f Skin Inl Register! S e a r c h p r o t e i n d a t a b a

se using a translated nucleotide query Search translated nucleotide database using a protein query Search translated nucleotide database using a translated nucleotide query

Your Recent Results New! Nucleotide Sequence (255 lett... Nucleotide Sequence (25 lette... News Hew SNP BLAST page The dbSNP BLAST page has been updated. Wed, 12 Jan 2011 14:00:00 EST g| More BLAST news... Tip of the Day Use Genomic BLAST to see the genomic context If you are interested in the evolution of a particular gene or gene famity it is often intetesting to examine the intro-exon structure even across species. [^I More tips...

Specialized BLAST Choose a type of specialized search (or database name in parentheses ) Make specific primers with Primer-BLAST o Search trace archives D Find conserved domains in your sequence (cds) Find sequences with similar conserved domain architecture (cdart) D Search sequences that have gene expression profiles (GEO) Search immunoglobulins (IgBLAST) Search using SNP flanks

n Sr.rppn spmipnre fnr \/Rrtnr rontaminatinn iVprsrrppnl

utput: Alignments

>gi|127552|sp|P23367|MUTL_ECOLI DNA mismatch repair protein mutL Length = 615 Score =42.0 bits (97), Expect = 3e-04 Identities = 26/59 (44%), Positives = 33/59 (55%), Gaps = 9/59 (15%) HEVHF-------LHE----ESILEV-QQHIESKL HEVRFHQSRLVHDFIYQGVLSVLQQQLETPL +H+ L PIOSITHPFLYLSLEIS PQNVDVNVH L 338 58 negative positive score + +L V QQ +E+ L Query + P 9 L LEI P VDVNVH substitution LGitfDQQPAFVLYLE IDPHQVDVNVH (conservative) Sbjct 280

Identical match

From NCBI training tutorial

Perform Blast search of the following sequence. In which gene? In the coding region?

Translate it into aa sequence, and perform Blastp search

GGCCGTGCCT GGGGATCCAA GTTCCCCTCT CTCCACCTGT GCTCACCTCT CCTCCGTCCC CAACCCTGCA CAGGCAAGAT CGTGGACGCC GTGATTCAGG AGCACCAGCC CTCCGTGCTG CTGGAGCTGG GGGCCTACTG TGGCTACTCA GCTGTGCGCA TGGCCCGCCT GCTGTCACCA GGGGCGAGGC TCATCACCAT CGAGATCAAC CCCGACTGTG CCGCCATCAC CCAGCGGATG GTGGATTTCG CTGGC

Regulation - Lactase - Gene - Click - Learn - Worksheet
100% (1)
Regulation - Lactase - Gene - Click - Learn - Worksheet
3 pages
Sequence Alignment and Searching
No ratings yet
Sequence Alignment and Searching
54 pages
Bio 2
No ratings yet
Bio 2
39 pages
Sequence Alignment and Searching
No ratings yet
Sequence Alignment and Searching
37 pages
Lecture 4
No ratings yet
Lecture 4
106 pages
UNIT IV _ BLAST (1)
No ratings yet
UNIT IV _ BLAST (1)
21 pages
_second_done_w14b_searching squence databases
No ratings yet
_second_done_w14b_searching squence databases
32 pages
TY-Exercise_4_(35)
No ratings yet
TY-Exercise_4_(35)
8 pages
Sequence Alignments: Felix Sappelt Irina Wagner
100% (1)
Sequence Alignments: Felix Sappelt Irina Wagner
34 pages
Blast 2 S, A New Tool For Comparing Protein and Nucleotide Sequences
No ratings yet
Blast 2 S, A New Tool For Comparing Protein and Nucleotide Sequences
4 pages
Bioinformatics Lab 2 (Evelyn)
No ratings yet
Bioinformatics Lab 2 (Evelyn)
9 pages
Diploma - Practical
No ratings yet
Diploma - Practical
11 pages
Blast Fasta
No ratings yet
Blast Fasta
27 pages
Bioinformatics Lab 2
No ratings yet
Bioinformatics Lab 2
9 pages
Running BLAST Through Perl
No ratings yet
Running BLAST Through Perl
35 pages
Introduction To Different Resources of Bioinformatics and Application PDF
No ratings yet
Introduction To Different Resources of Bioinformatics and Application PDF
55 pages
BLAST
No ratings yet
BLAST
30 pages
Lecture - 02 - Comparative Sequence Analysis
No ratings yet
Lecture - 02 - Comparative Sequence Analysis
28 pages
Basic Local Alignment
No ratings yet
Basic Local Alignment
36 pages
Lecture 4: Blast: Ly Le, PHD
No ratings yet
Lecture 4: Blast: Ly Le, PHD
60 pages
Lab 2.1
No ratings yet
Lab 2.1
21 pages
TY-Exercise_4_(35)(Updated)
No ratings yet
TY-Exercise_4_(35)(Updated)
7 pages
Bs982 l08 Basic Blast
No ratings yet
Bs982 l08 Basic Blast
38 pages
Blast 2 Sequences: Salman Khan Current Gpa in Bioinf 4 Gpa
No ratings yet
Blast 2 Sequences: Salman Khan Current Gpa in Bioinf 4 Gpa
45 pages
Bioinformatics: Blast and Sequence Analysis
No ratings yet
Bioinformatics: Blast and Sequence Analysis
45 pages
ALLIENU Blast and Fasta
No ratings yet
ALLIENU Blast and Fasta
27 pages
Blast
No ratings yet
Blast
18 pages
Blast ND Fasta
No ratings yet
Blast ND Fasta
28 pages
Fundamentals of bioinformatics_L5
No ratings yet
Fundamentals of bioinformatics_L5
56 pages
Lectures_9-12
No ratings yet
Lectures_9-12
39 pages
BLAST Background
100% (1)
BLAST Background
27 pages
Sequence Analysis - Alignment
No ratings yet
Sequence Analysis - Alignment
57 pages
BLAST (Basic Local Alignment Search Tool)
100% (1)
BLAST (Basic Local Alignment Search Tool)
23 pages
Basic Local Alignment Search Tool-BLAST
No ratings yet
Basic Local Alignment Search Tool-BLAST
9 pages
Genomic Sequence Alignment
No ratings yet
Genomic Sequence Alignment
25 pages
BLAST and Sequence Alignment
No ratings yet
BLAST and Sequence Alignment
36 pages
Bioinformatics Is The Inter-Disciplinary Branch of Biology Which Merges Computer Science, Mathematics and Engineering To Study The Biological Data
No ratings yet
Bioinformatics Is The Inter-Disciplinary Branch of Biology Which Merges Computer Science, Mathematics and Engineering To Study The Biological Data
26 pages
Lecture 8- BLAST_MSA
No ratings yet
Lecture 8- BLAST_MSA
15 pages
05 CAP5510 Fall21
No ratings yet
05 CAP5510 Fall21
40 pages
Blast Introduction
No ratings yet
Blast Introduction
42 pages
Algorithms For Biological Sequence Analysis: Class Presentation
No ratings yet
Algorithms For Biological Sequence Analysis: Class Presentation
40 pages
Heuristic Local Alignerers: The Basic Indexing & Extension Technique
No ratings yet
Heuristic Local Alignerers: The Basic Indexing & Extension Technique
39 pages
Database Similarity Searching
No ratings yet
Database Similarity Searching
4 pages
Lecture 05
No ratings yet
Lecture 05
36 pages
BLAST
No ratings yet
BLAST
17 pages
Sequence Alignment
No ratings yet
Sequence Alignment
14 pages
Using BLAST: FASTA Format
0% (1)
Using BLAST: FASTA Format
3 pages
BLAST
100% (1)
BLAST
4 pages
Retrieval of Data
No ratings yet
Retrieval of Data
22 pages
About Basic Local Alignment Search Tool
No ratings yet
About Basic Local Alignment Search Tool
17 pages
Blast
100% (1)
Blast
21 pages
Bi205: Genetics & Evolution: Bioinformatics 1 & 2
No ratings yet
Bi205: Genetics & Evolution: Bioinformatics 1 & 2
14 pages
BLAST - A Heuristic Algorithm
No ratings yet
BLAST - A Heuristic Algorithm
18 pages
Brutlag 98
No ratings yet
Brutlag 98
6 pages
Week 3 LocalAlignment
No ratings yet
Week 3 LocalAlignment
25 pages
Lecture/Lab: BLAST: Materials Last Updated June 2007
No ratings yet
Lecture/Lab: BLAST: Materials Last Updated June 2007
11 pages
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Production System: Fundamentals and Applications
From Everand
Production System: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Gene Expression Programming: Fundamentals and Applications
From Everand
Gene Expression Programming: Fundamentals and Applications
Fouad Sabry
No ratings yet
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
Ap Biology Interactive Cornell Notes 7.1-7.5-1
No ratings yet
Ap Biology Interactive Cornell Notes 7.1-7.5-1
9 pages
The Cell As A Machine 1st Edition PDF
No ratings yet
The Cell As A Machine 1st Edition PDF
22 pages
Logbook On Exp 6 - Zayed Hassan
No ratings yet
Logbook On Exp 6 - Zayed Hassan
3 pages
Verona SOP
No ratings yet
Verona SOP
1 page
Session 5 Biologics Patent Drafting and IP Strategy
No ratings yet
Session 5 Biologics Patent Drafting and IP Strategy
36 pages
Characteristics of Saccharomyces Cerevisiae
100% (1)
Characteristics of Saccharomyces Cerevisiae
1 page
Lab 2 Electrophoresis
No ratings yet
Lab 2 Electrophoresis
3 pages
Biology SQP-01 2024 Exam
No ratings yet
Biology SQP-01 2024 Exam
8 pages
Call For Papers: Peptide-Based Immunotherapeutics and Vaccines 2019
No ratings yet
Call For Papers: Peptide-Based Immunotherapeutics and Vaccines 2019
1 page
A.P. Chapter 8 WebTest
No ratings yet
A.P. Chapter 8 WebTest
9 pages
7es New DLL For Observation
No ratings yet
7es New DLL For Observation
14 pages
Current Topics in Developmental Biology 74 1st Edition Gerald P. Schatten (Eds.) - The full ebook with complete content is ready for download
No ratings yet
Current Topics in Developmental Biology 74 1st Edition Gerald P. Schatten (Eds.) - The full ebook with complete content is ready for download
81 pages
ucsc_proteome
No ratings yet
ucsc_proteome
20,508 pages
Chapter - 6 DNA Recombination
No ratings yet
Chapter - 6 DNA Recombination
68 pages
Quorum Sensing - Wikipedia
No ratings yet
Quorum Sensing - Wikipedia
75 pages
BIOLOGY syllabus Imat
No ratings yet
BIOLOGY syllabus Imat
6 pages
Molecular Biology MCQ
No ratings yet
Molecular Biology MCQ
197 pages
C Value Paradox
No ratings yet
C Value Paradox
4 pages
UM Professors Aquaculoture
No ratings yet
UM Professors Aquaculoture
8 pages
Unit I: An Introduction To Biotechnology
No ratings yet
Unit I: An Introduction To Biotechnology
22 pages
Module 2 Agribusiness in Retrospect in Entrep Elect 2 1st Sem 2024
No ratings yet
Module 2 Agribusiness in Retrospect in Entrep Elect 2 1st Sem 2024
18 pages
Full download Bioinformatics in Rice Research Theories and Techniques 1st Edition Manoj Kumar Gupta (Editor) pdf docx
100% (4)
Full download Bioinformatics in Rice Research Theories and Techniques 1st Edition Manoj Kumar Gupta (Editor) pdf docx
37 pages
SeatingArrangement (7)
No ratings yet
SeatingArrangement (7)
60 pages
Certificate For COVID-19 Vaccination: Beneficiary Details
No ratings yet
Certificate For COVID-19 Vaccination: Beneficiary Details
1 page
CHE631-Module 4 - Enzymes
No ratings yet
CHE631-Module 4 - Enzymes
26 pages
Pes Cells
No ratings yet
Pes Cells
7 pages
Acne Vulgaris: New Evidence in Pathogenesis and Future Modalities of Treatment
No ratings yet
Acne Vulgaris: New Evidence in Pathogenesis and Future Modalities of Treatment
11 pages
Dnarna Protein Synthesis
No ratings yet
Dnarna Protein Synthesis
69 pages
Recombinant Interferon Production
No ratings yet
Recombinant Interferon Production
8 pages

Pairwise Sequ Datab: Appos® ©mimfoimrdaifcltes

Uploaded by

Pairwise Sequ Datab: Appos® ©mimfoimrdaifcltes

Uploaded by

Pairwise Sequ Datab

Why make sequence alignments?

The difference in the alignments may be linked to the functional changes/diseases.

Approaches in Pairwise Sequence Alignment

rotmatrix - StophylococcuI^pIBermiclis TCP62A and ATCC12228

ignment --StophylococcusTpJBFrmidisRP62A and ATCC12228

NBed4etn fn and Wunsc

Local Alignment: Smith-Waterman Algorithm (1981)

C 00 0-0 0 0 00 00 10 10 0-0 00 00 0-0 00 10 00 00

C 0 0 0*0 00 0-3 0-3 2 0 10 0-3 03 1-3 00 10 10 03 0-7

U 0-0 00 00 00 00 1-3 30 1-7 1-8 10 10 0-3 20 0-7 0-3

C 0-0 00 00 0-0 00 20 13 0*7 0-7 0-7 2-0

U 00 0-0 00 10 00 0-7 17 1-0 1-7 1-7 17 1-7 27 2-7 1-3

l! 0-0 0-0 00 1-0 0-7 0-3 0 3 1*3 20 2-7 13 1*3 13 2 3 2-3

G 0 0 1 (I I 0 0-0 0-7 0-3 0-0 13 10 1-7 2-3 2-3 10 10 2-0 00 0o 7 07 10 0* 3 00 00 10 27 20 20 20 20

o-o 0-0 00 00 00 00 00 00 0-0 00 00 00 00

!li 20 t 17 \oso 2-7 13 \

Match: 1.0 Mismatch: -1/3 Gapwk=1.0+l/3*k

'Smith-Waterman Algorithm vs Needleman-Wunsch Algorithm

Similarity searches in sequence databases have become a mainstay of bioinformatics.

Sequence databases, e.g. NCBI.

Basic Local Alignment Search Tool (BLAST)

SA/hat BLAST Tells You

Assumptions Random sequences Constant composition

Conclusions Surprising similarities imply evolutionary homology

Basic Local Alignment Search Tool (BLAST)

Minimum word size = 7

blastn default = 11 Make a lookup TACTGGACATG

GGACATGGACC GACATGGACCC ACATGGACCCT

II f Skin Inl Register! S e a r c h p r o t e i n d a t a b a

n Sr.rppn spmipnre fnr \/Rrtnr rontaminatinn iVprsrrppnl

From NCBI training tutorial

Translate it into aa sequence, and perform Blastp search

You might also like

l! 0-0 0-0 00 1-0 0-7 0-3 0 3 13 20 2-7 13 13 13 2 3 2-3