Lecture 5 Protein Sequence Database

UniProt is a protein sequence database that consists of UniProtKB (a curated and annotated database), UniRef (non-redundant clusters of sequences), and UniParc (a comprehensive archive of all publicly available sequences). Pfam and Prosite are protein family and domain databases that group similar protein sequences and define common protein domains and families. The Protein Information Resource (PIR) was established in 1984 to provide a public resource for protein sequence identification and interpretation.

Uploaded by

Bhawna Rathi

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Lecture 5 Protein Sequence Database

Uploaded by

Bhawna Rathi

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Topic Name – Protein Sequence Databases

Protein Information Resource(PIR)

Uniprot - Protein Knowledge Database

PROTEIN/PROTEOMICS
DATABASES
Pfam - Protein Family And Domain

Prosite - Protein Family And Domain

• The Swiss-Prot, TrEMBL, and PIR protein
database activities have united to form the
Universal Protein Resource (UniProt)
– Uniprot Knowledgebase (UniprotKB):
curated Sequence information,
annotations, linked to other

UNIPROT
databases.
– Uniprot Reference Clusters (UniRef):
removing sequence redundancy by

Database merging sequences that are 100%,

90% and 50%, no annotations, linked
to Knowledgebase and UniParc
records.
– Uniprot Archive (UniParc): history of
sequences, no annotation, linked to
source records.
UNIPROT SEQUENCE DATABASES

UniProt Archive (UniParc) UniProt Reference (UniRef)

Stable, comprehensive, non-redundant Three non-redundant collections based
collection of all protein sequences ever on sequence similarity clusters
published • UniRef100 has all identical and
Merged from PIR, SwissProt, TREMBL, identical overlapping subsequences
DDBJ/EMBL/GenBank proteins and merged into one entry in UniRef100
proteomes, PDB, International Protein • UniRef90 merges all protein sequence
Index, RefSeq translations and other clusters with 90% sequence identity
organism proteomes not yet in into a single entry.
DDBJ/EMBL/GenBank • UniRef50 merges all protein sequence
clusters with 50% sequence identity
into a single entry
UniProt Sequence Databases (cont.)
•UniProt Archive (UniProt)
• UniProt/SwissProt
• Manually curated highly-annotated sequences from SwissProt & PIRSF
including descriptions, taxonomy, citations, GO terms, motifs, functional
and structural classifications, residue specific annotations including
variations.
• Some automatic rule-based annotations including InterPro domains and
motifs, PROSITE, PRINTS, Prodom, SMART, PFAM, PIRSF, Superfamily and
TIGRFAMS classifications.
• UniProt/TREMBL
• Automatically translated from genomes including predicted as well as
RefSeq genes.
• Automated rule-based annotations.
• PIR was established in 1984 by the
National Biomedical Research
Foundation (NBRF) as a resource to
assist researchers in the identification
PROTEIN and interpretation of protein sequence
INFORMATION information.
• The Protein Information Resource (PIR)
RESOURCE is an integrated public bioinformatics
resource to support genomic,
proteomic and systems biology
research and scientific studies
PFAM

PFAM IS A DATABASE OF CURATED PROTEIN FAMILIES, IN PFAM, THE PROFILE HMM IS SEARCHED AGAINST A
EACH OF WHICH IS DEFINED BY TWO ALIGNMENTS AND A LARGE SEQUENCE COLLECTION, BASED ON UNIPROT
PROFILE HIDDEN MARKOV MODEL (HMM). KNOWLEDGEBASE (UNIPROTKB), TO FIND ALL INSTANCES
OF THE FAMILY.
PROSITE DATABASE

PROSITE is a database of protein families and domains. It is based

on the observation that, while there is a huge number of different
proteins, most of them can be grouped, on the basis of similarities
in their sequences, into a limited number of families.

Proteins or protein domains belonging to a particular family

generally share functional attributes and are derived from a
common ancestor.
PROSITE DATABASE

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
58% (78)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
Shortcut To Shred Ebook Revised 9-9-2015 PDF
88% (8)
Shortcut To Shred Ebook Revised 9-9-2015 PDF
15 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (7)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
70% (71)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Epidemiologi Manajerial: Oleh: Ubaidillah S.Si.,M.Kes, Epid
100% (2)
Epidemiologi Manajerial: Oleh: Ubaidillah S.Si.,M.Kes, Epid
26 pages
B.Sc. Zoology PDF
0% (1)
B.Sc. Zoology PDF
86 pages
Protein Seq Databases (1)
No ratings yet
Protein Seq Databases (1)
20 pages
Lecture 4 Nucleic Acid Sequence Database
No ratings yet
Lecture 4 Nucleic Acid Sequence Database
21 pages
I Hate This Website
No ratings yet
I Hate This Website
4 pages
Lecture Topic: Protein Databases: Topics Covered
No ratings yet
Lecture Topic: Protein Databases: Topics Covered
67 pages
Protein Sequence Database Ankita Sharma
No ratings yet
Protein Sequence Database Ankita Sharma
31 pages
The Universal Protein Resource (Uniprot) : An Expanding Universe of Protein Information
No ratings yet
The Universal Protein Resource (Uniprot) : An Expanding Universe of Protein Information
6 pages
Bioinformatics Day4
No ratings yet
Bioinformatics Day4
5 pages
4.2
No ratings yet
4.2
18 pages
Bioinformatics - Derived Databases: How Do We Carry Out 1 and 2 ?
No ratings yet
Bioinformatics - Derived Databases: How Do We Carry Out 1 and 2 ?
25 pages
11-Protein Information Resource (PIR)-02-09-2024 (1)
No ratings yet
11-Protein Information Resource (PIR)-02-09-2024 (1)
11 pages
PFAM Database
No ratings yet
PFAM Database
22 pages
Uniprot: The Universal Protein Knowledgebase
No ratings yet
Uniprot: The Universal Protein Knowledgebase
12 pages
Adv Bi Unit 1
No ratings yet
Adv Bi Unit 1
39 pages
Module 2 Biodata
No ratings yet
Module 2 Biodata
36 pages
Presentation 11
No ratings yet
Presentation 11
20 pages
Bioinformatics (STH Sir)
No ratings yet
Bioinformatics (STH Sir)
13 pages
The Universal Protein Resource (Uniprot) 2009
No ratings yet
The Universal Protein Resource (Uniprot) 2009
6 pages
Protein Databases
No ratings yet
Protein Databases
8 pages
Uni Prot
No ratings yet
Uni Prot
6 pages
BCH 505 Bioinformatics 3(2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3(2 2) Databases
17 pages
Ncbi
No ratings yet
Ncbi
25 pages
Lec (3) - Protein_databases
No ratings yet
Lec (3) - Protein_databases
22 pages
Introduction To Databases - NCBI, PDB and Uniprot
No ratings yet
Introduction To Databases - NCBI, PDB and Uniprot
5 pages
Bif401 Manual 2023
No ratings yet
Bif401 Manual 2023
27 pages
Fat Noews Docx (3)
No ratings yet
Fat Noews Docx (3)
16 pages
Fat Noews Docx (2)
No ratings yet
Fat Noews Docx (2)
32 pages
Uniprotkb Quickguide
No ratings yet
Uniprotkb Quickguide
2 pages
Database Dalam Bioinformatika
No ratings yet
Database Dalam Bioinformatika
34 pages
Protein Databases
No ratings yet
Protein Databases
12 pages
BIF401 Midterm Short Notes
No ratings yet
BIF401 Midterm Short Notes
45 pages
Bioinformatic Database Record
No ratings yet
Bioinformatic Database Record
63 pages
Bioinformatics Database
No ratings yet
Bioinformatics Database
50 pages
WINSEM2021-22 BIY1012 ETH VL2021220501045 Reference Material I 11-01-2022 Ntroduction To Databases
No ratings yet
WINSEM2021-22 BIY1012 ETH VL2021220501045 Reference Material I 11-01-2022 Ntroduction To Databases
42 pages
Module8 ComparGenomics
No ratings yet
Module8 ComparGenomics
27 pages
Bioinformatics Unit I
No ratings yet
Bioinformatics Unit I
6 pages
Protein Database Overview
No ratings yet
Protein Database Overview
13 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Capture D'écran . 2023-03-14 À 00.15.22
No ratings yet
Capture D'écran . 2023-03-14 À 00.15.22
54 pages
Biological Database 1
No ratings yet
Biological Database 1
50 pages
Expert Protein Analysis System: Expasy
100% (1)
Expert Protein Analysis System: Expasy
14 pages
Protein Database
No ratings yet
Protein Database
3 pages
Data Base in Bioinformatics
No ratings yet
Data Base in Bioinformatics
30 pages
Lec2 Databases
No ratings yet
Lec2 Databases
135 pages
Pra 1 Swiss Prot
No ratings yet
Pra 1 Swiss Prot
2 pages
Unit II Major Databases in Bioinformatics
No ratings yet
Unit II Major Databases in Bioinformatics
54 pages
Manual
No ratings yet
Manual
68 pages
Inter Pro
No ratings yet
Inter Pro
7 pages
Biological Information on Artificial Intelligence
No ratings yet
Biological Information on Artificial Intelligence
20 pages
Databases - Final
No ratings yet
Databases - Final
50 pages
note 2
No ratings yet
note 2
54 pages
Unit II Bioinformatics
No ratings yet
Unit II Bioinformatics
25 pages
Basics of Bioinformatics
100% (7)
Basics of Bioinformatics
99 pages
Rese Rach
No ratings yet
Rese Rach
37 pages
Sequence and Structure Retrieval
No ratings yet
Sequence and Structure Retrieval
9 pages
Module 2 (Bioinformatics)
No ratings yet
Module 2 (Bioinformatics)
81 pages
Class 03-04-03
No ratings yet
Class 03-04-03
123 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
Advanced Perl Techniques for Bioinformatics: Optimizing Data Analysis and Computational Biology
From Everand
Advanced Perl Techniques for Bioinformatics: Optimizing Data Analysis and Computational Biology
Adam Jones
No ratings yet
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Bioinformatics: Merging Biology and Technology
From Everand
Bioinformatics: Merging Biology and Technology
Mani Devar
No ratings yet
Histology of Synapse and Neuroglia by Dr. Roomi
100% (1)
Histology of Synapse and Neuroglia by Dr. Roomi
18 pages
Transplant Immunology
No ratings yet
Transplant Immunology
5 pages
What Are Cells
No ratings yet
What Are Cells
6 pages
Plants Free Full-Text Citrus Canker Pathogen, Its Mechanism of Infection, Eradication, and Impacts
No ratings yet
Plants Free Full-Text Citrus Canker Pathogen, Its Mechanism of Infection, Eradication, and Impacts
1 page
Regulation of Crustacean Molting-A Review and Our Perspectives
No ratings yet
Regulation of Crustacean Molting-A Review and Our Perspectives
8 pages
19 the Statistics of Inheritance-S
No ratings yet
19 the Statistics of Inheritance-S
7 pages
7-Production of Protein From Cloned Genes
No ratings yet
7-Production of Protein From Cloned Genes
38 pages
Get Developmental Neurobiology 1st Edition Lynne Bianchi (Author) Free All Chapters
100% (2)
Get Developmental Neurobiology 1st Edition Lynne Bianchi (Author) Free All Chapters
64 pages
Neoplasia Outline Notes - Pathology
No ratings yet
Neoplasia Outline Notes - Pathology
4 pages
Simple and Gram Staining
100% (3)
Simple and Gram Staining
4 pages
Immunology: Special Issue
No ratings yet
Immunology: Special Issue
30 pages
The Importance of Invertebrates
100% (2)
The Importance of Invertebrates
4 pages
Artemisin in Vitro 2
No ratings yet
Artemisin in Vitro 2
7 pages
Biochem Lab 2 Protein
No ratings yet
Biochem Lab 2 Protein
4 pages
Insulin ELISA Kit (INS) : Images Publications
No ratings yet
Insulin ELISA Kit (INS) : Images Publications
9 pages
Q4. Activity 1 - Plant Reproduction
No ratings yet
Q4. Activity 1 - Plant Reproduction
4 pages
Marking Scheme Pembinaan Item Kbat Sem 1 2022
No ratings yet
Marking Scheme Pembinaan Item Kbat Sem 1 2022
6 pages
Microbiology with Diseases by Taxonomy 4th Edition Bauman Test Bankinstant download
100% (3)
Microbiology with Diseases by Taxonomy 4th Edition Bauman Test Bankinstant download
45 pages
Final Report Phase 4
No ratings yet
Final Report Phase 4
35 pages
Ib Biology Syllabus-New
No ratings yet
Ib Biology Syllabus-New
20 pages
Presentation Oncogenes
No ratings yet
Presentation Oncogenes
54 pages
0610 w04 QP 1
No ratings yet
0610 w04 QP 1
16 pages
16MSAB3R
No ratings yet
16MSAB3R
2 pages
Role of Biotechnology in Improving Human Health
No ratings yet
Role of Biotechnology in Improving Human Health
7 pages
Microbiology PDF
No ratings yet
Microbiology PDF
71 pages
Natural vs. Artificial Selection
No ratings yet
Natural vs. Artificial Selection
3 pages
General Biology Week 1 To 4
No ratings yet
General Biology Week 1 To 4
14 pages
inbo olymipiad
No ratings yet
inbo olymipiad
2 pages

Lecture 5 Protein Sequence Database

Uploaded by

Lecture 5 Protein Sequence Database

Uploaded by

Topic Name – Protein Sequence Databases

Protein Information Resource(PIR)

Uniprot - Protein Knowledge Database

Prosite - Protein Family And Domain

Database merging sequences that are 100%,

UniProt Archive (UniParc) UniProt Reference (UniRef)

PROSITE is a database of protein families and domains. It is based

Proteins or protein domains belonging to a particular family

You might also like