Databases

NCBI houses several biomedical databases including GenBank for DNA sequences and PubMed. It is directed by David Lipman and located in Bethesda, Maryland. EMBL maintains the nucleotide sequence database in collaboration with DDBJ and GenBank. Entrez is NCBI's retrieval system that integrates data from various databases through cross-referencing.

Uploaded by

Nandni Jha

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

Databases

Uploaded by

Nandni Jha

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

NCBI

• NCBI stands for National Centre for

Biotechnology Information.
• It is part of the United States National Library of
Medicine (NLM), a branch of the National
Institutes of Health.
• The NCBI is located in Bethesda, Maryland was
founded in 1988.
• The NCBI houses a series of databases relevant
to biotechnology and biomedicine.
• Major databases include Genebank for DNA
sequences and PubMed, a bibliographic
database for the biomedical literature.
• Other databases include the NCBI
Epigenomics database. All these databases are
available online through the Entrez search
engine.
• NCBI is directed by David Lipman, one of the
original authors of the BLAST sequence
alignment program.
EMBL
• The EMBL Nucleotide Sequence Database
(http://www.ebi.ac.uk/embl/), maintained at the European
Bioinformatics Institute (EBI), incorporates, organizes and
distributes nucleotide sequences from public sources.
• The database is a part of an international collaboration with DDBJ
(Japan) and GenBank (USA).
• Data are exchanged between the collaborating databases on a daily
basis.
• The web-based tool, Webin, is the preferred system for individual
submission of nucleotide sequences, including Annotation and
alignment data.
• Automatic submission procedures are used for submission of data
from large-scale genome sequencing centers and from the
European Patent Office.
• The latest data collection can be accessed via FTP, email
and WWW interfaces.
• The EBI's Sequence Retrieval System (SRS) integrates and
links the main nucleotide and protein databases as well as
many other specialist molecular biology databases.
• For sequence similarity searching, a variety of tools (e.g.
FASTA and BLAST) are available that allow external users
to compare their own sequences against the data in the
EMBL Nucleotide Sequence Database, the complete
genomic component subsection of the database, the WGS
data sets and other databases.
• All available resources can be accessed via the EBI home
page at http://www.ebi.ac.uk.
Home page of EMBL-ENA database
Result
Text file format
Fasta sequence
Home page of EMBL-EBI database
Total hits
Results
DDBJ
• DNA Data Bank of Japan is a biological database that collects DNA
sequences.
• It is located at National Institute of Genetics (NIG) in the Shizuoka
prefecture of Japan.
• It is also member of the International Nucleotide Sequence
Database Collaboration or INSDC.
• It exchanges its data with European Molecular Biology Laboratory
at European Bioinformatics Institute and with Genbank at the
National Center for Biotechnology Information on a daily basis.
• Thus these three databanks contains the same data at any given
time.
• DDBJ began data bank activities in 1986 at NIG and remains the
only nucleotide sequence data bank in Asia.
• Although DDBJ mainly receives its data from Japanese
researchers, it can accept data from contributors from any other
country.
• DDBJ is primarily founded by Japanese Ministry of Education,
Culture, Sports, Science and Technology.
• DDBJ has an international advisory committee which consists of
nine members, 3 member each from Europe, US and Japan.
• This committee advises DDBJ about its maintenance,
management and future plans once a year.
• Apart from this DDBJ also has an international collaborative
committee which advices on various technical issues related to
international collaboration and consists of working level
participants.
Home page of DDBJ
Search and analysis
Flat file of DDBJ
Nucleotide Fasta sequence
Amino acid fasta sequence
Results of ARSA
Entrez
• The NCBI developed and maintains Entrez, a biological
database retrieval system.
• It is a gateway that allows text-based searches for a
wide variety of data, including annotated genetic
sequence information, structural information, as well as
citations and abstracts, full papers, and taxonomic data.
• The key feature of Entrez is its ability to integrate
information, which comes from cross-referencing
between NCBI databases based on pre-existing and
logical relationships between individual entries.
• This is highly convenient: users do not have to
visit multiple databases located in disparate
places.
• For example, in a nucleotide sequence page,
one may find cross-referencing links to the
translated protein sequence, genome mapping
data, or to the related PubMed literature
information, and to protein structures if
available.
• Effective use of Entrez requires an understanding of the main
features of the search engine.
• There are several options common to all NCBI databases that
help to narrow the search.
• One option is “Limits,” which helps to restrict the search to a
subset of a particular database.
• It can also be set to restrict a search to a particular database
(e.g., the field for author or publication date) or a particular
type of data (e.g., chloroplast DNA/RNA).
• The search can also be limited to a particular search field (e.g.,
gene name or accession number).
• The “History” option provides a record of the previous searches
so that the user can review, revise, or combine the results of
earlier searches.
• One of the databases accessible from Entrez is a
biomedical literature database known as PubMed,
which contains abstracts and in some cases the full text
articles from nearly 4,000 journals.
• An important feature of PubMed is the retrieval of
information based on medical subject headings (MeSH)
terms.
• The MeSH system consists of a collection of more than
20,000 controlled and standardized vocabulary terms
used for indexing articles.
• In other words, it is a thesaurus that helps convert
search keywords into standardized terms to describe a
concept.
• By doing so, it allows “smart” searches in
which a group of accepted synonyms are
employed so that the user not only gets exact
matches, but also related matches on the
same topic that otherwise might have been
missed.
• Another way to broaden the retrieval is by
using the “Related Articles” option.
• For a complex search, a user can use the Boolean operators or a
combination of Limits and Preview/Index features to conduct
complex searches.
• Alternatively, field tags can be used to improve the efficiency of
obtaining the search results.
• The tags are identifiers for each field and are placed in brackets.
For example, [AU] limits the search for author name, and [JID] for
journal name.
• PubMed uses a list of tags for literature searches. The search
terms can be specified by the tags which are joined by Boolean
operators.
• Another unique database accessible from Entrez is Online
Mendelian Inheritance in Man(OMIM),which is a non-sequence-
based database of human disease genes and human genetic
disorders.
• Each entry in OMIM contains summary information about a
particular disease as well as genes related to the disease. The text
contains numerous hyperlinks to literature citations, primary
sequence records, as well as chromosome loci of the disease genes.
• The database can serve as an excellent starting point to study genes
related to a disease.
• NCBI also maintains a taxonomy database that contains the names
and taxonomic positions of over 100,000 organisms with at least
one nucleotide or protein sequence represented in the GenBank
database.
• The taxonomy database has a hierarchical classification scheme. The
root level is Archaea, Eubacteria, and Eukaryota.
• The database allows the taxonomic tree for a particular organism to
be displayed. The tree is based on molecular phylogenetic data,
namely, the small ribosomal RNA data.
SRS
• Sequence retrieval system (SRS;available at
http://srs6.ebi.ac.uk/) is a retrieval system maintained
by the EBI, which is comparable to NCBI Entrez.
• It is not as integrated as Entrez, but allows the user to
query multiple databases simultaneously, another
good example of database integration.
• It also offers direct access to certain sequence analysis
applications such as sequence similarity searching and
Clustal sequence alignment.
• Queries can be launched using “Quick Text Search”
with only one query box in which to enter information.
• There are also more elaborate submission forms, the
“Standard Query Form” and the “Extended Query
Form.”
• The standard form allows four criteria (fields) to be
used, which are linked by Boolean operators.
• The extended form allows many more diversified
criteria and fields to be used.
• The search results contain the query sequence and
sequence annotation as well as links to literature,
metabolic pathways, and other biological databases.

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6407)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (640)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1173)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (990)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1849)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4101)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (887)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (627)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1015)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (297)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5142)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4355)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (460)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2126)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (278)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2001)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2283)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1087)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2785)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2032)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2876)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4087)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (835)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (918)
Q1 a Write a program to construct a dot plot for the alignment of human and chicken+haemoglobin β chain. Identify the segments, which are same in both sequences
No ratings yet
Q1 a Write a program to construct a dot plot for the alignment of human and chicken+haemoglobin β chain. Identify the segments, which are same in both sequences
5 pages
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (814)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (277)
Biotechnology: Presented by M Qasim
No ratings yet
Biotechnology: Presented by M Qasim
15 pages
Microbial Genome Sequencing Projects
No ratings yet
Microbial Genome Sequencing Projects
23 pages
Presentation A - Using Restriction Enzymes
No ratings yet
Presentation A - Using Restriction Enzymes
12 pages
Product Note Curio Seeker 3x3 - 10x10 1
No ratings yet
Product Note Curio Seeker 3x3 - 10x10 1
4 pages
Multiplex PCR Improves Efficiency For Determining CRB Haplotype and Presence of NUDIVIRUS Multiplex Poster
No ratings yet
Multiplex PCR Improves Efficiency For Determining CRB Haplotype and Presence of NUDIVIRUS Multiplex Poster
1 page
Mzo-005 Genomics and Proteomics
No ratings yet
Mzo-005 Genomics and Proteomics
3 pages
Lecture 3-Molecular analysis
No ratings yet
Lecture 3-Molecular analysis
41 pages
2100292014DNA Replication in Prokaryotes
No ratings yet
2100292014DNA Replication in Prokaryotes
31 pages
Introduction to Applied Biology-3!5!231111_203803
No ratings yet
Introduction to Applied Biology-3!5!231111_203803
3 pages
Chemostat
No ratings yet
Chemostat
39 pages
M.tech. Biological Engineering Curriculum IIT
No ratings yet
M.tech. Biological Engineering Curriculum IIT
3 pages
Bioinformatics Exercises Print
No ratings yet
Bioinformatics Exercises Print
6 pages
DNA Sequencing
No ratings yet
DNA Sequencing
45 pages
ASU BIO 340 Exam 3 Questions With Complete Solutions
No ratings yet
ASU BIO 340 Exam 3 Questions With Complete Solutions
6 pages
Larone S Medically Important Fungi - 2018 - Walsh - Selected Websites
No ratings yet
Larone S Medically Important Fungi - 2018 - Walsh - Selected Websites
3 pages
MCQ Bio
No ratings yet
MCQ Bio
6 pages
Course Coordinator & Lecturer Qihui Jiang (Vivi Kasim), PHD: Academic Backgrounds
No ratings yet
Course Coordinator & Lecturer Qihui Jiang (Vivi Kasim), PHD: Academic Backgrounds
11 pages
Mls 522 Assignment
No ratings yet
Mls 522 Assignment
3 pages
SBT1043 Biotechnology Concepts and Techniques Test 1
No ratings yet
SBT1043 Biotechnology Concepts and Techniques Test 1
7 pages
Genetics Diagrams
No ratings yet
Genetics Diagrams
3 pages
Bif 401 PPT 1to 80 by M.habib
No ratings yet
Bif 401 PPT 1to 80 by M.habib
588 pages
Lec_Introduction
No ratings yet
Lec_Introduction
35 pages
Data Retrieval
67% (3)
Data Retrieval
17 pages
Cse Q
No ratings yet
Cse Q
8 pages
Lab 5 - 3D Structure Modelling
No ratings yet
Lab 5 - 3D Structure Modelling
21 pages
Bioinformatics A. Multiple Choice
No ratings yet
Bioinformatics A. Multiple Choice
3 pages
Lab Report
No ratings yet
Lab Report
2 pages
Unsoed Usft Lecture
No ratings yet
Unsoed Usft Lecture
29 pages
BT Practical Spotter
No ratings yet
BT Practical Spotter
2 pages

Databases

Uploaded by

Databases

Uploaded by

NCBI

• NCBI stands for National Centre for

You might also like