المحاضرة 2
المحاضرة 2
Biological Databases
Biological databases are vast digital repositories that store and organize
a wide range of biological data, including DNA sequences, protein
structures, gene expression profiles, and other molecular information.
These databases play a crucial role in enabling scientific research and
advancing our understanding of life.
Raja Alwami
Types of Biological Databases
Nucleotide Databases Protein Databases Specialized Databases
Store and manage DNA and Contain information on Focus on specific types of
RNA sequence data, such as protein sequences, biological data, such as gene
GenBank, EMBL, and DDBJ. structures, and functions, expression, pathways, and
including UniProt and the disease-related information.
Protein Data Bank (PDB).
Data Curation and Annotation
Data Preprocessing
Cleaning, standardizing, and formatting the data for integration.
Data Integration
Combining and linking the data to create a comprehensive and interoperable
resource.
Visualization and Analysis Tools
Sequence Alignment Structural Viewers Network Visualization
Tools
Render and interact with 3D Depict complex biological
Visualize and compare DNA protein structures to study pathways and interactions as
or protein sequences to their physical properties. interactive networks.
identify similarities and
differences.
Applications of Biological Databases
Drug Discovery Personalized Medicine
Identify potential drug targets and test Analyze patient genomic data to guide
candidate compounds using database tailored treatments and interventions.
information.
•Data storage and retrieval: Efficiently store and retrieve vast amounts of biological data.
•Data curation and annotation: Experts curate and annotate data for accuracy and consistency.
•Data analysis tools: Provide tools for analyzing and visualizing data, such as sequence alignment
or phylogenetic analysis.
•Data integration: Link data from different sources to provide a comprehensive view of biological
systems.
Benefits of Biological Databases:
•Data heterogeneity: Biological data comes from various sources and formats, making
integration challenging.
•Keeping up with data growth: The rapid growth of biological data requires constant
development and expansion of databases.
Despite these challenges, biological databases are essential resources for modern
biological research, enabling scientists to explore the complexities of life and develop new
solutions to global challenges in healthcare, agriculture, and the environment.
The National Center for Biotechnology Information (NCBI) database is a massive, publicly
available collection of biomedical data managed by the National Institutes of Health (NIH). It's a
cornerstone resource for researchers, clinicians, and anyone interested in exploring biological
information.
Key Databases within NCBI:
•GenBank: The gold standard for DNA sequences. Researchers submit genetic
data directly to GenBank, making it a constantly growing repository.
•PubMed: A comprehensive index of biomedical literature. It covers millions of
journal articles, books, and other resources, allowing users to search for specific
topics or authors.
•Protein: This database houses protein sequences from various sources,
including translations of GenBank coding regions and submissions from
researchers.
https://www.ncbi.nlm.nih.gov/
•Structure: Here, you'll find 3D structures of proteins, nucleic acids, and
complex assemblies, predominantly from techniques like X-ray
crystallography and NMR.
1. Research at EMBL:
•Cutting-edge Research: EMBL is renowned for its world-leading research across a wide spectrum
of life sciences, including:
• Genomics and gene regulation
• Cell biology and developmental biology
• Structural biology and biophysics
• Computational biology and bioinformatics
• Systems biology and disease modeling
•International Collaboration: EMBL fosters a highly collaborative environment, with scientists from
over 80 countries working together.
•Interdisciplinary Approach: Research at EMBL often transcends traditional boundaries,
integrating expertise from different fields to address complex biological questions.