0% found this document useful (0 votes)

53 views39 pages

Introduction To Bioinformatics: High-Throughput Biological Data and Evolution

This document provides an introduction to high-throughput biological data and bioinformatics algorithms. It discusses how the amount of biological data from sources like genome sequencing, gene expression data, and protein structure data is growing exponentially. This data deluge creates both opportunities and challenges for data analysis and knowledge discovery using bioinformatics algorithms. Common algorithms discussed include clustering, dynamic programming, and machine learning approaches. The document also covers topics like protein structure, protein folding, evolution, and algorithms for tasks like sequence analysis and gene finding.

Uploaded by

[email protected]

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views39 pages

Introduction To Bioinformatics: High-Throughput Biological Data and Evolution

Uploaded by

[email protected]

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 39

C

Introduction to bioinformatics
N
T
R

Lecture 3
E
F B
O I
R O
I I
N N
T
E
G
F
O
R
High-throughput Biological
R
A
T
M
A
T
Data
I
V
I
C -data deluge, bioinformatics algorithms-
E S
V
U and evolution
Last lecture:
• Many different genomics datasets:
– Genome sequencing: more than 300 species completely
sequenced and data in public domain (i.e. information is
freely available), virus genome can be sequenced in a day
– Gene expression (microarray) data: many microarrays
measured per day
– Proteomics: Protein Data Bank (PDB) - as of Tuesday
February 07, 2006 there are 35026 Structures.
http://www.rcsb.org/pdb/
– Protein-protein interaction data: many databases worldwide
– Metabolic pathway, regulation and signaling data, many
databases worldwide
Growth in number of protein
tertiary structures
The data deluge
Although a lot of tertiary structural data is being
produced (preceding slide), there is the

SEQUENCE-STRUCTURE-FUNCTION GAP

The gap between sequence data on the one hand, and

structure or function data on the other, is widening
rapidly: Sequence data grows much faster
High-throughput Biological Data
The data deluge
• Hidden in all these data classes is
information that reflects
– existence, organization, activity,
functionality …… of biological machineries
at different levels in living organisms

Most effectively utilising and analysing this

information computationally is essential for
Bioinformatics
Data issues: from data to
distributed knowledge
• Data collection: getting the data
• Data representation: data standards, data normalisation …..
• Data organisation and storage: database issues …..
• Data analysis and data mining: discovering “knowledge”,
patterns/signals, from data, establishing associations among
data patterns
• Data utilisation and application: from data patterns/signals to
models for bio-machineries
• Data visualization: viewing complex data ……
• Data transmission: data collection, retrieval, …..
• ……
Bio-Data Analysis and Data Mining
• Analysis and mining tools exist and are developed for:
– DNA sequence assembly
– Genetic map construction
– Sequence comparison and database searching
– Gene finding
– Gene expression data analysis
– Phylogenetic tree analysis, e.g. to infer horizontally-
transferred genes
– Mass spectrometry data analysis for protein complex
characterization
– ……
Bio-Data Analysis and Data Mining
• As the amount and types of data and their cross
connections increase rapidly
• the number of analysis tools needed will go up
“exponentially” if we do not reuse techniques
– blast, blastp, blastx, blastn, … from BLAST family
of tools (we will cover BLAST later)
– gene finding tools for human, mouse, fly, rice,
cyanobacteria, …..
– tools for finding various signals in genomic
sequences, protein-binding sites, splice junction
sites, translation start sites, …..
Bio-Data Analysis and Data Mining
Many of these data analysis problems are
fundamentally the same problem(s) and can
be solved using the same set of tools
e.g.
•clustering or
•optimal segmentation by Dynamic
Programming

We will cover both of these techniques in later lectures

Bio-data Analysis, Data
Mining and Integrative
Bioinformatics
To have analysis capabilities covering a wide
range of problems, we need to discover the
common fundamental structures of these
problems;
HOWEVER in biology one size does NOT fit all…

An important goal of bioinformatics is

development of a data analysis
infrastructure in support of Genomics and
beyond
Protein structure hierarchical levels
PRIMARY STRUCTURE (amino acid sequence) SECONDARY STRUCTURE (helices, strands)

VHLTPEEKSAVTALWGKVNVD
EVGGEALGRLLVVYPWTQRFF
ESFGDLSTPDAVMGNPKVKAH
GKKVLGAFSDGLAHLDNLKGTF
ATLSELHCDKLHVDPENFRLLG
NVLVCVLAHHFGKEFTPPVQAA
YQKVVAGVANALAHKYH

QUATERNARY STRUCTURE (oligomers) TERTIARY STRUCTURE (fold)

Protein complexes for photosynthesis in plants
Protein folding problem
PRIMARY STRUCTURE (amino acid sequence) Each protein sequence “knows”
VHLTPEEKSAVTALWGKVNVD
how to fold into its tertiary
EVGGEALGRLLVVYPWTQRFF structure. We still do not
ESFGDLSTPDAVMGNPKVKAH understand exactly how and why
GKKVLGAFSDGLAHLDNLKGTF
ATLSELHCDKLHVDPENFRLLG
NVLVCVLAHHFGKEFTPPVQAA SECONDARY STRUCTURE (helices, strands)
YQKVVAGVANALAHKYH

1-step
process
2-step
process

The 1-step process is based on a

hydrophobic collapse; the 2-step
process, more common in forming
larger proteins, is called the
TERTIARY STRUCTURE (fold)
framework model of folding
Protein folding: step on the way
is secondary structure prediction
• Long history -- first widely used algorithm
was by Chou and Fasman (1974)
• Different algorithms have been developed over
the years to crack the problem:
– Statistical approaches
– Neural networks (first from speech recognition)
– K-nearest neighbour algorithms
– Support Vector machines
Algorithms in bioinformatics
(recap)
• Sometimes the same basic algorithm can be
re-used for different problems (1-method-
multiple-problem)
• Normally, biological problems are
approached by different researchers using a
variety of methods (1-problem-multiple-
method)
Algorithms in bioinformatics
• string algorithms
• dynamic programming
• machine learning (Neural Netsworks, k-Nearest Neighbour, Support
Vector Machines, Genetic Algorithm, ..)
• Markov chain models, hidden Markov models, Markov Chain Monte
Carlo (MCMC) algorithms
• molecular mechanics, e.g. molecular dynamics, Monte Carlo,
simplified force fields
• stochastic context free grammars
• EM algorithms
• Gibbs sampling
• clustering
• tree algorithms
• text analysis
• hybrid/combinatorial techniques and more…
Sequence analysis and homology searching
Finding genes and regulatory elements

There are many different regulation signals such as start, stop and skip
messages hidden in the genome for each gene, but what and where are they?
Expression data
Functional genomics

• Monte Carlo
Protein translation
What is life?
• NASA astrobiology program:
“Life is a self-sustained chemical system
capable of undergoing Darwinian
evolution”
Evolution
Four requirements:
• Template structure providing stability (DNA)
• Copying mechanism (meiosis)
• Mechanism providing variation (mutations;
insertions and deletions; crossing-over; etc.)
• Selection: some traits lead to greater fitness of one
individual relative to another. Darwin wrote
“survival of the fittest”

Evolution is a conservative process: the vast majority of mutations

will not be selected (i.e. will not make it as they lead to worse
performance or are even lethal) – this is called negative (or
purifying) selection
Orthology/paralogy

Orthologous genes are homologous

(corresponding) genes in different
species
Paralogous genes are homologous genes
within the same species (genome)
Changing molecular sequences
• Mutations: changing nucleotides (‘letters’)
within DNA, also called ‘point mutations’
• A & G: purines, C & T/U: pyrimidines:
– Transition: purine -> purine or pyrimidine ->
pyrimidine
– Transversion: purine -> pyrimidine or
pyrimidine -> purine
Types of point mutation
• Synonymous mutation: mutation that does
not lead to an amino acid change (where in
the codon are these expected?)
• Non-synonymous mutation: does lead to
an amino acid change
– Missense mutation: one a.a replaced by other
a.a
– Nonsense mutation: a.a. replaced by stop
codon (what happens with protein?)
Ka/Ks Ratios
• Ks is defined as the number of synonymous
nucleotide substitutions per synonymous site
• Ka is defined as the number of nonsynonymous
nucleotide substitutions per nonsynonymous site
• The Ka/Ks ratio is used to estimate the type of
selection exerted on a given gene or DNA
fragment
• Need aligned orthologous sequences to do
calculate Ka/Ks ratios (we will talk about
alignment later).
Ka/Ks ratios

The frequency of different values of Ka/Ks for 835 mouse–rat

orthologous genes. Figures on the x axis represent the middle figure of
each bin; that is, the 0.05 bin collects data from 0 to 0.1
Ka/Ks ratios

Three types of selection:

1. Negative (purifying) selection -> Ka/Ks < 1
2. Neutral selection (Kimura) -> Ka/Ks ~= 1
3. Positive selection -> Ka/Ks > 1
Human Evolution
Divergent Evolution
Ancestral sequence: ABCD

ACCD (B C) ABD (C ø)
mutation deletion

ACCD or ACCD Pairwise Alignment

AB─D A─BD
Evolution
Ancestral sequence: ABCD

ACCD (B C) ABD (C ø)
mutation deletion

ACCD or ACCD Pairwise Alignment

AB─D A─BD
true alignment
Consequence of evolution
• Notion of comparative analysis (Darwin)
• What you know about one species might be
transferable to another, for example from
mouse to human
• Provides a framework to do the multi-level
large-scale analysis of the genomics data
plethora
Flavodoxin-cheY Multiple Sequence Alignment
Human Yeast

We need to be able to
do automatic pathway
comparison (pathway
alignment)

This pathway diagram shows a comparison of pathways in (left) Homo sapiens

(human) and (right) Saccharomyces cerevisiae (baker’s yeast). Changes in
controlling enzymes (square boxes in red) and the pathway itself have occurred
(yeast has one altered (‘overtaking’) path in the graph)
The citric-acid cycle

http://en.wikipedia.org/wiki/Krebs_cycle
The citric-acid cycle
Fig. 1. (a) A graphical representation of the reactions of the
citric-acid cycle (CAC), including the connections with
pyruvate and phosphoenolpyruvate, and the glyoxylate
shunt. When there are two enzymes that are not homologous
to each other but that catalyse the same reaction (non-
homologous gene displacement), one is marked with a solid
line and the other with a dashed line. The oxidative direction
is clockwise. The enzymes with their EC numbers are as
follows: 1, citrate synthase (4.1.3.7); 2, aconitase (4.2.1.3);
3, isocitrate dehydrogenase (1.1.1.42); 4, 2-ketoglutarate
dehydrogenase (solid line; 1.2.4.2 and 2.3.1.61) and 2-
ketoglutarate ferredoxin oxidoreductase (dashed line;
1.2.7.3); 5, succinyl- CoA synthetase (solid line; 6.2.1.5) or
succinyl-CoA–acetoacetate-CoA transferase (dashed line;
2.8.3.5); 6, succinate dehydrogenase or fumarate reductase
(1.3.99.1); 7, fumarase (4.2.1.2) class I (dashed line) and
class II (solid line); 8, bacterial-type malate dehydrogenase
(solid line) or archaeal-type malate dehydrogenase (dashed
line) (1.1.1.37); 9, isocitrate lyase (4.1.3.1); 10, malate
synthase (4.1.3.2); 11, phosphoenolpyruvate carboxykinase
(4.1.1.49) or phosphoenolpyruvate carboxylase (4.1.1.32);
M. A. Huynen, T. Dandekar and P. Bork 12, malic enzyme (1.1.1.40 or 1.1.1.38); 13, pyruvate
``Variation and evolution of the citric acid cycle: a carboxylase or oxaloacetate decarboxylase (6.4.1.1); 14,
genomic approach'' Trends Microbiol, 7, 281-29 pyruvate dehydrogenase (solid line; 1.2.4.1 and 2.3.1.12)
(1999) and pyruvate ferredoxin oxidoreductase (dashed line;
1.2.7.1).
The citric-acid cycle
b) Individual species might not have a
complete CAC. This diagram shows
the genes for the CAC for each
unicellular species for which a
genome sequence has been published,
together with the phylogeny of the
species. The distance-based
phylogeny was constructed using the
fraction of genes shared between
genomes as a similarity criterion29.
The major kingdoms of life are
indicated in red (Archaea), blue
(Bacteria) and yellow (Eukarya).
Question marks represent reactions for
which there is biochemical evidence
in the species itself or in a related
species but for which no genes could
be found. Genes that lie in a single
operon are shown in the same color.
Genes were assumed to be located in a
single operon when they were
transcribed in the same direction and
the stretches of non-coding DNA
separating them were less than 50
nucleotides in length.

M. A. Huynen, T. Dandekar and P. Bork ``Variation and evolution of the citric acid cycle: a genomic approach'' Trends Microbiol, 7, 281-29
(1999)
Thinking about evolution
• Is the evolutionary model applicable to other
systems?
– Story telling in old cultures
– Richard Dawkins’ book entitled A Selfish Gene talks
about Memes
• The Genetic Algorithm (GA) is arguably the best
computational optimisation strategy around, and is
based entirely on Darwinian evolution

Get (eBook PDF) Introduction to Bioinformatics 5th Edition free all chapters
100% (6)
Get (eBook PDF) Introduction to Bioinformatics 5th Edition free all chapters
41 pages
Protein Synthesis Model Lab
No ratings yet
Protein Synthesis Model Lab
7 pages
(Methods in Molecular Biology 1525) Jonathan M. Keith (Eds.) - Bioinformatics - Volume I - Data, Sequence Analysis, and Evolution-Humana Press (2017)
100% (3)
(Methods in Molecular Biology 1525) Jonathan M. Keith (Eds.) - Bioinformatics - Volume I - Data, Sequence Analysis, and Evolution-Humana Press (2017)
489 pages
Bioinformatics For Biologists PDF
95% (22)
Bioinformatics For Biologists PDF
394 pages
Test For Upload
No ratings yet
Test For Upload
25 pages
Unit 6 - Bioinformatics
No ratings yet
Unit 6 - Bioinformatics
41 pages
MATH3353 Notes
No ratings yet
MATH3353 Notes
100 pages
Bioinformatics: Tina Elizabeth Varghese
No ratings yet
Bioinformatics: Tina Elizabeth Varghese
9 pages
Into To Bioinfo
No ratings yet
Into To Bioinfo
53 pages
Lecture 01
No ratings yet
Lecture 01
20 pages
Need & Emergence of The Field: Speaker Shashi Shekhar Head of Computational Section Biowits Life Sciences
No ratings yet
Need & Emergence of The Field: Speaker Shashi Shekhar Head of Computational Section Biowits Life Sciences
59 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
8 pages
PDF (eBook PDF) Introduction to Bioinformatics 5th Edition download
100% (1)
PDF (eBook PDF) Introduction to Bioinformatics 5th Edition download
50 pages
(eBook PDF) Introduction to Bioinformatics 5th Edition download pdf
100% (10)
(eBook PDF) Introduction to Bioinformatics 5th Edition download pdf
55 pages
Bioinformatics Intro
No ratings yet
Bioinformatics Intro
69 pages
7256
No ratings yet
7256
51 pages
Bio in For Matics
No ratings yet
Bio in For Matics
17 pages
Bioinformatics Primer (An Introductory Handbook For Bioinformatics Practitioners)
No ratings yet
Bioinformatics Primer (An Introductory Handbook For Bioinformatics Practitioners)
258 pages
Basics of Bioinformatics
100% (7)
Basics of Bioinformatics
99 pages
Molecular Parte3
No ratings yet
Molecular Parte3
3 pages
Bioinformatics1
No ratings yet
Bioinformatics1
37 pages
(eBook PDF) Introduction to Bioinformatics 5th Edition download
No ratings yet
(eBook PDF) Introduction to Bioinformatics 5th Edition download
42 pages
01 Intro
No ratings yet
01 Intro
21 pages
Bioinformatics Chaper3
No ratings yet
Bioinformatics Chaper3
34 pages
(eBook PDF) Introduction to Bioinformatics 5th Edition download
No ratings yet
(eBook PDF) Introduction to Bioinformatics 5th Edition download
46 pages
(eBook PDF) Introduction to Bioinformatics 5th Edition instant download
No ratings yet
(eBook PDF) Introduction to Bioinformatics 5th Edition instant download
45 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
BioInformatics Abstract For Paper Presentation
100% (1)
BioInformatics Abstract For Paper Presentation
11 pages
Datamining
No ratings yet
Datamining
15 pages
Introduction To Bioinformatics: Tolga Can
No ratings yet
Introduction To Bioinformatics: Tolga Can
21 pages
First Lecture
No ratings yet
First Lecture
89 pages
Bioinformatics: Major Research Areas
No ratings yet
Bioinformatics: Major Research Areas
2 pages
Bioinformatics For Biologists
No ratings yet
Bioinformatics For Biologists
394 pages
Introduction To NCBI Resources
No ratings yet
Introduction To NCBI Resources
39 pages
Lecture Bioinfo Databases
No ratings yet
Lecture Bioinfo Databases
27 pages
Introduction To Bioinformatics 1
No ratings yet
Introduction To Bioinformatics 1
109 pages
Applications of Combinatorics To Molecular Biology: Michael S. WATERMAN
No ratings yet
Applications of Combinatorics To Molecular Biology: Michael S. WATERMAN
18 pages
Bio in For Matics
No ratings yet
Bio in For Matics
160 pages
Lecture 1
No ratings yet
Lecture 1
53 pages
Bioinfo Course Notes M1 2020 Dr Mbulli
No ratings yet
Bioinfo Course Notes M1 2020 Dr Mbulli
56 pages
Exploring Database and Analyzing Protein Sequence
No ratings yet
Exploring Database and Analyzing Protein Sequence
70 pages
Bioinformatics 2015
No ratings yet
Bioinformatics 2015
269 pages
Bioin
No ratings yet
Bioin
34 pages
Lecture 1: INTRODUCTION: A/Prof. Ly Le School of Biotechnology Email: Office: RM 705
No ratings yet
Lecture 1: INTRODUCTION: A/Prof. Ly Le School of Biotechnology Email: Office: RM 705
43 pages
Bioinformatics:: Guide To Bio-Computing and The Internet
No ratings yet
Bioinformatics:: Guide To Bio-Computing and The Internet
34 pages
Bioinformatics: Farhan Haq, PHD Department of Biosciences Cui
No ratings yet
Bioinformatics: Farhan Haq, PHD Department of Biosciences Cui
24 pages
Genomes and Their Evolution: Biology
No ratings yet
Genomes and Their Evolution: Biology
94 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
10 pages
Unit V DM
No ratings yet
Unit V DM
96 pages
Bioinformatics Lecture 1-Fall 2024
No ratings yet
Bioinformatics Lecture 1-Fall 2024
39 pages
(eBook PDF) Introduction to Bioinformatics 5th Edition pdf download
100% (1)
(eBook PDF) Introduction to Bioinformatics 5th Edition pdf download
50 pages
Concepts of Bioinformatics PDF
100% (2)
Concepts of Bioinformatics PDF
20 pages
02 Sequence Alignment
No ratings yet
02 Sequence Alignment
43 pages
Genomes and Their Evolution: Biology
No ratings yet
Genomes and Their Evolution: Biology
94 pages
9.5 - Genomics and Bioinformatics (Book Highlights)
No ratings yet
9.5 - Genomics and Bioinformatics (Book Highlights)
4 pages
Lec (1) - Introduction
No ratings yet
Lec (1) - Introduction
41 pages
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Systems Biology: A Textbook
From Everand
Systems Biology: A Textbook
Edda Klipp
No ratings yet
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
From Everand
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
Fouad Sabry
No ratings yet
Genetic Algorithm: Fundamentals and Applications
From Everand
Genetic Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Psyc311 CH 01 Intro & Theory
No ratings yet
Psyc311 CH 01 Intro & Theory
18 pages
Biochem 218 - Biomedical Informatics 231: Doug Brutlag Professor Emeritus Biochemistry & Medicine (By Courtesy)
No ratings yet
Biochem 218 - Biomedical Informatics 231: Doug Brutlag Professor Emeritus Biochemistry & Medicine (By Courtesy)
50 pages
Free Energy of A Reaction
No ratings yet
Free Energy of A Reaction
38 pages
EMBL's European Bioinformatics Institute: WWW - Ebi.ac - Uk
No ratings yet
EMBL's European Bioinformatics Institute: WWW - Ebi.ac - Uk
34 pages
Measurement of Cellulase Activity
No ratings yet
Measurement of Cellulase Activity
2 pages
Determination of Acid Value of Fats
No ratings yet
Determination of Acid Value of Fats
1 page
Cla2 Presentation
No ratings yet
Cla2 Presentation
13 pages
Biomining and Bioleaching
100% (4)
Biomining and Bioleaching
28 pages
Kathmandu University (KU) Undergraduate Prospectus
75% (4)
Kathmandu University (KU) Undergraduate Prospectus
40 pages
Bio Worksheet Mutations
No ratings yet
Bio Worksheet Mutations
2 pages
Notes On Mutation
No ratings yet
Notes On Mutation
9 pages
Biology Challenge
50% (2)
Biology Challenge
7 pages
Science Reviewer
No ratings yet
Science Reviewer
5 pages
Sickle Cell Disease in Children 2024-1
No ratings yet
Sickle Cell Disease in Children 2024-1
35 pages
G-12 Biology, 3.4 Mutations
No ratings yet
G-12 Biology, 3.4 Mutations
5 pages
Bio Worksheet 1
No ratings yet
Bio Worksheet 1
5 pages
Maria Puiu - Genetic Disorders-InTech (2013)
No ratings yet
Maria Puiu - Genetic Disorders-InTech (2013)
352 pages
IB Biology SL Topics Study Guide
No ratings yet
IB Biology SL Topics Study Guide
65 pages
Viral Genetics PDF
100% (1)
Viral Genetics PDF
37 pages
Mutations Activity - GenesyGenomas - Lab
No ratings yet
Mutations Activity - GenesyGenomas - Lab
4 pages
Perspectives: Anecdotal, Historical and Critical Commentaries On Genetics
No ratings yet
Perspectives: Anecdotal, Historical and Critical Commentaries On Genetics
4 pages
2001annex - Pages 8-160
No ratings yet
2001annex - Pages 8-160
156 pages
Mutation PDF
100% (1)
Mutation PDF
11 pages
WechatREEC2795 10 CBTG Bio10 Ch17
No ratings yet
WechatREEC2795 10 CBTG Bio10 Ch17
17 pages
Dna Mutation Worksheet Key
No ratings yet
Dna Mutation Worksheet Key
3 pages
Mutations WS
No ratings yet
Mutations WS
3 pages
Molecular Clock PDF
No ratings yet
Molecular Clock PDF
30 pages
Microbial Genetics: By: Malarvily Vasu, Sarranhyaah Subramaniam, Eunice Chuah Ming Hui, Dachaiinii Theeran
No ratings yet
Microbial Genetics: By: Malarvily Vasu, Sarranhyaah Subramaniam, Eunice Chuah Ming Hui, Dachaiinii Theeran
33 pages
Protein Synthesis and Mutations
No ratings yet
Protein Synthesis and Mutations
41 pages
Science Reviewer
No ratings yet
Science Reviewer
4 pages
Mutation Note
No ratings yet
Mutation Note
16 pages
Genetics and Malocclusion - Dr. K.thejasri
No ratings yet
Genetics and Malocclusion - Dr. K.thejasri
117 pages
% Chapter 27: Molecular Genetics
No ratings yet
% Chapter 27: Molecular Genetics
39 pages
Protein Synthesis تصنيع البروتين
No ratings yet
Protein Synthesis تصنيع البروتين
12 pages
Genetics Finals Reviewer
No ratings yet
Genetics Finals Reviewer
16 pages
4 Main Theories of Evolution (Explained With Diagram and Tables) - Biology
No ratings yet
4 Main Theories of Evolution (Explained With Diagram and Tables) - Biology
36 pages
Dr. Harvey Laranang Microbiology
No ratings yet
Dr. Harvey Laranang Microbiology
9 pages
Fragile X Syndrome Case File
No ratings yet
Fragile X Syndrome Case File
3 pages

Introduction To Bioinformatics: High-Throughput Biological Data and Evolution

Uploaded by

Introduction To Bioinformatics: High-Throughput Biological Data and Evolution

Uploaded by

C

The gap between sequence data on the one hand, and

Most effectively utilising and analysing this

We will cover both of these techniques in later lectures

An important goal of bioinformatics is

QUATERNARY STRUCTURE (oligomers) TERTIARY STRUCTURE (fold)

The 1-step process is based on a

Evolution is a conservative process: the vast majority of mutations

Orthologous genes are homologous

The frequency of different values of Ka/Ks for 835 mouse–rat

Three types of selection:

ACCD or ACCD Pairwise Alignment

ACCD or ACCD Pairwise Alignment

This pathway diagram shows a comparison of pathways in (left) Homo sapiens

You might also like