Bioinformatics Week 1: Play Video Starting At:4:13 and Follow Transcript4:13

Based on the information provided in the document, the biological process this gene is involved with according to the Gene Ontology terms is "regulation of transcription, DNA-templated".

Uploaded by

SawDust pH Indicator

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

Bioinformatics Week 1: Play Video Starting At:4:13 and Follow Transcript4:13

Based on the information provided in the document, the biological process this gene is involved with according to the Gene Ontology terms is "regulation of transcription, DNA-templated".

Uploaded by

SawDust pH Indicator

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

BIOINFORMATICS WEEK 1

Introduction
Instructor : Nicholas Provart

What is bioinformatics?
Basically, it's the use of computational tools to manage all kinds of biological data.
Here we use computers for storage, retrieval, to manipulate and to distribute information
related to biological molecules, such as DNA, RNA, proteins, and metabolites.
Here, we're generally talking about sequence information, structural information,
functional analysis of genes and genomes, and their corresponding products such as
transcripts, so gene expression levels.
It's sometimes called computational molecular biology.
This field has really developed in the past 10 years due to the efforts of genome
sequencing projects, such as the human genome sequencing project which you may have
heard of ;-)
How do we deal with
three billion pieces of sequence information?
So why do we need bioinformatics?
Well, if you can imagine
three billion letters in the human genome,
three billion nucleotides, how do you really
make sense of that without using computers?
So this is just a small section of
a human genome encompassing the human globin gene.
We would like to know about
which parts of the genome are important,
that code for proteins for instance.
Without using computers, we would
never know that this region here or
this region here actually
comprise an exon of the globin gene.
That is a piece of the gene that
actually codes for protein.
The other thing that bioinformatics is
about is biological databases,
how we can store these biological data.
We'll talk a little bit about what a database is,
data structures, flat file
databases versus relational databases.
We'll talk also about accession numbers and identifiers,
and we'll go over the GenBank flat file format,
and we'll just touch briefly on a practical example of
utility using NCBI's Entrez / GQuery / Search.

Play video starting at :4:13 and follow transcript4:13

BIOINFORMATICS WEEK 1

So if we look at the planet Earth from afar,

we see a lot of green means that life's present,
and this background that I've placed here
is actually an output
from a next generation sequencing machine,
you see small spots, clusters in an Illumina flow cell,
and it's possible now to generate
a universe of information about the organisms
that are on our planet.
We might be interested in genome and genomic sequences,
gene sequences and mutations,
gene regulation,
where a given gene is expressed and when.
That can tell us about the function of the gene,
what happens when introns aren't spliced
properly, or when they are
spliced properly but create variants.
We can think about
protein sequences and
some post-translational modifications,
such as phosphorylation of proteins,
and look at how the proteins fold up
to create small machines,
basically that do the things
that we need them to do inside our bodies.
These machines don't operate in isolation,
often they operate in networks so we're interested
in how proteins function together in networks,
where the proteins are localized,
the kinetics of enzymes, which are a sub-class proteins,
the metabolites that some of these proteins produce,
and when things go awry,
what kinds of diseases are caused by
defects in genes and proteins.
Of course, we would like to tie
all of these together with some academic framework,
so we want to have access to the literature.
So basically, we need databases
to archive accumulated knowledge
and to provide scientists
with easy access to biological data.
How can we store this data?
You can store them in a flat-file format
with the field separated by some kind of a delimiter.
So here we've got four records of professors,
BIOINFORMATICS WEEK 1

University of Toronto in

this case, some former professors.
Basically, that's the first name,
separated by a pipe character,
last name and then the department,
the university, and the address in this case.
We could store those data in a spreadsheet,
so this is maybe a more familiar way for
you to think about
storing data and you're all familiar with Excel, I'm sure.
Here we've got a column that contains the first_name,
the last_name, the institution,
the department, and the address.
There are problems with this kind of flat file format,
this kind of database.
One of these problems is that there's some redundancy.
So that for instance if we look at
this record here and this record here,
we've got two entries, which is taking
up extra storage space.
If the physical building
changes where these professors are housed in,
we'll have to update all of
the records in this flat file database.
If we miss one of them, that would be an error.
So relational databases actually offer a solution,
and they are commonly used in biology.
What we've got is a series of tables,
relations, that contain attributes,
which are fields or columns of the table,
and each row in a table is known as a tuple or a record,
and the information in these tables should
be normalized so that it's non-redundant.
So we can do this in a couple of different ways.
One common way to do this is to use
a foreign key to link tables.
The second table here,
the first table, we've got the table of Professors,
we've got a link to
another table of Contacts down here by
a foreign key to the primary key of the Contacts table.
So in fact here we would only represent
the Department of Botany once in the table of Contacts,
instead of having entered multiple
times as we did in the flat file field.
BIOINFORMATICS WEEK 1

SQL can be used to query relational databases,

and there's a very large body
of research and development on SQL databases,
how to index things efficiently
and query these databases efficiently.
When we create biological databases,
often we use different identifiers to index records.
A couple of different ways of identifying records in
a database, in GenBank for
instance, are using identifiers or accession codes.
In the case of identifiers,
typically a string of letters and digits
that's understandable in some meaningful way by a human.
They're not stable as accession numbers, mainly
because they can be
changed by curators if the function of the,
presumed function of the protein is found,
is changed, is updated as research advances.
In the case of GenBank,

The Answer for the Quiz

1. 1
2. 65 points …
3. Xm ..621 udah dicoba Xm… 721 xm..521 bv udahh
BIOINFORMATICS WEEK 1

a. What is the taxonomic lineage of your organism?

LINEAGE

cellular;organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embry
ophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliopsida; Mesangios
permae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales;
Brassicaceae; Camelineae; Arabidopsis
b. Has the genome of this organism been sequenced, i.e., is there a Genome Project?

Yes, there is a genome project. If i clicked a bio project of this organism, I can get 6,937 results.

c. If so, can you find the accession for the full sequence or one of the chromosomes?

Yes, I can. Example, the accession PRJNA795329

a. Where did this take you or what happened when you did this?
BIOINFORMATICS WEEK 1

This link took me to focused on the origin, as you can see the origin of this organism is
coloured by brown colour. So, the origin of this organism started by 1 until 541. Furthermore,
we can know the details about this organism such as gene synonym, inference (similar to
RNA sequence), domain etc.
a. Where is your gene’s location in the genome? (Tip: hover with your cursor over the green
bars in the “Genomic regions, transcripts, and products” section; the green bars represent
the gene in the sequence viewer)
Location : chromosome 2
Location complement(12,368,220..12,370,420)
:

b. How many exons do you see in this gene? Tip: how many green boxes are there?
Exon count : 4
c. What are the names of the genes surrounding it (i.e. what is its “Genomic context”)?
NC_003071.7
d. Does it have any conserved domains? What are they called? (Tip: use the “Related
Information” link to Conserved Domains on the right of the Gene page)
Yes, it does. There is 50 results of conserved domain in this organism.
BIOINFORMATICS WEEK 1

e. After exploring conserved domains go back to the Gene page. What biological process (Gene
Ontology terms) is this gene involved with (scroll down!)?

The Human Genome: Mapping the Blueprint of Human Life
From Everand
The Human Genome: Mapping the Blueprint of Human Life
Carla Mooney
No ratings yet
Bioinformatics and Quantumcomputing: Bio Informatics
No ratings yet
Bioinformatics and Quantumcomputing: Bio Informatics
10 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
52 pages
Computacional Biology
No ratings yet
Computacional Biology
201 pages
SN Brain Storm
No ratings yet
SN Brain Storm
3 pages
Essential Info Notes-1
No ratings yet
Essential Info Notes-1
57 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
8 pages
Bioinformatics Manual
No ratings yet
Bioinformatics Manual
117 pages
Plagiarism2 - Report
No ratings yet
Plagiarism2 - Report
6 pages
Computers Are From Mars, Organisms Are From Venus: Interrelationship Guide To Biology and Computer Science
No ratings yet
Computers Are From Mars, Organisms Are From Venus: Interrelationship Guide To Biology and Computer Science
8 pages
TB
No ratings yet
TB
143 pages
Algae Bioinformatics
No ratings yet
Algae Bioinformatics
10 pages
RAJU
No ratings yet
RAJU
24 pages
Bioinformatics Definition
No ratings yet
Bioinformatics Definition
11 pages
AP Bio Lab 3
No ratings yet
AP Bio Lab 3
18 pages
Bioinformatics Past Paper-WPS Office
No ratings yet
Bioinformatics Past Paper-WPS Office
19 pages
Bioinformatic Paper WPS Office
No ratings yet
Bioinformatic Paper WPS Office
20 pages
[Ebooks PDF] download Plant Epigenetics Methods and Protocols 1st Edition Andrea M. Foerster full chapters
100% (11)
[Ebooks PDF] download Plant Epigenetics Methods and Protocols 1st Edition Andrea M. Foerster full chapters
77 pages
SCU BIOL18 Midterm Study Guide
No ratings yet
SCU BIOL18 Midterm Study Guide
3 pages
Metagenomics Thesis PDF
75% (4)
Metagenomics Thesis PDF
11 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
10 pages
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
No ratings yet
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
54 pages
Introduction To NCBI Resources
No ratings yet
Introduction To NCBI Resources
39 pages
Chapter 1 - Genetics-An Introduction - 2
No ratings yet
Chapter 1 - Genetics-An Introduction - 2
17 pages
2006 09 01 - Lect01 - ch1 2 PDF
No ratings yet
2006 09 01 - Lect01 - ch1 2 PDF
104 pages
Annurev 2earplant 2E56 2E032604 2E144103
No ratings yet
Annurev 2earplant 2E56 2E032604 2E144103
29 pages
Bioinformatica
No ratings yet
Bioinformatica
10 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
7 pages
Unit I
No ratings yet
Unit I
28 pages
Biological Databases
No ratings yet
Biological Databases
39 pages
Bio PPT
No ratings yet
Bio PPT
35 pages
Molecular Genetics (Biology) - An Overview
No ratings yet
Molecular Genetics (Biology) - An Overview
13 pages
Full
No ratings yet
Full
235 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
22 pages
Bioinformatics Made Easy
No ratings yet
Bioinformatics Made Easy
232 pages
Bio in For Matics
No ratings yet
Bio in For Matics
232 pages
Lab 1 - Introduction and Protocol
No ratings yet
Lab 1 - Introduction and Protocol
28 pages
Instant Download Plant Genomics Databases Methods and Protocols 1st Edition Aalt D.J Van Dijk (Eds.) PDF All Chapters
100% (2)
Instant Download Plant Genomics Databases Methods and Protocols 1st Edition Aalt D.J Van Dijk (Eds.) PDF All Chapters
55 pages
Coursera BioinfoMethods-I Lab01 PDF
No ratings yet
Coursera BioinfoMethods-I Lab01 PDF
22 pages
Genome Project (1)
No ratings yet
Genome Project (1)
11 pages
Instant Download Phylogenomics A Primer 1st Edition Rob Desalle (Author) PDF All Chapters
100% (8)
Instant Download Phylogenomics A Primer 1st Edition Rob Desalle (Author) PDF All Chapters
60 pages
Human Molecular Genetics_ English
No ratings yet
Human Molecular Genetics_ English
340 pages
[FREE PDF sample] (Ebook) Plant Genomics Databases: Methods and Protocols by Aalt D.J van Dijk (eds.) ISBN 9781493966561, 9781493966585, 1493966561, 1493966588 ebooks
100% (8)
[FREE PDF sample] (Ebook) Plant Genomics Databases: Methods and Protocols by Aalt D.J van Dijk (eds.) ISBN 9781493966561, 9781493966585, 1493966561, 1493966588 ebooks
65 pages
Tics and Homology Modeling
No ratings yet
Tics and Homology Modeling
36 pages
Biological Databases
No ratings yet
Biological Databases
13 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
26 pages
Exercise 7 Bioinformatics
No ratings yet
Exercise 7 Bioinformatics
8 pages
Tics - A Brief Introduction
No ratings yet
Tics - A Brief Introduction
4 pages
Bio-Rad Explorer Cloning and Sequencing Explorer Series: Curriculum Manual
No ratings yet
Bio-Rad Explorer Cloning and Sequencing Explorer Series: Curriculum Manual
332 pages
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Bioinformatics for DNA sequence analysis 1st Edition Kit J. Menlove - The ebook is available for quick download, easy access to content
100% (1)
Bioinformatics for DNA sequence analysis 1st Edition Kit J. Menlove - The ebook is available for quick download, easy access to content
57 pages
BioinformaticsProjects Introduction
No ratings yet
BioinformaticsProjects Introduction
2 pages
Genome Annotation
No ratings yet
Genome Annotation
24 pages
CSC 821 - Bioinformatics
No ratings yet
CSC 821 - Bioinformatics
5 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Mastering Java through Biology: A Bioinformatics Project Book
From Everand
Mastering Java through Biology: A Bioinformatics Project Book
Peter Garst
3/5 (2)
Nature vs Nurture: Or Is It Neither? Could It Be the Ether?
From Everand
Nature vs Nurture: Or Is It Neither? Could It Be the Ether?
Marlon O Cole
No ratings yet
Bioinformatics: Merging Biology and Technology
From Everand
Bioinformatics: Merging Biology and Technology
Mani Devar
No ratings yet
The Tree with Many Branches: A Collection of Essays in Computational Phylogenetics
From Everand
The Tree with Many Branches: A Collection of Essays in Computational Phylogenetics
Tommy Rodriguez
No ratings yet
Ingenious Genes: How Gene Regulation Networks Evolve to Control Development
From Everand
Ingenious Genes: How Gene Regulation Networks Evolve to Control Development
Roger Sansom
No ratings yet
Dna Barcode Dan Analisis Filogenetik Molekuler Beberapa Jenis Bivalvia Asal Perairan Sulawesi Utara Berdasarkan Gen Coi
No ratings yet
Dna Barcode Dan Analisis Filogenetik Molekuler Beberapa Jenis Bivalvia Asal Perairan Sulawesi Utara Berdasarkan Gen Coi
7 pages
Epigenetic Biomarkers and Diagnostics 1st Edition Jose Luis Garca-Gimenez - Download the full set of chapters carefully compiled
100% (1)
Epigenetic Biomarkers and Diagnostics 1st Edition Jose Luis Garca-Gimenez - Download the full set of chapters carefully compiled
77 pages
Universal GenomeWalker 2.0 User Manual - 040314
No ratings yet
Universal GenomeWalker 2.0 User Manual - 040314
24 pages
Cleavable Linkers
100% (1)
Cleavable Linkers
12 pages
PRTN
No ratings yet
PRTN
16 pages
The Case of The Druid Dracula - PCR Lab
No ratings yet
The Case of The Druid Dracula - PCR Lab
12 pages
Fundamental Medical Science 1 Final Report (Genomic)
No ratings yet
Fundamental Medical Science 1 Final Report (Genomic)
14 pages
Southern Blot
No ratings yet
Southern Blot
2 pages
Diabetes Pathway
No ratings yet
Diabetes Pathway
12 pages
Guide To Electropherogram v3
No ratings yet
Guide To Electropherogram v3
12 pages
Bioinformatics Companies
100% (1)
Bioinformatics Companies
12 pages
What Is Pyrosequencing
No ratings yet
What Is Pyrosequencing
6 pages
2018 Julia Joo CV
No ratings yet
2018 Julia Joo CV
2 pages
Biol 3320 - Ames Test Lab Report - Travis Rempel
No ratings yet
Biol 3320 - Ames Test Lab Report - Travis Rempel
13 pages
Virfinder: A Novel K-Mer Based Tool For Identifying Viral Sequences From Assembled Metagenomic Data
No ratings yet
Virfinder: A Novel K-Mer Based Tool For Identifying Viral Sequences From Assembled Metagenomic Data
20 pages
Download Complete Molecular Diagnostics 1st Edition George Patrinos (Editor) PDF for All Chapters
100% (3)
Download Complete Molecular Diagnostics 1st Edition George Patrinos (Editor) PDF for All Chapters
43 pages
Archer Fusionplex Ngs Assays Brochure
No ratings yet
Archer Fusionplex Ngs Assays Brochure
4 pages
AI Can Help To Speed Up Drug Discovery - But Only If We Give It The Right Data
No ratings yet
AI Can Help To Speed Up Drug Discovery - But Only If We Give It The Right Data
4 pages
Nucleic Acid Biotechnology Techniques: Mary K. Campbell Shawn O. Farrell
No ratings yet
Nucleic Acid Biotechnology Techniques: Mary K. Campbell Shawn O. Farrell
40 pages
DANC113 Lec Notes 4
No ratings yet
DANC113 Lec Notes 4
6 pages
Molecular Characterization of Wild Mushroom
No ratings yet
Molecular Characterization of Wild Mushroom
5 pages
s41591-022-01717-2
No ratings yet
s41591-022-01717-2
8 pages
Databases in Bioinformatics - An Introduction
No ratings yet
Databases in Bioinformatics - An Introduction
11 pages
(Ebook) Parasite Genomics Protocols by Daniella Bartholomeu, Najib M. El-Sayed (auth.), Sara E. Melville (eds.) ISBN 9781588290625, 158829062Xdownload
100% (3)
(Ebook) Parasite Genomics Protocols by Daniella Bartholomeu, Najib M. El-Sayed (auth.), Sara E. Melville (eds.) ISBN 9781588290625, 158829062Xdownload
55 pages
Biotechnology Timeline
No ratings yet
Biotechnology Timeline
4 pages
Molecular Biology Structure and Dynamics of Genomes and Proteomes 2e by Jordanka Zlatanova
No ratings yet
Molecular Biology Structure and Dynamics of Genomes and Proteomes 2e by Jordanka Zlatanova
732 pages
Senol Cali Et Al., 2018
No ratings yet
Senol Cali Et Al., 2018
18 pages
MOLBIO-Syllabus
No ratings yet
MOLBIO-Syllabus
13 pages
(생명과학) (포스터) (경기과학고등학교) (정예찬)
No ratings yet
(생명과학) (포스터) (경기과학고등학교) (정예찬)
1 page

Bioinformatics Week 1: Play Video Starting At:4:13 and Follow Transcript4:13

Uploaded by

Bioinformatics Week 1: Play Video Starting At:4:13 and Follow Transcript4:13

Uploaded by

BIOINFORMATICS WEEK 1

Play video starting at :4:13 and follow transcript4:13

So if we look at the planet Earth from afar,

University of Toronto in

SQL can be used to query relational databases,

The Answer for the Quiz

a. What is the taxonomic lineage of your organism?

Yes, I can. Example, the accession PRJNA795329

You might also like