Bioinformatic Practice
Bioinformatic Practice
Class: …………………..
Student-ID: …………….
Introduction to Ensembl
https://www.youtube.com/watch?time_continue=1442&v=lA2xq3YkWko
Exercise 1 – Panda
(a) Go to the species homepage for Giant Panda. What is the name of
the genome assembly for Panda?
(b) Click on More information and statistics. How long is the Panda
genome (in bp)? How many coding genes have been annotated?
Exercise 2 – Zebrafish
Exercise 3 – Mosquitoes
(b) When was the current Anopheles gambiae genome assembly last
revised?
Exercise 4 – Bacteria
Go to Ensembl Bacteria and find the species Belliella baltica. How
many coding and non-coding genes does it have?
(a) Find the human MYH9 (myosin, heavy chain 9, non-muscle) gene,
and go to the Gene
(b) Click on Phenotype at the left side of the page. Are there any
diseases associated with this gene, according to O-MIM (Online
Mendelian Inheritance in Man)?
(c) In the transcript table, click on the transcript ID for MYH9-201, and
go to the Transcript tab.
(b) How many protein coding transcripts does this gene have? View all
of these in the transcript comparison view.
(c) What is the OMIM gene identifier for this gene?
The SNP rs1738074 in the 5’ UTR of the human TAGAP gene has been
identified as a genetic risk factor for a few diseases.
(b) What is the least frequent genotype for this SNP in the Yoruba (YRI)
population from the 1000 Genomes phase 3?
(d) With which diseases is this SNP associated? Are there any known
risk (or associated alleles?
(c) Zoom in on the largest gene EFI27358. How many exons does this
gene have?