CS444: BIO INFORMATICS (Lab 1 - Manual) Bioinformatics Databases and Key Online Resources
CS444: BIO INFORMATICS (Lab 1 - Manual) Bioinformatics Databases and Key Online Resources
Overview:
The purpose of this lab session is to introduce a range of Bio Informatics databases and
associated services available on the Web.
The following transcript was found to be abundant in a human patient’s blood sample.
>example1
ATGGTGCATCTGACTCCTGTGGAGAAGTCTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGA
TGAAGTTGGTGGTGAGGCCCTGGGCAGGCTGCTGGTGGTCTACCCTTGGACCCAGAGGTTCTTTG
AGTCCTTTGGGGATCTGTCCACTCCTGATGCAGTTATGGGCAACCCTAAGGTGAAGGCTCATGGC
AAGAAAGTGCTCGGTGCCTTTAGTGATGGCCTGGCTCACCTGGACAACCTCAAGGGCACCTTTGC
CACACTGAGTGAGCTGCACTGTGACAAGCTGCACGTGGATCCTGAGAACTTCAGGCTCCTGGGCA
ACGTGCTGGTCTGTGTGCTGGCCCATCACTTTGGCAAAGAATTCACCCCACCAGTGCAGGCTGCC
TATCAGAAAGTGGTGGCTGGTGTGGCTAATGCCCTGGCCCACAAGTATCACTAAGCTCGCTTTCT
TGCTGTCCAATTT
The only information you are given is the above sequence so you must begin your
investigation with a sequence search - for this example we will use NCBI’s
BLASTservice at: http://blast.ncbi.nlm.nih.gov/
Note that there are several different “basic BLAST” programs available at NCBI
(including nucleotide BLAST, protein BLAST, and BLASTx, etc.).
Q2: What are the names and accession numbers of the top four hits from your
BLAST search?
Sol: NM_000518, Homo sapiens hemoglobin subunit beta (HBB)
XM_508242, Pan troglodytes hemoglobin subunit beta (HBB)
XM_003819029, Pan paniscus hemoglobin subunit beta (LOC100976465)
AY136510, Homo sapiens hemoglobin beta chain variant Hb S-Wake (HBB)
Q3: What are the percent identities for the top few hits?
[HINT: scroll down to the alignment section of your BLAST result page for
details of matched nucleotides]
Sol: 466/468(99%),
465/468(99%),
465/468(99%),
465/468(99%)
Q4: How many identical and non identical nucleotides are there in your top hit
compared to your last reported hit?
Sol: Top: 466 identical, 2 non-identical
Last: 398 identical, 40 non-identical
From the results of your BLAST search you can link to the GENE entry for one of your top
hits. This link is located under the “Related Information” heading at the right hand side of
each displayed alignment (i.e. scroll down to the “Alignments” section).
Q5: What is the “Official Symbol” and “Official Full Name” for this gene?
Sol: HBB, hemoglobin subunit beta
Q10: Does the protein have a role in human disease(s)? If so, what diseases?
Sol: Sickle cell anemia. The disease is caused by mutations affecting the gene represented
in this entry.
[HINT: Scroll down to the “Phenotypes” section of the GENE entry page and also
explore the link to the OMIM database]