0% found this document useful (0 votes)

73 views

NIH Public Access: Author Manuscript

Uploaded by

Andressa Alves

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views

NIH Public Access: Author Manuscript

Uploaded by

Andressa Alves

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

NIH Public Access

Author Manuscript
Nat Genet. Author manuscript; available in PMC 2011 February 10.
Published in final edited form as:
NIH-PA Author Manuscript

Nat Genet. 2000 May ; 25(1): 25–29. doi:10.1038/75556.

Gene Ontology: tool for the unification of biology

The Gene Ontology Consortium, Michael Ashburner1, Catherine A. Ball3, Judith A. Blake4,
David Botstein3, Heather Butler1, J. Michael Cherry3, Allan P. Davis4, Kara Dolinski3,
Selina S. Dwight3, Janan T. Eppig4, Midori A. Harris3, David P. Hill4, Laurie Issel-Tarver3,
Andrew Kasarskis3, Suzanna Lewis2, John C. Matese3, Joel E. Richardson4, Martin
Ringwald4, Gerald M. Rubin2, and Gavin Sherlock3
1FlyBase (http://www.flybase.bio.indiana.edu)

2Berkeley Drosophila Genome Project (http://fruitfly.bdgp.berkeley.edu)

3Saccharomyces Genome Database (http://genome-www.stanford.edu)
4Mouse Genome Database and Gene Expression Database (http://www.informatics.jax.org)
NIH-PA Author Manuscript

Abstract
Genomic sequencing has made it clear that a large fraction of the genes specifying the core
biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared
proteins in one organism can often be transferred to other organisms. The goal of the Gene
Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all
eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To
this end, three independent ontologies accessible on the World-Wide Web
(http://www.geneontology.org) are being constructed: biological process, molecular function and
cellular component.

The accelerating availability of molecular sequences, particularly the sequences of entire

genomes, has transformed both the theory and practice of experimental biology. Where once
biochemists characterized proteins by their diverse activities and abundances, and geneticists
characterized genes by the phenotypes of their mutations, all biologists now acknowledge
that there is likely to be a single limited universe of genes and proteins, many of which are
conserved in most or all living cells. This recognition has fuelled a grand unification of
biology; the information about the shared genes and proteins contributes to our
NIH-PA Author Manuscript

understanding of all the diverse organisms that share them. Knowledge of the biological role
of such a shared protein in one organism can certainly illuminate, and often provide strong
inference of, its role in other organisms.

Progress in the way that biologists describe and conceptualize the shared biological elements
has not kept pace with sequencing. For the most part, the current systems of nomenclature
for genes and their products remain divergent even when the experts appreciate the
underlying similarities. Interoperability of genomic databases is limited by this lack of
progress, and it is this major obstacle that the Gene Ontology (GO) Consortium was formed
to address.

© 2000 Nature America Inc.

Correspondence should be addressed to J.M.C. ([email protected]) and D.B. ([email protected]), Department of
Genetics, Stanford University School of Medicine, Stanford, California, USA..
et al. Page 2

Functional conservation requires a common language for annotation

Nowhere is the impact of the grand biological unification more evident than in the
NIH-PA Author Manuscript

eukaryotes, where the genomic sequences of three model systems are already available
(budding yeast, Saccharomyces cerevisiae, completed in 1996 (ref. 1); the nematode worm
Caenorhabditis elegans, completed in 1998 (ref. 2); and the fruitfly Drosophila
melanogaster, completed earlier this year3) and two more (the flowering plant Arabidopsis
thaliana4 and fission yeast Schizosaccharomyces pombe) are imminent. The complete
genomic sequence of the human genome is expected in a year or two, and the sequence of
the mouse (Mus musculus) will likely follow shortly thereafter.

The first comparison between two complete eukaryotic genomes (budding yeast and worm5)
revealed that a surprisingly large fraction of the genes in these two organisms displayed
evidence of orthology. About 12% of the worm genes (~18,000) encode proteins whose
biological roles could be inferred from their similarity to their putative orthologues in yeast,
comprising about 27% of the yeast genes (~5,700). Most of these proteins have been found
to have a role in the ‘core biological processes’ common to all eukaryotic cells, such as
DNA replication, transcription and metabolism. A three-way comparison among budding
yeast, worm and fruitfly shows that this relationship can be extended; the same subset of
yeast genes generally have recognizable homologues in the fly genome6. Estimates of
sequence and functional conservation between the genes of these model systems and those
NIH-PA Author Manuscript

of mammals are less reliable, as no mammalian genome sequence is yet known in its
entirety. Nevertheless, it is clear that a high level of sequence and functional conservation
will extend to all eukaryotes, with the likelihood that genes and proteins that carry out the
core biological processes will again be probable orthologues. Furthermore, since the late
1980s, many experimental confirmations of functional conservation between mammals and
model organisms (commonly yeast) have been published7-12.

This astonishingly high degree of sequence and functional conservation presents both
opportunities and challenges. The main opportunity lies in the possibility of automated
transfer of biological annotations from the experimentally tractable model organisms to the
less tractable organisms based on gene and protein sequence similarity. Such information
can be used to improve human health or agriculture. The challenge lies in meeting the
requirements for a largely or entirely computational system for comparing or transferring
annotation among different species. Although robust methods for sequence comparison are
at hand13-15, many of the other elements for such a system remain to be developed.

A dynamic gene ontology

The GO Consortium is a joint project of three model organism databases: FlyBase16, Mouse
NIH-PA Author Manuscript

Genome Informatics17,18 (MGI) and the Saccharomyces Genome Database19 (SGD). It is

expected that other organism databases will join in the near future. The goal of the
Consortium is to produce a structured, precisely defined, common, controlled vocabulary for
describing the roles of genes and gene products in any organism. Early considerations of the
problems posed by the diversity of activities that characterize the cells of yeast, flies and
mice made it clear that extensions of standard indexing methods (for example, keywords)
are likely to be both unwieldy and, in the end, unworkable. Although these resources remain
essential, and our proposed system will continue to link to and depend on them, they are not
sufficient in themselves to allow automatic transfers of annotation.

Each node in the GO ontologies will be linked to other kinds of information, including the
many gene and protein keyword databases such as SwissPROT (ref. 20), Gen-Bank (ref. 21),
EMBL (ref. 22), DDBJ (ref. 23), PIR (ref. 24), MIPS (ref. 25), YPD & WormPD (ref. 26),
Pfam (ref. 27), SCOP (ref. 28) and ENZYME (ref. 29). One reason for this is that the state

Nat Genet. Author manuscript; available in PMC 2011 February 10.

et al. Page 3

of biological knowledge of what genes and proteins do is very incomplete and changing
rapidly. Discoveries that change our understanding of the roles of gene products in cells are
published on a daily basis. To illustrate this, consider annotating two different proteins. One
NIH-PA Author Manuscript

is known to be a transmembrane receptor serine/threonine kinase involved in p53-induced

apoptosis; the other is known only to be a membrane-bound protein. In one case, the
knowledge about the protein is substantial, whereas in the other it is minimal. We need to be
able to organize, describe, query and visualize biological knowledge at vastly different
stages of completeness. Any system must be flexible and tolerant of this constantly changing
level of knowledge and allow updates on a continuing basis.

Similar considerations suggested that a static hierarchical system, such as the Enzyme
Commission30 (EC) hierarchy, although computationally tractable, was also likely to be
inadequate to describe the role of a gene or a protein in biology in a manner that would be
either intuitive or helpful for biologists. The hierarchical EC numbering system for enzymes
is the standard resource for classifying enzymatic chemical reactions. The EC system does
not address the classification of non-enzymatic proteins or the ability to describe the role of
a gene product within a cell; also, the system has little facility for describing diverse protein
interactions. The vagueness of the term ‘function’ when applied to genes or proteins
emerged as a particular problem, as this term is colloquially used to describe biochemical
activities, biological goals and cellular structure. It is commonplace today to refer to the
function of a protein such as tubulin as ‘GTPase’ or ‘constituent of the mitotic spindle’. For
NIH-PA Author Manuscript

all these reasons, we are constructing three independent ontologies.

Three categories of GO
Biological process refers to a biological objective to which the gene or gene product
contributes. A process is accomplished via one or more ordered assemblies of molecular
functions. Processes often involve a chemical or physical transformation, in the sense that
something goes into a process and something different comes out of it. Examples of broad
(high level) biological process terms are ‘cell growth and maintenance’ or ‘signal
transduction’. Examples of more specific (lower level) process terms are ‘translation’,
‘pyrimidine metabolism’ or ‘cAMP biosynthesis’.

Molecular function is defined as the biochemical activity (including specific binding to

ligands or structures) of a gene product. This definition also applies to the capability that a
gene product (or gene product complex) carries as a potential. It describes only what is done
without specifying where or when the event actually occurs. Examples of broad functional
terms are ‘enzyme’, ‘transporter’ or ‘ligand’. Examples of narrower functional terms are
‘adenylate cyclase’ or ‘Toll receptor ligand’.
NIH-PA Author Manuscript

Cellular component refers to the place in the cell where a gene product is active. These
terms reflect our understanding of eukaryotic cell structure. As is true for the other
ontologies, not all terms are applicable to all organisms; the set of terms is meant to be
inclusive. Cellular component includes such terms as ‘ribo-some’ or ‘proteasome’,
specifying where multiple gene products would be found. It also includes terms such as
‘nuclear membrane’ or ‘Golgi apparatus’.

Ontologies have long been used in an attempt to describe all entities within an area of reality
and all relationships between those entities. An ontology comprises a set of well-defined
terms with well-defined relationships. The structure itself reflects the current representation
of biological knowledge as well as serving as a guide for organizing new data. Data can be
annotated to varying levels depending on the amount and completeness of available
information. This flexibility also allows users to narrow or widen the focus of queries.
Ultimately, an ontology can be a vital tool enabling researchers to turn data into knowledge.

Nat Genet. Author manuscript; available in PMC 2011 February 10.

et al. Page 4

Computer scientists have made significant contributions to linguistic formalisms and

computational tools for developing complex vocabulary systems using reason-based
structures, and we hope that our ontologies will be useful in providing a well-developed data
NIH-PA Author Manuscript

set for this community to test their systems. The Molecular Biology Ontology Working
Group (http://wwwsmi.stanford.edu/projects/bio-ontology/) is actively attempting to develop
standards in this general field.

Biological process, molecular function and cellular component are all attributes of genes,
gene products or gene-product groups. Each of these may be assigned independently and,
indeed, we believe that simply recognizing that biological process, molecular function and
cellular location represent independent attributes is by itself clarifying in many situations, as
in the annotation of gene-expression data. The relationships between a gene product (or
gene-product group) to biological process, molecular function and cellular component are
one-to-many, reflecting the biological reality that a particular protein may function in
several processes, contain domains that carry out diverse molecular functions, and
participate in multiple alternative interactions with other proteins, organelles or locations in
the cell.

The ontologies are developed for a generic eukaryotic cell; accordingly, specialized organs
or body parts are not represented. Full integration of the ontologies with anatomical
structures will occur as the ontologies are incorporated into each species’ database and are
NIH-PA Author Manuscript

related to anatomical data within each database. GO terms are connected into nodes of a
network, thus the connections between its parents and children are known and form what are
technically described as directed acyclic graphs. The ontologies are dynamic, in the sense
that they exist as a network that is changed as more information accumulates, but have
sufficient uniqueness and precision so that databases based on the ontologies can
automatically be updated as the ontologies mature. The ontologies are flexible in another
way, so that they can reflect the many differences in the biology of the diverse organisms,
such as the breakdown of the nucleus during mitosis. In this way the GO Consortium has
built up a system that supports a common language with specific, agreed-on terms with
definitions and supporting documentation (the GO ontologies) that can be understood and
used by a wide biological community.

Examples of GO annotation
As one example, consider DNA metabolism, a biological process carried out by largely (but
not entirely) shared elements in eukaryotes. The part of the process ontology (with selected
gene names from S. cerevisiae, Drosophila and M. musculus) shown is largely one parent to
many children (Fig. 1a). One notable exception is the process of DNA ligation, which is a
NIH-PA Author Manuscript

child of three processes, DNA replication, DNA repair and DNA recombination. The yeast
gene product Cdc9p is able to carry out the ligation step for all three processes, whereas it is
uncertain whether the same enzyme is used in the other species. From the point of view of
the ontology, it matters not, and a computer (or a human searcher) will find the appropriate
nodes in either case using as the query either the enzyme, the gene name(s) or the GO term
(or, if available, the unique GO identifier, in this case, GO:0003910).

Also shown are the molecular function ontology for the MCM protein complex members
that are known to regulate initiation of DNA replication in the three organisms (Fig. 1b), and
a portion of the cellular component ontology for these proteins (Fig. 1c). These ontologies
reflect the finding that Mcm2–7 proteins are components of the pre-replicative complex in
several model organisms, as well as sometimes localizing to the cytoplasm30. The ontology
supports both biological realities, and yet the molecular functions and the biological
processes of the MCM homologues are conserved nevertheless.

Nat Genet. Author manuscript; available in PMC 2011 February 10.

et al. Page 5

The usefulness of the GO ontologies for annotation received its first major test in the
annotation of the recently completed sequence of the Drosophila genome. Little human
intervention was required to annotate 50% of the genes to the molecular function and
NIH-PA Author Manuscript

biological process ontologies using the GO method. Another use for GO ontologies that is
gaining rapid adherence is the annotation of gene-expression data, especially after these
have been clustered by similarities in pattern of gene expression32,33. The results of
clustering about 100 yeast experiments (of which about half are shown; Fig. 2) grouped
together a subset of genes which, by name alone, convey little to most biologists. When the
full short GO annotations for process, molecular function and location are added, however,
the biological reason and import of the co-expression of these genes becomes evident.

The GO project is currently using a flat file format to store the ontologies, definitions of
terms and gene associations. The ontologies, gene associations, definitions and
documentation are available from the GO web site (http://www.geneontology.org), which
also describes the principles and objectives used by the project. The ontologies are by no
means complete. They are being expanded during the association of gene products from the
collaborating databases and we expect them to continue to evolve for many years. GO
requires that all gene associations to the ontologies must be attributed to the literature; for
each citation the type of evidence will be encoded. As of early April 2000 there were 1,923,
2,094 and 490 nodes in the process, function and component ontologies, respectively. The
three organism databases have made substantial progress to link gene products. Thus far the
NIH-PA Author Manuscript

process, function and component ontologies have associations with 1,624, 1,602 and 1,577
yeast genes; 741, 2,334 and 1,061 fly genes; and 1,933, 2,896 and 1,696 mouse genes,
respectively. A running table of these statistics can be found at the web site.

The GO concept is intended to make possible, in a flexible and dynamic way, the annotation
of homologous gene and protein sequences in multiple organisms using a common
vocabulary that results in the ability to query and retrieve genes and proteins based on their
shared biology. The GO ontologies produce a controlled vocabulary that can be used for
dynamic maintenance and interoperability between genome databases. The ontologies are a
work in progress. They can be consulted at any time on the World-Wide Web; indeed, their
availability to human and machine alike is essential to maintain their flexibility and allow
their evolution along with increased understanding of the underlying biology. It is hoped
that the GO concepts, especially the distinctions between biological process, molecular
function and cellular component, will find favour among biologists so that we can all
facilitate, in our writing as well as our thinking, the grand unification of biology that the
genome sequences portend.

Acknowledgments
NIH-PA Author Manuscript

We thank K. Fasman and M. Rebhan for useful discussions, and Astra Zeneca for financial support. SGD is
supported by a P41, National Resources, grant from National Human Genome Research Institute (NHGRI) grant
HG01315; MGD by a P41 from NHGRI grant HG00330; GXD by National Institute of Child Health and Human
Development grant HD33745; and FlyBase by a P41 from NHGRI grant HG00739 and the Medical Research
Council, London.

References
1. Goffeau A, et al. Life with 6000 genes. Science 1996;274:546. [PubMed: 8849441]
2. Worm Sequencing Consortium; The C. elegans Sequencing Consortium. Genome sequence of the
nematode C. elegans: a platform for investigating biology. Science 1998;282:2012–2018. [PubMed:
9851916]
3. Adams MD, et al. The genome sequence of Drosophila melanogaster. Science 2000;287:2185–
2195. [PubMed: 10731132]

Nat Genet. Author manuscript; available in PMC 2011 February 10.

et al. Page 6

4. Meinke DW, et al. Arabidopsis thaliana: a model plant for genome analysis. Science 1998;282:662–
682. [PubMed: 9784120]
5. Chervitz SA, et al. Using the Saccharomyces Genome Database (SGD) for analysis of protein
NIH-PA Author Manuscript

similarities and structure. Nucleic Acids Res 1999;27:74–78. [PubMed: 9847146]

6. Rubin GM, et al. Comparative genomics of the eukaryotes. Science 2000;287:2204–2215. [PubMed:
10731134]
7. Tang Z, Kuo T, Shen J, Lin RJ. Biochemical and genetic conservation of fission yeast Dsk1 and
human SR protein-specific kinase 1. Mol. Cell. Biol 2000;20:816–824. [PubMed: 10629038]
8. Vajo Z, et al. Conservation of the Caenorhabditis elegans timing gene clk-1 from yeast to human: a
gene required for ubiquinone biosynthesis with potential implications for aging. Mamm. Genome
1999;10:1000–1004. [PubMed: 10501970]
9. Ohi R, et al. Myb-related Schizosaccharomyces pombe cdc5p is structurally and functionally
conserved in eukaryotes. Mol. Cell. Biol 1998;18:4097–4108. [PubMed: 9632794]
10. Bassett DE Jr, et al. Genome cross-referencing and XREFdb: implications for the identification
and analysis of genes mutated in human disease. Nature Genet 1997;15:339–344. [PubMed:
9090377]
11. Kataoka T, et al. Functional homology of mammalian and yeast RAS genes. Cell 1985;40:19–26.
[PubMed: 2981628]
12. Botstein D, Fink GR. Yeast: an experimental organism for modern biology. Science
1988;240:1439–1443. [PubMed: 3287619]
13. Tatusov RL, Galperin MY, Natale DA, Koonin EV. The COG database: a tool for genome-scale
NIH-PA Author Manuscript

analysis of protein functions and evolution. Nucleic Acids Res 2000;28:33–36. [PubMed:
10592175]
14. Andrade MA, et al. Automated genome sequence analysis and annotation. Bioinformatics
1999;15:391–412. [PubMed: 10366660]
15. Fleischmann W, Moller S, Gateau A, Apweiler R. A novel method for automatic functional
annotation of proteins. Bioinformatics 1999;15:228–233. [PubMed: 10222410]
16. The FlyBase Consortium. The FlyBase database of the Drosophila Genome Projects and
community literature. Nucleic Acids Res 1999;27:85–88. [PubMed: 9847148]
17. Blake JA, et al. The Mouse Genome Database (MGD): expanding genetic and genomic resources
for the laboratory mouse. Nucleic Acids Res 2000;28:108–111. [PubMed: 10592195]
18. Ringwald M, et al. GXD: a gene expression database for the laboratory mouse—current status and
recent enhancements. Nucleic Acids Res 2000;28:115–119. [PubMed: 10592197]
19. Ball CA, et al. Integrating functional genomic information into the Saccharomyces Genome
Database. Nucleic Acids Res 2000;28:77–80. [PubMed: 10592186]
20. Bairoch A, Apweiler R. The SWISS-PROT protein sequence database and its supplement TrEMBL
in 2000. Nucleic Acids Res 2000;28:45–48. [PubMed: 10592178]
21. Benson DA, et al. GenBank. Nucleic Acids Res 2000;28:15–18. [PubMed: 10592170]
22. Baker W, et al. The EMBL Nucleotide Sequence Database. Nucleic Acids Res 2000;28:19–23.
NIH-PA Author Manuscript

[PubMed: 10592171]
23. Tateno Y, et al. DNA Data Bank of Japan (DDBJ) in collaboration with mass sequencing teams.
Nucleic Acids Res 2000;28:24–26. [PubMed: 10592172]
24. Barker WC, et al. The Protein Information Resource (PIR). Nucleic Acids Res 2000;28:41–44.
[PubMed: 10592177]
25. Mewes HW, et al. MIPS: a database for genomes and protein sequences. Nucleic Acids Res
2000;28:37–40. [PubMed: 10592176]
26. Costanzo MC, et al. The Yeast Proteome Database (YPD) and Caenorhabditis elegans Proteome
Database (WormPD): comprehensive resources for the organization and comparison of model
organism protein information. Nucleic Acids Res 2000;28:73–76. [PubMed: 10592185]
27. Bateman A, et al. The Pfam protein families database. Nucleic Acids Res 2000;28:263–266.
[PubMed: 10592242]
28. Lo Conte L, et al. SCOP: a structural classification of proteins database. Nucleic Acids Res
2000;28:257–259. [PubMed: 10592240]

Nat Genet. Author manuscript; available in PMC 2011 February 10.

et al. Page 7

29. Bairoch A. The ENZYME database in 2000. Nucleic Acids Res 2000;28:304–305. [PubMed:
10592255]
30. Enzyme Nomenclature. Recommendations of the Nomenclature Committee of the International
NIH-PA Author Manuscript

Union of Biochemistry and Molecular Biology on the Nomenclature and Classification of

Enyzmes. NC-IUBMB. Academic; New York: 1992.
31. Tye BK. MCM proteins in DNA replication. Annu. Rev. Biochem 1999;68:649–686. [PubMed:
10872463]
32. Eisen M, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide
expression patterns. Proc. Natl Acad. Sci. USA 1998;95:14863–14868. [PubMed: 9843981]
33. Spellman PT, et al. Comprehensive identification of cell cycle-regulated genes of the yeast
Saccharomyces cerevisiae by microarray hybridization. Mol. Biol. Cell 1998;9:3273–3297.
[PubMed: 9843569]
NIH-PA Author Manuscript
NIH-PA Author Manuscript

Nat Genet. Author manuscript; available in PMC 2011 February 10.

et al. Page 8
NIH-PA Author Manuscript

Fig. 1.
Examples of Gene Ontology. Three examples illustrate the structure and style used by GO to
represent the gene ontologies and to associate genes with nodes within an ontology. The
ontologies are built from a structured, controlled vocabulary. The illustrations are the
products of work in progress and are subject to change when new evidence becomes
available. For simplicity, not all known gene annotations have been included in the figures.
a, Biological process ontology. This section illustrates a portion of the biological process
ontology describing DNA metabolism. Note that a node may have more than one parent; for
example, ‘DNA ligation’ has three parents, ‘DNA-dependent DNA replication’, ‘DNA
repair’ and ‘DNA recombination’. b, Molecular function ontology. The ontology is not
NIH-PA Author Manuscript

intended to represent a reaction pathway, but instead reflects conceptual categories of gene-
product function. A gene product can be associated with more than one node within an
ontology, as illustrated by the MCM proteins. These proteins have been shown to bind
chromatin and to possess ATP-dependent DNA helicase activity, and are annotated to both
nodes. c, Cellular component ontology. The ontologies are designed for a generic eukaryotic
cell, and are flexible enough to represent the known differences between diverse organisms.
NIH-PA Author Manuscript

Nat Genet. Author manuscript; available in PMC 2011 February 10.

et al. Page 9
NIH-PA Author Manuscript

Fig. 2.
Correspondence between hierarchical clustering of expression microarray experiments with
GO terms. The coloured matrix represents the results of clustering many microarray
expression experiments32. In the matrix, each row represents the yeast gene described to the
right, and each column represents the expression of that gene in a particular microarray
hybridization. For each gene in the matrix, the table at right lists the systematic ORF name,
the standard gene name (if known), and the GO biological process, molecular function and
cellular component annotations for that gene. The GO annotations suggest that this
experimental expression cluster groups gene products involved in the biological process of
protein folding. In contrast, the molecular function and cellular component annotations of
these gene products correlate less well with the clustered expression patterns of these gene
products.
NIH-PA Author Manuscript
NIH-PA Author Manuscript

Nat Genet. Author manuscript; available in PMC 2011 February 10.

Comparative Genomics Methods and Protocols (João C. Setubal, Jens Stoye Etc.) (Z-Library)
No ratings yet
Comparative Genomics Methods and Protocols (João C. Setubal, Jens Stoye Etc.) (Z-Library)
486 pages
Biochemistry ACS Study Guide
No ratings yet
Biochemistry ACS Study Guide
14 pages
Experiment Lipids
No ratings yet
Experiment Lipids
5 pages
The Gene Ontology Consortium - Gene Ontology: Tool For The Unification of Biology
No ratings yet
The Gene Ontology Consortium - Gene Ontology: Tool For The Unification of Biology
5 pages
Unlocking Secrets of Genome
No ratings yet
Unlocking Secrets of Genome
10 pages
Anotacion_de_Genomas
No ratings yet
Anotacion_de_Genomas
84 pages
염색체2
No ratings yet
염색체2
27 pages
Evolutionary Parameters of The Transcribed Mammalian Genome
No ratings yet
Evolutionary Parameters of The Transcribed Mammalian Genome
6 pages
Unlocking The Secrets of The Genome: Feature
No ratings yet
Unlocking The Secrets of The Genome: Feature
4 pages
Decoding Molecular Biology
From Everand
Decoding Molecular Biology
Aparaj Rudra Paul
No ratings yet
Comparative Genomics
No ratings yet
Comparative Genomics
14 pages
The Geneontology Handbook
100% (1)
The Geneontology Handbook
298 pages
The Gene Ontology Resource: 20 Years and Still Going Strong
No ratings yet
The Gene Ontology Resource: 20 Years and Still Going Strong
9 pages
Genomics and Proteomics
No ratings yet
Genomics and Proteomics
4 pages
GOATOOLS: A Python Library For Gene Ontology Analyses
No ratings yet
GOATOOLS: A Python Library For Gene Ontology Analyses
17 pages
1 Genomics Notes
No ratings yet
1 Genomics Notes
4 pages
MBG2004 Introduction - and - Comparative Genomics - Week - I - II
No ratings yet
MBG2004 Introduction - and - Comparative Genomics - Week - I - II
33 pages
Genomes Unraveled: Informatics in Genomics
From Everand
Genomes Unraveled: Informatics in Genomics
Pasquale De Marco
No ratings yet
The_NIH_Comparative_Genomics_Resource_addressing_t
No ratings yet
The_NIH_Comparative_Genomics_Resource_addressing_t
15 pages
Genomic Resources
No ratings yet
Genomic Resources
6 pages
Unit Vi
No ratings yet
Unit Vi
64 pages
Note On COGs
No ratings yet
Note On COGs
6 pages
COMPARATIVE GENOMICS
No ratings yet
COMPARATIVE GENOMICS
48 pages
Galperin 2019 The COG Approach
No ratings yet
Galperin 2019 The COG Approach
8 pages
Functional Analysis of Genes
No ratings yet
Functional Analysis of Genes
16 pages
Slides 3
No ratings yet
Slides 3
53 pages
An Introduction To Heredity And Genetics - A Study Of The Modern Biological Laws And Theories Relating To Animal And Plant Breeding
From Everand
An Introduction To Heredity And Genetics - A Study Of The Modern Biological Laws And Theories Relating To Animal And Plant Breeding
W. Lochhead
No ratings yet
Genomics Of Plants And Fungi Mycology 1st Edition Rolf A Prade download
No ratings yet
Genomics Of Plants And Fungi Mycology 1st Edition Rolf A Prade download
85 pages
GENESPACE tracks regions of interest
No ratings yet
GENESPACE tracks regions of interest
20 pages
Topical Guidebook For GCE O Level Biology 3 Part 2
From Everand
Topical Guidebook For GCE O Level Biology 3 Part 2
Esther Chen
5/5 (1)
The New Eugenics: Modifying Biological Life in the Twenty-First Century
From Everand
The New Eugenics: Modifying Biological Life in the Twenty-First Century
Conrad B. Quintyn Ph.D.
No ratings yet
Tutorial R
No ratings yet
Tutorial R
456 pages
Exploring Molecular Biology and Genetic Engineering
From Everand
Exploring Molecular Biology and Genetic Engineering
Jerry H. Swift
5/5 (1)
Comparative Genomics
No ratings yet
Comparative Genomics
1 page
MODULE 2 NOTES
No ratings yet
MODULE 2 NOTES
312 pages
Protein Biosynthesis: Molecular Mechanisms and Dynamics of Cellular Protein Formation
From Everand
Protein Biosynthesis: Molecular Mechanisms and Dynamics of Cellular Protein Formation
Fouad Sabry
No ratings yet
Protein Domain: Structural Insights Into Molecular Interactions and Functionality
From Everand
Protein Domain: Structural Insights Into Molecular Interactions and Functionality
Fouad Sabry
No ratings yet
Unit 2 sem 6 gen
No ratings yet
Unit 2 sem 6 gen
14 pages
Lecture 1: INTRODUCTION: A/Prof. Ly Le School of Biotechnology Email: Office: RM 705
No ratings yet
Lecture 1: INTRODUCTION: A/Prof. Ly Le School of Biotechnology Email: Office: RM 705
43 pages
Ontologies: Gene Ontology (Go), Amigo, Obo-Edit: Sheena Scroggins BI 7553 Fall 2010
No ratings yet
Ontologies: Gene Ontology (Go), Amigo, Obo-Edit: Sheena Scroggins BI 7553 Fall 2010
38 pages
Systematic: How Systems Biology Is Transforming Modern Medicine
From Everand
Systematic: How Systems Biology Is Transforming Modern Medicine
James R. Valcourt
No ratings yet
Functional Genomics and Proteomics: Charting A Multidimensional Map of The Yeast Cell
No ratings yet
Functional Genomics and Proteomics: Charting A Multidimensional Map of The Yeast Cell
13 pages
Functional Analysis of Genes: DOI: 10.2478/v10052-010-0001-Y
No ratings yet
Functional Analysis of Genes: DOI: 10.2478/v10052-010-0001-Y
16 pages
On Evolution
From Everand
On Evolution
John C. Avise
4/5 (1)
Protein Folding: Exploring the Dynamics of Molecular Structure and Function
From Everand
Protein Folding: Exploring the Dynamics of Molecular Structure and Function
Fouad Sabry
No ratings yet
Genomics and Bioinformatics
No ratings yet
Genomics and Bioinformatics
34 pages
Genome Wide Prediction and Analysis of Protein Protein Functional Linkages in Bacteria Complete EPUB eBook
100% (13)
Genome Wide Prediction and Analysis of Protein Protein Functional Linkages in Bacteria Complete EPUB eBook
15 pages
Genomic Medicine: Basic Molecular Biology
No ratings yet
Genomic Medicine: Basic Molecular Biology
23 pages
Bioinformatics_uodate
No ratings yet
Bioinformatics_uodate
33 pages
Epigenetics Book: The Most Comprehensive Exploration of the Practical, Social and Ethical Impact of DNA on Our Society and Our World
From Everand
Epigenetics Book: The Most Comprehensive Exploration of the Practical, Social and Ethical Impact of DNA on Our Society and Our World
Roy Carroll
4/5 (2)
Comparative Genomics IACD QBT FCQ
No ratings yet
Comparative Genomics IACD QBT FCQ
14 pages
Human Genome Project
No ratings yet
Human Genome Project
16 pages
Systems and Computational Biology Molecular and Cellular Experimental Systems PDF
No ratings yet
Systems and Computational Biology Molecular and Cellular Experimental Systems PDF
344 pages
Yeast Systems Biology Methods And Protocols 1st Edition Juan I Castrillo pdf download
100% (2)
Yeast Systems Biology Methods And Protocols 1st Edition Juan I Castrillo pdf download
90 pages
Intrinsically Disordered Proteins: Exploring Structural Dynamics and Functional Roles in Cellular Mechanisms
From Everand
Intrinsically Disordered Proteins: Exploring Structural Dynamics and Functional Roles in Cellular Mechanisms
Fouad Sabry
No ratings yet
Genomes and Their Evolution: Biology
No ratings yet
Genomes and Their Evolution: Biology
94 pages
Unknome
No ratings yet
Unknome
31 pages
Chapter 3 - Bioinformatics Intervention in Functional Ge - 2022 - Bioinformatics
No ratings yet
Chapter 3 - Bioinformatics Intervention in Functional Ge - 2022 - Bioinformatics
10 pages
Bridging Biological Insights From Mus Musculus and Beyond. Introduction & Objective Methodology Results Introduction & Objective Methodology Results
No ratings yet
Bridging Biological Insights From Mus Musculus and Beyond. Introduction & Objective Methodology Results Introduction & Objective Methodology Results
1 page
Introduction To NCBI Resources
No ratings yet
Introduction To NCBI Resources
39 pages
(Ebook) Genetics and Philosophy: An Introduction by Paul Griffiths, Karola Stotz ISBN 9781107002128, 1107002125 - The ebook in PDF format is ready for download
100% (1)
(Ebook) Genetics and Philosophy: An Introduction by Paul Griffiths, Karola Stotz ISBN 9781107002128, 1107002125 - The ebook in PDF format is ready for download
31 pages
Biology Unleashed: A Comprehensive Guide to Mastering the Science of Life
From Everand
Biology Unleashed: A Comprehensive Guide to Mastering the Science of Life
Dominic Front
No ratings yet
KOBAS 2.0: A Web Server For Annotation and Identification of Enriched Pathways and Diseases
No ratings yet
KOBAS 2.0: A Web Server For Annotation and Identification of Enriched Pathways and Diseases
7 pages
Bioinformatics: Applications Note
No ratings yet
Bioinformatics: Applications Note
2 pages
The Reactome Pathway Knowledgebase
No ratings yet
The Reactome Pathway Knowledgebase
6 pages
STRING v11: Protein-Protein Association Networks With Increased Coverage, Supporting Functional Discovery in Genome-Wide Experimental Datasets
No ratings yet
STRING v11: Protein-Protein Association Networks With Increased Coverage, Supporting Functional Discovery in Genome-Wide Experimental Datasets
7 pages
CH - 14 Biomolecules
No ratings yet
CH - 14 Biomolecules
14 pages
Structure of DNA
No ratings yet
Structure of DNA
62 pages
Physical Science Module 4
No ratings yet
Physical Science Module 4
9 pages
Introduction To Plant Biotechnology-Lecture 2
No ratings yet
Introduction To Plant Biotechnology-Lecture 2
31 pages
The Molecules of Life Physical and Chemical Principles 1st Edition John Kuriyan pdf download
100% (3)
The Molecules of Life Physical and Chemical Principles 1st Edition John Kuriyan pdf download
63 pages
ARWEN Bioinformatika
No ratings yet
ARWEN Bioinformatika
4 pages
Molecular Biology 3rd Edition David P. Clark All Chapters Instant Download
100% (1)
Molecular Biology 3rd Edition David P. Clark All Chapters Instant Download
52 pages
Genetics and Genomics for Nursing 1st Edition Kenner Solutions Manualinstant download
100% (6)
Genetics and Genomics for Nursing 1st Edition Kenner Solutions Manualinstant download
48 pages
Nucleic Acid Structure: Components of DNA and RNA
No ratings yet
Nucleic Acid Structure: Components of DNA and RNA
18 pages
NCERT Solutions For Class 12 Chemistry Chapter 14 Biomolecules
No ratings yet
NCERT Solutions For Class 12 Chemistry Chapter 14 Biomolecules
7 pages
Exercise and Problems Types of Nucleic Acids (Section 22.1)
No ratings yet
Exercise and Problems Types of Nucleic Acids (Section 22.1)
7 pages
!README
No ratings yet
!README
8 pages
Dr.Nesreen SAT2Bio
No ratings yet
Dr.Nesreen SAT2Bio
168 pages
Sbi4u Molecgenetics 1 Stripped
No ratings yet
Sbi4u Molecgenetics 1 Stripped
31 pages
Lecture Notes On Molecular Biology
No ratings yet
Lecture Notes On Molecular Biology
94 pages
Test Bank for Microbiology: An Introduction, 12th Edition, Gerard J. Tortora, Berdell R. Funke Christine L. Case - Full Version With All Chapters Is Ready For Download
100% (8)
Test Bank for Microbiology: An Introduction, 12th Edition, Gerard J. Tortora, Berdell R. Funke Christine L. Case - Full Version With All Chapters Is Ready For Download
49 pages
Bio Molecules
No ratings yet
Bio Molecules
15 pages
Biochem Reviewer
No ratings yet
Biochem Reviewer
9 pages
Cambridge A-Level Biology (9700) BIOLOGY P2 Biology (9700) Exam-Mate
100% (2)
Cambridge A-Level Biology (9700) BIOLOGY P2 Biology (9700) Exam-Mate
1,140 pages
Topic 1.1 Test (Science 10) : Relevance Extending Proficient Developing Emerging
No ratings yet
Topic 1.1 Test (Science 10) : Relevance Extending Proficient Developing Emerging
2 pages
Lec 3 Terms and Definitions in Bioinformatics
No ratings yet
Lec 3 Terms and Definitions in Bioinformatics
8 pages
4TH Quarter - Genbio
No ratings yet
4TH Quarter - Genbio
30 pages
Botany Senior Inter
No ratings yet
Botany Senior Inter
5 pages
WAJA JPN Perak
100% (3)
WAJA JPN Perak
2 pages
A 1.2 HL Nucleic Acids - Student Notes
No ratings yet
A 1.2 HL Nucleic Acids - Student Notes
9 pages
Cellular Aberration: Merchie Lissa T. Alabat, RN June 13, 2013
No ratings yet
Cellular Aberration: Merchie Lissa T. Alabat, RN June 13, 2013
105 pages
Form 4 Biology Meanings
No ratings yet
Form 4 Biology Meanings
15 pages
Biomolecules Act
No ratings yet
Biomolecules Act
2 pages

NIH Public Access: Author Manuscript

Uploaded by

NIH Public Access: Author Manuscript

Uploaded by

NIH Public Access

Nat Genet. 2000 May ; 25(1): 25–29. doi:10.1038/75556.

Gene Ontology: tool for the unification of biology

2Berkeley Drosophila Genome Project (http://fruitfly.bdgp.berkeley.edu)

The accelerating availability of molecular sequences, particularly the sequences of entire

© 2000 Nature America Inc.

Functional conservation requires a common language for annotation

A dynamic gene ontology

Genome Informatics17,18 (MGI) and the Saccharomyces Genome Database19 (SGD). It is

Nat Genet. Author manuscript; available in PMC 2011 February 10.

is known to be a transmembrane receptor serine/threonine kinase involved in p53-induced

all these reasons, we are constructing three independent ontologies.

Molecular function is defined as the biochemical activity (including specific binding to

Nat Genet. Author manuscript; available in PMC 2011 February 10.

Computer scientists have made significant contributions to linguistic formalisms and

Nat Genet. Author manuscript; available in PMC 2011 February 10.

Nat Genet. Author manuscript; available in PMC 2011 February 10.

similarities and structure. Nucleic Acids Res 1999;27:74–78. [PubMed: 9847146]

Nat Genet. Author manuscript; available in PMC 2011 February 10.

Union of Biochemistry and Molecular Biology on the Nomenclature and Classification of

Nat Genet. Author manuscript; available in PMC 2011 February 10.

Nat Genet. Author manuscript; available in PMC 2011 February 10.

Nat Genet. Author manuscript; available in PMC 2011 February 10.

You might also like