A Comparison of Upper Ontologies
A Comparison of Upper Ontologies
Abstract. Upper Ontologies are quickly becoming a key technology for inte-
grating heterogeneous knowledge coming from different sources. In this techni-
cal report we analyse 7 Upper Ontologies, namely BFO, Cyc, DOLCE, GFO,
PROTON, Sowa’s ontology, and SUMO, according to a set of standard software
engineering criteria, and we synthesise our analysis in form of a comparative ta-
ble. A summary of some existing comparisons drawn among subsets of the 7
Upper Ontologies that we deal with in this document, is also provided.
1 Introduction
Upper ontologies are quickly becoming a key technology for integrating heterogeneous
knowledge coming from different sources. In fact, they may be used by different par-
ties involved in a knowledge integration and exchange process as a reference, common
model of the reality.
The definition of upper ontology (also named top-level ontology, or foundation on-
tology) given by Wikipedia [22] is “an attempt to create an ontology which describes
very general concepts that are the same across all domains. The aim is to have a large
number on ontologies accessible under this upper ontology”.
In this report, we have described 7 upper ontologies along different criteria that
include dimension, implementation language(s), modularity, developed applications,
alignment with the WordNet lexical resource, and licensing. We have chosen these cri-
teria for three reasons:
– They are software engineering criteria useful for the developer of a knowledge-
based system that has to choose the most suitable Upper Ontology for his/her needs,
among a set of existing ones. Since all of us have a computer science background,
these criteria are more familiar to us than philosophical ones.
– They take into account some of the evaluation questions proposed by the IEEE
Standard Upper Ontology Working Group (http://suo.ieee.org/SUO/Eva-
luations/), and they also extend the criteria considered in an existing compari-
son among SUMO, Cyc, and DOLCE [18], thus allowing us to “reuse”, and to be
consistent with, the results already obtained there.
– They are not (easily) scientifically falsifiable.
The choice of the 7 upper ontologies we have described, namely BFO, Cyc, DOLCE,
GFO, PROTON, Sowa’s ontology, and SUMO, is based on how much, to the best of our
knowledge, they are visible and used inside the research community. For example, we
have discussed all the Upper Ontologies referenced by Wikipedia, apart from WordNet
that we consider a lexical resource rather than an Upper Ontology, and from the Global
Justice XML Data Model and National Information Exchange Model, that addresses the
specific application domain of justice and public safety. We have reported alignments
between the Upper Ontologies and WordNet, when existing. To the 5 Upper Ontologies
considered by Wikipedia, we have added PROTON and Sowa’s ontology. We have also
cited three attempts to merge existing Upper Ontologies, namely COSMO, MSO, and
OntoMap, although we have not described them in detail since the first two ones are
still work in progress, and the last one is over since four years.
The methodology followed to draw this report consisted in checking the existing
literature, producing a first draft of the comparison based on the retrieved literature,
submitting it to the attention of the developers of all the 7 upper ontologies under com-
parison, and integrating the obtained answers and suggestions into the current version
of the report. Due to time constraints, we were not able to experiment with the upper
ontologies by our own. This “on the field” experimentation is part of our near future
work, and its results will be described in a companion technical report.
The report is organised in the following way: Section 2 provides a description of
the 7 upper ontologies, and Section 3 surveys some existing, partial comparisons drawn
in the past years among subsets of the Upper Ontologies that we describe in Section 2,
and provides a synthesis of the results of our comparison among them.
2 Description
Cyc
– Status of this description. Validated by L. Lefkowitz, executive director for busi-
ness solutions at Cycorp.
– Home page. http://www.cyc.com/.
– Developers. Cycorp.
– Description. The Cyc Knowledge Base (KB) is a formalised representation of
facts, rules of thumb, and heuristics for reasoning about the objects and events
of everyday life. The KB consists of terms and assertions which relate those terms.
These assertions include both simple ground assertions and rules. The Cyc KB is
divided into thousands of “microtheories” focused on a particular domain of knowl-
edge, a particular level of detail, a particular interval in time, etc.
– History. The Cyc project was founded in 1984 by D. Leant as a lead project in the
Microelectronics and Computer Technology Corporation (MCC). In 1994, Cycorp
was founded to further develop, commercialize, and apply the Cyc technology. Cy-
corp offers a no-cost license to its semantic technologies development toolkit to
the research community (ResearchCyc). Additionally, it has placed the core Cyc
ontology (OpenCyc) into the public domain.
– Dimensions. The Cyc KB (including Cyc’s microtheories) contains more than
300,000 concepts and nearly 3,000,000 assertions (facts and rules), using more
than 15,000 relations.
– Implementation language(s). Cyc is represented in the CycL formal language
(http://www.cyc.com/cycdoc/ref/cycl-syntax.html). The latest release
of Cyc includes an Ontology Exporter that allows to export specified portions of
Cyc to OWL files.
– Modularity. The “microtheory” approach supports modularity.
– Applications. Cyc has been used in the domains of natural language processing,
in particular for the tasks of word sense disambiguation [4] and question answer-
ing [5], of network risk assessment [19], and of representation of terrorism-related
knowledge [6].
– Alignment with WordNet. The last release of Cyc (as well as of OpenCyc and
ResearchCyc) includes links between Cyc concepts and about 12,000 WordNet
synsets.
– Licensing. Cyc is a commercial product, but Cycorp also released OpenCyc (http:
//www.opencyc.org/), the open source version of the Cyc technology, and Re-
searchCyc (http://research.cyc.com/), namely the Full Cyc ontology, but
with a research-only license.
DOLCE (a Descriptive Ontology for Linguistic and Cognitive Engineering)
Sowa’s Ontology
Merging Upper Level Ontologies. Three attempts to merge some of the upper level on-
tologies, thus leading to an “upper-upper level ontology”, are COSMO (COmmon Se-
mantic MOdel, http://colab.cim3.net/cgi-bin/wiki.pl?CosmoWG/TopLe-
vel), MSO (Multi-Source Ontology, http://www.webkb.org/doc/MSO.html), and
the OntoMap Project [11].
COSMO results from the efforts of the COSMO working group (COSMO-WG)
and its parent group, the Ontology and Taxonomy Coordinating Working Group (ON-
TACWG). COSMO is viewed as consisting of a lattice of ontologies which will serve as
a set of basic logically-specified concepts (classes, relations, functions, instances) with
which the meanings of all terms and concepts in domain ontologies can be specified.
The use of a common set of defining concepts will permit accurate interoperability of
knowledge-based systems using the logical relations of their ontologies as the basis for
reasoning in the system. Currently, COSMO integrates concepts from the OpenCyc and
SUMO ontologies, with some classes from DOLCE and BFO. The work on COSMO is
in progress.
MSO is the Multi-Source Ontology of WebKB-2, a knowledge server that permits
Web users to browse and update private knowledge bases on their machines, or alterna-
tively, a large shared knowledge base on the server machine. The ontology of the shared
knowledge base is currently an integration of various top-level ontologies and a lexical
ontology derived from an extension and correction of the noun-related part of Word-
Net 1.7. The semantics of some categories from WordNet has been modified in order
to fix inconsistencies, while the semantics of categories from other sources (e.g. Sowa,
DOLCE) has been kept. Also the work regarding the MSO is still in progress. In partic-
ular, the integration of the SUMO is still far from being complete. This integration links
the SUMO categories to the existing categories of the MSO, adds some structure when
needed, adds equivalent categories the names of which are better suited for knowledge
representation conventions that are “common” in the communities using graph-based
or frame-based notations, and finally translates the axioms from KIF to more intuitive
notations that permit people to more easily understand the meanings of the categories
and their relationships.
Finally, OntoMap was a project with the goal to facilitate the access, understand-
ing, and reuse of such resources. A semantic framework on conceptual level was im-
plemented that was small and easy enough to be learned on-the-fly. Technically, On-
toMap was implemented as a web-site providing access to several upper-level ontolo-
gies and manual mapping between them. OntoMap was similar in spirit to COSMO
and MSO, but only the very top concepts of each of the Upper Ontologies consid-
ered there were aligned. Unfortunately, OntoMap was over 4 years ago, and no main-
tenance was guaranteed to it. The web-portal which was allowing online browsing
is no longer available, but the stand-alone viewer may be downloaded from http:
//www.ontotext.com/projects/OntoMapViewer/install.htm.
3 Comparison
Some partial comparisons exist among subsets of the Upper Ontologies that we have
considered in Section 2. In the next paragraphs, we have summarised them in the most
faithful way. The interested reader should go to the source, always cited, in order to
have a complete picture of the conclusions reached by the comparisons’ authors. The
last paragraph, instead, provides a synthesis of the description we have given in Section
2.
Pease’s comparison of DOLCE and SUMO. In [15] and [16], Pease draws a comparison
between DOLCE and SUMO. His conclusions are that DOLCE has a similar purpose
and business process to SUMO in that it is a free research project for use in both natural
language tasks and inference. DOLCE has been carefully crafted with respect to strong
principles. DOLCE is an “ontology of particulars”; it does have universals (classes and
properties), but the claim is that they are only employed in the service of describing
particulars. In contrast, SUMO could be described as an ontology of both particulars
and universals. It has a hierarchy of properties as well as classes. This is a very im-
portant feature for practical knowledge engineering, as it allows common features like
transitivity to be applied to a set of properties, with an axiom that is written once and
inherited by those properties, rather than having to be rewritten, specific to each prop-
erty. Other differences include DOLCE’s use of a set of meta-properties as a guiding
methodology, as opposed to SUMO’s use and formal definition of such meta-properties
directly in the ontology itself. With respect to SUMO, DOLCE does not include such
items as a hierarchy of process types, physical objects, organisms, units and measures,
and event roles.
MITRE’s comparison of SUMO, Upper Cyc, and DOLCE. In [18], Semy, Pulvermacher
and Obrst compare SUMO, Upper Cyc, and DOLCE according to the existence of an
open license, modularity and evidence of use. We have adopted these criteria inside our
analysis, which thus subsumes Semy, Pulvermacher and Obrst’s one.
1
The forthcoming release of GFO, expected by early 2007, will include some refinements of
the notion of persistence which will make this statement no longer valid.
Grenon’s comparison of DOLCE and BFO. Grenon made a careful comparison be-
tween DOLCE and BFO [7]. The conclusion is that both ontologies contain a category
of endurants and perdurants and an eternalist stance, and that the theory of parthood and
the theory of dependence are similar in the two ontologies. Despite these similarities,
there are also many differences, including:
– DOLCE is methodologically fundamentally conceptualist while BFO is method-
ologically fundamentally realist;
– DOLCE seems to be oriented toward commonsense, and BFO’s naïve realism is
in the same spirit. However, DOLCE distinguishes between abstract and concrete
entities, and it includes agents and intentionality. BFO is deliberately not committed
to these distinctions. In particular, the physical / non-physical endurants distinction
in DOLCE is absent in BFO.
– As already mentioned, DOLCE is intended as an ontology of particulars. BFO is
intended to be an ontology of both universals and particulars.
– In DOLCE, qualities are abstract entities which may not be found in space or time,
and do not have parts. For BFO, the proxies of DOLCE’s qualities (“tropes”) are
located in space and exist at a time in the very same way that the entities in which
they inhere.
Another source of information about the similarities and differences between DOLCE
and BFO is [12], where Masolo, Borgo, Gangemi, Guarino, and Oltramari of the Lab-
oratory For Applied Ontology (LOA) compare DOLCE and BFO (besides the OCHRE
object-centered ontology, [17], that we did not consider in our analysis) by representing
the assertion “A statute of clay exists for a period of time going from t1 to t2 . Between
t2 and t3 , the statue is crashed and so ceases to exists although the clay is still there.” in
both of them.
Our comparison. The description of the 7 Upper Ontologies given in Section 2 is syn-
thesised here in Tables 1 and 2.
Acknowledgments
We want to acknowledge all the researchers that helped in drawing this comparison
with their constructive comments and useful advices. In particular, many thanks go to J.
Euzenat, A. Kiryakov, L. Lefkowitz, F. Loebe, A. Pease, J. Schoening, P. Shvaiko, and
H. Stenzhorn.
We also acknowledge the research projects TIN2006-15265-C06-04 and “Iniziativa
Software” CINI-FINMECCANICA that partially funded this work.
Home page Developers Dimensions Language(s)
http://www. Smith, Grenon,
36 classes related via the
BFO ifomis.org/ Stenzhorn, Spear OWL
is_a relation
bfo (IFOMIS)
About 300,000 concepts,
3,000,000 assertions (facts
http://www.
Cyc Cycorp and rules), 15,000 relations CycL, OWL
cyc.com/
(these numbers include mi-
crotheories)
http://www. Guarino and other First Order
About 100 concepts and
DOLCE loa-cnr.it/ researchers of the Logic, KIF,
100 axioms
DOLCE.html LOA OWL
First Order
http://www.
Logic and
onto-med.de/ The Onto-Med Re- 79 classes, 97 subclass-
GFO KIF (forth-
ontologies/ search Group relations, 67 properties
coming);
gfo.html
OWL
http:
//proton. 300 concepts and 100
PROTON Ontotext Lab, Sirma OWL Lite
semanticweb. properties
org/
First Or-
http://www.
30 classes, 5 relationships, der Modal
Sowa’s jfsowa.com/ Sowa
30 axioms Language,
ontology/
KIF
http://www. 20,000 terms and 60,000
Niles, Pease, and SUO-KIF,
SUMO ontologyportal. axioms (including domain
Menzel OWL
org/ ontologies)
[1] American National Standard. KIF Knowledge Interchange Format – draft pro-
posed American National Standard (dpANS) NCITS.T2/98-004, 1998.
[2] P. Burek, R. Hoehndorf, F. Loebe, J. Visagie, H. Herre, and J. Kelso. A top-level
ontology of functions and its application in the open biomedical ontologies. In
ISMB (Supplement of Bioinformatics), volume 22, pages 66–73, 2006.
[3] N. Casellas, M. Blázquez, A. Kiryakov, P. Casanovas, M. Poblet, and R. Ben-
jamins. OPJK into PROTON: Legal domain ontology integration into an upper-
level ontology. In R. Meersman and et al., editors, Proceedings of the 3rd In-
ternational Workshop on Regulatory Ontologies (WORM 2005), volume 3762 of
Lecture Notes in Computer Science, pages 846–855. Springer, 2005.
[4] J. Curtis, D. Baxter, and J. Cabral. On the application of the Cyc ontology to word
sense disambiguation. In Proceedings of the 19th International Florida Artificial
Intelligence Research Society Conference, pages 652–657, 2006.
[5] J. Curtis, G. Matthews, and D. Baxter. On the effective use of Cyc in a question
answering system. In Proceedings of the IJCAI Workshop on Knowledge and
Reasoning for Answering Questions (KRAQ’05), 2005.
[6] C. Deaton, B. Shepard, C. Klein, C. Mayans, B. Summers, A. Brusseau, and
M. Witbrock. The comprehensive terrorism knowledge base in Cyc. In Pro-
ceedings of the 2005 International Conference on Intelligence Analysis, 2005.
[7] P. Grenon. BFO in a nutshell: A bi-categorial axiomatization of BFO and com-
parison with DOLCE. Technical Report 06/2003, IFOMIS, University of Leipzig,
2003.
[8] P. Grenon, B. Smith, and L. Goldberg. Biodynamic ontology: Applying BFO
in the biomedical domain. In D. M. Pisanelli, editor, Ontologies in Medicine,
volume 102 of Studies in Health Technology and Informatics, pages 20–38. IOS
Press, 2004.
[9] G. Guizzardi, H. Herre, and G. Wagner. On the general ontological foundations of
conceptual modeling. In S. Spaccapietra, S. T. March, and Y. Kambayashi, editors,
Proceedings of the 21st International Conference on Conceptual Modeling (ER
2002), volume 2503 of Lecture Notes in Computer Science, pages 65–78. Springer,
2002.
[10] H. Herre, B. Heller, P. Burek, R. Hoehndorf, F. Loebe, and H. Michalek. General
formal ontology (GFO) – part I: Basic principles. Technical Report 8, Onto-Med,
University of Leipzig, 2006.
[11] A. Kiryakov, K. Ivanov Simov, and M. Dimitrov. OntoMap: portal for upper-
level ontologies. In Proceedings of the 2nd International Conference on Formal
Ontology in Information Systems (FOIS 2001), pages 47–58. ACM, 2001.
[12] C. Masolo, S. Borgo, A. Gangemi, N. Guarino, and A. Oltramari. Ontology library
(final). IST Project 2001-33052 WonderWeb Deliverable D18, 2003.
[13] G. Nagypál and J. Lemcke. A business data ontology. Data, Information and Pro-
cess Integration with Semantic Web Services Project, FP6 Ű 507483, Deliverable
D3.3, 2005.
[14] I. Niles and A. Pease. Towards a standard upper ontology. In C. Welty and
B. Smith, editors, Proceedings of the 2nd International Conference on Formal
Ontology in Information Systems (FOIS-2001), pages 2–9. ACM Press, 2001.
[15] A. Pease. Formal representation of concepts: The Suggested Upper Merged On-
tology and its use in linguistics. In A. C. Schalley and D. Zaefferer, editors, On-
tolinguistics. How Ontological Status Shapes the Linguistic Coding of Concepts.
Mouton de Gruyter, 2006.
[16] A. Pease and C. Fellbaum. Formal ontology as interlingua: The SUMO and Word-
Net linking project and GlobalWordNet. To appear.
[17] L. Schneider. How to build a foundational ontology: The object-centered high-
level reference ontology OCHRE. In A. Günter, R. Kruse, and B. Neumann,
editors, Proceedings of the 26th Annual German Conference on AI, KI 2003:
Advances in Artificial Intelligence, volume 2821 of Lecture Notes in Computer
Science, pages 120–134. Springer, 2003.
[18] S. K. Semy, M. K. Pulvermacher, and L. J. Obrst. Toward the use of an upper on-
tology for U.S. government and U.S. military domains: An evaluation. Technical
Report MTR 04B0000063, The MITRE Corporation, 2004.
[19] B. Shepard, C. Matuszek, C. B. Fraser, W. Wechtenhiser, D. Crabbe, Z. Güngördü,
J. Jantos, T. Hughes, L. Lefkowitz, M. J. Witbrock, D. B. Lenat, and E. Larson.
A knowledge-based approach to network security: Applying Cyc in the domain
of network risk assessment. In M. M. Veloso and S. Kambhampati, editors, Pro-
ceedings of the 20th National Conference on Artificial Intelligence and the 17th
Innovative Applications of Artificial Intelligence Conference, pages 1563–1568.
AAAI Press AAAI Press / The MIT Press, 2005.
[20] J. F. Sowa. In Knowledge Representation: Logical, Philosophical, and Computa-
tional Foundations. Brooks Cole Publishing, 1999.
[21] W3C. OWL Web Ontology Language Overview – W3C Recommendation 10
February 2004 , 2004.
[22] Wikipedia. Upper ontology – wikipedia, the free encyclopedia, 2006. [Online;
accessed 15-December-2006].