SlideShare a Scribd company logo
Knowledge Graphs vs. Property
Graphs — Similarities, Differences
and Some Guidance on Capabilities
TOPQUADRANT COMPANY
FOUNDATION
▪ TopQuadrant was founded in 2001
▪ Strong commitment to standards-based approaches to data semantics
MISSION
▪ Empower people and drive results — by making enterprise information meaningful
FOCUS
▪ Provide comprehensive data governance solutions using knowledge graph technologies
TOPQUADRANT COMPANY
© Copyright 2020 TopQuadrant Inc. Slide 3
I. A brief overview of:
– Property Graphs
– Knowledge Graphs
II. Differences in Terminology and Capabilities of these models
III. Strengths and Limitations of each approach
IV. Why Knowledge Graphs provide a strong foundation for data
centric enterprise applications
– E.g. enterprise data governance and metadata management
▪ Q & A
Today’s Agenda
Irene Polikoff
© Copyright 2020 TopQuadrant Inc. Slide 4
Some Definitions of a Knowledge Graph
▪ A knowledge graph (i) mainly describes real world entities and their
interrelations, organized in a graph, (ii) defines possible classes and relations of
entities in a schema, (iii) allows for potentially interrelating arbitrary entities
with each other and (iv) covers various topical domains.
H. Paulheim. Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods.
Semantic Web Journal, 1–20, 2016
▪ A knowledge graph acquires and integrates information into an ontology
and applies a reasoner to derive new knowledge.
Lisa Ehrlinger and Wolfram Wöß, Towards a Definition of Knowledge Graphs,
Institute for Application Oriented Knowledge Processing at SEMANTiCS 2016, http://ceur-ws.org/Vol-1695/paper4.pdf
▪ A knowledge graph is a knowledge base that uses a graph-structured data
model or topology to integrate knowledge and data. Knowledge graphs are often
used to store interlinked descriptions of entities — real-world objects, events,
situations or abstract concepts, including their semantics.
Wikipedia
© Copyright 2020 TopQuadrant Inc. Slide 5
TopQuadrant: What Are Knowledge Graphs?
▪ A Knowledge Graph represents a knowledge domain
– With “Active Models” that can be consulted at run-time
▪ It represents all information as a graph
– A network of nodes and links
– Not tables of rows and columns
▪ It represents facts (data) and models (metadata) in the
same way – as part of a graph
– Rich rules and inferencing
▪ It is based on open standards, from top to bottom
– Readily connects to knowledge in private and public clouds
A knowledge graph, with its Active Models, grows and evolves over time.
© Copyright 2020 TopQuadrant Inc. Slide 6
Knowledge Graph Recipe
Vehicle
Car
Organization
Year
Design/Model
brand
type
type type
type
BMW X2
BMW
2019
BMW
228
Ontology
+ Instance Data
----------------------------
Knowledge Graph
*Blueprint *Actuals
An Ontology is a data model (schema) with classes, properties, and rules - a precise blueprint for data
© Copyright 2020 TopQuadrant Inc. Slide 7
Property Graphs
aka Labeled Property Graphs
▪ Caveat: There is no standard property graph data model
– Each implementation of a Property Graph is, thus, somewhat different
– We will focus on the common aspects
▪ In general, the Property Graphs data model consists of three elements:
– Nodes – entities in a graph
– Edges – direct links aka relationships between nodes
– Properties – key value pairs that can be associated with either nodes or edges
© Copyright 2020 TopQuadrant Inc. Slide 8
Property Graph by Example
© Copyright 2020 TopQuadrant Inc. Slide 9
Property Graph by Example
© Copyright 2020 TopQuadrant Inc. Slide 10
Property Graph by Example
© Copyright 2020 TopQuadrant Inc. Slide 11
RDF Knowledge Graph by Example
56853
http://www.wikidata.org/entity/Q2263
wikidata:
© Copyright 2020 TopQuadrant Inc. Slide 12
RDF Knowledge Graph by Example
56853
© Copyright 2020 TopQuadrant Inc. Slide 13
RDF Knowledge Graph by Example
© Copyright 2020 TopQuadrant Inc. Slide 14
Property Graph vs RDF Knowledge Graph
Property Graph Knowledge Graph
IDs are internal to a graph database, user has no control
over them.
IDs are global – URIs, meant to be under users control to
enable combining different graphs
Properties are literal values. They are fundamentally
different from nodes and relationships.
Canonical structure. Everything is stored as nodes and
links connecting them. A literal value is a node like any
other. Property is any link – to a resource or a literal.
© Copyright 2020 TopQuadrant Inc. Slide 15
“Schema” as part of a Knowledge Graph
© Copyright 2020 TopQuadrant Inc. Slide 16
“Schema” as part of a Knowledge Graph
© Copyright 2020 TopQuadrant Inc. Slide 17
More “Schema”
sh:maxCount
1
© Copyright 2020 TopQuadrant Inc. Slide 18
Standards-based Ontologies and Vocabularies are
Knowledge Graphs Increasingly Used across Industries
Key enablers of transparency, interoperability
and the emerging “cognitive enterprise”
These ontologies uniquely identify and describe things and
relationships in healthcare, web search services,
manufacturing, financial services, …
*
© Copyright 2020 TopQuadrant Inc. Slide 19
Property Graph vs RDF Knowledge Graph
Property Graph Knowledge Graph
IDs are internal to a graph database, user has no
control over them.
IDs are global – URIs, meant to be under user’s control to
enable combining different graphs.
Properties are literal values. They are fundamentally
different from nodes and relationships.
Canonical structure. Everything is stored as nodes and
links connecting them. A literal value is a node like any
other. Property is any link - to a resource or a literal.
Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph.
© Copyright 2020 TopQuadrant Inc. Slide 20
RDF Knowledge Graphs:
How to add information to triples
© Copyright 2020 TopQuadrant Inc. Slide 21
RDF Knowledge Graphs:
How to add information to triples
© Copyright 2020 TopQuadrant Inc. Slide 22
Property Graph vs RDF Knowledge Graph
Property Graph Knowledge Graph
IDs are internal to a graph database, user has no control
over them.
IDs are global – URIs, meant to be under user’s control to
enable combining different graphs.
Properties are literal values. They are fundamentally
different from nodes and relationships.
Canonical structure. Everything is stored as nodes and links
connecting them. A literal value is a node like any other.
Property is any link - to a resource or a literal.
Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph.
Each relationship (link or arch) uniquely identifies “node
– link – node” combination. Relationships can be
annotated with additional facts, but properties can’t be.
IDs of properties (links) are re-used. Thus, they do not
uniquely identify a “node – link - node” combination that
uses it. There is a way to give these triples identity. Any
triple can be annotated with additional facts.
© Copyright 2020 TopQuadrant Inc. Slide 23
Property Graphs:
How to add information to a “property” value
Can we say what year is the population for?
© Copyright 2020 TopQuadrant Inc. Slide 24
type: POPULATION_SIZE type: POPULATION_SIZE
label: Population
label: Population
131 132“Size” 58811
“Year” 2018
“Size” 56853
“Year” 2010
Property Graphs:
How to add information to a “property” value
Can’t be done. Must turn a “property” into a relationship
© Copyright 2020 TopQuadrant Inc. Slide 25
RDF Knowledge Graph: How to add
information to a triple with a literal object
Or, as it more commonly said, “to a property with a literal value”
ex:year
2010
ex:100058811 56853
© Copyright 2020 TopQuadrant Inc. Slide 26
Property Graph: How to use a node as a
“property” value
Let’s say we want to capture information
about Ben Bradlee so we want to turn
the value of “role” into a node.
© Copyright 2020 TopQuadrant Inc. Slide 27
Property Graph: How to use a node as a
“property” value
Must remove the edge
annotation and create a
relationship to a node
labeled Person.
© Copyright 2020 TopQuadrant Inc. Slide 28
RDF Knowledge Graph: How to turn a
literal value into a resource reference
© Copyright 2020 TopQuadrant Inc. Slide 29
RDF Knowledge Graph: How to turn a
literal value into a resource reference
© Copyright 2020 TopQuadrant Inc. Slide 30
Property Graph vs RDF Knowledge Graph
Property Graph Knowledge Graph
IDs are internal to a graph database, user has no control
over them.
IDs are global – URIs, meant to be under user’s control to
enable combining different graphs.
Properties are literal values. They are fundamentally
different from nodes and relationships.
Canonical structure. Everything is stored as nodes and links
connecting them. A literal value is a node like any other.
Property is any link - to a resource or a literal.
Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph.
Each relationship (link or arch) uniquely identifies “node –
link – node” combination. Relationships can be annotated
with additional facts, but properties can’t be.
IDs of properties (links) are re-used. Thus, they do not uniquely
identify a “node – link - node” combination that uses it. There is
a way to give these triples identity. Any triple can be annotated
with additional facts.
Changes in the graph design require restructure/re-load of
the data and changes to all impacted queries.
Graphs can evolve and changes in the design can often be
done with minimal impact on existing data and queries.
Product-specific query languages, variants of Cypher,
increasingly, GraphQL support, introspection not integrated
Query standard – SPARQL. Increasingly, GraphQL support. In
EDG: introspection and auto-generation of GraphQL Schemas.
© Copyright 2020 TopQuadrant Inc. Slide 31
wikidata:Q2263
rdf:type entont:Actor ;
rdf:type entont:Director ;
entont:actedIn entertainment_data:ThePost
[[ entont:role “Ben Bradlee” ]] ;
schema:birthDate "1956-07-09"^^xsd:date ;
entont:directed entertainment_data:ALeagueOfTheirOwn ;
schema:familyName "Tom" @en;
schema:givenName "Hanks" @en;
rdfs:label "Tom Hanks" @en; .
Example of RDF Serialization
@prefix entertainment_data: <http://example.org/data-graphs/Entertainment_Data#> .
@prefix entont: <http://example.org/ontologies/Entertainment#> .
@prefix wikidata: <http://www.wikidata.org/entity/> .
@prefix schema: <http://schema.org/> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
Subject Node
Object NodeEdge
© Copyright 2020 TopQuadrant Inc. Slide 32
Property Graph vs RDF Knowledge Graph
Property Graph Knowledge Graph
IDs are internal to a graph database, user has no control
over them.
IDs are global – URIs, meant to be under user’s control to
enable combining different graphs.
Properties are literal values. They are fundamentally
different from nodes and relationships.
Canonical structure. Everything is stored as nodes and links
connecting them. A literal value is a node like any other.
Property is any link - to a resource or a literal.
Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph.
Each relationship (link or arch) uniquely identifies “node –
link – node” combination. Relationships can be annotated
with additional facts, but properties can’t be.
IDs of properties (links) are re-used. Thus, they do not uniquely
identify a “node – link - node” combination that uses it. There is
a way to give these triples identity. Any triple can be annotated
with additional facts.
Changes in the graph design require restructure/re-load of
the data and changes to all impacted queries.
Graphs can evolve organically. Changes in the design can often
be accomplished with minimal impact on existing data and
queries.
Product-specific query languages, variants of Cypher,
increasingly, GraphQL support, introspection not integrated
Query standard – SPARQL. Increasingly, GraphQL support. In
EDG: introspection and auto-generation of GraphQL Schemas.
No standard serialization for export.
Standard serializations supported by all products –
RDF/XML, Turtle, N3 and JSON-LD formats.
© Copyright 2020 TopQuadrant Inc. Slide 33
Knowledge Graphs are Self-Organizing –
they can be composed
Data
Element
Dataset Data
Center
Location
element of located in
KG1 KG3
located in
CMS
CMS
Cloud
Service
hosted by
KG2
App.
Server
VM
resourcedrunning on
stored in
© Copyright 2020 TopQuadrant Inc. Slide 34
Knowledge Graphs are Self-Organizing –
they can be composed
With consistent Identifiers for both Nodes and Links,
Graph queries and rules can give insights and make new knowledge.
Data
Element
Dataset
Data
Center
Location
element of located in
KG1 KG3
located in
CMSCMS Cloud
Service
hosted by
KG2
App.
Server
VM
resourcedrunning onstored in regioned in
© Copyright 2020 TopQuadrant Inc. Slide 35
Property Graph vs RDF Knowledge Graph
Property Graph Knowledge Graph
IDs are internal to a graph database, user has no control over them. IDs are global – URIs, meant to be under user’s control to enable
combining different graphs.
Properties are literal values. They are fundamentally different from
nodes and relationships.
Canonical structure. Everything is stored as nodes and links connecting
them. A literal value is a node as any other. Property is any link - to a
resource or a literal.
Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph.
Each relationship (link or arch) uniquely identifies “node – link –
node” combination. Relationships can be annotated with additional
facts, but properties can’t be.
IDs of properties (links) are re-used. Thus, they do not uniquely identify
a “node – link - node” combination that uses it. There is a way to give
these triples identity. Any triple can be annotated with additional facts.
Changes in the graph design require restructure/re-load of the data
and changes to all impacted queries.
Graphs can evolve organically. Changes in the design can often be
accomplished with minimal impact on existing data and queries.
Product-specific query languages, variants of Cypher, increasingly,
GraphQL support, introspection not integrated.
Query standard – SPARQL. Increasingly, GraphQL support. In EDG:
introspection and auto-generation of GraphQL Schemas.
No standard serialization for export. Standard serializations supported by all products – RDF/XML, Turtle, N3
and JSON-LD formats.
No partitioning of graphs nor merging or connectivity
across graphs.
Named graphs as a partitioning approach. Graphs are “self-
composing”, can be connected and merged.
© Copyright 2019 TopQuadrant Inc. Slide 36
Knowledge Graphs support the move to
Data Centric
Systems and Architectures
© Copyright 2020 TopQuadrant Inc. Slide 37
Data Centric vs Application Centric
Application Centric
Data is a second-class citizen, and the
application is the main thing
Data models are not self describing, can’t be
understood without understanding
application’s logic
All business rules and semantics are hard
coded in the applications
Data models are very rigid
Changing application’s data model is
expensive – requires changes to the
application code due to tight coupling
One data model per application
Data sharing is an afterthought
Data Centric
Data is the main asset, applications are
“model-driven”
Data is self-describing and can stand on its
own
Data can be “active” with accessible
semantics, business rules and even
executable code
Data models are flexible and evolvable
Changing data models is inexpensive and may
not require application changes
Multiple applications may share some parts of
a data model, extending it as needed
Data sharing is “built in”
© Copyright 2020 TopQuadrant Inc. Slide 38
Why Self Describing Data is Important?
Open Data Example: Road Safety - Vehicles by Make and Model
We see the numbers, but what do they mean?
Data is only as good as its metadata
© Copyright 2020 TopQuadrant Inc. Slide 39
Metadata adds Context & Definition –
Making Data sharable and increasing its Value
39
First Name Last Name Company City
Year
Purchased
Joe Smith Komputers R Us New York 1970
Mary Jones The Lord’s Store London 1999
Proful Bishwal The Lady’s Store Mumbai 1998
Ming Lee My Favorite Store Beijing 2001
Customer Definition
Last Name represents the surname or family name of
an individual.
Business Rules
In the Chinese market, family name is listed first in
salutations.
Format VARCHAR(30)
Abbreviation LNAME
Required YES
More …
Numerous technical & business metadata including
security, privacy, permissible values, etc.
Is this the city where the customer lives
or where the store is located?
• If metadata is not readily available, data can’t be understood.
• If metadata is kept separate from data, applications can’t access it.
• Further, data will change and metadata will get stale.
• If metadata is captured in a proprietary way, it is not open.
Key Facts
© Copyright 2017 TopQuadrant Inc. Slide 40
TopBraid EDG – Composition of Knowledge Graphs
to Support Adaptive Data Governance
Business Logic
UX/UI
Access Control
Compose UI
Behavior,
Queries,
Rules and
Constraints
Enterprise
Assets
Technical
Assets
Big Data
Assets
Data Assets
Glossaries
Terminologies
Governance
Model
DataSchema
Compose
Data and
Schema
Over 300 of pre-defined Asset Types
© Copyright 2020 TopQuadrant Inc. Slide 41
How Applications can be Model Driven –
an example from TopBraid EDG
Models drive the User Experience (UX) i.e., what fields will appear on an (edit, search or browse) form,
possible values, order of the fields, their division into sections and other relevant information.
edg:FileAsset
rdf:type edg:AssetClass ;
dash:abstract "false"^^xsd:boolean ;
rdfs:label "File Asset" ;
rdfs:subClassOf edg:AccessControllable ;
rdfs:subClassOf edg:ComplianceAspect ;
rdfs:subClassOf edg:Identifiable ;
⋮ ⋮
sh:property edg:File-formatType ;
sh:property edg:File-name ;
sh:property edg:File-residesIn ;
sh:property edg:File-sourceLink ;
.
edg:File-formatType
rdf:type sh:PropertyShape ;
sh:path edg:format ;
sh: description “Specifies the format of a file, or dataset, using an enumeration of values”
sh:class edg:FormatType ;
.
edg:File-name
rdf:type sh:PropertyShape ;
sh:path edg:name ;
sh:datatype xsd:string ;
.
edg:File-residesIn
rdf:type sh:PropertyShape ;
sh:path edg:residesIn ;
sh:class edg:FileSystem ;.
An active model (ontology)
describing a File Asset as it is
captured in a Knowledge Graph –
open to access at run time, sharable,
making data self describing – for
people and machines
A form for capturing data about files – if a
model changes, a form automatically
adjusts.
The same form can display information
about other assets – whatever the model
says, it will display and will let you edit
according to the definitions in the model
We know the meaning of each data value
because it is described in the ontology that
is as accessible as the data
© Copyright 2020 TopQuadrant Inc. Slide 42
Some Limitations of Property Graphs
You are Likely to Face
▪ No Capture of Schema in a Graph
▪ Minimal Support for Validation and Data Integrity
▪ No Capture of Rich Rules in a Graph
▪ No Support for Inheritance and Inference in a Graph
▪ No Globally Unique Identifiers
▪ No Resolvable Identifiers
▪ No Inherent Connectivity Across Graphs
▪ Difficulties with Graph Evolvability
© Copyright 2020 TopQuadrant Inc. Slide 43
Steps for Transitioning from Property
Graphs to a Knowledge Graph
1. Get your data out – “as-is”
– It is fairly easy to generate one of the RDF standard serializations from a property
graph. In fact, Neo4J offers a library for doing this
2. Create a model for your data
– You will not be able to get the semantics of the data; this is due to the fact that
the data model only exists in your initial design sketches and, partially, within
Cypher queries and programs.
– TopBraid EDG can introspect the data to help you generate the initial model
© Copyright 2020 TopQuadrant Inc. Slide 44
Steps for Transitioning from Property
Graphs to a Knowledge Graph
3. Iterate
– The initial model and data is likely to benefit from some adjustment as the
structure of the graph data may be influenced by the specific limitations of the
property graph data model and optimizations that were required due to its
architecture
4. Migrate application code and queries
– Take advantage of the data validation, reasoning and model-based introspection
capabilities of the knowledge graphs – you will end up with smaller and more
maintainable code base
– If you already use GraphQL, this will likely simplify the migration
© Copyright 2020 TopQuadrant Inc. Slide 45
Summary of Key Take Away Points
▪ Increasingly graphs are seen as an excellent choice for data / metadata representation
and management - for the data centric enterprise
– Flexibility is one strong driver: heterogeneous data, integrating new data sources, analytics, and metadata
management and governance all require flexibility and evolvability
▪ The two main graph data models are: Property Graphs and Knowledge (RDF) Graphs
▪ Property Graphs are offered in mature implementations (e.g. Neo4J), that are easy to get started with
– but there are fundamental limitations that are not addressed in the design of property graphs
– E.g., Property Graphs are not self-describing; the meaning of the data they store is not a part of a graph
▪ Knowledge Graphs support the move to Data Centric systems and architectures
– by offering robust capabilities that support flexibility and evolvability
– E.g., Knowledge Graphs capture not only data, but the meaning or semantics of data,
including rich constraints and highly expressive rules
– In the past, implementations of Knowledge Graphs were considered to be more academic, but this has
changed with products like TopBraid EDG
▪ It is possible to move from a Property Graph to a Knowledge Graph in logical steps
– Get your data out – “as-is”; Create a model for your data; iterate and optimize …
© Copyright 2020 TopQuadrant Inc. Slide 46
… Questions?
Thank You !
© Copyright 2020 TopQuadrant Inc. Slide 47
Benefits of a Knowledge Graph based
Platform for Data Governance 2.0
As an enterprise knowledge graph infrastructure, TopBraid EDG supports
Data Governance 2.0 and applications of AI / ML
TopBraid Enterprise Data Governance (EDG):
▪ Is flexible and extensible, based on standards
▪ Integrates reasoning and machine learning
▪ Enables people (UI) and software (APIs/web
services) to view, follow and query
▪ Bridges data and metadata “silos” for a
seamless data governance
▪ Delivers Knowledge-driven data governance
© Copyright 2020 TopQuadrant Inc. Slide 48
To Learn More about TopBraid EDG and Knowledge Graphs:
EDG Product Info:
▪ TopBraid Enterprise Data Governance (TopBraid EDG)
(https://www.topquadrant.com/products/topbraid-enterprise-data-governance/)
Contact us: at info@topquadrant.com to:
▪ Discuss data governance and knowledge graphs
▪ Request a more targeted demo of TopBraid EDG
▪ Ask for a free EDG evaluation account
© Copyright 2020 TopQuadrant Inc. Slide 49
To Learn More about TopBraid EDG and Knowledge Graphs:
EDG Product Info:
▪ TopBraid Enterprise Data Governance (TopBraid EDG)
(https://www.topquadrant.com/products/topbraid-enterprise-data-governance/)
What Are Knowledge Graphs?
A Knowledge Graph turns Data into Knowledge
(https://www.topquadrant.com/technology/knowledge-graphs/)
Contact us: at info@topquadrant.com to:
▪ Discuss data governance and knowledge graphs
▪ Request a more targeted demo of TopBraid EDG
▪ Ask for a free EDG evaluation account
© Copyright 2020 TopQuadrant Inc. Slide 50© Copyright 2020 TopQuadrant Inc. Slide 50
Latest Whitepaper
Download a copy on our website at:
topquadrant.com/knowledge-assets/whitepapers/
© Copyright 2019 TopQuadrant Inc. Slide 51
More Resources ...
More Webinar Recordings, Slides, Q&A:
▪ https://www.topquadrant.com/knowledge-assets/topquadrant-
webinars/
Short Videos:
▪ TopBraid EDG “Quick Grok” Videos
https://www.topquadrant.com/knowledge-assets/videos/
▪ TopBraid EDG Animated Video
https://www.topquadrant.com/project/edg_agile_modular/
Blog:
▪ https://www.topquadrant.com/the-semantic-ecosystems-journal/
Data Governance White Papers
▪ https://www.topquadrant.com/knowledge-assets/whitepapers/

More Related Content

What's hot (20)

PDF
Property graph vs. RDF Triplestore comparison in 2020
Ontotext
 
PDF
Introduction to Knowledge Graphs: Data Summit 2020
Enterprise Knowledge
 
PDF
Introduction to Knowledge Graphs
mukuljoshi
 
PDF
Workshop - Neo4j Graph Data Science
Neo4j
 
PDF
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
Neo4j
 
PDF
Data Catalog for Better Data Discovery and Governance
Denodo
 
PDF
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
PPTX
Deep dive into LangChain integration with Neo4j.pptx
TomazBratanic1
 
PDF
Time to Talk about Data Mesh
LibbySchulze
 
PDF
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Neo4j
 
PPTX
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
Neo4j
 
PDF
Intro to Graphs and Neo4j
jexp
 
PDF
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
PDF
Knowledge Graphs - The Power of Graph-Based Search
Neo4j
 
PDF
Graphs for Finance - AML with Neo4j Graph Data Science
Neo4j
 
PPTX
Databricks Platform.pptx
Alex Ivy
 
PDF
stackconf 2022: Introduction to Vector Search with Weaviate
NETWAYS
 
PDF
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
PDF
Building End-to-End Delta Pipelines on GCP
Databricks
 
PDF
Getting Started with Knowledge Graphs
Peter Haase
 
Property graph vs. RDF Triplestore comparison in 2020
Ontotext
 
Introduction to Knowledge Graphs: Data Summit 2020
Enterprise Knowledge
 
Introduction to Knowledge Graphs
mukuljoshi
 
Workshop - Neo4j Graph Data Science
Neo4j
 
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
Neo4j
 
Data Catalog for Better Data Discovery and Governance
Denodo
 
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Deep dive into LangChain integration with Neo4j.pptx
TomazBratanic1
 
Time to Talk about Data Mesh
LibbySchulze
 
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Neo4j
 
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
Neo4j
 
Intro to Graphs and Neo4j
jexp
 
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
Knowledge Graphs - The Power of Graph-Based Search
Neo4j
 
Graphs for Finance - AML with Neo4j Graph Data Science
Neo4j
 
Databricks Platform.pptx
Alex Ivy
 
stackconf 2022: Introduction to Vector Search with Weaviate
NETWAYS
 
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
Building End-to-End Delta Pipelines on GCP
Databricks
 
Getting Started with Knowledge Graphs
Peter Haase
 

Similar to Slides: Knowledge Graphs vs. Property Graphs (20)

PPTX
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
Optum
 
PDF
Introduction_to_knowledge_graph.pdf
JaberRad1
 
PDF
En un mundo hiperconectado, las bases de datos de grafos son tu arma secreta
javier ramirez
 
PDF
ESWC 2017 Tutorial Knowledge Graphs
Peter Haase
 
PDF
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
Joshua Shinavier
 
ODP
Graph Data -- RDF and Property Graphs
andyseaborne
 
PDF
Introduction to Knowledge Graphs for Information Architects.pdf
Heather Hedden
 
PDF
Two graph data models : RDF and Property Graphs
andyseaborne
 
PDF
2016-02 Graphs - PG+RDF
andyseaborne
 
PPTX
The Semantic Knowledge Graph
Trey Grainger
 
PDF
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge
 
PDF
Introduction to Graph Databases
Paolo Pareti
 
PDF
Knowledge Graphs and Their Application.pdf
Kabul Kurniawan
 
PDF
Semantic Web and It's Application - Kabul Kurniawan
Kabul Kurniawan
 
PDF
GraphConnect 2014 SF: From Zero to Graph in 120: Model
Neo4j
 
PDF
Distributed graph processing
Bartosz Konieczny
 
PDF
A Brief Introduction to Knowledge Graphs
Heather Hedden
 
PDF
Ten myths about knowledge graphs.pdf
Elena Simperl
 
PPTX
Graph_Database_Prepared_by_Ali_Rajab.pptx
removed_d60a659b5ed8cd7a5e0931e92447dc14
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
Optum
 
Introduction_to_knowledge_graph.pdf
JaberRad1
 
En un mundo hiperconectado, las bases de datos de grafos son tu arma secreta
javier ramirez
 
ESWC 2017 Tutorial Knowledge Graphs
Peter Haase
 
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
Joshua Shinavier
 
Graph Data -- RDF and Property Graphs
andyseaborne
 
Introduction to Knowledge Graphs for Information Architects.pdf
Heather Hedden
 
Two graph data models : RDF and Property Graphs
andyseaborne
 
2016-02 Graphs - PG+RDF
andyseaborne
 
The Semantic Knowledge Graph
Trey Grainger
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge
 
Introduction to Graph Databases
Paolo Pareti
 
Knowledge Graphs and Their Application.pdf
Kabul Kurniawan
 
Semantic Web and It's Application - Kabul Kurniawan
Kabul Kurniawan
 
GraphConnect 2014 SF: From Zero to Graph in 120: Model
Neo4j
 
Distributed graph processing
Bartosz Konieczny
 
A Brief Introduction to Knowledge Graphs
Heather Hedden
 
Ten myths about knowledge graphs.pdf
Elena Simperl
 
Graph_Database_Prepared_by_Ali_Rajab.pptx
removed_d60a659b5ed8cd7a5e0931e92447dc14
 
Ad

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
PDF
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
PDF
Exploring Levels of Data Literacy
DATAVERSITY
 
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
PDF
Make Data Work for You
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
PDF
Data Modeling Fundamentals
DATAVERSITY
 
PDF
Showing ROI for Your Analytic Project
DATAVERSITY
 
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
PDF
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
PDF
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
PDF
2023 Trends in Enterprise Analytics
DATAVERSITY
 
PDF
Data Strategy Best Practices
DATAVERSITY
 
PDF
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
PDF
Data Management Best Practices
DATAVERSITY
 
PDF
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
Exploring Levels of Data Literacy
DATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
Make Data Work for You
DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
Data Modeling Fundamentals
DATAVERSITY
 
Showing ROI for Your Analytic Project
DATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
2023 Trends in Enterprise Analytics
DATAVERSITY
 
Data Strategy Best Practices
DATAVERSITY
 
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
Data Management Best Practices
DATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Ad

Recently uploaded (20)

PDF
InformaticsPractices-MS - Google Docs.pdf
seshuashwin0829
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PPTX
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
PPTX
BinarySearchTree in datastructures in detail
kichokuttu
 
PDF
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
PPTX
What Is Data Integration and Transformation?
subhashenia
 
PPTX
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
big data eco system fundamentals of data science
arivukarasi
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PPTX
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
PPTX
Powerful Uses of Data Analytics You Should Know
subhashenia
 
PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
InformaticsPractices-MS - Google Docs.pdf
seshuashwin0829
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
BinarySearchTree in datastructures in detail
kichokuttu
 
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
What Is Data Integration and Transformation?
subhashenia
 
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
big data eco system fundamentals of data science
arivukarasi
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
Powerful Uses of Data Analytics You Should Know
subhashenia
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 

Slides: Knowledge Graphs vs. Property Graphs

  • 1. Knowledge Graphs vs. Property Graphs — Similarities, Differences and Some Guidance on Capabilities
  • 2. TOPQUADRANT COMPANY FOUNDATION ▪ TopQuadrant was founded in 2001 ▪ Strong commitment to standards-based approaches to data semantics MISSION ▪ Empower people and drive results — by making enterprise information meaningful FOCUS ▪ Provide comprehensive data governance solutions using knowledge graph technologies TOPQUADRANT COMPANY
  • 3. © Copyright 2020 TopQuadrant Inc. Slide 3 I. A brief overview of: – Property Graphs – Knowledge Graphs II. Differences in Terminology and Capabilities of these models III. Strengths and Limitations of each approach IV. Why Knowledge Graphs provide a strong foundation for data centric enterprise applications – E.g. enterprise data governance and metadata management ▪ Q & A Today’s Agenda Irene Polikoff
  • 4. © Copyright 2020 TopQuadrant Inc. Slide 4 Some Definitions of a Knowledge Graph ▪ A knowledge graph (i) mainly describes real world entities and their interrelations, organized in a graph, (ii) defines possible classes and relations of entities in a schema, (iii) allows for potentially interrelating arbitrary entities with each other and (iv) covers various topical domains. H. Paulheim. Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods. Semantic Web Journal, 1–20, 2016 ▪ A knowledge graph acquires and integrates information into an ontology and applies a reasoner to derive new knowledge. Lisa Ehrlinger and Wolfram Wöß, Towards a Definition of Knowledge Graphs, Institute for Application Oriented Knowledge Processing at SEMANTiCS 2016, http://ceur-ws.org/Vol-1695/paper4.pdf ▪ A knowledge graph is a knowledge base that uses a graph-structured data model or topology to integrate knowledge and data. Knowledge graphs are often used to store interlinked descriptions of entities — real-world objects, events, situations or abstract concepts, including their semantics. Wikipedia
  • 5. © Copyright 2020 TopQuadrant Inc. Slide 5 TopQuadrant: What Are Knowledge Graphs? ▪ A Knowledge Graph represents a knowledge domain – With “Active Models” that can be consulted at run-time ▪ It represents all information as a graph – A network of nodes and links – Not tables of rows and columns ▪ It represents facts (data) and models (metadata) in the same way – as part of a graph – Rich rules and inferencing ▪ It is based on open standards, from top to bottom – Readily connects to knowledge in private and public clouds A knowledge graph, with its Active Models, grows and evolves over time.
  • 6. © Copyright 2020 TopQuadrant Inc. Slide 6 Knowledge Graph Recipe Vehicle Car Organization Year Design/Model brand type type type type BMW X2 BMW 2019 BMW 228 Ontology + Instance Data ---------------------------- Knowledge Graph *Blueprint *Actuals An Ontology is a data model (schema) with classes, properties, and rules - a precise blueprint for data
  • 7. © Copyright 2020 TopQuadrant Inc. Slide 7 Property Graphs aka Labeled Property Graphs ▪ Caveat: There is no standard property graph data model – Each implementation of a Property Graph is, thus, somewhat different – We will focus on the common aspects ▪ In general, the Property Graphs data model consists of three elements: – Nodes – entities in a graph – Edges – direct links aka relationships between nodes – Properties – key value pairs that can be associated with either nodes or edges
  • 8. © Copyright 2020 TopQuadrant Inc. Slide 8 Property Graph by Example
  • 9. © Copyright 2020 TopQuadrant Inc. Slide 9 Property Graph by Example
  • 10. © Copyright 2020 TopQuadrant Inc. Slide 10 Property Graph by Example
  • 11. © Copyright 2020 TopQuadrant Inc. Slide 11 RDF Knowledge Graph by Example 56853 http://www.wikidata.org/entity/Q2263 wikidata:
  • 12. © Copyright 2020 TopQuadrant Inc. Slide 12 RDF Knowledge Graph by Example 56853
  • 13. © Copyright 2020 TopQuadrant Inc. Slide 13 RDF Knowledge Graph by Example
  • 14. © Copyright 2020 TopQuadrant Inc. Slide 14 Property Graph vs RDF Knowledge Graph Property Graph Knowledge Graph IDs are internal to a graph database, user has no control over them. IDs are global – URIs, meant to be under users control to enable combining different graphs Properties are literal values. They are fundamentally different from nodes and relationships. Canonical structure. Everything is stored as nodes and links connecting them. A literal value is a node like any other. Property is any link – to a resource or a literal.
  • 15. © Copyright 2020 TopQuadrant Inc. Slide 15 “Schema” as part of a Knowledge Graph
  • 16. © Copyright 2020 TopQuadrant Inc. Slide 16 “Schema” as part of a Knowledge Graph
  • 17. © Copyright 2020 TopQuadrant Inc. Slide 17 More “Schema” sh:maxCount 1
  • 18. © Copyright 2020 TopQuadrant Inc. Slide 18 Standards-based Ontologies and Vocabularies are Knowledge Graphs Increasingly Used across Industries Key enablers of transparency, interoperability and the emerging “cognitive enterprise” These ontologies uniquely identify and describe things and relationships in healthcare, web search services, manufacturing, financial services, … *
  • 19. © Copyright 2020 TopQuadrant Inc. Slide 19 Property Graph vs RDF Knowledge Graph Property Graph Knowledge Graph IDs are internal to a graph database, user has no control over them. IDs are global – URIs, meant to be under user’s control to enable combining different graphs. Properties are literal values. They are fundamentally different from nodes and relationships. Canonical structure. Everything is stored as nodes and links connecting them. A literal value is a node like any other. Property is any link - to a resource or a literal. Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph.
  • 20. © Copyright 2020 TopQuadrant Inc. Slide 20 RDF Knowledge Graphs: How to add information to triples
  • 21. © Copyright 2020 TopQuadrant Inc. Slide 21 RDF Knowledge Graphs: How to add information to triples
  • 22. © Copyright 2020 TopQuadrant Inc. Slide 22 Property Graph vs RDF Knowledge Graph Property Graph Knowledge Graph IDs are internal to a graph database, user has no control over them. IDs are global – URIs, meant to be under user’s control to enable combining different graphs. Properties are literal values. They are fundamentally different from nodes and relationships. Canonical structure. Everything is stored as nodes and links connecting them. A literal value is a node like any other. Property is any link - to a resource or a literal. Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph. Each relationship (link or arch) uniquely identifies “node – link – node” combination. Relationships can be annotated with additional facts, but properties can’t be. IDs of properties (links) are re-used. Thus, they do not uniquely identify a “node – link - node” combination that uses it. There is a way to give these triples identity. Any triple can be annotated with additional facts.
  • 23. © Copyright 2020 TopQuadrant Inc. Slide 23 Property Graphs: How to add information to a “property” value Can we say what year is the population for?
  • 24. © Copyright 2020 TopQuadrant Inc. Slide 24 type: POPULATION_SIZE type: POPULATION_SIZE label: Population label: Population 131 132“Size” 58811 “Year” 2018 “Size” 56853 “Year” 2010 Property Graphs: How to add information to a “property” value Can’t be done. Must turn a “property” into a relationship
  • 25. © Copyright 2020 TopQuadrant Inc. Slide 25 RDF Knowledge Graph: How to add information to a triple with a literal object Or, as it more commonly said, “to a property with a literal value” ex:year 2010 ex:100058811 56853
  • 26. © Copyright 2020 TopQuadrant Inc. Slide 26 Property Graph: How to use a node as a “property” value Let’s say we want to capture information about Ben Bradlee so we want to turn the value of “role” into a node.
  • 27. © Copyright 2020 TopQuadrant Inc. Slide 27 Property Graph: How to use a node as a “property” value Must remove the edge annotation and create a relationship to a node labeled Person.
  • 28. © Copyright 2020 TopQuadrant Inc. Slide 28 RDF Knowledge Graph: How to turn a literal value into a resource reference
  • 29. © Copyright 2020 TopQuadrant Inc. Slide 29 RDF Knowledge Graph: How to turn a literal value into a resource reference
  • 30. © Copyright 2020 TopQuadrant Inc. Slide 30 Property Graph vs RDF Knowledge Graph Property Graph Knowledge Graph IDs are internal to a graph database, user has no control over them. IDs are global – URIs, meant to be under user’s control to enable combining different graphs. Properties are literal values. They are fundamentally different from nodes and relationships. Canonical structure. Everything is stored as nodes and links connecting them. A literal value is a node like any other. Property is any link - to a resource or a literal. Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph. Each relationship (link or arch) uniquely identifies “node – link – node” combination. Relationships can be annotated with additional facts, but properties can’t be. IDs of properties (links) are re-used. Thus, they do not uniquely identify a “node – link - node” combination that uses it. There is a way to give these triples identity. Any triple can be annotated with additional facts. Changes in the graph design require restructure/re-load of the data and changes to all impacted queries. Graphs can evolve and changes in the design can often be done with minimal impact on existing data and queries. Product-specific query languages, variants of Cypher, increasingly, GraphQL support, introspection not integrated Query standard – SPARQL. Increasingly, GraphQL support. In EDG: introspection and auto-generation of GraphQL Schemas.
  • 31. © Copyright 2020 TopQuadrant Inc. Slide 31 wikidata:Q2263 rdf:type entont:Actor ; rdf:type entont:Director ; entont:actedIn entertainment_data:ThePost [[ entont:role “Ben Bradlee” ]] ; schema:birthDate "1956-07-09"^^xsd:date ; entont:directed entertainment_data:ALeagueOfTheirOwn ; schema:familyName "Tom" @en; schema:givenName "Hanks" @en; rdfs:label "Tom Hanks" @en; . Example of RDF Serialization @prefix entertainment_data: <http://example.org/data-graphs/Entertainment_Data#> . @prefix entont: <http://example.org/ontologies/Entertainment#> . @prefix wikidata: <http://www.wikidata.org/entity/> . @prefix schema: <http://schema.org/> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . Subject Node Object NodeEdge
  • 32. © Copyright 2020 TopQuadrant Inc. Slide 32 Property Graph vs RDF Knowledge Graph Property Graph Knowledge Graph IDs are internal to a graph database, user has no control over them. IDs are global – URIs, meant to be under user’s control to enable combining different graphs. Properties are literal values. They are fundamentally different from nodes and relationships. Canonical structure. Everything is stored as nodes and links connecting them. A literal value is a node like any other. Property is any link - to a resource or a literal. Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph. Each relationship (link or arch) uniquely identifies “node – link – node” combination. Relationships can be annotated with additional facts, but properties can’t be. IDs of properties (links) are re-used. Thus, they do not uniquely identify a “node – link - node” combination that uses it. There is a way to give these triples identity. Any triple can be annotated with additional facts. Changes in the graph design require restructure/re-load of the data and changes to all impacted queries. Graphs can evolve organically. Changes in the design can often be accomplished with minimal impact on existing data and queries. Product-specific query languages, variants of Cypher, increasingly, GraphQL support, introspection not integrated Query standard – SPARQL. Increasingly, GraphQL support. In EDG: introspection and auto-generation of GraphQL Schemas. No standard serialization for export. Standard serializations supported by all products – RDF/XML, Turtle, N3 and JSON-LD formats.
  • 33. © Copyright 2020 TopQuadrant Inc. Slide 33 Knowledge Graphs are Self-Organizing – they can be composed Data Element Dataset Data Center Location element of located in KG1 KG3 located in CMS CMS Cloud Service hosted by KG2 App. Server VM resourcedrunning on stored in
  • 34. © Copyright 2020 TopQuadrant Inc. Slide 34 Knowledge Graphs are Self-Organizing – they can be composed With consistent Identifiers for both Nodes and Links, Graph queries and rules can give insights and make new knowledge. Data Element Dataset Data Center Location element of located in KG1 KG3 located in CMSCMS Cloud Service hosted by KG2 App. Server VM resourcedrunning onstored in regioned in
  • 35. © Copyright 2020 TopQuadrant Inc. Slide 35 Property Graph vs RDF Knowledge Graph Property Graph Knowledge Graph IDs are internal to a graph database, user has no control over them. IDs are global – URIs, meant to be under user’s control to enable combining different graphs. Properties are literal values. They are fundamentally different from nodes and relationships. Canonical structure. Everything is stored as nodes and links connecting them. A literal value is a node as any other. Property is any link - to a resource or a literal. Schema (semantics of data) is not a part of the graph. Rich schemas, including rules are a part of the graph. Each relationship (link or arch) uniquely identifies “node – link – node” combination. Relationships can be annotated with additional facts, but properties can’t be. IDs of properties (links) are re-used. Thus, they do not uniquely identify a “node – link - node” combination that uses it. There is a way to give these triples identity. Any triple can be annotated with additional facts. Changes in the graph design require restructure/re-load of the data and changes to all impacted queries. Graphs can evolve organically. Changes in the design can often be accomplished with minimal impact on existing data and queries. Product-specific query languages, variants of Cypher, increasingly, GraphQL support, introspection not integrated. Query standard – SPARQL. Increasingly, GraphQL support. In EDG: introspection and auto-generation of GraphQL Schemas. No standard serialization for export. Standard serializations supported by all products – RDF/XML, Turtle, N3 and JSON-LD formats. No partitioning of graphs nor merging or connectivity across graphs. Named graphs as a partitioning approach. Graphs are “self- composing”, can be connected and merged.
  • 36. © Copyright 2019 TopQuadrant Inc. Slide 36 Knowledge Graphs support the move to Data Centric Systems and Architectures
  • 37. © Copyright 2020 TopQuadrant Inc. Slide 37 Data Centric vs Application Centric Application Centric Data is a second-class citizen, and the application is the main thing Data models are not self describing, can’t be understood without understanding application’s logic All business rules and semantics are hard coded in the applications Data models are very rigid Changing application’s data model is expensive – requires changes to the application code due to tight coupling One data model per application Data sharing is an afterthought Data Centric Data is the main asset, applications are “model-driven” Data is self-describing and can stand on its own Data can be “active” with accessible semantics, business rules and even executable code Data models are flexible and evolvable Changing data models is inexpensive and may not require application changes Multiple applications may share some parts of a data model, extending it as needed Data sharing is “built in”
  • 38. © Copyright 2020 TopQuadrant Inc. Slide 38 Why Self Describing Data is Important? Open Data Example: Road Safety - Vehicles by Make and Model We see the numbers, but what do they mean? Data is only as good as its metadata
  • 39. © Copyright 2020 TopQuadrant Inc. Slide 39 Metadata adds Context & Definition – Making Data sharable and increasing its Value 39 First Name Last Name Company City Year Purchased Joe Smith Komputers R Us New York 1970 Mary Jones The Lord’s Store London 1999 Proful Bishwal The Lady’s Store Mumbai 1998 Ming Lee My Favorite Store Beijing 2001 Customer Definition Last Name represents the surname or family name of an individual. Business Rules In the Chinese market, family name is listed first in salutations. Format VARCHAR(30) Abbreviation LNAME Required YES More … Numerous technical & business metadata including security, privacy, permissible values, etc. Is this the city where the customer lives or where the store is located? • If metadata is not readily available, data can’t be understood. • If metadata is kept separate from data, applications can’t access it. • Further, data will change and metadata will get stale. • If metadata is captured in a proprietary way, it is not open. Key Facts
  • 40. © Copyright 2017 TopQuadrant Inc. Slide 40 TopBraid EDG – Composition of Knowledge Graphs to Support Adaptive Data Governance Business Logic UX/UI Access Control Compose UI Behavior, Queries, Rules and Constraints Enterprise Assets Technical Assets Big Data Assets Data Assets Glossaries Terminologies Governance Model DataSchema Compose Data and Schema Over 300 of pre-defined Asset Types
  • 41. © Copyright 2020 TopQuadrant Inc. Slide 41 How Applications can be Model Driven – an example from TopBraid EDG Models drive the User Experience (UX) i.e., what fields will appear on an (edit, search or browse) form, possible values, order of the fields, their division into sections and other relevant information. edg:FileAsset rdf:type edg:AssetClass ; dash:abstract "false"^^xsd:boolean ; rdfs:label "File Asset" ; rdfs:subClassOf edg:AccessControllable ; rdfs:subClassOf edg:ComplianceAspect ; rdfs:subClassOf edg:Identifiable ; ⋮ ⋮ sh:property edg:File-formatType ; sh:property edg:File-name ; sh:property edg:File-residesIn ; sh:property edg:File-sourceLink ; . edg:File-formatType rdf:type sh:PropertyShape ; sh:path edg:format ; sh: description “Specifies the format of a file, or dataset, using an enumeration of values” sh:class edg:FormatType ; . edg:File-name rdf:type sh:PropertyShape ; sh:path edg:name ; sh:datatype xsd:string ; . edg:File-residesIn rdf:type sh:PropertyShape ; sh:path edg:residesIn ; sh:class edg:FileSystem ;. An active model (ontology) describing a File Asset as it is captured in a Knowledge Graph – open to access at run time, sharable, making data self describing – for people and machines A form for capturing data about files – if a model changes, a form automatically adjusts. The same form can display information about other assets – whatever the model says, it will display and will let you edit according to the definitions in the model We know the meaning of each data value because it is described in the ontology that is as accessible as the data
  • 42. © Copyright 2020 TopQuadrant Inc. Slide 42 Some Limitations of Property Graphs You are Likely to Face ▪ No Capture of Schema in a Graph ▪ Minimal Support for Validation and Data Integrity ▪ No Capture of Rich Rules in a Graph ▪ No Support for Inheritance and Inference in a Graph ▪ No Globally Unique Identifiers ▪ No Resolvable Identifiers ▪ No Inherent Connectivity Across Graphs ▪ Difficulties with Graph Evolvability
  • 43. © Copyright 2020 TopQuadrant Inc. Slide 43 Steps for Transitioning from Property Graphs to a Knowledge Graph 1. Get your data out – “as-is” – It is fairly easy to generate one of the RDF standard serializations from a property graph. In fact, Neo4J offers a library for doing this 2. Create a model for your data – You will not be able to get the semantics of the data; this is due to the fact that the data model only exists in your initial design sketches and, partially, within Cypher queries and programs. – TopBraid EDG can introspect the data to help you generate the initial model
  • 44. © Copyright 2020 TopQuadrant Inc. Slide 44 Steps for Transitioning from Property Graphs to a Knowledge Graph 3. Iterate – The initial model and data is likely to benefit from some adjustment as the structure of the graph data may be influenced by the specific limitations of the property graph data model and optimizations that were required due to its architecture 4. Migrate application code and queries – Take advantage of the data validation, reasoning and model-based introspection capabilities of the knowledge graphs – you will end up with smaller and more maintainable code base – If you already use GraphQL, this will likely simplify the migration
  • 45. © Copyright 2020 TopQuadrant Inc. Slide 45 Summary of Key Take Away Points ▪ Increasingly graphs are seen as an excellent choice for data / metadata representation and management - for the data centric enterprise – Flexibility is one strong driver: heterogeneous data, integrating new data sources, analytics, and metadata management and governance all require flexibility and evolvability ▪ The two main graph data models are: Property Graphs and Knowledge (RDF) Graphs ▪ Property Graphs are offered in mature implementations (e.g. Neo4J), that are easy to get started with – but there are fundamental limitations that are not addressed in the design of property graphs – E.g., Property Graphs are not self-describing; the meaning of the data they store is not a part of a graph ▪ Knowledge Graphs support the move to Data Centric systems and architectures – by offering robust capabilities that support flexibility and evolvability – E.g., Knowledge Graphs capture not only data, but the meaning or semantics of data, including rich constraints and highly expressive rules – In the past, implementations of Knowledge Graphs were considered to be more academic, but this has changed with products like TopBraid EDG ▪ It is possible to move from a Property Graph to a Knowledge Graph in logical steps – Get your data out – “as-is”; Create a model for your data; iterate and optimize …
  • 46. © Copyright 2020 TopQuadrant Inc. Slide 46 … Questions? Thank You !
  • 47. © Copyright 2020 TopQuadrant Inc. Slide 47 Benefits of a Knowledge Graph based Platform for Data Governance 2.0 As an enterprise knowledge graph infrastructure, TopBraid EDG supports Data Governance 2.0 and applications of AI / ML TopBraid Enterprise Data Governance (EDG): ▪ Is flexible and extensible, based on standards ▪ Integrates reasoning and machine learning ▪ Enables people (UI) and software (APIs/web services) to view, follow and query ▪ Bridges data and metadata “silos” for a seamless data governance ▪ Delivers Knowledge-driven data governance
  • 48. © Copyright 2020 TopQuadrant Inc. Slide 48 To Learn More about TopBraid EDG and Knowledge Graphs: EDG Product Info: ▪ TopBraid Enterprise Data Governance (TopBraid EDG) (https://www.topquadrant.com/products/topbraid-enterprise-data-governance/) Contact us: at [email protected] to: ▪ Discuss data governance and knowledge graphs ▪ Request a more targeted demo of TopBraid EDG ▪ Ask for a free EDG evaluation account
  • 49. © Copyright 2020 TopQuadrant Inc. Slide 49 To Learn More about TopBraid EDG and Knowledge Graphs: EDG Product Info: ▪ TopBraid Enterprise Data Governance (TopBraid EDG) (https://www.topquadrant.com/products/topbraid-enterprise-data-governance/) What Are Knowledge Graphs? A Knowledge Graph turns Data into Knowledge (https://www.topquadrant.com/technology/knowledge-graphs/) Contact us: at [email protected] to: ▪ Discuss data governance and knowledge graphs ▪ Request a more targeted demo of TopBraid EDG ▪ Ask for a free EDG evaluation account
  • 50. © Copyright 2020 TopQuadrant Inc. Slide 50© Copyright 2020 TopQuadrant Inc. Slide 50 Latest Whitepaper Download a copy on our website at: topquadrant.com/knowledge-assets/whitepapers/
  • 51. © Copyright 2019 TopQuadrant Inc. Slide 51 More Resources ... More Webinar Recordings, Slides, Q&A: ▪ https://www.topquadrant.com/knowledge-assets/topquadrant- webinars/ Short Videos: ▪ TopBraid EDG “Quick Grok” Videos https://www.topquadrant.com/knowledge-assets/videos/ ▪ TopBraid EDG Animated Video https://www.topquadrant.com/project/edg_agile_modular/ Blog: ▪ https://www.topquadrant.com/the-semantic-ecosystems-journal/ Data Governance White Papers ▪ https://www.topquadrant.com/knowledge-assets/whitepapers/