TAS - Graph Database - June 2020
TAS - Graph Database - June 2020
Graph Database
Based upon the concept of a mathematical graph, a graph database contains a collection of nodes and edges
A node represents an object, and an edge represents the connection or relationship between two objects
Each node in a graph database is identified by a unique identifier that expresses key value pairs
Additionally, each edge is defined by a unique identifier that details a starting or ending node, along with a set of
properties
Graph Database | Technology Assessment Services | June 2020 Source: CBR online, Neo4j, © Capgemini 2020. All rights reserved | 2
Graph Query Languages
Graph Database Query Language (GraphQL), is a concrete mechanism for creating, manipulating and querying graph data in a
graph database
Graph query languages are SQL equivalents for Graph DBMS
GraphQL is actually an API Query Language while Gremlin, SPARQL and now GQL are all Query Languages for Graph Databases
Most common and widely-used Originally developed by Neo4j as Originally developed by the W3C
graph query language a graph query language that to query data stored in the
allows users to store and retrieve Resource Description Framework
It is the query language of data from the graph database (RDF) format for metadata
Apache TinkerPop graph
computing framework Opensource since 2015 SPARQL (SPARQL Protocol And
and openCypher project RDF Query Language) is a W3C
Gremlin is a functional, data- provides an open language standard designed to meet the
flow language that enables users specification, technical use cases identified by the RDF
to succinctly express complex compatibility kit, and reference Data Access Working Group
traversals on (or queries of) their implementation of the parser,
application's property graph planner, and runtime for Cypher Even though its a protocol, for
most use cases SPARQL's
Widely adopted and supported by OpenCypher has industry greatest value is a query
nearly all graph databases support, most prominently by language for RDF
supporting Property Graphs (PG) SAP graphs (another W3C standard)
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 3
Use cases of Knowledge graphs
It powers Google’s search engine, as the original page rank algorithm is based on a form of
knowledge graph, as well as later additions to its search technology
Uses knowledge graph technology to organize information on its vast catalog of content, drawing
connections between movies and TV shows and the actors, directors or producers who put them
together. This helps them to predict what customers might like to watch next, and foster the "binge-
watching" model of consumption it has built its business around
Uses knowledge graphs to build accessible models of all of the data it generates and
stores, and use it for risk management, process monitoring and building “digital
twins” – simulated versions of real-world systems which can be used for design,
prototyping and training
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 4
List of Top 5 Graph
Database
Neo4j - Leading native graph database and graph platform
Competitive Advantages
Database combines everything needed for performance and trust in Implementation Language
applications that bring data relationships to the fore Java, Scala
Native graph storage, native graph processing, graph scalability, high Server operating systems
availability, graph clustering, graphs in the cloud, graphs on Spark,
Linux, OS X, Solaris,
built-in ETL, and integration support, plus Cypher, a powerful and
Windows
expressive language for queries using vastly less code than SQL
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 6
OrientDB - First Multi-Model Distributed DBMS with a True
Graph Engine
Competitive Advantages
It is touted to be the fastest graph database and OrientDB’s query Implementation Language
language is built on SQL Java
Can be used as a pure Graph Database or as a Multi-Model, avoiding Server operating systems
using multiple DBMS products in the same application All OS with a Java JDK (>=
Supports the creation of schemas around graphs JDK 6)
Scalability
Open source , Multi-model Supports a Multi-Master + Applications / Use cases
Sharded architecture: all the APIs and other access
DBMS (Document, Graph, Fraud detection
servers are masters methods
Key/Value) Fighting Crime
Manages relationships without Tinkerpop technology stack
Investigation, Fraud
using JOINs, but rather direct with Blueprints
Multi-Model means 2nd Detection and prevention
pointers. This allows to have Gremlin, Pipes
generation NoSQL able to Data Governance, Master
constant performance on Java API
manage complex domain Data Management
traversing relationships, no RESTful HTTP/JSON API
with incredible Traffic Management
performance matter the database size
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 7
ArangoDB - Fast growing native multi-model NoSQL
database
Competitive Advantages
As a native multi-model database, can be used as a full blown
document store, graph graph database, search engine or any Implementation Language
combination of these technologies C++
Strong Data Consistency and Simplified Performance Scaling Server operating systems
Deployment is very easy with the ArangoDB Starter and as well on Linux, OS X, Windows
Kubernetes with the ArangoDB Operator
Open source , native Scalability Applications / Use cases APIs and other access
multi-model DBMS for Scales both vertically and Single View of everything methods
graph, document, horizontally Cybersecurity AQL
key/value and search If performance needs Simulations in manufacturing Foxx Framework
decrease, it can be easily Identity & Access Mgmt Graph API (Gremlin)
All in one engine and scale down the backend Fraud detection GraphQL query language
accessible with one query system to save on hardware Recommendation Engines HTTP API
language and operational Feature Engineering in ML & Java & SpringData
requirements. AI JSON style queries
Designed to store data Network Mgmt & Surveillance VelocyPack/VelocyStream
natively as key-value pairs,
graphs and JSON documents Major Customers
that can be accessed with Cisco, Barclays, Refinitive, Siemens Mentor, Kabbage, Liaison, Douglas, Supported programming
one declarative query MakeMyTrip, Kaseware, Demonware, Brainhub, Oxford University, IC languages
language - AQL Manage, Actify C#, C++, lojure, Elixir, Go, Java,
JavaScript (Node.js), PHP,
Initial Release – 2012 Gartner Peer Insight recognizes ArangoDB as one of the highest Python, R, Rust
Current Release - 3.6.0, rated operational databases
January 2020
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 8
Microsoft Azure CosmosDB - Native support for NoSQL
choices
Competitive Advantages
Offers multiple well-defined consistency models Implementation Language
Guarantees single-digit-millisecond latencies at the 99th percentile, C++
and guarantees high availability with multi-homing capabilities and Server operating systems
low latencies anywhere in the world Hosted
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 9
Amazon Neptune - Fully-managed graph database service
Competitive Advantages
Fast, reliable, fully-managed graph database service that makes it Implementation Language
easy to build and run applications that work with highly connected Java, Scala
datasets Server operating systems
The core of Amazon Neptune is a purpose-built, high-performance Hosted
graph database engine optimized for storing billions of relationships
and querying the graph with milliseconds latency
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 10
Other Popular graph databases
Graph Database | Technology Assessment Services | June 2020 © Capgemini 2020. All rights reserved | 11
About Capgemini
A global leader in consulting, technology services and digital transformation,
Capgemini is at the forefront of innovation to address the entire breadth of clients’
opportunities in the evolving world of cloud, digital and platforms. Building on its
strong 50-year heritage and deep industry-specific expertise, Capgemini enables
organizations to realize their business ambitions through an array of services from
strategy to operations. Capgemini is driven by the conviction that the business
value of technology comes from and through people. It is a multicultural company
of almost 220,000 team members in more than 40 countries. The Group reported
2019 global revenues of EUR 14.1 billion.
www.capgemini.com