SlideShare a Scribd company logo
Data 
Science 
in 
Marke-ng 
rik@neotechnology.com 
@rvanbruggen
Graphs 
for 
Recommenda-ons 
rik@neotechnology.com 
@rvanbruggen
Agenda 
• About 
Graphs 
• About 
Graph 
Databases 
• Why 
Graph 
Databases 
ma=er 
for 
Recommenda?ons 
– Short 
demonstra?on 
• Case 
Studies 
• Q&A
Introduc?on: 
about 
Graphs
Using graphs for recommendations
Meet  
Leonhard Euler 
• Swiss 
mathema?cian 
• Inventor 
of 
Graph 
Theory 
(1736)
Königsberg 
(Prussia) 
-­‐ 
1736
A 
B 
D 
C
A 
B 
D 
C 
1 
2 
3 
4 
7 
6 
5
About 
Graph 
Databases
So 
what 
is 
a 
graph 
database? 
• OLTP 
database 
– “end-­‐user” 
transac?ons 
• Model, 
store, 
manage 
data 
as 
a 
graph
What 
is 
a 
graph? 
Node 
Rela?onship
Contrast 
with 
Rela?onal 
Graphs are often referred to as “Whiteboard Friendly”. The 
data model reflects the way a domain expert would naturally 
draw their data on a whiteboard 
“The schema is the data”. Schema flexibility allows the system 
to change in response to a changing environment
What 
are 
graphs 
good 
for? 
Complex 
Querying
Examples 
of 
complex 
queries? 
1. 
Semi-­‐structure 
in 
datasets 
1 
– Normaliza?on 
introduces 
complexity 
– Forces 
developers 
to 
develop 
all 
kinds 
of 
logic 
to 
deal 
with 
this 
variability 
in 
their 
applica?on 
logic
Examples 
of 
complex 
queries: 
2. 
Connectedness 
in 
data 
Lots 
of 
normalized 
rela?onships 
between 
the 
different 
en??es, 
forces 
developers 
to 
do 
• Deep 
joins 
• Recursive 
joins 
• Pathfinding 
opera?ons 
• “open-­‐ended” 
queries
Examples 
of 
Connectedness
Graphs 
in 
Recommenda-ons?
Recommender 
system 
• Not 
new 
• Prime 
BI 
applica?on 
– Mining 
data 
– Calcula?ng 
recommenda?ons 
/ 
promo?ons 
in 
batch 
– Presen?ng 
these 
to 
the 
user 
online 
/ 
offline 
1
Recommender 
systems
Graphs 
in 
Recommender 
Systems 
• Real 
?me 
aspect 
• Recommenda?on 
Prac?ces 
rely 
on 
Graph 
Algorithms 
• Opera?onal 
efficiency
Real 
-me 
recommenda?ons? 
• Context 
is 
everything 
– You 
don’t 
want 
to 
be 
recommending 
a 
fitness 
subscrip?on 
when 
you 
have 
just 
been 
admi=ed 
to 
hospital… 
• Relevance 
is 
King 
– More 
‘s?cky’ 
applica?ons 
– Increase 
user 
sa?sfac?on 
– Drive 
traffic: 
get 
users 
to 
do 
more 
stuff 
with 
your 
applica?on
Recommenda?on 
prac?ces: 
Graph 
Algorithms 
● Helpful 
for 
naviga?ng 
complex 
networks 
● tell 
me 
how 
A 
and 
B 
are 
related 
● The 
things 
on 
the 
path 
between 
A 
and 
B 
could 
very 
well 
be 
interes?ng 
● ShortestPath, 
AllShortestPaths, 
Weighted 
ShortestPath 
(Dijkstra, 
A*) 
● Helpful 
for 
understanding 
the 
important 
parts 
of 
a 
network 
● Clusters 
● Bridges 
● Centrality 
● Betweenness 
Centrality 
● (Page)Ranking
Recommenda?on 
prac?ces: 
Making 
triangles 
● Close 
the 
triangle 
to 
make 
a 
recommenda?on 
● Olen 
looking 
for 
“missing 
links” 
● e.g. 
making 
a 
recommenda?on 
of 
movies 
based 
on 
what 
similar 
users 
have 
enjoyed
Opera-onal 
Efficiency 
• Graph 
datamodel 
removes 
the 
need 
for 
many 
“batch 
opera?ons” 
– No 
need 
to 
precalculate 
– 
just 
feed 
it 
into 
the 
graph 
• Complex 
pa=ern 
matching 
in 
milliseconds 
• Graph 
Locality 
== 
Predictability 
 
Speed, 
even 
over 
large 
datasets
Short 
demo
Use 
Cases 
(neo4j.com/use-­‐cases)
Customers 
(neo4j.com/customers)
Graph 
Gists 
(h=p://gist.neo4j.org/)
Neo4j 
versions 
/ 
licenses 
Neo4j License Overview 
Developer! 
Seats! 
Personal 
 
Startup 
/ 
Departmental 
 
Enterprise 
deployment 
models 
($6K*/Developer/Year) 
Test! 
Instances! 
($6K/Instance/Year) 
Production! 
Instances! 
(Bundle / Core Pricing) 
Open 
source 
 
Commercial 
license 
terms 
available 
Specific 
OEM 
models 
Instances whose purpose is to 
ensure that the software accessing 
Neo4j is meeting specification.! 
! 
(e.g. System Test, Integration Test, 
UAT, Performance Test, Staging) 
Instances that store and process 
data in a way that benefits and 
advances an organization’s goals.! 
! 
May be accessed by applications 
and/or end users 
Includes access by programmers 
to licensed test instances, and 
private instances on the 
programmer’s personal machine 
for the sole purpose of writing, 
debugging, or testing software 
designed to access Neo4j 
*Or otherwise, depending on the Bundle, and negotiation
Future 
trainings 
 
events! 
3
Conway 
Hall, 
November 
13th 
bit.ly/graphday
QA, 
Conclusion, 
Next 
Steps 
Neo 
Technology 
www.neotechnology.com 
Neo4j 
www.neo4j.org 
rik@neotechnology.com 
or 
+32 
478 
686800

More Related Content

What's hot (20)

PDF
Tensors Are All You Need: Faster Inference with Hummingbird
Databricks
 
PPTX
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Rodney Joyce
 
PDF
Machine Learning Pipelines
jeykottalam
 
PPTX
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Spark Summit
 
PPTX
Query-time Nonparametric Regression with Temporally Bounded Models - Patrick ...
Lucidworks
 
PDF
From discovering to trusting data
markgrover
 
PDF
NLP Text Recommendation System Journey to Automated Training
Databricks
 
PDF
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Spark Summit
 
PPTX
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
Aditya Bhattacharya
 
PDF
Tuning ML Models: Scaling, Workflows, and Architecture
Databricks
 
PDF
Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...
Databricks
 
PDF
Conversational AI with Transformer Models
Databricks
 
PPTX
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...
Aditya Bhattacharya
 
PPTX
Jethro for tableau webinar (11 15)
Remy Rosenbaum
 
PDF
How to obtain the Cloudera Data Engineer Certification
elephantscale
 
PDF
Machine learning model to production
Georg Heiler
 
PPTX
ODSC East 2018
Cameron Sim
 
PPTX
Graph Databases
Girish Khanzode
 
PPTX
Improving Search in Workday Products using Natural Language Processing
DataWorks Summit
 
PDF
Introduction to Machine Learning for Oracle Database Professionals
Alex Gorbachev
 
Tensors Are All You Need: Faster Inference with Hummingbird
Databricks
 
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Rodney Joyce
 
Machine Learning Pipelines
jeykottalam
 
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Spark Summit
 
Query-time Nonparametric Regression with Temporally Bounded Models - Patrick ...
Lucidworks
 
From discovering to trusting data
markgrover
 
NLP Text Recommendation System Journey to Automated Training
Databricks
 
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Spark Summit
 
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
Aditya Bhattacharya
 
Tuning ML Models: Scaling, Workflows, and Architecture
Databricks
 
Operationalizing Edge Machine Learning with Apache Spark with Nisha Talagala ...
Databricks
 
Conversational AI with Transformer Models
Databricks
 
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...
Aditya Bhattacharya
 
Jethro for tableau webinar (11 15)
Remy Rosenbaum
 
How to obtain the Cloudera Data Engineer Certification
elephantscale
 
Machine learning model to production
Georg Heiler
 
ODSC East 2018
Cameron Sim
 
Graph Databases
Girish Khanzode
 
Improving Search in Workday Products using Natural Language Processing
DataWorks Summit
 
Introduction to Machine Learning for Oracle Database Professionals
Alex Gorbachev
 

Similar to Using graphs for recommendations (20)

PDF
201411203 goto night on graphs for fraud detection
Rik Van Bruggen
 
PDF
20141015 how graphs revolutionize access management
Rik Van Bruggen
 
PPTX
Bitkom Cray presentation - on HPC affecting big data analytics in FS
Philip Filleul
 
PDF
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j
 
PDF
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Anyscale
 
PDF
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Felicia Haggarty
 
PDF
The Analytics Frontier of the Hadoop Eco-System
inside-BigData.com
 
PPTX
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
Allen Day, PhD
 
PDF
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j
 
PPTX
Webinar: Scaling MongoDB
MongoDB
 
PDF
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
PDF
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Precisely
 
PDF
PGConf.ASIA 2019 Bali - Keynote Speech 2 - Ivan Pachenko
Equnix Business Solutions
 
PPTX
Neo4j GraphTour New York_EY Presentation_Michael Moore
Neo4j
 
PPTX
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellen
Neo4j
 
PDF
Your Roadmap for An Enterprise Graph Strategy
Neo4j
 
PPTX
Apache Spark Model Deployment
Databricks
 
PPTX
Your Roadmap for An Enterprise Graph Strategy
Neo4j
 
PDF
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
PDF
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Ali Alkan
 
201411203 goto night on graphs for fraud detection
Rik Van Bruggen
 
20141015 how graphs revolutionize access management
Rik Van Bruggen
 
Bitkom Cray presentation - on HPC affecting big data analytics in FS
Philip Filleul
 
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Anyscale
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Felicia Haggarty
 
The Analytics Frontier of the Hadoop Eco-System
inside-BigData.com
 
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
Allen Day, PhD
 
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j
 
Webinar: Scaling MongoDB
MongoDB
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Precisely
 
PGConf.ASIA 2019 Bali - Keynote Speech 2 - Ivan Pachenko
Equnix Business Solutions
 
Neo4j GraphTour New York_EY Presentation_Michael Moore
Neo4j
 
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellen
Neo4j
 
Your Roadmap for An Enterprise Graph Strategy
Neo4j
 
Apache Spark Model Deployment
Databricks
 
Your Roadmap for An Enterprise Graph Strategy
Neo4j
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Ali Alkan
 
Ad

More from Rik Van Bruggen (11)

PDF
2 dirk vermeylen - modeling with neo4 j
Rik Van Bruggen
 
PDF
1 rik van bruggen - intro and state of the graph
Rik Van Bruggen
 
PDF
3 surya gupta - tabloid proteome
Rik Van Bruggen
 
PDF
4 tom michiels - graph platform enabler
Rik Van Bruggen
 
PDF
Reinventing Identity and Access Management with Graph Databases
Rik Van Bruggen
 
PDF
Cevora ICT Symposium - Graph Databases
Rik Van Bruggen
 
PDF
20150624 Belgian GraphDB meetup at Ordina
Rik Van Bruggen
 
PDF
20150619 GOTO Amsterdam Conference - What Business can learn from Dating
Rik Van Bruggen
 
PDF
Intro to Graphs for Fedict
Rik Van Bruggen
 
PDF
20150326 data innovation summit IGNITE talk
Rik Van Bruggen
 
PDF
20150121 wolters kluwer innovation pitch
Rik Van Bruggen
 
2 dirk vermeylen - modeling with neo4 j
Rik Van Bruggen
 
1 rik van bruggen - intro and state of the graph
Rik Van Bruggen
 
3 surya gupta - tabloid proteome
Rik Van Bruggen
 
4 tom michiels - graph platform enabler
Rik Van Bruggen
 
Reinventing Identity and Access Management with Graph Databases
Rik Van Bruggen
 
Cevora ICT Symposium - Graph Databases
Rik Van Bruggen
 
20150624 Belgian GraphDB meetup at Ordina
Rik Van Bruggen
 
20150619 GOTO Amsterdam Conference - What Business can learn from Dating
Rik Van Bruggen
 
Intro to Graphs for Fedict
Rik Van Bruggen
 
20150326 data innovation summit IGNITE talk
Rik Van Bruggen
 
20150121 wolters kluwer innovation pitch
Rik Van Bruggen
 
Ad

Recently uploaded (20)

PPTX
ChessBase 18.02 Crack + Serial Key Free Download
cracked shares
 
PDF
Notification System for Construction Logistics Application
Safe Software
 
PDF
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
Alluxio, Inc.
 
PDF
Odoo Customization Services by CandidRoot Solutions
CandidRoot Solutions Private Limited
 
PPTX
Cutting Optimization Pro 5.18.2 Crack With Free Download
cracked shares
 
PDF
Virtual Threads in Java: A New Dimension of Scalability and Performance
Tier1 app
 
PDF
Show Which Projects Support Your Strategy and Deliver Results with OnePlan df
OnePlan Solutions
 
PDF
Australian Enterprises Need Project Service Automation
Navision India
 
PDF
Step-by-Step Guide to Install SAP HANA Studio | Complete Installation Tutoria...
SAP Vista, an A L T Z E N Company
 
PPTX
Function & Procedure: Function Vs Procedure in PL/SQL
Shani Tiwari
 
PDF
Code and No-Code Journeys: The Maintenance Shortcut
Applitools
 
PPTX
Operations Profile SPDX_Update_20250711_Example_05_03.pptx
Shane Coughlan
 
PDF
Ready Layer One: Intro to the Model Context Protocol
mmckenna1
 
PPTX
BB FlashBack Pro 5.61.0.4843 With Crack Free Download
cracked shares
 
PPTX
MiniTool Partition Wizard Crack 12.8 + Serial Key Download Latest [2025]
filmoracrack9001
 
PDF
Top 10 AI Use Cases Every Business Should Know.pdf
nicogonzalez1075
 
PPTX
Smart Doctor Appointment Booking option in odoo.pptx
AxisTechnolabs
 
PPT
Brief History of Python by Learning Python in three hours
adanechb21
 
PDF
Dialora AI Voice Agent for Customer Support
Dialora. Ai
 
PPTX
UI5con_2025_Accessibility_Ever_Evolving_
gerganakremenska1
 
ChessBase 18.02 Crack + Serial Key Free Download
cracked shares
 
Notification System for Construction Logistics Application
Safe Software
 
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
Alluxio, Inc.
 
Odoo Customization Services by CandidRoot Solutions
CandidRoot Solutions Private Limited
 
Cutting Optimization Pro 5.18.2 Crack With Free Download
cracked shares
 
Virtual Threads in Java: A New Dimension of Scalability and Performance
Tier1 app
 
Show Which Projects Support Your Strategy and Deliver Results with OnePlan df
OnePlan Solutions
 
Australian Enterprises Need Project Service Automation
Navision India
 
Step-by-Step Guide to Install SAP HANA Studio | Complete Installation Tutoria...
SAP Vista, an A L T Z E N Company
 
Function & Procedure: Function Vs Procedure in PL/SQL
Shani Tiwari
 
Code and No-Code Journeys: The Maintenance Shortcut
Applitools
 
Operations Profile SPDX_Update_20250711_Example_05_03.pptx
Shane Coughlan
 
Ready Layer One: Intro to the Model Context Protocol
mmckenna1
 
BB FlashBack Pro 5.61.0.4843 With Crack Free Download
cracked shares
 
MiniTool Partition Wizard Crack 12.8 + Serial Key Download Latest [2025]
filmoracrack9001
 
Top 10 AI Use Cases Every Business Should Know.pdf
nicogonzalez1075
 
Smart Doctor Appointment Booking option in odoo.pptx
AxisTechnolabs
 
Brief History of Python by Learning Python in three hours
adanechb21
 
Dialora AI Voice Agent for Customer Support
Dialora. Ai
 
UI5con_2025_Accessibility_Ever_Evolving_
gerganakremenska1
 

Using graphs for recommendations

  • 1. Data Science in Marke-ng [email protected] @rvanbruggen
  • 3. Agenda • About Graphs • About Graph Databases • Why Graph Databases ma=er for Recommenda?ons – Short demonstra?on • Case Studies • Q&A
  • 6. Meet Leonhard Euler • Swiss mathema?cian • Inventor of Graph Theory (1736)
  • 8. A B D C
  • 9. A B D C 1 2 3 4 7 6 5
  • 11. So what is a graph database? • OLTP database – “end-­‐user” transac?ons • Model, store, manage data as a graph
  • 12. What is a graph? Node Rela?onship
  • 13. Contrast with Rela?onal Graphs are often referred to as “Whiteboard Friendly”. The data model reflects the way a domain expert would naturally draw their data on a whiteboard “The schema is the data”. Schema flexibility allows the system to change in response to a changing environment
  • 14. What are graphs good for? Complex Querying
  • 15. Examples of complex queries? 1. Semi-­‐structure in datasets 1 – Normaliza?on introduces complexity – Forces developers to develop all kinds of logic to deal with this variability in their applica?on logic
  • 16. Examples of complex queries: 2. Connectedness in data Lots of normalized rela?onships between the different en??es, forces developers to do • Deep joins • Recursive joins • Pathfinding opera?ons • “open-­‐ended” queries
  • 19. Recommender system • Not new • Prime BI applica?on – Mining data – Calcula?ng recommenda?ons / promo?ons in batch – Presen?ng these to the user online / offline 1
  • 21. Graphs in Recommender Systems • Real ?me aspect • Recommenda?on Prac?ces rely on Graph Algorithms • Opera?onal efficiency
  • 22. Real -me recommenda?ons? • Context is everything – You don’t want to be recommending a fitness subscrip?on when you have just been admi=ed to hospital… • Relevance is King – More ‘s?cky’ applica?ons – Increase user sa?sfac?on – Drive traffic: get users to do more stuff with your applica?on
  • 23. Recommenda?on prac?ces: Graph Algorithms ● Helpful for naviga?ng complex networks ● tell me how A and B are related ● The things on the path between A and B could very well be interes?ng ● ShortestPath, AllShortestPaths, Weighted ShortestPath (Dijkstra, A*) ● Helpful for understanding the important parts of a network ● Clusters ● Bridges ● Centrality ● Betweenness Centrality ● (Page)Ranking
  • 24. Recommenda?on prac?ces: Making triangles ● Close the triangle to make a recommenda?on ● Olen looking for “missing links” ● e.g. making a recommenda?on of movies based on what similar users have enjoyed
  • 25. Opera-onal Efficiency • Graph datamodel removes the need for many “batch opera?ons” – No need to precalculate – just feed it into the graph • Complex pa=ern matching in milliseconds • Graph Locality == Predictability Speed, even over large datasets
  • 30. Neo4j versions / licenses Neo4j License Overview Developer! Seats! Personal Startup / Departmental Enterprise deployment models ($6K*/Developer/Year) Test! Instances! ($6K/Instance/Year) Production! Instances! (Bundle / Core Pricing) Open source Commercial license terms available Specific OEM models Instances whose purpose is to ensure that the software accessing Neo4j is meeting specification.! ! (e.g. System Test, Integration Test, UAT, Performance Test, Staging) Instances that store and process data in a way that benefits and advances an organization’s goals.! ! May be accessed by applications and/or end users Includes access by programmers to licensed test instances, and private instances on the programmer’s personal machine for the sole purpose of writing, debugging, or testing software designed to access Neo4j *Or otherwise, depending on the Bundle, and negotiation
  • 31. Future trainings events! 3
  • 32. Conway Hall, November 13th bit.ly/graphday
  • 33. QA, Conclusion, Next Steps Neo Technology www.neotechnology.com Neo4j www.neo4j.org [email protected] or +32 478 686800