SlideShare a Scribd company logo
Graphs & AI
A Path for Data Science
Amy E. Hodler
Director, Graph Analytics & AI Programs
Neo4j
@amyhodler
It’s Not What You Know
It’s Who You Know And Where They Are
Whose pay will
increase the most?
Photo by Helena Lopes on Unsplash
Network Structure
is highly predictive of
pay and promotions
• People Near Structural Holes
• Organizational Misfits
“Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum
“Structural Holes and Good Ideas” R. Burt
Relationships and Network Structure
Strongest Predictors of Behavior & Complex Outcomes
“Research into networks reveal that,
surprisingly, the most connected
people inside a tight group within a
single industry are less valuable than
the people who span the gaps ...”
6
“…jumping from ladder to ladder is a
more effective strategy, and that lateral
or even downward moves across an
organization are more promising in the
longer run . . .”
It’s a counter-intuitive
notion
7
Which is why network
science is so powerful
8
Overview
Network Structure and
Predictions
Neo4j for Graph Data Science
Steps of Graph Data Science
GraphTour London 2020  - Graphs for AI, Amy Hodler
Relationships
The Strongest Predictors of Behavior!
“Increasingly we're learning that you can
make better predictions about people
by getting all the information from their
friends and their friends’ friends than
you can from the information you have
about the person themselves”
James Fowler
11
823
1607
2439
3765
5824
0
1000
2000
3000
4000
5000
6000
7000
2010 2011 2012 2013 2014 2015 2016 2017 2018 2019
Graph Is Accelerating AI Innovation
12
AI Research Papers Featuring Graph
Data Source: Dimensions knowledge system
Graph Technology
graph neural network
graph convolutional
graph embedding
graph learning
graph attention
graph kernel
graph completion
Better Predictions with Graphs
Using the Data You Already Have
• Current data science models ignore network structure
• Graphs add highly predictive features to ML models, increasing accuracy
• Otherwise unattainable predictions based on relationships
Machine Learning Pipeline
13
14
• 27 Million warranty & service documents
parsed for text to knowledge graph
• Graph is context for AI to learn “prime
examples” and anticipate maintenance
• Improves satisfaction and equipment lifespan
• Connecting 50 research databases, 100k’s of
Excel workbooks, 30 bio-sample databases
• Bytes 4 Diabetes Award for use of a
knowledge graph, graph analytics, and AI
• Customized views for flexible research angles
• Almost 70% of credit card fraud was missed
• ~1B Nodes and +1B Relationships to analyze
• Graph analytics with queries & algorithms
help find $ millions of fraud in 1st year
Neo4j for Graph Analytics, AI and Data Science
Caterpillar’s AI Supply
Chain & Maintenance
German Center for
Diabetes Research (DZD)
Financial Fraud
Detection & Recovery Top 10
Bank
Predictive
Maintenance
Churn
Prediction
Fraud
Detection
Life
Sciences
Recommendations
Cybersecurity
Customer
Segmentation
Search/MDM
Graph Data Science Applications
Just a few examples…
A Path for Graph Data Science
The Steps of Graph Data Science
Decision
Support
Graph Based
Predictions
Graph Native
Learning
17
Graph Feature
Engineering
Graph
Embeddings
Graph
Networks
Knowledge
Graphs
Graph
Analytics
The Steps of Graph Data Science
Graph Feature
Engineering
Graph
Embeddings
Graph
Networks
18
Graph
AnalyticsKnowledge
Graphs
Graph search
and queries
Support domain
experts
Knowledge Graph
Connecting the Dots has become...
19
Multiple graph layers of financial information
Includes corporate data with cross-relationships and external news
Knowledge Graph with Queries
Connecting the Dots
Dashboards and tools
• Credit risk
• Investment risk
• Portfolio news recommendations
• Typical analyst portfolio is 200
companies
• Custom relative weights
1 Week Snapshot:
800,000 shortest path calculations for the
ranked newsfeed. Each calculation
optimized to take approximately 10 ms.
has become...
20
The Steps of Graph Data Science
Graph Feature
Engineering
Graph
Embeddings
Graph
Networks
21
Knowledge
Graphs
Graph
Analytics
Graph queries &
algorithms for
offline analysis
Understanding
Structures
Query
(e.g. Cypher)
Fast, local decisioning
and pattern matching
Graph Algorithms
(e.g. Neo4j Algorithms Library)
Global analysis
and iterations
You know what you’re
looking for and
making a decision
You’re learning the overall
structure of a network, updating
data, and predicting
Local Patterns Global Computation
22
Deceptively Simple Queries
How many flagged accounts are in the
applicant’s network 4+ hops out?
How many login / account variables
in common?
Add these metrics to your approval
process
Difficult for RDMS systems over 3 hops
Graph Analytics via Queries
Detecting Financial Fraud
Improving existing pipelines to identify fraud via heuristics
23
Graph Analytics via Algorithms
Generally Unsupervised
24
A subset of data science algorithms that come from network science,
Graph Algorithms enable reasoning about network structure.
Pathfinding
and Search
Centrality
(Importance)
Community
Detection
Heuristic
Link Prediction
Similarity
• Degree Centrality
• Closeness Centrality
• CC Variations: Harmonic, Dangalchev,
Wasserman & Faust
• Betweenness Centrality
• Approximate Betweenness Centrality
• PageRank
• Personalized PageRank
• ArticleRank
• Eigenvector Centrality
• Triangle Count
• Clustering Coefficients
• Connected Components (Union Find)
• Strongly Connected Components
• Label Propagation
• Louvain Modularity
• Balanced Triad (identification)
Graph Algorithms & Functions in Neo4j
• Shortest Path
• Single-Source Shortest Path
• All Pairs Shortest Path
• A* Shortest Path
• Yen’s K Shortest Path
• Minimum Weight Spanning Tree
• K-Spanning Tree (MST)
• Random Walk
• Degree Centrality
• Closeness Centrality
• CC Variations: Harmonic, Dangalchev,
Wasserman & Faust
• Betweenness Centrality & Approximate
• PageRank
• Personalized PageRank
• ArticleRank
• Eigenvector Centrality
• Triangle Count
• Clustering Coefficients
• Connected Components (Union Find)
• Strongly Connected Components
• Label Propagation
• Louvain Modularity
• K-1 Coloring
• Euclidean Distance
• Cosine Similarity
• Node Similarity (Jaccard)
• Overlap Similarity
• Pearson Similarity
• Approximate KNN
Pathfinding
& Search
Centrality /
Importance
Community
Detection
Similarity
Link
Prediction
• Adamic Adar
• Common Neighbors
• Preferential Attachment
• Resource Allocations
• Same Community
• Total Neighbors
...and also Auxiliary Functions:
• Random graph generation
• One hot encoding
• Distributions & metrics
45
Graph Algorithms
Detecting Financial Fraud
Graph algorithms enable reasoning
about network structure
Louvain to identify communities
that frequently interact
PageRank to measure influence
and transaction volumes
Connected components
identify disjointed group
sharing identifiers
Jaccard to measure account
similarity
26
The Steps of Graph Data Science
Graph
Embeddings
Graph
Networks
27
Knowledge
Graphs
Graph
Analytics
Graph Feature
Engineering
Graph algorithms
& queries for
machine learning
Improve Prediction
Accuracy
Graph Feature Engineering
Feature Engineering is combines and processes data to create new,
more meaningful features, such as clustering or connectivity metrics.
EXTRACTION
28
Client
Betweenness
Centrality
Unique Shared
Identifiers
Weighted
Score
Known
Fraudster?
Jacob Olsen 0 1 1 No
Kaylee Roach 32 2 4 Yes
Mackenzie Burns 0 0 0 No
Kayla Knowles 192 3 4 Yes
Nicholas Jones 0 1 2 No
John Smith 0.08 2 10 YesPaySim Dataset
Graph Feature Engineering
Feature Engineering is combines and processes data to create new,
more meaningful features, such as clustering or connectivity metrics.
29
Client
Betweenness
Centrality
Shared
Identifiers
Weighted
PageRank
Known
Fraudster?
Jacob Olsen 0 1 1 No
Kaylee Roach 32 2 4 Yes
Mackenzie Burns 0 0 0 No
Kayla Knowles 192 3 4 Yes
Nicholas Jones 0 1 2 No
John Smith 0.08 2 10 Yes
Machine Learning on
this
To Build a Predictive Model
The Steps of Graph Data Science
Decision
Support
Graph Based
Predictions
Graph Native
Learning
30
Graph Feature
Engineering
Graph
Embeddings
Graph
Networks
Knowledge
Graphs
Graph
Analytics
FUTURE
Neo4j GDS Library
Evolving the Graph Algorithms Library for Data Scientists
• Run optimized, parallel algorithms over 10’s Billions
of nodes
• Production features like seeding for consistency
• Scalable in-memory graph model that loads in
parallel, can flexibly aggregate & reshape underlying
data models
• Simplified syntax & API with easy to understand
guides, warnings, & errors messages
• Extensive documentation with examples, tips, and
browser guides
Preview
for Enterprise Graph Data Science
Neo4j Graph Data
Science Library
Practical, Scalable
Graph Data Science
Native Graph
Creation & Persistence
Neo4j
Database
Graph Exploration
& Prototyping
Neo4j
Bloom
Preview
Business
neo4j.com/use-cases/
artificial-intelligence-analytics/
Data Scientists
neo4j.com/sandbox
Developers
neo4j.com/download
neo4j.com
/graph-algorithms-book
Free Until April 15
34
“AI is not all about Machine
Learning.
Context, structure, and
reasoning are necessary
ingredients, and Knowledge
Graphs and Linked Data are
key technologies for this.”
Wais Bashir
Managing Editor, Onyx Advisory
35
Amy E. Hodler
@amyhodler
amy.hodler@neo4j.com

More Related Content

PDF
GraphTour 2020 - Graphs & AI: A Path for Data Science
PDF
Graph Data Science DEMO for fraud analysis
PDF
How Graph Algorithms Answer your Business Questions in Banking and Beyond
PDF
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
PDF
3. Relationships Matter: Using Connected Data for Better Machine Learning
PDF
Relationships Matter: Using Connected Data for Better Machine Learning
PDF
4. Document Discovery with Graph Data Science
PDF
Real World Guide to Building Your Knowledge Graph
GraphTour 2020 - Graphs & AI: A Path for Data Science
Graph Data Science DEMO for fraud analysis
How Graph Algorithms Answer your Business Questions in Banking and Beyond
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
3. Relationships Matter: Using Connected Data for Better Machine Learning
Relationships Matter: Using Connected Data for Better Machine Learning
4. Document Discovery with Graph Data Science
Real World Guide to Building Your Knowledge Graph

What's hot (20)

PDF
Neo4j: What's Under the Hood & How Knowing This Can Help You
PDF
Graph technology meetup slides
PDF
GraphTour London 2020 - Customer Journey
PDF
Real World Guide to Building Your Knowledge Graph
PDF
What Is GDS and Neo4j’s GDS Library
PDF
Leveraging Graphs for Better AI
ODP
How do You Graph
PDF
Illustrating Graphs Visually through Neo4j Bloom
PDF
Graph Databases and Graph Data Science in Neo4j
PDF
The Future is Big Graphs: A Community View on Graph Processing Systems
PPTX
Graphs in the Real World
PDF
GraphTour 2020 - Opening Keynote
PDF
Digital Graph tour Rome: "Connect the Dots, Lorenzo Speranzoni
PDF
A Picture is Worth 1,000 Rows
PDF
Knowledge Graphs - The Power of Graph-Based Search
PPTX
Graphs and Financial Services Analytics
PDF
Experiments With Knowledge Graphs in Fisheries & Oceans Canada
PDF
Insights Driven Intelligence through Knowledge Graphs
PDF
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
PDF
Detecting eCommerce Fraud with Neo4j and Linkurious
Neo4j: What's Under the Hood & How Knowing This Can Help You
Graph technology meetup slides
GraphTour London 2020 - Customer Journey
Real World Guide to Building Your Knowledge Graph
What Is GDS and Neo4j’s GDS Library
Leveraging Graphs for Better AI
How do You Graph
Illustrating Graphs Visually through Neo4j Bloom
Graph Databases and Graph Data Science in Neo4j
The Future is Big Graphs: A Community View on Graph Processing Systems
Graphs in the Real World
GraphTour 2020 - Opening Keynote
Digital Graph tour Rome: "Connect the Dots, Lorenzo Speranzoni
A Picture is Worth 1,000 Rows
Knowledge Graphs - The Power of Graph-Based Search
Graphs and Financial Services Analytics
Experiments With Knowledge Graphs in Fisheries & Oceans Canada
Insights Driven Intelligence through Knowledge Graphs
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
Detecting eCommerce Fraud with Neo4j and Linkurious
Ad

Similar to GraphTour London 2020 - Graphs for AI, Amy Hodler (20)

PPTX
How Graphs are Changing AI
PDF
How Graph Technology is Changing AI
PDF
How Graphs Enhance AI
PDF
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
PDF
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
PDF
Leveraging Graphs for Better AI
PPTX
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
PDF
01 introduction to graph data science
PDF
Graph Data Science with Neo4j: Nordics Webinar
PDF
Are You Underestimating the Value Within Your Data? A conversation about grap...
PDF
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
PDF
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
PDF
Graphs for Data Science and Machine Learning
PDF
Graph Algorithms for Developers
PDF
Workshop - Neo4j Graph Data Science
PPTX
Graphs for Ai and ML
PDF
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...
PDF
GraphSummit Toronto: Leveraging Graphs for AI and ML
PDF
Graphs for Finance - AML with Neo4j Graph Data Science
PDF
GPT and Graph Data Science to power your Knowledge Graph
How Graphs are Changing AI
How Graph Technology is Changing AI
How Graphs Enhance AI
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Leveraging Graphs for Better AI
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
01 introduction to graph data science
Graph Data Science with Neo4j: Nordics Webinar
Are You Underestimating the Value Within Your Data? A conversation about grap...
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Graphs for Data Science and Machine Learning
Graph Algorithms for Developers
Workshop - Neo4j Graph Data Science
Graphs for Ai and ML
El camino hacia el éxito con las bases de datos de grafos, la ciencia de dato...
GraphSummit Toronto: Leveraging Graphs for AI and ML
Graphs for Finance - AML with Neo4j Graph Data Science
GPT and Graph Data Science to power your Knowledge Graph
Ad

More from Neo4j (20)

PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
PDF
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
PDF
GraphSummit Singapore Master Deck - May 20, 2025
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
PPTX
Neo4j Knowledge for Customer Experience.pptx
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
PDF
Neo4j: The Art of the Possible with Graph
PDF
Smarter Knowledge Graphs For Public Sector
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
PDF
ANZ Presentation: GraphSummit Melbourne 2024
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
PDF
Démonstration Digital Twin Building Wire Management
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
PDF
Démonstration Supply Chain - GraphTalk Paris
PDF
The Art of Possible - GraphTalk Paris Opening Session
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
GraphSummit Singapore Master Deck - May 20, 2025
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j Knowledge for Customer Experience.pptx
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j: The Art of the Possible with Graph
Smarter Knowledge Graphs For Public Sector
GraphRAG and Knowledge Graphs Exploring AI's Future
Matinée GenAI & GraphRAG Paris - Décembre 24
ANZ Presentation: GraphSummit Melbourne 2024
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Démonstration Digital Twin Building Wire Management
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Démonstration Supply Chain - GraphTalk Paris
The Art of Possible - GraphTalk Paris Opening Session
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...

Recently uploaded (20)

PPTX
Data-Driven-Credit-Card-Launch-A-Wells-Fargo-Case-Study.pptx
PDF
Mastering Financial Analysis Materials.pdf
PPTX
Purple and Violet Modern Marketing Presentation (1).pptx
PPTX
lec_5(probability).pptxzzjsjsjsjsjsjjsjjssj
PPTX
Machine Learning Solution for Power Grid Cybersecurity with GraphWavelets
PPTX
batch data Retailer Data management Project.pptx
PPTX
Extract Transformation Load (3) (1).pptx
PPTX
Global journeys: estimating international migration
PPTX
Lecture 1 Intro in Inferential Statistics.pptx
PPTX
Web dev -ppt that helps us understand web technology
PDF
Company Presentation pada Perusahaan ADB.pdf
PDF
Data Analyst Certificate Programs for Beginners | IABAC
PPTX
1intro to AI.pptx AI components & composition
PPTX
LESSON-1-NATURE-OF-MATHEMATICS.pptx patterns
PPTX
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
PDF
Launch Your Data Science Career in Kochi – 2025
PDF
Master Databricks SQL with AccentFuture – The Future of Data Warehousing
PDF
Taxes Foundatisdcsdcsdon Certificate.pdf
PPT
Performance Implementation Review powerpoint
PPTX
Business Acumen Training GuidePresentation.pptx
Data-Driven-Credit-Card-Launch-A-Wells-Fargo-Case-Study.pptx
Mastering Financial Analysis Materials.pdf
Purple and Violet Modern Marketing Presentation (1).pptx
lec_5(probability).pptxzzjsjsjsjsjsjjsjjssj
Machine Learning Solution for Power Grid Cybersecurity with GraphWavelets
batch data Retailer Data management Project.pptx
Extract Transformation Load (3) (1).pptx
Global journeys: estimating international migration
Lecture 1 Intro in Inferential Statistics.pptx
Web dev -ppt that helps us understand web technology
Company Presentation pada Perusahaan ADB.pdf
Data Analyst Certificate Programs for Beginners | IABAC
1intro to AI.pptx AI components & composition
LESSON-1-NATURE-OF-MATHEMATICS.pptx patterns
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
Launch Your Data Science Career in Kochi – 2025
Master Databricks SQL with AccentFuture – The Future of Data Warehousing
Taxes Foundatisdcsdcsdon Certificate.pdf
Performance Implementation Review powerpoint
Business Acumen Training GuidePresentation.pptx

GraphTour London 2020 - Graphs for AI, Amy Hodler

  • 1. Graphs & AI A Path for Data Science Amy E. Hodler Director, Graph Analytics & AI Programs Neo4j @amyhodler
  • 2. It’s Not What You Know
  • 3. It’s Who You Know And Where They Are
  • 5. Photo by Helena Lopes on Unsplash Network Structure is highly predictive of pay and promotions • People Near Structural Holes • Organizational Misfits “Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum “Structural Holes and Good Ideas” R. Burt
  • 6. Relationships and Network Structure Strongest Predictors of Behavior & Complex Outcomes “Research into networks reveal that, surprisingly, the most connected people inside a tight group within a single industry are less valuable than the people who span the gaps ...” 6 “…jumping from ladder to ladder is a more effective strategy, and that lateral or even downward moves across an organization are more promising in the longer run . . .”
  • 8. Which is why network science is so powerful 8
  • 9. Overview Network Structure and Predictions Neo4j for Graph Data Science Steps of Graph Data Science
  • 11. Relationships The Strongest Predictors of Behavior! “Increasingly we're learning that you can make better predictions about people by getting all the information from their friends and their friends’ friends than you can from the information you have about the person themselves” James Fowler 11
  • 12. 823 1607 2439 3765 5824 0 1000 2000 3000 4000 5000 6000 7000 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 Graph Is Accelerating AI Innovation 12 AI Research Papers Featuring Graph Data Source: Dimensions knowledge system Graph Technology graph neural network graph convolutional graph embedding graph learning graph attention graph kernel graph completion
  • 13. Better Predictions with Graphs Using the Data You Already Have • Current data science models ignore network structure • Graphs add highly predictive features to ML models, increasing accuracy • Otherwise unattainable predictions based on relationships Machine Learning Pipeline 13
  • 14. 14 • 27 Million warranty & service documents parsed for text to knowledge graph • Graph is context for AI to learn “prime examples” and anticipate maintenance • Improves satisfaction and equipment lifespan • Connecting 50 research databases, 100k’s of Excel workbooks, 30 bio-sample databases • Bytes 4 Diabetes Award for use of a knowledge graph, graph analytics, and AI • Customized views for flexible research angles • Almost 70% of credit card fraud was missed • ~1B Nodes and +1B Relationships to analyze • Graph analytics with queries & algorithms help find $ millions of fraud in 1st year Neo4j for Graph Analytics, AI and Data Science Caterpillar’s AI Supply Chain & Maintenance German Center for Diabetes Research (DZD) Financial Fraud Detection & Recovery Top 10 Bank
  • 16. A Path for Graph Data Science
  • 17. The Steps of Graph Data Science Decision Support Graph Based Predictions Graph Native Learning 17 Graph Feature Engineering Graph Embeddings Graph Networks Knowledge Graphs Graph Analytics
  • 18. The Steps of Graph Data Science Graph Feature Engineering Graph Embeddings Graph Networks 18 Graph AnalyticsKnowledge Graphs Graph search and queries Support domain experts
  • 19. Knowledge Graph Connecting the Dots has become... 19 Multiple graph layers of financial information Includes corporate data with cross-relationships and external news
  • 20. Knowledge Graph with Queries Connecting the Dots Dashboards and tools • Credit risk • Investment risk • Portfolio news recommendations • Typical analyst portfolio is 200 companies • Custom relative weights 1 Week Snapshot: 800,000 shortest path calculations for the ranked newsfeed. Each calculation optimized to take approximately 10 ms. has become... 20
  • 21. The Steps of Graph Data Science Graph Feature Engineering Graph Embeddings Graph Networks 21 Knowledge Graphs Graph Analytics Graph queries & algorithms for offline analysis Understanding Structures
  • 22. Query (e.g. Cypher) Fast, local decisioning and pattern matching Graph Algorithms (e.g. Neo4j Algorithms Library) Global analysis and iterations You know what you’re looking for and making a decision You’re learning the overall structure of a network, updating data, and predicting Local Patterns Global Computation 22
  • 23. Deceptively Simple Queries How many flagged accounts are in the applicant’s network 4+ hops out? How many login / account variables in common? Add these metrics to your approval process Difficult for RDMS systems over 3 hops Graph Analytics via Queries Detecting Financial Fraud Improving existing pipelines to identify fraud via heuristics 23
  • 24. Graph Analytics via Algorithms Generally Unsupervised 24 A subset of data science algorithms that come from network science, Graph Algorithms enable reasoning about network structure. Pathfinding and Search Centrality (Importance) Community Detection Heuristic Link Prediction Similarity
  • 25. • Degree Centrality • Closeness Centrality • CC Variations: Harmonic, Dangalchev, Wasserman & Faust • Betweenness Centrality • Approximate Betweenness Centrality • PageRank • Personalized PageRank • ArticleRank • Eigenvector Centrality • Triangle Count • Clustering Coefficients • Connected Components (Union Find) • Strongly Connected Components • Label Propagation • Louvain Modularity • Balanced Triad (identification) Graph Algorithms & Functions in Neo4j • Shortest Path • Single-Source Shortest Path • All Pairs Shortest Path • A* Shortest Path • Yen’s K Shortest Path • Minimum Weight Spanning Tree • K-Spanning Tree (MST) • Random Walk • Degree Centrality • Closeness Centrality • CC Variations: Harmonic, Dangalchev, Wasserman & Faust • Betweenness Centrality & Approximate • PageRank • Personalized PageRank • ArticleRank • Eigenvector Centrality • Triangle Count • Clustering Coefficients • Connected Components (Union Find) • Strongly Connected Components • Label Propagation • Louvain Modularity • K-1 Coloring • Euclidean Distance • Cosine Similarity • Node Similarity (Jaccard) • Overlap Similarity • Pearson Similarity • Approximate KNN Pathfinding & Search Centrality / Importance Community Detection Similarity Link Prediction • Adamic Adar • Common Neighbors • Preferential Attachment • Resource Allocations • Same Community • Total Neighbors ...and also Auxiliary Functions: • Random graph generation • One hot encoding • Distributions & metrics 45
  • 26. Graph Algorithms Detecting Financial Fraud Graph algorithms enable reasoning about network structure Louvain to identify communities that frequently interact PageRank to measure influence and transaction volumes Connected components identify disjointed group sharing identifiers Jaccard to measure account similarity 26
  • 27. The Steps of Graph Data Science Graph Embeddings Graph Networks 27 Knowledge Graphs Graph Analytics Graph Feature Engineering Graph algorithms & queries for machine learning Improve Prediction Accuracy
  • 28. Graph Feature Engineering Feature Engineering is combines and processes data to create new, more meaningful features, such as clustering or connectivity metrics. EXTRACTION 28 Client Betweenness Centrality Unique Shared Identifiers Weighted Score Known Fraudster? Jacob Olsen 0 1 1 No Kaylee Roach 32 2 4 Yes Mackenzie Burns 0 0 0 No Kayla Knowles 192 3 4 Yes Nicholas Jones 0 1 2 No John Smith 0.08 2 10 YesPaySim Dataset
  • 29. Graph Feature Engineering Feature Engineering is combines and processes data to create new, more meaningful features, such as clustering or connectivity metrics. 29 Client Betweenness Centrality Shared Identifiers Weighted PageRank Known Fraudster? Jacob Olsen 0 1 1 No Kaylee Roach 32 2 4 Yes Mackenzie Burns 0 0 0 No Kayla Knowles 192 3 4 Yes Nicholas Jones 0 1 2 No John Smith 0.08 2 10 Yes Machine Learning on this To Build a Predictive Model
  • 30. The Steps of Graph Data Science Decision Support Graph Based Predictions Graph Native Learning 30 Graph Feature Engineering Graph Embeddings Graph Networks Knowledge Graphs Graph Analytics FUTURE
  • 31. Neo4j GDS Library Evolving the Graph Algorithms Library for Data Scientists • Run optimized, parallel algorithms over 10’s Billions of nodes • Production features like seeding for consistency • Scalable in-memory graph model that loads in parallel, can flexibly aggregate & reshape underlying data models • Simplified syntax & API with easy to understand guides, warnings, & errors messages • Extensive documentation with examples, tips, and browser guides Preview
  • 32. for Enterprise Graph Data Science Neo4j Graph Data Science Library Practical, Scalable Graph Data Science Native Graph Creation & Persistence Neo4j Database Graph Exploration & Prototyping Neo4j Bloom Preview
  • 34. 34 “AI is not all about Machine Learning. Context, structure, and reasoning are necessary ingredients, and Knowledge Graphs and Linked Data are key technologies for this.” Wais Bashir Managing Editor, Onyx Advisory