SlideShare a Scribd company logo
Hands-On Network
Science
Colleen M. Farrelly, Post Urban Ventures
Yae Ulrich Gaba, Quantum Leap Africa
Franck Kalala Mutumbo, University of Lubumbashi
• Set with defined relationships across
items in the set
• Examples:
• People connected to each other on
social media
• Geographic areas connected by
animal migration patterns
• Stocks connected by buyer behavior
• Goods connected by supply chains
• Ideas connected semantically
Network Structures
• Hubs
• Densely-connected regions
• Tight-knit friends groups, cities with
many international flights, watering
holes where many animals
congregate
• Bridges
• Connections between regions
• Individuals that span many social
groups, manufacturers that provide
common parts to many industries,
common food sources for many
types of animals
• More computationally feasible for many data
science problems than traditional approaches
• Spatial regression vs. network science for change point
detection
• Time series methods vs. network science methods
• Nice visual representation of data and algorithms
• Many deep connections to mathematics
• Topology
• Geometry
• Dynamic systems
Case Studies
Case 1:
Epidemic
Spread
Problem
Predict and stop epidemic
starting on a friendship
network
Case 1:
Epidemic
Spread Data
Collected dataset of friendships
Static relationships (no changes over
time)
Represents medical school friendships
and veterans’ group friendships
Theoretical disease spread to predict
severity within network and strategies
to prevent disease spread
Case 1: Epidemic
Spread Methods
• SIR model
• System of differential
equations
• Adapted for connectivity of
network
• Forman-Ricci curvature
• Geometric measurement of
centrality
• Removal of highest-ranked
vertex (highest risk for
epidemic spread)
Case 2: Stock Market
Prediction Problem
• American stock exchange crash
forecasting
• Change-point problem in time series
analytics
• Caveats of non-stationary data
• Difficult to model time series data at scale
Case 2: Stock Market
Prediction Data
• Apple, Alphabet, Nvidia, and Microsoft
• 8/19/2004-4/1/2020
• Periodic trends of constant growth, crashes, and accelerated growth that
sometimes overlap across stocks (and sometimes doesn’t!)
Case 2: Stock Market
Prediction Methods
• Overlapping time windows
• Thresholded correlation
networks
• Changes in Forman-Ricci
curvature, betweenness
centrality, PageRank
centrality, and degree
centrality to assess risk
• Predicting millet price in markets
across Burkina Faso
• Impacted by supply chain and
global trends (COVID 19, war in
Ukraine…)
• Spatiotemporal aspects
• Computational cost of spatial
regression models
• Non-stationarity
• Quarterly millet prices
• Time period of 2015
(Quarter 2) to 2022
(Quarter 2)
• 45 administrative
provinces (averaged
market prices)
Case 3: Food Pricing
Methods
• Overlapping time windows
• Local Moran statistic thresholding to
create network
• PageRank and Forman-Ricci curvature
centrality to assess risk
• Benefits of network science
approaches
• Computational feasibility
• Easy visualizations
• Interpretable results
• Future directions
• Spatiotemporal data applications
• Temporal data applications
• Scaling of problems
Software Packages
• Python
• igraph
• networkX
• R
• Igraph
• Books
• The Shape of Data

More Related Content

Similar to Hands-On Network Science, PyData Global 2023 (20)

PDF
Social And Economic Networks Matthew O Jackson
depybuchta
 
PPTX
Small Worlds Social Graphs Social Media
suresh sood
 
PDF
Social Media Mining _indian edition available.pdf
RohithVaidya
 
PDF
Social Media Mining _indian edition available.pdf
RohithVaidya
 
PPTX
Network sampling, community detection
roberval mariano
 
PPT
Financial Networks and Financial Stability
Kimmo Soramaki
 
PDF
Graph algorithms - Notes
Thach-Ngoc TRAN
 
PDF
Visualize Big Graph Data
Mathieu Bastian
 
PDF
Descobrindo o tesouro escondido nos seus dados usando grafos.
Ana Appel
 
PDF
Thinking in networks: what it means for policy makers
Alberto Cottica
 
PPTX
Session 09 learning relationships.pptx
Sara-Jayne Terp
 
PPTX
Session 09 learning relationships.pptx
bodaceacat
 
PDF
Gephi icwsm-tutorial
csedays
 
PDF
SP1: Exploratory Network Analysis with Gephi
John Breslin
 
PPT
An Introduction to Network Theory
Socialphysicist
 
PPTX
social.pptx
NISHASOMSCS113
 
PPTX
Network analysis lecture
Sara-Jayne Terp
 
PPTX
[20240610_LabSeminar_Huy]CausalGNN: Causal-Based Graph Neural Networks for Sp...
thanhdowork
 
PPTX
06 Community Detection
Duke Network Analysis Center
 
PDF
Adaptive network models of socio-cultural dynamics
Hiroki Sayama
 
Social And Economic Networks Matthew O Jackson
depybuchta
 
Small Worlds Social Graphs Social Media
suresh sood
 
Social Media Mining _indian edition available.pdf
RohithVaidya
 
Social Media Mining _indian edition available.pdf
RohithVaidya
 
Network sampling, community detection
roberval mariano
 
Financial Networks and Financial Stability
Kimmo Soramaki
 
Graph algorithms - Notes
Thach-Ngoc TRAN
 
Visualize Big Graph Data
Mathieu Bastian
 
Descobrindo o tesouro escondido nos seus dados usando grafos.
Ana Appel
 
Thinking in networks: what it means for policy makers
Alberto Cottica
 
Session 09 learning relationships.pptx
Sara-Jayne Terp
 
Session 09 learning relationships.pptx
bodaceacat
 
Gephi icwsm-tutorial
csedays
 
SP1: Exploratory Network Analysis with Gephi
John Breslin
 
An Introduction to Network Theory
Socialphysicist
 
social.pptx
NISHASOMSCS113
 
Network analysis lecture
Sara-Jayne Terp
 
[20240610_LabSeminar_Huy]CausalGNN: Causal-Based Graph Neural Networks for Sp...
thanhdowork
 
06 Community Detection
Duke Network Analysis Center
 
Adaptive network models of socio-cultural dynamics
Hiroki Sayama
 

More from Colleen Farrelly (20)

PPTX
Generative AI for Social Good at Open Data Science East 2024
Colleen Farrelly
 
PPTX
Modeling Climate Change.pptx
Colleen Farrelly
 
PPTX
Natural Language Processing for Beginners.pptx
Colleen Farrelly
 
PPTX
The Shape of Data--ODSC.pptx
Colleen Farrelly
 
PPTX
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
PPTX
Emerging Technologies for Public Health in Remote Locations.pptx
Colleen Farrelly
 
PPTX
Applications of Forman-Ricci Curvature.pptx
Colleen Farrelly
 
PPTX
Geometry for Social Good.pptx
Colleen Farrelly
 
PPTX
Topology for Time Series.pptx
Colleen Farrelly
 
PPTX
Time Series Applications AMLD.pptx
Colleen Farrelly
 
PPTX
An introduction to quantum machine learning.pptx
Colleen Farrelly
 
PPTX
An introduction to time series data with R.pptx
Colleen Farrelly
 
PPTX
NLP: Challenges and Opportunities in Underserved Areas
Colleen Farrelly
 
PPTX
Geometry, Data, and One Path Into Data Science.pptx
Colleen Farrelly
 
PPTX
Topological Data Analysis.pptx
Colleen Farrelly
 
PPTX
Transforming Text Data to Matrix Data via Embeddings.pptx
Colleen Farrelly
 
PPTX
Natural Language Processing in the Wild.pptx
Colleen Farrelly
 
PPTX
SAS Global 2021 Introduction to Natural Language Processing
Colleen Farrelly
 
PPTX
2021 American Mathematical Society Data Science Talk
Colleen Farrelly
 
PPTX
Technical aspects of writing poetry II--sounds
Colleen Farrelly
 
Generative AI for Social Good at Open Data Science East 2024
Colleen Farrelly
 
Modeling Climate Change.pptx
Colleen Farrelly
 
Natural Language Processing for Beginners.pptx
Colleen Farrelly
 
The Shape of Data--ODSC.pptx
Colleen Farrelly
 
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
Emerging Technologies for Public Health in Remote Locations.pptx
Colleen Farrelly
 
Applications of Forman-Ricci Curvature.pptx
Colleen Farrelly
 
Geometry for Social Good.pptx
Colleen Farrelly
 
Topology for Time Series.pptx
Colleen Farrelly
 
Time Series Applications AMLD.pptx
Colleen Farrelly
 
An introduction to quantum machine learning.pptx
Colleen Farrelly
 
An introduction to time series data with R.pptx
Colleen Farrelly
 
NLP: Challenges and Opportunities in Underserved Areas
Colleen Farrelly
 
Geometry, Data, and One Path Into Data Science.pptx
Colleen Farrelly
 
Topological Data Analysis.pptx
Colleen Farrelly
 
Transforming Text Data to Matrix Data via Embeddings.pptx
Colleen Farrelly
 
Natural Language Processing in the Wild.pptx
Colleen Farrelly
 
SAS Global 2021 Introduction to Natural Language Processing
Colleen Farrelly
 
2021 American Mathematical Society Data Science Talk
Colleen Farrelly
 
Technical aspects of writing poetry II--sounds
Colleen Farrelly
 

Recently uploaded (20)

PDF
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
 
PPTX
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
 
PDF
Darley - FIRST Copenhagen Lightning Talk (2025-06-26) Epochalypse 2038 - Time...
treyka
 
PPTX
𝙳𝚘𝚠𝚗𝚕𝚘𝚊𝚍—Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
 
PPTX
Practical Applications of AI in Local Government
OnBoard
 
PDF
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
PDF
Kubernetes - Architecture & Components.pdf
geethak285
 
PDF
Python Conference Singapore - 19 Jun 2025
ninefyi
 
PPTX
reInforce 2025 Lightning Talk - Scott Francis.pptx
ScottFrancis51
 
PPTX
Simplifica la seguridad en la nube y la detección de amenazas con FortiCNAPP
Cristian Garcia G.
 
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
PDF
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
 
PDF
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
 
PDF
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
PPTX
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
PDF
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
 
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
 
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
 
Darley - FIRST Copenhagen Lightning Talk (2025-06-26) Epochalypse 2038 - Time...
treyka
 
𝙳𝚘𝚠𝚗𝚕𝚘𝚊𝚍—Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
 
Practical Applications of AI in Local Government
OnBoard
 
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
Kubernetes - Architecture & Components.pdf
geethak285
 
Python Conference Singapore - 19 Jun 2025
ninefyi
 
reInforce 2025 Lightning Talk - Scott Francis.pptx
ScottFrancis51
 
Simplifica la seguridad en la nube y la detección de amenazas con FortiCNAPP
Cristian Garcia G.
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
 
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
 
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
Why aren't you using FME Flow's CPU Time?
Safe Software
 

Hands-On Network Science, PyData Global 2023

  • 1. Hands-On Network Science Colleen M. Farrelly, Post Urban Ventures Yae Ulrich Gaba, Quantum Leap Africa Franck Kalala Mutumbo, University of Lubumbashi
  • 2. • Set with defined relationships across items in the set • Examples: • People connected to each other on social media • Geographic areas connected by animal migration patterns • Stocks connected by buyer behavior • Goods connected by supply chains • Ideas connected semantically
  • 3. Network Structures • Hubs • Densely-connected regions • Tight-knit friends groups, cities with many international flights, watering holes where many animals congregate • Bridges • Connections between regions • Individuals that span many social groups, manufacturers that provide common parts to many industries, common food sources for many types of animals
  • 4. • More computationally feasible for many data science problems than traditional approaches • Spatial regression vs. network science for change point detection • Time series methods vs. network science methods • Nice visual representation of data and algorithms • Many deep connections to mathematics • Topology • Geometry • Dynamic systems
  • 6. Case 1: Epidemic Spread Problem Predict and stop epidemic starting on a friendship network
  • 7. Case 1: Epidemic Spread Data Collected dataset of friendships Static relationships (no changes over time) Represents medical school friendships and veterans’ group friendships Theoretical disease spread to predict severity within network and strategies to prevent disease spread
  • 8. Case 1: Epidemic Spread Methods • SIR model • System of differential equations • Adapted for connectivity of network • Forman-Ricci curvature • Geometric measurement of centrality • Removal of highest-ranked vertex (highest risk for epidemic spread)
  • 9. Case 2: Stock Market Prediction Problem • American stock exchange crash forecasting • Change-point problem in time series analytics • Caveats of non-stationary data • Difficult to model time series data at scale
  • 10. Case 2: Stock Market Prediction Data • Apple, Alphabet, Nvidia, and Microsoft • 8/19/2004-4/1/2020 • Periodic trends of constant growth, crashes, and accelerated growth that sometimes overlap across stocks (and sometimes doesn’t!)
  • 11. Case 2: Stock Market Prediction Methods • Overlapping time windows • Thresholded correlation networks • Changes in Forman-Ricci curvature, betweenness centrality, PageRank centrality, and degree centrality to assess risk
  • 12. • Predicting millet price in markets across Burkina Faso • Impacted by supply chain and global trends (COVID 19, war in Ukraine…) • Spatiotemporal aspects • Computational cost of spatial regression models • Non-stationarity
  • 13. • Quarterly millet prices • Time period of 2015 (Quarter 2) to 2022 (Quarter 2) • 45 administrative provinces (averaged market prices)
  • 14. Case 3: Food Pricing Methods • Overlapping time windows • Local Moran statistic thresholding to create network • PageRank and Forman-Ricci curvature centrality to assess risk
  • 15. • Benefits of network science approaches • Computational feasibility • Easy visualizations • Interpretable results • Future directions • Spatiotemporal data applications • Temporal data applications • Scaling of problems
  • 16. Software Packages • Python • igraph • networkX • R • Igraph • Books • The Shape of Data