SlideShare a Scribd company logo
data modeling for apache
cassandra
with a sprinkle of C* background and some time series fun
dani traphagen
@dtrapezoid
quiz time - who is this?
but
how
do I
even
?
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
conceptual data modeling
• abstract view of the domain
• technology independent
• not specific to any database system
entity relationship diagram
• entity types
• relationship types
• attribute types
relationship keys
attribute key participation
• relationship attributes can make part
of the key
full conceptual model
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
application workflow
• each application has a workflow
• access patterns help us get how data
is accessed
• know your queries you will run before
you design your model
application workflow
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
mapping rules & patterns
• mapping rules ensure that a logical
data model is right
• each query has a corresponding table
• tables are designed to allow queries to
execute properly
• tables return data in the correct order
mapping rules
1. entities and
relationships
2. equality search
attributes
3. inequality search
attributes
4. ordering attributes
5. key attributes
entities and relationships
1. entity and relationship types map to
tables
2. entities and relationships map to
partitions or rows
3. partition may have data about one or
more entities and relationships
4. attributes are represented by
columns
entities and relationships
Each	relationship	becomes	a	row	in	the	table
Relationship	type	attributes	are	represented	by	columns
Queries	and	relationship	cardinality	affects	the	design	of	the	primary	key
• each relationship becomes a row in the table
• relationship type attributes are represented by columns
• queries and relationship cardinality affects the design
of the primary key
equality search attributes
• equality search attributes become
initial columns of a primary key
• querying on: title and type
inequality search attributes
• clustering columns follow partition
key columns in a primary key
• the column involved in an inequality
search must come after columns in
the primary key that are used in an
equality search
ordering attributes
1. ordering attributes become clustering columns
2. querying on: user_id = ? and
uploaded_timestamp > ?
3. ordering attributes : uploaded_timestamp (DESC)
key attributes
1. relationship type key attributes are included as
primary key columns
2. queries affect primary key design
query driven methodology
conceptual data
model
access patterns
mapping
rules and
patterns
logical data
model
erd queries
diagram it
logical data model
this
sucks
a real world example, w/time series data
This	is	using	simple	strategy,	but	don’t	do	that	in	prod	PLEASE!
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
time series bucketing
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
well dani this is all great and
everything…but just how do i get
started? hm? i mean i want to learn
cassandra and try it and everything
but it is just so daunting. i mean
you showed me all this stuff and i
am overwhelmed and it is sort of
making me insecure. what exactly
do i do next to learn more? how do i
really dig in and learn these
concepts? have you thought about
that? have you thought about what i
NEED?
HMMMM DANI, HAVE
YOU??!!!!!!!!!!11
Data Modeling for Apache Cassandra
& prepare to be wowed
cdm
https://github.com/riptano/cdm
cdm preview
namaste
but what does
it all mean?
also, we’re
hiring

More Related Content

What's hot (20)

PPT
Basic DBMS ppt
dangwalrajendra888
 
PPT
Object Oriented Analysis and Design
Haitham El-Ghareeb
 
PPTX
Dbms classification according to data models
ABDUL KHALIQ
 
DOCX
data replication
Hassanein Alwan
 
PPTX
Types of Database Models
Murassa Gillani
 
PPTX
Data models
KIRANPREET KAUR
 
PPT
Input output in linux
Dr. C.V. Suresh Babu
 
PPT
Data models
Usman Tariq
 
PPT
Planning for software quality assurance lecture 6
Abdul Basit
 
PPTX
Database recovery
Vritti Malhotra
 
PDF
Ddb 1.6-design issues
Esar Qasmi
 
PPTX
Chapter-1 Introduction to Database Management Systems
Kunal Anand
 
PPTX
Fundamentals of Data Modeling and Database Design by Dr. Kamal Gulati
Amity University | FMS - DU | IMT | Stratford University | KKMI International Institute | AIMA | DTU
 
PPTX
Architecture of data mining system
ramya marichamy
 
PPT
Introduction to SQL
Dr. Thippeswamy S.
 
PPTX
Database management system
krishna partiwala
 
PPTX
Data structure power point presentation
Anil Kumar Prajapati
 
PDF
Advance database systems (part 1)
Abdullah Khosa
 
Basic DBMS ppt
dangwalrajendra888
 
Object Oriented Analysis and Design
Haitham El-Ghareeb
 
Dbms classification according to data models
ABDUL KHALIQ
 
data replication
Hassanein Alwan
 
Types of Database Models
Murassa Gillani
 
Data models
KIRANPREET KAUR
 
Input output in linux
Dr. C.V. Suresh Babu
 
Data models
Usman Tariq
 
Planning for software quality assurance lecture 6
Abdul Basit
 
Database recovery
Vritti Malhotra
 
Ddb 1.6-design issues
Esar Qasmi
 
Chapter-1 Introduction to Database Management Systems
Kunal Anand
 
Fundamentals of Data Modeling and Database Design by Dr. Kamal Gulati
Amity University | FMS - DU | IMT | Stratford University | KKMI International Institute | AIMA | DTU
 
Architecture of data mining system
ramya marichamy
 
Introduction to SQL
Dr. Thippeswamy S.
 
Database management system
krishna partiwala
 
Data structure power point presentation
Anil Kumar Prajapati
 
Advance database systems (part 1)
Abdullah Khosa
 

Similar to Data Modeling for Apache Cassandra (20)

PDF
data-modeling-paper
Artem Chebotko
 
PPTX
Query driven development
João Paulo Leonidas Fernandes Dias da Silva
 
PDF
Using the Chebotko Method to Design Sound and Scalable Data Models for Apache...
Artem Chebotko
 
PDF
Cassandra
Lucian Neghina
 
PDF
Cassandra Data Modelling with CQL (OSCON 2015)
twentyideas
 
ODP
Cassandra Data Modelling
Knoldus Inc.
 
PDF
Cassandra Day Atlanta 2015: Data Modeling 101
DataStax Academy
 
PDF
Cassandra Day Chicago 2015: Apache Cassandra Data Modeling 101
DataStax Academy
 
PDF
Cassandra Day London 2015: Data Modeling 101
DataStax Academy
 
PDF
Introduction to data modeling with apache cassandra
Patrick McFadin
 
PPTX
Introduction to cassandra
Tarun Garg
 
PDF
The data model is dead, long live the data model
Patrick McFadin
 
PDF
Rigorous Cassandra Data Modeling for the Relational Data Architect
Artem Chebotko
 
PDF
DataStax: Rigorous Cassandra Data Modeling for the Relational Data Architect
DataStax Academy
 
PDF
Apache Cassandra & Data Modeling
Massimiliano Tomassi
 
PPTX
Apache Cassandra Developer Training Slide Deck
DataStax Academy
 
PPTX
Cassandra20141009
Brian Enochson
 
PDF
Introduction to Data Modeling with Apache Cassandra
Luke Tillman
 
DOCX
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
DOCX
llr+ cHApTEFt s Database Processing(2) Does this design e.docx
smile790243
 
data-modeling-paper
Artem Chebotko
 
Using the Chebotko Method to Design Sound and Scalable Data Models for Apache...
Artem Chebotko
 
Cassandra
Lucian Neghina
 
Cassandra Data Modelling with CQL (OSCON 2015)
twentyideas
 
Cassandra Data Modelling
Knoldus Inc.
 
Cassandra Day Atlanta 2015: Data Modeling 101
DataStax Academy
 
Cassandra Day Chicago 2015: Apache Cassandra Data Modeling 101
DataStax Academy
 
Cassandra Day London 2015: Data Modeling 101
DataStax Academy
 
Introduction to data modeling with apache cassandra
Patrick McFadin
 
Introduction to cassandra
Tarun Garg
 
The data model is dead, long live the data model
Patrick McFadin
 
Rigorous Cassandra Data Modeling for the Relational Data Architect
Artem Chebotko
 
DataStax: Rigorous Cassandra Data Modeling for the Relational Data Architect
DataStax Academy
 
Apache Cassandra & Data Modeling
Massimiliano Tomassi
 
Apache Cassandra Developer Training Slide Deck
DataStax Academy
 
Cassandra20141009
Brian Enochson
 
Introduction to Data Modeling with Apache Cassandra
Luke Tillman
 
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
llr+ cHApTEFt s Database Processing(2) Does this design e.docx
smile790243
 
Ad

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
PPTX
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
PDF
Cassandra 3.0 Data Modeling
DataStax Academy
 
PPTX
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
PDF
Coursera Cassandra Driver
DataStax Academy
 
PDF
Production Ready Cassandra
DataStax Academy
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
PDF
Standing Up Your First Cluster
DataStax Academy
 
PDF
Real Time Analytics with Dse
DataStax Academy
 
PDF
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Cassandra Core Concepts
DataStax Academy
 
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
PPTX
Bad Habits Die Hard
DataStax Academy
 
PDF
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Advanced Cassandra
DataStax Academy
 
PDF
Apache Cassandra and Drivers
DataStax Academy
 
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra 3.0 Data Modeling
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Coursera Cassandra Driver
DataStax Academy
 
Production Ready Cassandra
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
Standing Up Your First Cluster
DataStax Academy
 
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Cassandra Core Concepts
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Bad Habits Die Hard
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
Advanced Cassandra
DataStax Academy
 
Apache Cassandra and Drivers
DataStax Academy
 
Ad

Recently uploaded (20)

PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 

Data Modeling for Apache Cassandra