SlideShare a Scribd company logo
100K times faster apps.
In Memory Grids
Prateek Jain
Agenda
• In Memory Grids
– 10,000 foot view.
– Present scenario
– Why
– Why now
• Use Cases
• Types of In Memory Grids
– Compute Grid
– Data Grid
• Reference Architecture
• Sample application demo
• Further Resources
• Questions & Feedback
In Memory Grids
• 10,000 foot view
Breaking your problem to solve it using multiple resources on network.
Using main memory instead of Disk to do file I/O.
BigData landscape
Traditional App Associated Challenges
RDBMS -- Used to run many analytics
systems
Performance ( Not Real time), Scaling,
Cost++
CEP -- Designed to correlate data in real
time
Scaling (often necessary to aggregate
events into a centralized source ), not
designed for historical data.
Hadoop -- Designed for batch analytics and
complex correlation
Not designed for Real time.
NoSQL -- Designed to handle large data
volumes at low cost
Processing capability: Sheer amount of
data can be challenging.
IMDG -- Fast for storing and processing
data
Storing vast amounts of information in-
memory doesn’t scale, in terms of both
system scaling and cost
Different problems, so are the solutions.
Why ?
• Speed matters
– Citi : 100ms == $1 M
– Google : 500ms == 20% traffic drop
• Disk up to 107 times slower than RAM.
In Memory Grids• Why now?
– Hardware, ability++ and cost--
• 1TB RAM & 48 core cluster (can hold full week tweets) ~ $40K
Data Growth, PB DRAM Cost, $
BigData tech. plannedData is growing exponentially 30% drop each 12-18 months
Use Cases
• Trading Systems
– Handle large volume of transactions
• Real time risk analytics
– Analysis of trading positions and risk
• Online gaming
– Online real-time backbone for gaming
• Geo Mapping
– Real-time geographical route and traffic information
• Bio Informatics
– Real-time DNA sequencing and matching
In Memory Compute Grid
(IMCG)
In Memory Grids
1. In Memory -- Compute Grid.
Compute Grids allow you to take a computation, optionally split it into multiple parts, and
execute them on different grid nodes in parallel.
Functionality
• Distributed Execution Models - map-reduce, Streaming
Processing & CEP, MPP, MPI style
• Distributed Execution Management Services – task
distribution, failover, load balancing, collision resolution,
job stealing, redundant mapping support, task scheduling,
asynchronous reduction, task checkpoints
• Distributed Deployment & Provisioning.
• Distributed Resources Management - Automatic discovery
In Memory Data Grid
(IMDG)
In Memory Grids
2. In Memory -- Data Grid. (aka, Distributed data caching )
Provides applications with ability to keep data in memory for high availability rather than
constantly fetching it from slower storage elsewhere, like RDBMS or shared file systems.
IMDG ?
• Several JVMs sharing in-memory partitioned data.
• Provides extremely low latency access to,
and high availability of, application data by keeping it in
memory and to do so in a highly parallelized way.
• Support most of the Big Data processing requirements.
Common Features
• Distributed maps
• Caching , Evictions
• Code execution (executor service, map-reduce)
• Listeners
• Queries (SQL like)
• Pluggable indexing
• Hibernate L 2 cache (optional)
• ACID Transactions
• MapStore (write-behind, write-through, read-through)
• Optimized Serialization
Common Features
• The same object your business logic is using can be kept in the data grid.
• No extra step of marshaling and un-marshaling.
• Embeddable (optional)
Reference Architecture
IMDG is not a
• NoSQL database
• In Memory Database (IMDB)
• How?
• Support for true distributed ACID transactions with highly optimized 2PC protocol implementation.
• Scalable Data Partitioning across a cluster including both partitioned or fully replicated scenarios
• Ability to work directly with application domain objects rather than with primitive types or “documents”
• Tight integration with In-Memory Compute Grid (IMCG)
• Pluggable segmentation (a.k.a. "brain split" problem) resolution
• Pluggable expiration policies
• Pluggable indexing support
Further Reading
• http://www.ventanaresearch.com/uploadedFiles/Content/Landing_Pages/Ventana_Research_Big_
Data_Benchmark_Research_Presentation.pdf
• http://wikibon.org/wiki/v/Data_in_DRAM_is_a_Flash_in_the_Pan#Data_in_Memory_Solutions_for
_Real-Time_High-Performance_Transaction_Analytics
• http://www.gridgain.com/book/book.html
• http://java.dzone.com/articles/compute-grids-vs-data-grids
• http://www.infoq.com/articles/in-memory-data-grids
• http://natishalom.typepad.com/nati_shaloms_blog/2011/07/real-time-analytics-for-big-data-an-
alternative-approach-to-facebooks-new-realtime-analytics-system.html
• https://del.sapient.resultspace.com/scm/gmtechip/POCs/gridgain_risk_analytics

More Related Content

What's hot (20)

PDF
Distributed applications using Hazelcast
Taras Matyashovsky
 
KEY
Infinspan: In-memory data grid meets NoSQL
Manik Surtani
 
PPT
MongoDB Sharding Webinar 2014
Dylan Tong
 
PPTX
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
DataStax
 
PPTX
Spring Meetup Paris - Getting Distributed with Hazelcast and Spring
Emrah Kocaman
 
PPTX
Querying Druid in SQL with Superset
DataWorks Summit
 
PPTX
Maximizing performance via tuning and optimization
MariaDB plc
 
PPTX
سکوهای ابری و مدل های برنامه نویسی در ابر
datastack
 
PPTX
Welcome: MariaDB today and our vision for the future
MariaDB plc
 
PPTX
Hazelcast For Beginners (Paris JUG-1)
Emrah Kocaman
 
PDF
5 Postgres DBA Tips
EDB
 
PPTX
DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...
DataStax
 
PPTX
Microsoft Dryad
Colin Clark
 
PDF
GCP Data Engineer cheatsheet
Guang Xu
 
PPTX
Webinar | Introducing DataStax Enterprise 4.6
DataStax
 
PPTX
Big Data Technologies and Why They Matter To R Users
Adaryl "Bob" Wakefield, MBA
 
PDF
#GeodeSummit: Combining Stream Processing and In-Memory Data Grids for Near-R...
PivotalOpenSourceHub
 
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
MSAdvAnalytics
 
PPTX
Leveraging ApsaraDB to Deploy Business Data on the Cloud
Oliver Theobald
 
PDF
What database
Regunath B
 
Distributed applications using Hazelcast
Taras Matyashovsky
 
Infinspan: In-memory data grid meets NoSQL
Manik Surtani
 
MongoDB Sharding Webinar 2014
Dylan Tong
 
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
DataStax
 
Spring Meetup Paris - Getting Distributed with Hazelcast and Spring
Emrah Kocaman
 
Querying Druid in SQL with Superset
DataWorks Summit
 
Maximizing performance via tuning and optimization
MariaDB plc
 
سکوهای ابری و مدل های برنامه نویسی در ابر
datastack
 
Welcome: MariaDB today and our vision for the future
MariaDB plc
 
Hazelcast For Beginners (Paris JUG-1)
Emrah Kocaman
 
5 Postgres DBA Tips
EDB
 
DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...
DataStax
 
Microsoft Dryad
Colin Clark
 
GCP Data Engineer cheatsheet
Guang Xu
 
Webinar | Introducing DataStax Enterprise 4.6
DataStax
 
Big Data Technologies and Why They Matter To R Users
Adaryl "Bob" Wakefield, MBA
 
#GeodeSummit: Combining Stream Processing and In-Memory Data Grids for Near-R...
PivotalOpenSourceHub
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
MSAdvAnalytics
 
Leveraging ApsaraDB to Deploy Business Data on the Cloud
Oliver Theobald
 
What database
Regunath B
 

Viewers also liked (8)

PDF
Data Grids vs Databases
Galder Zamarreño
 
PDF
Data Grids and Data Caching
Galder Zamarreño
 
PDF
Devoxx uk 2014 High performance in-memory Java with open source
David Brimley
 
PDF
Hazelcast Striim Hot Cache Presentation
Steve Wilkes
 
PPTX
Apache ignite Datagrid
Surinder Mehra
 
PDF
Fast, In-Memory SQL on Apache Cassandra with Apache Ignite (Rachel Pedreschi,...
DataStax
 
PDF
Hazelcast 소개
sangyun han
 
PDF
[오픈소스컨설팅]오픈소스 클라우드 개발플랫폼_및_Docker의_이해_v1
Ji-Woong Choi
 
Data Grids vs Databases
Galder Zamarreño
 
Data Grids and Data Caching
Galder Zamarreño
 
Devoxx uk 2014 High performance in-memory Java with open source
David Brimley
 
Hazelcast Striim Hot Cache Presentation
Steve Wilkes
 
Apache ignite Datagrid
Surinder Mehra
 
Fast, In-Memory SQL on Apache Cassandra with Apache Ignite (Rachel Pedreschi,...
DataStax
 
Hazelcast 소개
sangyun han
 
[오픈소스컨설팅]오픈소스 클라우드 개발플랫폼_및_Docker의_이해_v1
Ji-Woong Choi
 
Ad

Similar to In memory grids IMDG (20)

PDF
GPU Acceleration for Financial Services
Kinetica
 
PPTX
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Andrii Vozniuk
 
PPTX
The elephantintheroom bigdataanalyticsinthecloud
Khazret Sapenov
 
PDF
Webinar: SQL for Machine Data?
Crate.io
 
PDF
Operational-Analytics
Niloy Mukherjee
 
ODP
Big data nyu
Edward Capriolo
 
PDF
OSMC 2009 | Implementing a large monitoring infrastructure with Nagios and Ga...
NETWAYS
 
PPTX
From Data to Services at the Speed of Business
Ali Hodroj
 
PPTX
In_Memory_Computing_Presentation asasdasmfdaksfkasfjaskfsafasfsa
prabhakarchary3
 
PDF
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 
PDF
Wolfgang Lehner Technische Universitat Dresden
InfinIT - Innovationsnetværket for it
 
PPTX
Manta Unleashed BigDataSG talk 2 July 2013
Christopher Hogue
 
PDF
An overview of modern scalable web development
Tung Nguyen
 
PDF
Fast data in times of crisis with GPU accelerated database QikkDB | Business ...
Matej Misik
 
PDF
25 snowflake
剑飞 陈
 
PDF
Primitive Pursuits: Slaying Latency with Low-Level Primitives and Instructions
ScyllaDB
 
PDF
Learning from google megastore (Part-1)
Schubert Zhang
 
PPTX
Introduction to Cloud computing and Big Data-Hadoop
Nagarjuna D.N
 
PPTX
London Cloud Computing Meetup: From GigaSpaces to the Cloud - a demonstration...
Skills Matter
 
PPTX
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
Qian Lin
 
GPU Acceleration for Financial Services
Kinetica
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Andrii Vozniuk
 
The elephantintheroom bigdataanalyticsinthecloud
Khazret Sapenov
 
Webinar: SQL for Machine Data?
Crate.io
 
Operational-Analytics
Niloy Mukherjee
 
Big data nyu
Edward Capriolo
 
OSMC 2009 | Implementing a large monitoring infrastructure with Nagios and Ga...
NETWAYS
 
From Data to Services at the Speed of Business
Ali Hodroj
 
In_Memory_Computing_Presentation asasdasmfdaksfkasfjaskfsafasfsa
prabhakarchary3
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 
Wolfgang Lehner Technische Universitat Dresden
InfinIT - Innovationsnetværket for it
 
Manta Unleashed BigDataSG talk 2 July 2013
Christopher Hogue
 
An overview of modern scalable web development
Tung Nguyen
 
Fast data in times of crisis with GPU accelerated database QikkDB | Business ...
Matej Misik
 
25 snowflake
剑飞 陈
 
Primitive Pursuits: Slaying Latency with Low-Level Primitives and Instructions
ScyllaDB
 
Learning from google megastore (Part-1)
Schubert Zhang
 
Introduction to Cloud computing and Big Data-Hadoop
Nagarjuna D.N
 
London Cloud Computing Meetup: From GigaSpaces to the Cloud - a demonstration...
Skills Matter
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
Qian Lin
 
Ad

Recently uploaded (20)

PDF
Alpha Altcoin Setup : TIA - 19th July 2025
CIFDAQ
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PPTX
The Yotta x CloudStack Advantage: Scalable, India-First Cloud
ShapeBlue
 
PPTX
PCU Keynote at IEEE World Congress on Services 250710.pptx
Ramesh Jain
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
Per Axbom: The spectacular lies of maps
Nexer Digital
 
PPTX
Earn Agentblazer Status with Slack Community Patna.pptx
SanjeetMishra29
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
PDF
How Current Advanced Cyber Threats Transform Business Operation
Eryk Budi Pratama
 
PPTX
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PDF
RAT Builders - How to Catch Them All [DeepSec 2024]
malmoeb
 
Alpha Altcoin Setup : TIA - 19th July 2025
CIFDAQ
 
The Future of Artificial Intelligence (AI)
Mukul
 
The Yotta x CloudStack Advantage: Scalable, India-First Cloud
ShapeBlue
 
PCU Keynote at IEEE World Congress on Services 250710.pptx
Ramesh Jain
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Per Axbom: The spectacular lies of maps
Nexer Digital
 
Earn Agentblazer Status with Slack Community Patna.pptx
SanjeetMishra29
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
How Current Advanced Cyber Threats Transform Business Operation
Eryk Budi Pratama
 
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
RAT Builders - How to Catch Them All [DeepSec 2024]
malmoeb
 

In memory grids IMDG

  • 1. 100K times faster apps. In Memory Grids Prateek Jain
  • 2. Agenda • In Memory Grids – 10,000 foot view. – Present scenario – Why – Why now • Use Cases • Types of In Memory Grids – Compute Grid – Data Grid • Reference Architecture • Sample application demo • Further Resources • Questions & Feedback
  • 3. In Memory Grids • 10,000 foot view Breaking your problem to solve it using multiple resources on network. Using main memory instead of Disk to do file I/O.
  • 4. BigData landscape Traditional App Associated Challenges RDBMS -- Used to run many analytics systems Performance ( Not Real time), Scaling, Cost++ CEP -- Designed to correlate data in real time Scaling (often necessary to aggregate events into a centralized source ), not designed for historical data. Hadoop -- Designed for batch analytics and complex correlation Not designed for Real time. NoSQL -- Designed to handle large data volumes at low cost Processing capability: Sheer amount of data can be challenging. IMDG -- Fast for storing and processing data Storing vast amounts of information in- memory doesn’t scale, in terms of both system scaling and cost Different problems, so are the solutions.
  • 5. Why ? • Speed matters – Citi : 100ms == $1 M – Google : 500ms == 20% traffic drop • Disk up to 107 times slower than RAM.
  • 6. In Memory Grids• Why now? – Hardware, ability++ and cost-- • 1TB RAM & 48 core cluster (can hold full week tweets) ~ $40K Data Growth, PB DRAM Cost, $ BigData tech. plannedData is growing exponentially 30% drop each 12-18 months
  • 7. Use Cases • Trading Systems – Handle large volume of transactions • Real time risk analytics – Analysis of trading positions and risk • Online gaming – Online real-time backbone for gaming • Geo Mapping – Real-time geographical route and traffic information • Bio Informatics – Real-time DNA sequencing and matching
  • 8. In Memory Compute Grid (IMCG)
  • 9. In Memory Grids 1. In Memory -- Compute Grid. Compute Grids allow you to take a computation, optionally split it into multiple parts, and execute them on different grid nodes in parallel.
  • 10. Functionality • Distributed Execution Models - map-reduce, Streaming Processing & CEP, MPP, MPI style • Distributed Execution Management Services – task distribution, failover, load balancing, collision resolution, job stealing, redundant mapping support, task scheduling, asynchronous reduction, task checkpoints • Distributed Deployment & Provisioning. • Distributed Resources Management - Automatic discovery
  • 11. In Memory Data Grid (IMDG)
  • 12. In Memory Grids 2. In Memory -- Data Grid. (aka, Distributed data caching ) Provides applications with ability to keep data in memory for high availability rather than constantly fetching it from slower storage elsewhere, like RDBMS or shared file systems.
  • 13. IMDG ? • Several JVMs sharing in-memory partitioned data. • Provides extremely low latency access to, and high availability of, application data by keeping it in memory and to do so in a highly parallelized way. • Support most of the Big Data processing requirements.
  • 14. Common Features • Distributed maps • Caching , Evictions • Code execution (executor service, map-reduce) • Listeners • Queries (SQL like) • Pluggable indexing • Hibernate L 2 cache (optional) • ACID Transactions • MapStore (write-behind, write-through, read-through) • Optimized Serialization
  • 15. Common Features • The same object your business logic is using can be kept in the data grid. • No extra step of marshaling and un-marshaling. • Embeddable (optional)
  • 17. IMDG is not a • NoSQL database • In Memory Database (IMDB) • How? • Support for true distributed ACID transactions with highly optimized 2PC protocol implementation. • Scalable Data Partitioning across a cluster including both partitioned or fully replicated scenarios • Ability to work directly with application domain objects rather than with primitive types or “documents” • Tight integration with In-Memory Compute Grid (IMCG) • Pluggable segmentation (a.k.a. "brain split" problem) resolution • Pluggable expiration policies • Pluggable indexing support
  • 18. Further Reading • http://www.ventanaresearch.com/uploadedFiles/Content/Landing_Pages/Ventana_Research_Big_ Data_Benchmark_Research_Presentation.pdf • http://wikibon.org/wiki/v/Data_in_DRAM_is_a_Flash_in_the_Pan#Data_in_Memory_Solutions_for _Real-Time_High-Performance_Transaction_Analytics • http://www.gridgain.com/book/book.html • http://java.dzone.com/articles/compute-grids-vs-data-grids • http://www.infoq.com/articles/in-memory-data-grids • http://natishalom.typepad.com/nati_shaloms_blog/2011/07/real-time-analytics-for-big-data-an- alternative-approach-to-facebooks-new-realtime-analytics-system.html • https://del.sapient.resultspace.com/scm/gmtechip/POCs/gridgain_risk_analytics