SlideShare a Scribd company logo
4
Most read
5
Most read
7
Most read
IN-MEMORY BIG DATA
ANALYTICS
Supreeth MP
1st sem BDA
18/9/2017 1
Table of Contents:
1. Data is growing
2. What is In-Memory analytics?
3. Why In-Memory Now?
4. The landscape of disk-based and in-memory data management systems
5. In-Memory vsTraditional (on-disk) database management system
6. Optimization Aspects on In-Memory Data Management and Processing
7. Some questions on in-memory analytics
8. References
18/9/2017 2
Data is growing:
• Continuous flow of data
• Real-time, 24/7 streaming updates
• More than 2.5 quintillion bytes of data added daily
• Data is always available
• Democratization of data
• Main source for business decisions
• Shift to digital and STP
• Affordable technology
• Better and faster analytics
• Business Intelligence
• Cloud and subscription-based computing
18/9/2017 3
What is In-Memory analytics?
An in-memory analytics system basically is a database management system that
stores data entirely in main memory that is in the RAM.This contrasts to traditional
(on-disk) database systems, which are designed for data storage on persistent media
such as hardisk. Because working with data in memory is much faster than writing to
and reading from a file system.
In-memory is ideal when:
• Your database is too slow for interactive analytics
• You need to perform real-time data analytics
• You need to be offline and can't connect to your data live
18/9/2017 4
Why In-Memory Now?
• RAM is 200 times faster than disk storage and typically enables data access 50 to 100 times
quicker
• Memory storage capacity and bandwidth have been doubling roughly every three years,
while its price has been dropping by a factor of 10 every five years.
• Modern high-end servers now have multiple sockets, each of which can have tens or
hundreds of gigabytes of DRAM
• Growth of distributed systems
• The increasing adoption of 64-bit computer technology has made RAM more suitable for
use with large datasets.
• Database systems have been evolving over the last few decades.
18/9/2017 5
The landscape of disk-based and in-memory
data management systems:
18/9/2017 6
In-Memory vsTraditional (on-disk) database
management system:
18/9/2017 7
In-Memory vsTraditional (on-disk) database
management system:
Aspects DBDMS IMDBS
File I/O Carries File I/O burden No file I/O burden
Storage Usage Assumes storage is abundant Uses storage more efficiently
Algorithms Algorithm optimized for disk Algorithms optimized for memory
CPU Cycles More CPU cycles Less CPU cycles
Persistence Non-volatile Volatile
Lock Fine Locks Coarse Locks
18/9/2017 8
In-Memory vsTraditional (on-disk) database
management system:
18/9/2017 9
Optimization Aspects on In-Memory Data
Management and Processing:
Aspects Concerns Techniques
Index Cache consciousness, time/space
efficiency
Hash-based, tree-based
Data Layout Cache consciousness, space efficiency Columnar layout
Parallelism Linear scaling, partitioning Data-level, shared-memory scale-up and
shared-nothing scale out parallelism
Concurrency
Control
Overhead, correctness Coarse-grained locks
Query Processing Code locality, register temporal locality,
time efficiency
Coarse-grained stored procedures
Fault tolerance Durability, correlated failures, availability Checkpoints andTransaction logging
Data Overflow Locality, Paging strategy, hot/cold
classification
Anti-caching
18/9/2017 10
Some questions on in-memory analytics:
• What do companies need to think about as they take on an in-memory analytics path?
• What are some potential speed bumps in adopting in-memory analytics?
• What role do skills play here?
• If an in-memory database system boosts performance by holding all records in memory,
can’t we get the same result by creating a RAM disk and deploying a traditional database
there?
• Won’t an in-memory database require huge amounts of memory because database systems
are large?
• Isn’t the database just lost if there’s a system crash?
18/9/2017 11
References:
[1] In-Memory Big Data Management and Processing:A Survey, IEEETRANSACTIONSON
KNOWLEDGEAND DATA ENGINEERING JULY 2015
[2] Using In-MemoryAnalytics to Quickly Crunch Big Data by Lee Garber
[3] https://www.sas.com/en_us/insights/articles/big-data/in-memory-analytics-questions.html
[4] DataAnalytics using In-MemoryComputing: https://www.gridgain.com/
[5] How Computers Work: Disks And Secondary Storage:
http://homepage.cs.uri.edu/faculty/wolfe/book/Readings/Reading05.htm
[6] http://www.mcobject.com/in_memory_database
[7] In-Memory DatabaseComputing – Smarter way of data analysis:
http://www.xoriant.com/blog/big-data-analytics/memory-database-computing-faster-smarter-
analysis-big-data-world.html
[8] How Computers Work:The CPU and Memory:
http://homepage.cs.uri.edu/book/cpu_memory/cpu_memory.htm
18/9/2017 12
THANKYOU

More Related Content

What's hot (20)

PDF
10 Lessons Learned from Building Machine Learning Systems
Xavier Amatriain
 
PPT
20. Parallel Databases in DBMS
koolkampus
 
PPTX
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
Simplilearn
 
PPTX
Mining Data Streams
SujaAldrin
 
PDF
HDFS Architecture
Jeff Hammerbacher
 
PPTX
Multidimensional schema of data warehouse
kunjan shah
 
PDF
Data Mining: Association Rules Basics
Benazir Income Support Program (BISP)
 
PDF
Big Data: Its Characteristics And Architecture Capabilities
Ashraf Uddin
 
PPTX
Distributed database
ReachLocal Services India
 
PPTX
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Simplilearn
 
PPTX
Adbms 3 main characteristics of the database approach
Vaibhav Khanna
 
PPSX
A Seminar on NoSQL Databases.
Navdeep Charan
 
PPTX
Data warehousing
Vigneshwaar Ponnuswamy
 
PPTX
Classification techniques in data mining
Kamal Acharya
 
PPTX
Output primitives in Computer Graphics
Kamal Acharya
 
PDF
Lecture6 introduction to data streams
hktripathy
 
PPTX
NoSQL databases - An introduction
Pooyan Mehrparvar
 
PDF
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
PPTX
Real time analytics
Leandro Totino Pereira
 
PPTX
lazy learners and other classication methods
rajshreemuthiah
 
10 Lessons Learned from Building Machine Learning Systems
Xavier Amatriain
 
20. Parallel Databases in DBMS
koolkampus
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
Simplilearn
 
Mining Data Streams
SujaAldrin
 
HDFS Architecture
Jeff Hammerbacher
 
Multidimensional schema of data warehouse
kunjan shah
 
Data Mining: Association Rules Basics
Benazir Income Support Program (BISP)
 
Big Data: Its Characteristics And Architecture Capabilities
Ashraf Uddin
 
Distributed database
ReachLocal Services India
 
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Simplilearn
 
Adbms 3 main characteristics of the database approach
Vaibhav Khanna
 
A Seminar on NoSQL Databases.
Navdeep Charan
 
Data warehousing
Vigneshwaar Ponnuswamy
 
Classification techniques in data mining
Kamal Acharya
 
Output primitives in Computer Graphics
Kamal Acharya
 
Lecture6 introduction to data streams
hktripathy
 
NoSQL databases - An introduction
Pooyan Mehrparvar
 
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
Real time analytics
Leandro Totino Pereira
 
lazy learners and other classication methods
rajshreemuthiah
 

Similar to In-Memory Big Data Analytics (20)

PDF
In memory big data management and processing a survey
redpel dot com
 
PDF
IRJET- Improving Performance of Data Analytical Queries using In-Memory D...
IRJET Journal
 
PDF
How In Memory Computing Changes Everything
Debajit Banerjee
 
PPTX
Are your ready for in memory applications?
G2MCommunications
 
PDF
Capitalizing on the New Era of In-memory Computing
Infosys
 
PDF
The Intelligent Thing -- Using In-Memory for Big Data and Beyond
Inside Analysis
 
PPTX
In_Memory_Computing_Presentation asasdasmfdaksfkasfjaskfsafasfsa
prabhakarchary3
 
PPTX
Oracle Database in-Memory Overivew
Maria Colgan
 
PPTX
In memory computing
Gagan Reddy
 
PDF
ManMachine&Mathematics_Arup_Ray_Ext
Arup Ray
 
PDF
Making the Most of In-Memory: More than Speed
Inside Analysis
 
PPTX
In-Memory Computing Webcast. Market Predictions 2017
SingleStore
 
PPTX
In-Memory Computing: How, Why? and common Patterns
Srinath Perera
 
PDF
Sloupcové uložení dat a použití in-memory technologií u řešení Exadata
MarketingArrowECS_CZ
 
PDF
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...
Kai Wähner
 
PDF
Oracle12c Database in-memory Data Sheet
Oracle
 
PPTX
In memory cloud computing
Reno Varghese George
 
PDF
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...
In-Memory Computing Summit
 
PDF
How In-memory Computing Drives IT Simplification
SAP Technology
 
PDF
Meta scale kognitio hadoop webinar
Michael Hiskey
 
In memory big data management and processing a survey
redpel dot com
 
IRJET- Improving Performance of Data Analytical Queries using In-Memory D...
IRJET Journal
 
How In Memory Computing Changes Everything
Debajit Banerjee
 
Are your ready for in memory applications?
G2MCommunications
 
Capitalizing on the New Era of In-memory Computing
Infosys
 
The Intelligent Thing -- Using In-Memory for Big Data and Beyond
Inside Analysis
 
In_Memory_Computing_Presentation asasdasmfdaksfkasfjaskfsafasfsa
prabhakarchary3
 
Oracle Database in-Memory Overivew
Maria Colgan
 
In memory computing
Gagan Reddy
 
ManMachine&Mathematics_Arup_Ray_Ext
Arup Ray
 
Making the Most of In-Memory: More than Speed
Inside Analysis
 
In-Memory Computing Webcast. Market Predictions 2017
SingleStore
 
In-Memory Computing: How, Why? and common Patterns
Srinath Perera
 
Sloupcové uložení dat a použití in-memory technologií u řešení Exadata
MarketingArrowECS_CZ
 
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...
Kai Wähner
 
Oracle12c Database in-memory Data Sheet
Oracle
 
In memory cloud computing
Reno Varghese George
 
IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind...
In-Memory Computing Summit
 
How In-memory Computing Drives IT Simplification
SAP Technology
 
Meta scale kognitio hadoop webinar
Michael Hiskey
 
Ad

Recently uploaded (20)

PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PDF
Simplifying Document Processing with Docling for AI Applications.pdf
Tamanna
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PPTX
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
PPT
AI Future trends and opportunities_oct7v1.ppt
SHIKHAKMEHTA
 
PPTX
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
PPTX
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PDF
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
Simplifying Document Processing with Docling for AI Applications.pdf
Tamanna
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
AI Future trends and opportunities_oct7v1.ppt
SHIKHAKMEHTA
 
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
Ad

In-Memory Big Data Analytics

  • 1. IN-MEMORY BIG DATA ANALYTICS Supreeth MP 1st sem BDA 18/9/2017 1
  • 2. Table of Contents: 1. Data is growing 2. What is In-Memory analytics? 3. Why In-Memory Now? 4. The landscape of disk-based and in-memory data management systems 5. In-Memory vsTraditional (on-disk) database management system 6. Optimization Aspects on In-Memory Data Management and Processing 7. Some questions on in-memory analytics 8. References 18/9/2017 2
  • 3. Data is growing: • Continuous flow of data • Real-time, 24/7 streaming updates • More than 2.5 quintillion bytes of data added daily • Data is always available • Democratization of data • Main source for business decisions • Shift to digital and STP • Affordable technology • Better and faster analytics • Business Intelligence • Cloud and subscription-based computing 18/9/2017 3
  • 4. What is In-Memory analytics? An in-memory analytics system basically is a database management system that stores data entirely in main memory that is in the RAM.This contrasts to traditional (on-disk) database systems, which are designed for data storage on persistent media such as hardisk. Because working with data in memory is much faster than writing to and reading from a file system. In-memory is ideal when: • Your database is too slow for interactive analytics • You need to perform real-time data analytics • You need to be offline and can't connect to your data live 18/9/2017 4
  • 5. Why In-Memory Now? • RAM is 200 times faster than disk storage and typically enables data access 50 to 100 times quicker • Memory storage capacity and bandwidth have been doubling roughly every three years, while its price has been dropping by a factor of 10 every five years. • Modern high-end servers now have multiple sockets, each of which can have tens or hundreds of gigabytes of DRAM • Growth of distributed systems • The increasing adoption of 64-bit computer technology has made RAM more suitable for use with large datasets. • Database systems have been evolving over the last few decades. 18/9/2017 5
  • 6. The landscape of disk-based and in-memory data management systems: 18/9/2017 6
  • 7. In-Memory vsTraditional (on-disk) database management system: 18/9/2017 7
  • 8. In-Memory vsTraditional (on-disk) database management system: Aspects DBDMS IMDBS File I/O Carries File I/O burden No file I/O burden Storage Usage Assumes storage is abundant Uses storage more efficiently Algorithms Algorithm optimized for disk Algorithms optimized for memory CPU Cycles More CPU cycles Less CPU cycles Persistence Non-volatile Volatile Lock Fine Locks Coarse Locks 18/9/2017 8
  • 9. In-Memory vsTraditional (on-disk) database management system: 18/9/2017 9
  • 10. Optimization Aspects on In-Memory Data Management and Processing: Aspects Concerns Techniques Index Cache consciousness, time/space efficiency Hash-based, tree-based Data Layout Cache consciousness, space efficiency Columnar layout Parallelism Linear scaling, partitioning Data-level, shared-memory scale-up and shared-nothing scale out parallelism Concurrency Control Overhead, correctness Coarse-grained locks Query Processing Code locality, register temporal locality, time efficiency Coarse-grained stored procedures Fault tolerance Durability, correlated failures, availability Checkpoints andTransaction logging Data Overflow Locality, Paging strategy, hot/cold classification Anti-caching 18/9/2017 10
  • 11. Some questions on in-memory analytics: • What do companies need to think about as they take on an in-memory analytics path? • What are some potential speed bumps in adopting in-memory analytics? • What role do skills play here? • If an in-memory database system boosts performance by holding all records in memory, can’t we get the same result by creating a RAM disk and deploying a traditional database there? • Won’t an in-memory database require huge amounts of memory because database systems are large? • Isn’t the database just lost if there’s a system crash? 18/9/2017 11
  • 12. References: [1] In-Memory Big Data Management and Processing:A Survey, IEEETRANSACTIONSON KNOWLEDGEAND DATA ENGINEERING JULY 2015 [2] Using In-MemoryAnalytics to Quickly Crunch Big Data by Lee Garber [3] https://www.sas.com/en_us/insights/articles/big-data/in-memory-analytics-questions.html [4] DataAnalytics using In-MemoryComputing: https://www.gridgain.com/ [5] How Computers Work: Disks And Secondary Storage: http://homepage.cs.uri.edu/faculty/wolfe/book/Readings/Reading05.htm [6] http://www.mcobject.com/in_memory_database [7] In-Memory DatabaseComputing – Smarter way of data analysis: http://www.xoriant.com/blog/big-data-analytics/memory-database-computing-faster-smarter- analysis-big-data-world.html [8] How Computers Work:The CPU and Memory: http://homepage.cs.uri.edu/book/cpu_memory/cpu_memory.htm 18/9/2017 12