SlideShare a Scribd company logo
IN-MEMORY BIG DATA
ANALYTICS
Supreeth MP
1st sem BDA
18/9/2017 1
Table of Contents:
1. Data is growing
2. What is In-Memory analytics?
3. Why In-Memory Now?
4. The landscape of disk-based and in-memory data management systems
5. In-Memory vsTraditional (on-disk) database management system
6. Optimization Aspects on In-Memory Data Management and Processing
7. Some questions on in-memory analytics
8. References
18/9/2017 2
Data is growing:
• Continuous flow of data
• Real-time, 24/7 streaming updates
• More than 2.5 quintillion bytes of data added daily
• Data is always available
• Democratization of data
• Main source for business decisions
• Shift to digital and STP
• Affordable technology
• Better and faster analytics
• Business Intelligence
• Cloud and subscription-based computing
18/9/2017 3
What is In-Memory analytics?
An in-memory analytics system basically is a database management system that
stores data entirely in main memory that is in the RAM.This contrasts to traditional
(on-disk) database systems, which are designed for data storage on persistent media
such as hardisk. Because working with data in memory is much faster than writing to
and reading from a file system.
In-memory is ideal when:
• Your database is too slow for interactive analytics
• You need to perform real-time data analytics
• You need to be offline and can't connect to your data live
18/9/2017 4
Why In-Memory Now?
• RAM is 200 times faster than disk storage and typically enables data access 50 to 100 times
quicker
• Memory storage capacity and bandwidth have been doubling roughly every three years,
while its price has been dropping by a factor of 10 every five years.
• Modern high-end servers now have multiple sockets, each of which can have tens or
hundreds of gigabytes of DRAM
• Growth of distributed systems
• The increasing adoption of 64-bit computer technology has made RAM more suitable for
use with large datasets.
• Database systems have been evolving over the last few decades.
18/9/2017 5
The landscape of disk-based and in-memory
data management systems:
18/9/2017 6
In-Memory vsTraditional (on-disk) database
management system:
18/9/2017 7
In-Memory vsTraditional (on-disk) database
management system:
Aspects DBDMS IMDBS
File I/O Carries File I/O burden No file I/O burden
Storage Usage Assumes storage is abundant Uses storage more efficiently
Algorithms Algorithm optimized for disk Algorithms optimized for memory
CPU Cycles More CPU cycles Less CPU cycles
Persistence Non-volatile Volatile
Lock Fine Locks Coarse Locks
18/9/2017 8
In-Memory vsTraditional (on-disk) database
management system:
18/9/2017 9
Optimization Aspects on In-Memory Data
Management and Processing:
Aspects Concerns Techniques
Index Cache consciousness, time/space
efficiency
Hash-based, tree-based
Data Layout Cache consciousness, space efficiency Columnar layout
Parallelism Linear scaling, partitioning Data-level, shared-memory scale-up and
shared-nothing scale out parallelism
Concurrency
Control
Overhead, correctness Coarse-grained locks
Query Processing Code locality, register temporal locality,
time efficiency
Coarse-grained stored procedures
Fault tolerance Durability, correlated failures, availability Checkpoints andTransaction logging
Data Overflow Locality, Paging strategy, hot/cold
classification
Anti-caching
18/9/2017 10
Some questions on in-memory analytics:
• What do companies need to think about as they take on an in-memory analytics path?
• What are some potential speed bumps in adopting in-memory analytics?
• What role do skills play here?
• If an in-memory database system boosts performance by holding all records in memory,
can’t we get the same result by creating a RAM disk and deploying a traditional database
there?
• Won’t an in-memory database require huge amounts of memory because database systems
are large?
• Isn’t the database just lost if there’s a system crash?
18/9/2017 11
References:
[1] In-Memory Big Data Management and Processing:A Survey, IEEETRANSACTIONSON
KNOWLEDGEAND DATA ENGINEERING JULY 2015
[2] Using In-MemoryAnalytics to Quickly Crunch Big Data by Lee Garber
[3] https://www.sas.com/en_us/insights/articles/big-data/in-memory-analytics-questions.html
[4] DataAnalytics using In-MemoryComputing: https://www.gridgain.com/
[5] How Computers Work: Disks And Secondary Storage:
http://homepage.cs.uri.edu/faculty/wolfe/book/Readings/Reading05.htm
[6] http://www.mcobject.com/in_memory_database
[7] In-Memory DatabaseComputing – Smarter way of data analysis:
http://www.xoriant.com/blog/big-data-analytics/memory-database-computing-faster-smarter-
analysis-big-data-world.html
[8] How Computers Work:The CPU and Memory:
http://homepage.cs.uri.edu/book/cpu_memory/cpu_memory.htm
18/9/2017 12
THANKYOU
Ad

More Related Content

What's hot (20)

Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
Lucian Neghina
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache Spark
Rahul Jain
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
Harri Kauhanen
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Simplilearn
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
itnewsafrica
 
Introduction to snowflake
Introduction to snowflakeIntroduction to snowflake
Introduction to snowflake
Sunil Gurav
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
LibbySchulze
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
Minhazul Arefin
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Thirunavukkarasu Ps
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
Guido Schmutz
 
Data pipelines from zero to solid
Data pipelines from zero to solidData pipelines from zero to solid
Data pipelines from zero to solid
Lars Albertsson
 
Row or Columnar Database
Row or Columnar DatabaseRow or Columnar Database
Row or Columnar Database
Biju Nair
 
Databricks on AWS.pptx
Databricks on AWS.pptxDatabricks on AWS.pptx
Databricks on AWS.pptx
Wasm1953
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
Dr. C.V. Suresh Babu
 
Zero to Snowflake Presentation
Zero to Snowflake Presentation Zero to Snowflake Presentation
Zero to Snowflake Presentation
Brett VanderPlaats
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
nehabsairam
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Vivek Aanand Ganesan
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
Data Con LA
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databases
Ashwani Kumar
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache Spark
Rahul Jain
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Simplilearn
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
itnewsafrica
 
Introduction to snowflake
Introduction to snowflakeIntroduction to snowflake
Introduction to snowflake
Sunil Gurav
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
LibbySchulze
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
Minhazul Arefin
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
Guido Schmutz
 
Data pipelines from zero to solid
Data pipelines from zero to solidData pipelines from zero to solid
Data pipelines from zero to solid
Lars Albertsson
 
Row or Columnar Database
Row or Columnar DatabaseRow or Columnar Database
Row or Columnar Database
Biju Nair
 
Databricks on AWS.pptx
Databricks on AWS.pptxDatabricks on AWS.pptx
Databricks on AWS.pptx
Wasm1953
 
Zero to Snowflake Presentation
Zero to Snowflake Presentation Zero to Snowflake Presentation
Zero to Snowflake Presentation
Brett VanderPlaats
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
nehabsairam
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
Data Con LA
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databases
Ashwani Kumar
 

Similar to In-Memory Big Data Analytics (20)

Capitalizing on the New Era of In-memory Computing
Capitalizing on the New Era of In-memory ComputingCapitalizing on the New Era of In-memory Computing
Capitalizing on the New Era of In-memory Computing
Infosys
 
IRJET- Improving Performance of Data Analytical Queries using In-Memory D...
IRJET-  	  Improving Performance of Data Analytical Queries using In-Memory D...IRJET-  	  Improving Performance of Data Analytical Queries using In-Memory D...
IRJET- Improving Performance of Data Analytical Queries using In-Memory D...
IRJET Journal
 
In Memory Data Grids, Demystified!
In Memory Data Grids, Demystified! In Memory Data Grids, Demystified!
In Memory Data Grids, Demystified!
Uri Cohen
 
Enterprise Storage Solutions for Overcoming Big Data and Analytics Challenges
Enterprise Storage Solutions for Overcoming Big Data and Analytics ChallengesEnterprise Storage Solutions for Overcoming Big Data and Analytics Challenges
Enterprise Storage Solutions for Overcoming Big Data and Analytics Challenges
INFINIDAT
 
Big in memory file system
Big in memory file systemBig in memory file system
Big in memory file system
Mahesh Gupta
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Michael Hiskey
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a survey
redpel dot com
 
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
IBM India Smarter Computing
 
IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015
Doug O'Flaherty
 
#FMS2018 NGD Systems Real World Results with #ComputationalStorage
#FMS2018 NGD Systems Real World Results with #ComputationalStorage#FMS2018 NGD Systems Real World Results with #ComputationalStorage
#FMS2018 NGD Systems Real World Results with #ComputationalStorage
Scott Shadley, MBA,PMC-III
 
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkjE06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
ElyesAljane1
 
Webinar: The Three New Requirements of Unstructured Data Protection
Webinar: The Three New Requirements of Unstructured Data ProtectionWebinar: The Three New Requirements of Unstructured Data Protection
Webinar: The Three New Requirements of Unstructured Data Protection
Storage Switzerland
 
E06WarehouseDesignissuesindatawarehousedesign.ppt
E06WarehouseDesignissuesindatawarehousedesign.pptE06WarehouseDesignissuesindatawarehousedesign.ppt
E06WarehouseDesignissuesindatawarehousedesign.ppt
deshpandeprajakta40
 
Best storage engine for MySQL
Best storage engine for MySQLBest storage engine for MySQL
Best storage engine for MySQL
tomflemingh2
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Kognitio
 
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational CacheUsing Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
 
#MFSummit2016 Operate: The race for space
#MFSummit2016 Operate: The race for space#MFSummit2016 Operate: The race for space
#MFSummit2016 Operate: The race for space
Micro Focus
 
Big Data Boom
Big Data BoomBig Data Boom
Big Data Boom
Syed Jahanzaib Bin Hassan - JBH Syed
 
MongoDB and In-Memory Computing
MongoDB and In-Memory ComputingMongoDB and In-Memory Computing
MongoDB and In-Memory Computing
Dylan Tong
 
Capitalizing on the New Era of In-memory Computing
Capitalizing on the New Era of In-memory ComputingCapitalizing on the New Era of In-memory Computing
Capitalizing on the New Era of In-memory Computing
Infosys
 
IRJET- Improving Performance of Data Analytical Queries using In-Memory D...
IRJET-  	  Improving Performance of Data Analytical Queries using In-Memory D...IRJET-  	  Improving Performance of Data Analytical Queries using In-Memory D...
IRJET- Improving Performance of Data Analytical Queries using In-Memory D...
IRJET Journal
 
In Memory Data Grids, Demystified!
In Memory Data Grids, Demystified! In Memory Data Grids, Demystified!
In Memory Data Grids, Demystified!
Uri Cohen
 
Enterprise Storage Solutions for Overcoming Big Data and Analytics Challenges
Enterprise Storage Solutions for Overcoming Big Data and Analytics ChallengesEnterprise Storage Solutions for Overcoming Big Data and Analytics Challenges
Enterprise Storage Solutions for Overcoming Big Data and Analytics Challenges
INFINIDAT
 
Big in memory file system
Big in memory file systemBig in memory file system
Big in memory file system
Mahesh Gupta
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Michael Hiskey
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a survey
redpel dot com
 
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
IBM India Smarter Computing
 
IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015IBM Spectrum Scale Overview november 2015
IBM Spectrum Scale Overview november 2015
Doug O'Flaherty
 
#FMS2018 NGD Systems Real World Results with #ComputationalStorage
#FMS2018 NGD Systems Real World Results with #ComputationalStorage#FMS2018 NGD Systems Real World Results with #ComputationalStorage
#FMS2018 NGD Systems Real World Results with #ComputationalStorage
Scott Shadley, MBA,PMC-III
 
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkjE06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
E06WarehouseDesign.pptxkjhjkljhlkjhlkhlkj
ElyesAljane1
 
Webinar: The Three New Requirements of Unstructured Data Protection
Webinar: The Three New Requirements of Unstructured Data ProtectionWebinar: The Three New Requirements of Unstructured Data Protection
Webinar: The Three New Requirements of Unstructured Data Protection
Storage Switzerland
 
E06WarehouseDesignissuesindatawarehousedesign.ppt
E06WarehouseDesignissuesindatawarehousedesign.pptE06WarehouseDesignissuesindatawarehousedesign.ppt
E06WarehouseDesignissuesindatawarehousedesign.ppt
deshpandeprajakta40
 
Best storage engine for MySQL
Best storage engine for MySQLBest storage engine for MySQL
Best storage engine for MySQL
tomflemingh2
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Kognitio
 
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational CacheUsing Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
 
#MFSummit2016 Operate: The race for space
#MFSummit2016 Operate: The race for space#MFSummit2016 Operate: The race for space
#MFSummit2016 Operate: The race for space
Micro Focus
 
MongoDB and In-Memory Computing
MongoDB and In-Memory ComputingMongoDB and In-Memory Computing
MongoDB and In-Memory Computing
Dylan Tong
 
Ad

Recently uploaded (20)

Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
chapter3 Central Tendency statistics.ppt
chapter3 Central Tendency statistics.pptchapter3 Central Tendency statistics.ppt
chapter3 Central Tendency statistics.ppt
justinebandajbn
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Ch3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendencyCh3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendency
ayeleasefa2
 
How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Calories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptxCalories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptx
TijiLMAHESHWARI
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
chapter3 Central Tendency statistics.ppt
chapter3 Central Tendency statistics.pptchapter3 Central Tendency statistics.ppt
chapter3 Central Tendency statistics.ppt
justinebandajbn
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Ch3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendencyCh3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendency
ayeleasefa2
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Calories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptxCalories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptx
TijiLMAHESHWARI
 
Ad

In-Memory Big Data Analytics

  • 1. IN-MEMORY BIG DATA ANALYTICS Supreeth MP 1st sem BDA 18/9/2017 1
  • 2. Table of Contents: 1. Data is growing 2. What is In-Memory analytics? 3. Why In-Memory Now? 4. The landscape of disk-based and in-memory data management systems 5. In-Memory vsTraditional (on-disk) database management system 6. Optimization Aspects on In-Memory Data Management and Processing 7. Some questions on in-memory analytics 8. References 18/9/2017 2
  • 3. Data is growing: • Continuous flow of data • Real-time, 24/7 streaming updates • More than 2.5 quintillion bytes of data added daily • Data is always available • Democratization of data • Main source for business decisions • Shift to digital and STP • Affordable technology • Better and faster analytics • Business Intelligence • Cloud and subscription-based computing 18/9/2017 3
  • 4. What is In-Memory analytics? An in-memory analytics system basically is a database management system that stores data entirely in main memory that is in the RAM.This contrasts to traditional (on-disk) database systems, which are designed for data storage on persistent media such as hardisk. Because working with data in memory is much faster than writing to and reading from a file system. In-memory is ideal when: • Your database is too slow for interactive analytics • You need to perform real-time data analytics • You need to be offline and can't connect to your data live 18/9/2017 4
  • 5. Why In-Memory Now? • RAM is 200 times faster than disk storage and typically enables data access 50 to 100 times quicker • Memory storage capacity and bandwidth have been doubling roughly every three years, while its price has been dropping by a factor of 10 every five years. • Modern high-end servers now have multiple sockets, each of which can have tens or hundreds of gigabytes of DRAM • Growth of distributed systems • The increasing adoption of 64-bit computer technology has made RAM more suitable for use with large datasets. • Database systems have been evolving over the last few decades. 18/9/2017 5
  • 6. The landscape of disk-based and in-memory data management systems: 18/9/2017 6
  • 7. In-Memory vsTraditional (on-disk) database management system: 18/9/2017 7
  • 8. In-Memory vsTraditional (on-disk) database management system: Aspects DBDMS IMDBS File I/O Carries File I/O burden No file I/O burden Storage Usage Assumes storage is abundant Uses storage more efficiently Algorithms Algorithm optimized for disk Algorithms optimized for memory CPU Cycles More CPU cycles Less CPU cycles Persistence Non-volatile Volatile Lock Fine Locks Coarse Locks 18/9/2017 8
  • 9. In-Memory vsTraditional (on-disk) database management system: 18/9/2017 9
  • 10. Optimization Aspects on In-Memory Data Management and Processing: Aspects Concerns Techniques Index Cache consciousness, time/space efficiency Hash-based, tree-based Data Layout Cache consciousness, space efficiency Columnar layout Parallelism Linear scaling, partitioning Data-level, shared-memory scale-up and shared-nothing scale out parallelism Concurrency Control Overhead, correctness Coarse-grained locks Query Processing Code locality, register temporal locality, time efficiency Coarse-grained stored procedures Fault tolerance Durability, correlated failures, availability Checkpoints andTransaction logging Data Overflow Locality, Paging strategy, hot/cold classification Anti-caching 18/9/2017 10
  • 11. Some questions on in-memory analytics: • What do companies need to think about as they take on an in-memory analytics path? • What are some potential speed bumps in adopting in-memory analytics? • What role do skills play here? • If an in-memory database system boosts performance by holding all records in memory, can’t we get the same result by creating a RAM disk and deploying a traditional database there? • Won’t an in-memory database require huge amounts of memory because database systems are large? • Isn’t the database just lost if there’s a system crash? 18/9/2017 11
  • 12. References: [1] In-Memory Big Data Management and Processing:A Survey, IEEETRANSACTIONSON KNOWLEDGEAND DATA ENGINEERING JULY 2015 [2] Using In-MemoryAnalytics to Quickly Crunch Big Data by Lee Garber [3] https://www.sas.com/en_us/insights/articles/big-data/in-memory-analytics-questions.html [4] DataAnalytics using In-MemoryComputing: https://www.gridgain.com/ [5] How Computers Work: Disks And Secondary Storage: http://homepage.cs.uri.edu/faculty/wolfe/book/Readings/Reading05.htm [6] http://www.mcobject.com/in_memory_database [7] In-Memory DatabaseComputing – Smarter way of data analysis: http://www.xoriant.com/blog/big-data-analytics/memory-database-computing-faster-smarter- analysis-big-data-world.html [8] How Computers Work:The CPU and Memory: http://homepage.cs.uri.edu/book/cpu_memory/cpu_memory.htm 18/9/2017 12