SlideShare a Scribd company logo
Abhishek Manoj Sharma
Database vs Data Warehouse
Database Data Warehouse
Used for Online Transactional Processing (OLTP) Used for Online Analytical Processing (OLAP)
Entity – Relationship modelling techniques are used
for RDBS design
Data Modelling techniques are used for Data
Warehouse design
Optimized for write operations Optimized for read operations
Performance is low for analytical queries Performance is high for analytical queries
Minimal to no downtime expected Can have scheduled downtime for data warehouse
refresh
Data Warehouse
 A decision support database maintained separately from transactional
or operational database
 Contains huge amounts of historic data
 Single and consistent store of data from multiple sources
 Contains transformed informational data
 Structured for complex queries and data analysis
Data Warehousing
The process of transforming
data into information and
making it available to the
users in a timely and
efficient manner
Why Data Warehousing
 Structured in a way to increase performance of analytical tasks
 Partitions analytical and operational tasks, limiting risks of locking
users during data updates
 Faster and more flexible reporting for business intelligence
 Understanding how well the business is performing
 Example: Sales information
Data Warehousing – Structure
On-Line Analytical Processing (OLAP)
OLAP
CUBE
OLAP
CUBE
 OLAP is a mechanism to extract
and view data from different
point of views.
 It allows users to analyse
database information from
multiple sources simultaneously.
 Stores data from multiple
dimensions.
OLAP Cube
 A data structure that allows fast analysis of data.
 Consists of numeric facts called measures which are categorized by
dimensions.
OLAP Cube
Operations:
 Drill down
 Roll up
 Slice
 Dice
 Pivot
Parallel Execution in Data
Warehousing
Why:
 For queries requiring large table scans or joins
 For creation of large indexes
 For creation of large tables (including materialized views)
 For bulk operations like bulk inserts, merges, or deletes
Parallel Execution in Data
Warehousing
When to implement parallel processing:
 Sufficient I/O bandwidth
 Underutilized or intermittently used CPUs
 Sufficient memory to support intensive tasks like sorts, hashing, etc.
When not to implement parallel processing:
 Queries are generally very short and simple
 CPU, memory, or I/O is already heavily utilized
Degree of Parallelism (DOP)
 Defined as the number of parallel executions servers associated with a
single operation.
Managing DOP in Oracle Database
Systems:
 Adaptive multi-user algorithm (default):
Reduces DOP as system load increases.
 User Profile Management:
Sets limit to each user within a security domain.
 Database Resource Manager:
Allocates resources based on user groups.
How Parallelism Works
 Parallel Execution divides SQL statements into multiple small units
called granules.
 Each granule is executed by a separate process.
 Oracle initializes the flag PARALLEL_MIN_SERVERS and
PARALLEL_MAX_SERVERS on instance startup.
 If number of parallel execution servers increases, the flag
PARALLEL_MAX_SERVERS is modified.
How Parallelism Works
 One parallel execution server is
set as the query coordinator.
 The query coordinator
distributes tasks amongst all
other execution servers.
 The execution servers scan the
tables, process their results and
send it back to the coordinator.
Automatic Degree of Parallelism
 The query optimizer determines the fastest possible plan.
 Parses the query, calculates the execution cost, and calculates the DOP
based on that.
 The cheapest plan could also be a serial execution, and hence the
architecture keeps that as an option.
 Uses PARALLEL_MIN_TIME_THRESHOLD flag for this.
 Default value of this flag is 10 seconds.
Automatic Degree of Parallelism
Parallelism In Action
Schema: SALES HISTORY (available on livesql.oracle.com)
No. of records:
SALES: 918,843
CUSTOMERS: 55,500 Execution Time: 18.83 seconds
Parallelism In Action
Schema: SALES HISTORY (available on livesql.oracle.com)
No. of records:
SALES: 918,843
CUSTOMERS: 55,500 Execution Time: 5.60 seconds
Parallelism In Action
Parallelism - Performance
Parallel processing in data warehousing and big data
Ad

More Related Content

What's hot (20)

Data mining techniques unit 2
Data mining techniques unit 2Data mining techniques unit 2
Data mining techniques unit 2
malathieswaran29
 
3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
Azad public school
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
nurmeen1
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
Mr. Fmhyudin
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
Sunita Sahu
 
"Diffrence between RDBMS, OODBMS and ORDBMS"
"Diffrence between RDBMS, OODBMS and  ORDBMS""Diffrence between RDBMS, OODBMS and  ORDBMS"
"Diffrence between RDBMS, OODBMS and ORDBMS"
baabtra.com - No. 1 supplier of quality freshers
 
Object Oriented Database Management System
Object Oriented Database Management SystemObject Oriented Database Management System
Object Oriented Database Management System
Ajay Jha
 
Decision Trees
Decision TreesDecision Trees
Decision Trees
Student
 
Distributed database
Distributed databaseDistributed database
Distributed database
ReachLocal Services India
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Pradnya Saval
 
Frequent itemset mining methods
Frequent itemset mining methodsFrequent itemset mining methods
Frequent itemset mining methods
Prof.Nilesh Magar
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management system
Pooja Dixit
 
Denormalization
DenormalizationDenormalization
Denormalization
Amna Magzoub
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Shruti Dalela
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
Mohit Saini
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
Institute of Technology Telkom
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
Zalpa Rathod
 
Avro introduction
Avro introductionAvro introduction
Avro introduction
Nanda8904648951
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
janani thirupathi
 
Data partitioning
Data partitioningData partitioning
Data partitioning
Vinod Wilson
 
Data mining techniques unit 2
Data mining techniques unit 2Data mining techniques unit 2
Data mining techniques unit 2
malathieswaran29
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
nurmeen1
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
Sunita Sahu
 
Object Oriented Database Management System
Object Oriented Database Management SystemObject Oriented Database Management System
Object Oriented Database Management System
Ajay Jha
 
Decision Trees
Decision TreesDecision Trees
Decision Trees
Student
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Pradnya Saval
 
Frequent itemset mining methods
Frequent itemset mining methodsFrequent itemset mining methods
Frequent itemset mining methods
Prof.Nilesh Magar
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management system
Pooja Dixit
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
Mohit Saini
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
Zalpa Rathod
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
janani thirupathi
 

Similar to Parallel processing in data warehousing and big data (20)

Building High Performance MySql Query Systems And Analytic Applications
Building High Performance MySql Query Systems And Analytic ApplicationsBuilding High Performance MySql Query Systems And Analytic Applications
Building High Performance MySql Query Systems And Analytic Applications
guest40cda0b
 
Building High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic ApplicationsBuilding High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic Applications
Calpont
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
Priyanshu931034
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
Zalpa Rathod
 
OLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptxOLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptx
lalitajites
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
SAP BODS -quick guide.docx
SAP BODS -quick guide.docxSAP BODS -quick guide.docx
SAP BODS -quick guide.docx
Ken T
 
Query Optimization for Big Data Analytics
Query Optimization for Big Data AnalyticsQuery Optimization for Big Data Analytics
Query Optimization for Big Data Analytics
AIRCC Publishing Corporation
 
QUERY OPTIMIZATION FOR BIG DATA ANALYTICS
QUERY OPTIMIZATION FOR BIG DATA ANALYTICSQUERY OPTIMIZATION FOR BIG DATA ANALYTICS
QUERY OPTIMIZATION FOR BIG DATA ANALYTICS
ijcsit
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016
Łukasz Grala
 
Data warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersData warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswers
Sourav Singh
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
AyushMeraki1
 
SAP ARCHITECTURE (I).pptx
SAP ARCHITECTURE (I).pptxSAP ARCHITECTURE (I).pptx
SAP ARCHITECTURE (I).pptx
Temitope Fagbuyi
 
Online Analytical Processing
Online Analytical ProcessingOnline Analytical Processing
Online Analytical Processing
nayakslideshare
 
R12 d49656 gc10-apps dba 07
R12 d49656 gc10-apps dba 07R12 d49656 gc10-apps dba 07
R12 d49656 gc10-apps dba 07
zeesniper
 
שבוע אורקל 2016
שבוע אורקל 2016שבוע אורקל 2016
שבוע אורקל 2016
Aaron Shilo
 
Data warehouse 26 exploiting parallel technologies
Data warehouse  26 exploiting parallel technologiesData warehouse  26 exploiting parallel technologies
Data warehouse 26 exploiting parallel technologies
Vaibhav Khanna
 
Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...
Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...
Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...
Aaron Shilo
 
data mining
data miningdata mining
data mining
renukarenuka9
 
data mining
data miningdata mining
data mining
renukarenuka9
 
Building High Performance MySql Query Systems And Analytic Applications
Building High Performance MySql Query Systems And Analytic ApplicationsBuilding High Performance MySql Query Systems And Analytic Applications
Building High Performance MySql Query Systems And Analytic Applications
guest40cda0b
 
Building High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic ApplicationsBuilding High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic Applications
Calpont
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
Zalpa Rathod
 
OLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptxOLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptx
lalitajites
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
SAP BODS -quick guide.docx
SAP BODS -quick guide.docxSAP BODS -quick guide.docx
SAP BODS -quick guide.docx
Ken T
 
QUERY OPTIMIZATION FOR BIG DATA ANALYTICS
QUERY OPTIMIZATION FOR BIG DATA ANALYTICSQUERY OPTIMIZATION FOR BIG DATA ANALYTICS
QUERY OPTIMIZATION FOR BIG DATA ANALYTICS
ijcsit
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016
Łukasz Grala
 
Data warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersData warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswers
Sourav Singh
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
AyushMeraki1
 
Online Analytical Processing
Online Analytical ProcessingOnline Analytical Processing
Online Analytical Processing
nayakslideshare
 
R12 d49656 gc10-apps dba 07
R12 d49656 gc10-apps dba 07R12 d49656 gc10-apps dba 07
R12 d49656 gc10-apps dba 07
zeesniper
 
שבוע אורקל 2016
שבוע אורקל 2016שבוע אורקל 2016
שבוע אורקל 2016
Aaron Shilo
 
Data warehouse 26 exploiting parallel technologies
Data warehouse  26 exploiting parallel technologiesData warehouse  26 exploiting parallel technologies
Data warehouse 26 exploiting parallel technologies
Vaibhav Khanna
 
Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...
Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...
Exploring Oracle Database Performance Tuning Best Practices for DBAs and Deve...
Aaron Shilo
 
Ad

Recently uploaded (20)

Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
Ad

Parallel processing in data warehousing and big data

  • 2. Database vs Data Warehouse Database Data Warehouse Used for Online Transactional Processing (OLTP) Used for Online Analytical Processing (OLAP) Entity – Relationship modelling techniques are used for RDBS design Data Modelling techniques are used for Data Warehouse design Optimized for write operations Optimized for read operations Performance is low for analytical queries Performance is high for analytical queries Minimal to no downtime expected Can have scheduled downtime for data warehouse refresh
  • 3. Data Warehouse  A decision support database maintained separately from transactional or operational database  Contains huge amounts of historic data  Single and consistent store of data from multiple sources  Contains transformed informational data  Structured for complex queries and data analysis
  • 4. Data Warehousing The process of transforming data into information and making it available to the users in a timely and efficient manner
  • 5. Why Data Warehousing  Structured in a way to increase performance of analytical tasks  Partitions analytical and operational tasks, limiting risks of locking users during data updates  Faster and more flexible reporting for business intelligence  Understanding how well the business is performing  Example: Sales information
  • 7. On-Line Analytical Processing (OLAP) OLAP CUBE OLAP CUBE  OLAP is a mechanism to extract and view data from different point of views.  It allows users to analyse database information from multiple sources simultaneously.  Stores data from multiple dimensions.
  • 8. OLAP Cube  A data structure that allows fast analysis of data.  Consists of numeric facts called measures which are categorized by dimensions. OLAP Cube Operations:  Drill down  Roll up  Slice  Dice  Pivot
  • 9. Parallel Execution in Data Warehousing Why:  For queries requiring large table scans or joins  For creation of large indexes  For creation of large tables (including materialized views)  For bulk operations like bulk inserts, merges, or deletes
  • 10. Parallel Execution in Data Warehousing When to implement parallel processing:  Sufficient I/O bandwidth  Underutilized or intermittently used CPUs  Sufficient memory to support intensive tasks like sorts, hashing, etc. When not to implement parallel processing:  Queries are generally very short and simple  CPU, memory, or I/O is already heavily utilized
  • 11. Degree of Parallelism (DOP)  Defined as the number of parallel executions servers associated with a single operation. Managing DOP in Oracle Database Systems:  Adaptive multi-user algorithm (default): Reduces DOP as system load increases.  User Profile Management: Sets limit to each user within a security domain.  Database Resource Manager: Allocates resources based on user groups.
  • 12. How Parallelism Works  Parallel Execution divides SQL statements into multiple small units called granules.  Each granule is executed by a separate process.  Oracle initializes the flag PARALLEL_MIN_SERVERS and PARALLEL_MAX_SERVERS on instance startup.  If number of parallel execution servers increases, the flag PARALLEL_MAX_SERVERS is modified.
  • 13. How Parallelism Works  One parallel execution server is set as the query coordinator.  The query coordinator distributes tasks amongst all other execution servers.  The execution servers scan the tables, process their results and send it back to the coordinator.
  • 14. Automatic Degree of Parallelism  The query optimizer determines the fastest possible plan.  Parses the query, calculates the execution cost, and calculates the DOP based on that.  The cheapest plan could also be a serial execution, and hence the architecture keeps that as an option.  Uses PARALLEL_MIN_TIME_THRESHOLD flag for this.  Default value of this flag is 10 seconds.
  • 15. Automatic Degree of Parallelism
  • 16. Parallelism In Action Schema: SALES HISTORY (available on livesql.oracle.com) No. of records: SALES: 918,843 CUSTOMERS: 55,500 Execution Time: 18.83 seconds
  • 17. Parallelism In Action Schema: SALES HISTORY (available on livesql.oracle.com) No. of records: SALES: 918,843 CUSTOMERS: 55,500 Execution Time: 5.60 seconds