Lecture 2.3 B

Uploaded by

himanshisaini47

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Lecture 2.3 B

Uploaded by

himanshisaini47

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 7

INSTITUTE: CHANDIGARH UNIVERSITY

DEPARTMENT: UIC
MCA
Business Analytics
23CAH-701

DISCOVER . LEARN . EMPOWER

Datasets
• Datasets are curated tables of data that can be reused across multiple
reports. They are created by writing a SQL query and transforming the
results of that query into a reusable asset. Datasets can then be
shared across your organization. This allows multiple reports to be
created off of the initial query, which can be set to refresh on a
schedule. Reports created from Datasets will be able to consume the
fresh data when available to ensure accuracy of the reporting over
time.
• The data in a Dataset is cached in Helix, which enables more efficient
data usage and improved performance for reports created from
Datasets.
Key benefits of Datasets:
• Centralize logic and data quality: Datasets can power multiple reports, allowing analysts
to write or update one query that cascades down across multiple reports.
• Manage data stack complexities: Datasets introduces a new way data moves through
Mode, creating a middle governance later that can centralize logic and make scaling
easier.
• Improve efficiency and performance: With data cached in Helix, this provides incremental
performance gains for each report refresh that doesn't have to hit the data warehouse.
• Cost savings: Datasets are positioned between reports and warehouses for more efficient
data usage and controlled warehouse hits.
• Confident self service access: Datasets can be an approved source for teams within an
organization to confidently build reports without writing any code, knowing the dataset
has been published by the data team.
• Data accessibility: Datasets can be organized in collections and browsed when creating
reports. Datasets are subject to permissions just like reports.
Manipulate Large Data Sets
When working with large data sets in SQL, it is important to employ
efficient techniques to manipulate the data effectively. Here are
some strategies for manipulating large data sets:
1.Use Proper Indexing: Indexes help improve query performance
by allowing the database to quickly locate and retrieve the
required data. Ensure that appropriate indexes are created on
columns frequently used in search conditions, joins, and sorting
operations. Analyze query execution plans and consider adding or
adjusting indexes based on the query patterns and data access
patterns.
2.Filter and Subset Data: When dealing with large data sets, it is
often beneficial to filter and retrieve only the necessary subset of
data instead of processing the entire dataset. Utilize the WHERE
clause in SELECT statements to apply conditions and retrieve only
the relevant rows. This helps reduce the amount of data being
processed and improves query performance. 4
3. Use Pagination or Limiting Techniques: Instead of retrieving the
entire result set at once, use pagination or limiting techniques to
retrieve data in smaller chunks. This involves retrieving a subset of rows
using keywords like LIMIT, OFFSET, or the equivalent syntax supported
by your database system. By retrieving data in smaller batches, you can
reduce memory consumption and improve query performance.
4. Optimize Joins: When joining tables, ensure that the join conditions
are well-defined and appropriate indexes are in place. Consider using
appropriate join types (INNER JOIN, LEFT JOIN, etc.) based on the
relationship between the tables and the desired output. Avoid
unnecessary or redundant joins that can result in excessive data
processing.
5. Aggregate and Summarize Data: Instead of processing every
individual row, consider aggregating and summarizing the data using
GROUP BY, SUM, COUNT, and other aggregate functions. This helps
reduce the amount of data being processed and provides a more
concise view of the information.
5
6. Partitioning and Parallel Processing: Some database systems
support data partitioning, which involves splitting large tables into smaller,
more manageable pieces based on specific criteria (such as ranges or
hash values). Partitioning allows for parallel processing of data,
distributing the workload across multiple resources and improving query
performance.
7. Consider Batch Processing: If applicable, consider performing
operations on the data in batches rather than processing the entire
dataset at once. This can be useful for tasks such as updates, deletions, or
inserts. Breaking the data into smaller batches can help manage resources
more effectively and allow for easier error handling and recovery.
8. Optimize Query Performance: Analyze and optimize your SQL
queries to ensure they are written efficiently. Use appropriate join
conditions, avoid unnecessary subqueries or redundant calculations, and
ensure that your queries are using the best execution plan available.
Regularly review and analyze query performance using database-specific
tools or EXPLAIN/EXPLAIN PLAN statements to identify areas for
improvement.
6
THANK YOU

SQL Tuning
No ratings yet
SQL Tuning
12 pages
College of Computing and Digital Media: SE 350 - Object-Oriented Software Development
No ratings yet
College of Computing and Digital Media: SE 350 - Object-Oriented Software Development
10 pages
Seed Lab
No ratings yet
Seed Lab
13 pages
6 tips for better sql query optimization (with example code)
No ratings yet
6 tips for better sql query optimization (with example code)
4 pages
advanced query optimization techniques in sql _ by darshan lunagariya _ medium
No ratings yet
advanced query optimization techniques in sql _ by darshan lunagariya _ medium
7 pages
1740108276046
No ratings yet
1740108276046
10 pages
12 Tips for Optimizing SQL Queries for Faster Performance _ by Sarang S. Babu _ Learning SQL _ Medium
No ratings yet
12 Tips for Optimizing SQL Queries for Faster Performance _ by Sarang S. Babu _ Learning SQL _ Medium
13 pages
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
12 sql query optimization best practices for cloud databases
No ratings yet
12 sql query optimization best practices for cloud databases
9 pages
30 SQL Performance Tips & Trics
No ratings yet
30 SQL Performance Tips & Trics
9 pages
Db2 SQL Tuning Tips
100% (1)
Db2 SQL Tuning Tips
11 pages
SQL-30-Performance-Tips-_-Cheat-Sheet
No ratings yet
SQL-30-Performance-Tips-_-Cheat-Sheet
1 page
Question 1^.^.^Tatenda
No ratings yet
Question 1^.^.^Tatenda
5 pages
17 ways to speed your SQL queries
No ratings yet
17 ways to speed your SQL queries
8 pages
optimizing sql query performance_ a comprehensive guide _ by taran kaur _ women in technology _ medium
No ratings yet
optimizing sql query performance_ a comprehensive guide _ by taran kaur _ women in technology _ medium
15 pages
Untitled document (1)
No ratings yet
Untitled document (1)
3 pages
Top 10 SQL Performance Optimization Techniques
No ratings yet
Top 10 SQL Performance Optimization Techniques
1 page
SQL Database Mastery: Advanced Techniques for Database Management
From Everand
SQL Database Mastery: Advanced Techniques for Database Management
Adam Jones
No ratings yet
SQL
No ratings yet
SQL
4 pages
Best SQL Practices On Performance
No ratings yet
Best SQL Practices On Performance
25 pages
SQL Performance Tuning
100% (1)
SQL Performance Tuning
10 pages
SQL Performance Tuning
No ratings yet
SQL Performance Tuning
10 pages
mastering sql query performance_ an in-depth optimization g…
No ratings yet
mastering sql query performance_ an in-depth optimization g…
6 pages
Assignment by Raj Singh
No ratings yet
Assignment by Raj Singh
14 pages
How to monitor and resolve blocking in SQL Server_PDF
No ratings yet
How to monitor and resolve blocking in SQL Server_PDF
3 pages
Writing Good SQL
No ratings yet
Writing Good SQL
13 pages
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
From Everand
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
Robert Johnson
No ratings yet
Les 02 Introtun
No ratings yet
Les 02 Introtun
23 pages
Data Analyst Interview Questions
No ratings yet
Data Analyst Interview Questions
6 pages
BA Questions - Answers
No ratings yet
BA Questions - Answers
12 pages
Querry Optimization
No ratings yet
Querry Optimization
13 pages
SQL Structured Query Language
No ratings yet
SQL Structured Query Language
3 pages
SQL Performance Improvement
No ratings yet
SQL Performance Improvement
94 pages
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Dbms Optimization
No ratings yet
Dbms Optimization
1 page
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Chapter 2 SQL Project
No ratings yet
Chapter 2 SQL Project
4 pages
SQL For Everyone (Definitive Guide)
No ratings yet
SQL For Everyone (Definitive Guide)
10 pages
Access 2016: Up To Speed
From Everand
Access 2016: Up To Speed
R.M. Hyttinen
5/5 (2)
Five Performance Hints For Efficient SQL
No ratings yet
Five Performance Hints For Efficient SQL
7 pages
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
From Everand
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
Brian Knight
3/5 (1)
Advanced SQL Performance Tuning: Optimize Your Database Workloads
From Everand
Advanced SQL Performance Tuning: Optimize Your Database Workloads
Robert Johnson
No ratings yet
Course1 Description Database Design
No ratings yet
Course1 Description Database Design
58 pages
UNIT-5
No ratings yet
UNIT-5
5 pages
Tuning
No ratings yet
Tuning
20 pages
7 SQL Tricks in Data Analysis
No ratings yet
7 SQL Tricks in Data Analysis
9 pages
32 Tips For Oracle SQL Query Writing and Performance Tuning
No ratings yet
32 Tips For Oracle SQL Query Writing and Performance Tuning
4 pages
SC4x W3L1 TopicsInDatabases v2
No ratings yet
SC4x W3L1 TopicsInDatabases v2
37 pages
SQL SERVER 2005/2008 Performance Tuning For The Developer: Michelle Gutzait
No ratings yet
SQL SERVER 2005/2008 Performance Tuning For The Developer: Michelle Gutzait
112 pages
Becoming A Data Analyst
100% (2)
Becoming A Data Analyst
348 pages
SSAS 2008 R2 Performance Guide PDF
No ratings yet
SSAS 2008 R2 Performance Guide PDF
99 pages
SQL-Data Analytcs
No ratings yet
SQL-Data Analytcs
13 pages
Textbook 464
No ratings yet
Textbook 464
484 pages
Best Practices - Oracle SQL - 1
No ratings yet
Best Practices - Oracle SQL - 1
6 pages
Snowflake Query Optimization Techniques Snow
No ratings yet
Snowflake Query Optimization Techniques Snow
13 pages
Module 3 SQL
No ratings yet
Module 3 SQL
5 pages
SQL Query optimization techniques
No ratings yet
SQL Query optimization techniques
12 pages
SQL Performance Optimization 1734323939
No ratings yet
SQL Performance Optimization 1734323939
17 pages
SQL Notes
No ratings yet
SQL Notes
9 pages
Tuning SQL Queries For Better Performance in Management Information Systems Using Large Set of Data
No ratings yet
Tuning SQL Queries For Better Performance in Management Information Systems Using Large Set of Data
10 pages
NICE ONE - SQL Optimization
No ratings yet
NICE ONE - SQL Optimization
60 pages
Course Information: Customer Data Analysis and Customer Relationship Management
No ratings yet
Course Information: Customer Data Analysis and Customer Relationship Management
13 pages
Microsoft Windows XP Professional - EULA
No ratings yet
Microsoft Windows XP Professional - EULA
6 pages
IT Academy-Solutions 3-Assessment - v1.0
100% (1)
IT Academy-Solutions 3-Assessment - v1.0
37 pages
RTG-B-07 MICREX-SX Series, User's Manual D300Win (Reference) FEH254 - All
No ratings yet
RTG-B-07 MICREX-SX Series, User's Manual D300Win (Reference) FEH254 - All
550 pages
Business Objects Questions
No ratings yet
Business Objects Questions
43 pages
TCL Development Guide
100% (1)
TCL Development Guide
1,286 pages
Installing VLC Media Player: Installing Using The .Exe Installer Package in Windows 7
No ratings yet
Installing VLC Media Player: Installing Using The .Exe Installer Package in Windows 7
12 pages
Programmer's Reference Guide: MVME6100 Single-Board Computer
No ratings yet
Programmer's Reference Guide: MVME6100 Single-Board Computer
57 pages
Job Opportunity in Agilisium - Reg.
No ratings yet
Job Opportunity in Agilisium - Reg.
2 pages
Tekla Model Share User Cases
No ratings yet
Tekla Model Share User Cases
22 pages
Dynalite Engineers Specification
No ratings yet
Dynalite Engineers Specification
19 pages
CPS 216: Advanced Database Systems: Shivnath Babu Fall 2006
No ratings yet
CPS 216: Advanced Database Systems: Shivnath Babu Fall 2006
23 pages
PHP Legacy App Scaling Overview
No ratings yet
PHP Legacy App Scaling Overview
38 pages
Pagbasa at Pagsulat Tungo Sa Pananaliksik
100% (1)
Pagbasa at Pagsulat Tungo Sa Pananaliksik
16 pages
JSF AV C++ Coding Standards Rev C
No ratings yet
JSF AV C++ Coding Standards Rev C
140 pages
HP-Lecture 02
No ratings yet
HP-Lecture 02
41 pages
KG7&KG7n-SPARK8P NPI Training Manaul V1
No ratings yet
KG7&KG7n-SPARK8P NPI Training Manaul V1
59 pages
Delta Ia-Plc As MDM en 20240430
No ratings yet
Delta Ia-Plc As MDM en 20240430
840 pages
Manuel Opc 11
No ratings yet
Manuel Opc 11
59 pages
Virtusa_GenAi_Mavericks1
No ratings yet
Virtusa_GenAi_Mavericks1
4 pages
Scientific Computing Using Python
100% (1)
Scientific Computing Using Python
54 pages
Cisco Netflow Configuration
No ratings yet
Cisco Netflow Configuration
19 pages
Code
No ratings yet
Code
8 pages
João Álvaro Carvalho - Information System - Which One Do You Mean
No ratings yet
João Álvaro Carvalho - Information System - Which One Do You Mean
24 pages
Practical 1 (CN)
No ratings yet
Practical 1 (CN)
9 pages
Load Balanced High-Availability Apache Cluster
No ratings yet
Load Balanced High-Availability Apache Cluster
12 pages
Chapter 2
No ratings yet
Chapter 2
8 pages
A Communication Plan A Detailed Asset Inventory A Data Restoration Priority Plan A Vendor Communication and Service Restoration Plan
100% (1)
A Communication Plan A Detailed Asset Inventory A Data Restoration Priority Plan A Vendor Communication and Service Restoration Plan
4 pages
5th CE PDF
No ratings yet
5th CE PDF
38 pages