SlideShare a Scribd company logo
Using SQL-MapReduce for Advanced Analytical QueriesbyRick F. van der LansR20/Consultancy BV
What Did the Users Want?BI reportsProduction databases
But What Did We Create?ODSdatawarehousedatamartproductiondatabasecube
Problems with Current DW Platforms45%40%39%37%33%29%23%23%21%20%19%16%16%15%14%13%11%4%3%Poor query responseCan’t support advanced analyticsInadequate data load speedCan’t scale to large data volumesCost of scaling up is too expensivePoorly suited to real-time or on demand workloadsCurrent platform is a legacy we must phase outCan’t support data modeling we needWe need platform that supports mixed workloadsCan’t support large concurrent user countInadequate high availabilityInadequate support for in-memory processingInadequate support for web services and SOACurrent platform is 32-bit, and we need 64-bitCurrent platform is SMP, and we need MPPWe need platform better suited to cloud or virtualizationCan’t secure the data properlyOtherNo problemsSource: P. Russom, ‘NextGeneration Data WarehousePlatforms’, TDWI BestPractices Report, fourthquarter 2009.
49%8%20%12%8%1%1%3%current DW platform2009201020112012201320142015 or laterNeed for More Powerful Data Warehouse Platformsno plans to replaceSource: P. Russom, ‘NextGeneration Data WarehousePlatforms’, TDWI BestPractices Report, fourthquarter 2009.
New Forms of AnalyticsAdvanced AnalyticsOperational AnalyticsDeep AnalyticsSelf-Service AnalyticsComplex AnalyticsAutomated Analytics
Positioning of Advanced Analyticscomplexityof analyticalquerieshighcomplex queries onsmall to medium size databasesadvancedanalyticssimple queries onsmall to medium size databasessimple queries onlarge to ultra large databaseslowdatabase sizelowhigh
Parallellization of SQLWorkerWorkerWorkerSELECT   *FROM     CUSTOMERSWHERE    LOCATION = 'New York'DatabaseServerMaster
How Easy Is Parallelizing SQL Queries? (1)Example  1:SELECT   ID, SALES_DATE, PRICEFROM     SALES_RECORDSWHERE    PRICE > 100Example 2:SELECT   REGION_ID, SUM(PRICE)FROM     SALES_RECORDSWHERE    PRICE > 100GROUP BY REGION_ID
How Easy Is Parallelizing SQL Queries? (2)Example 3: Get all the flights to London for which another flight exists to London that leaves within an hour on the same day.SELECT   *FROM     DEPARTURES AS D1WHERE    DESTINATION = 'London'AND      DEPARTURE_TIME + 60 MINUTES >=        (SELECT   MIN(DEPARTURE_TIME)         FROM     DEPARTURES AS D2         WHERE    DESTINATION = 'London'         AND      D2.DEPARTURE_TIME > D1.DEPARTURE_TIME         AND      D2.DEPARTURE_DAY = D1.DEPARTURE_DAY)ORDER BY DEPARTURE_TIME
How Easy Is Parallelizing SQL Queries? (3)SELECTA.PROD_DESC AS ITEM1,B.PROD_DESC AS ITEM2,C.PROD_DESC AS ITEM3,COUNT (*) AS CNTFROM(SELECT SF.STORE_ID, SF.REG_ID, SF.TRAN_NO, SF.ITEM_ID, SF.DT, PD.PROD_DESC, PD.PRICE       FROM             SALES_FACT SF       INNER JOIN             PRODUCT_DIM PD       WHERE             SF.ITEM_ID=PD.ITEM_ID) AS TRANSACTIONS A, (SELECT SF.STORE_ID, SF.REG_ID, SF.TRAN_NO, SF.ITEM_ID, SF.DT, PD.PROD_DESC, PD.PRICE       FROM             SALES_FACT SF       INNER JOIN             PRODUCT_DIM PD       WHERE             SF.ITEM_ID=PD.ITEM_ID) AS TRANSACTIONS B,(SELECT SF.STORE_ID, SF.REG_ID, SF.TRAN_NO, SF.ITEM_ID, SF.DT, PD.PROD_DESC, PD.PRICE       FROM             SALES_FACT SF       ,,,            PRODUCT_DIM PD       WHERE             SF.ITEM_ID=PD.ITEM_ID) AS TRANSACTIONS C WHERE A.STORE_ID=B.STORE_ID AND  B.STORE_ID=C.STORE_ID AND A.STORE_ID=C.STORE_ID AND A.REG_ID=B.REG_ID AND  B.REG_ID=C.REG_ID AND A.REG_ID=C.REG_ID AND A.TRAN_NO=B.TRAN_NO AND  B.TRAN_NO=C.TRAN_NO AND A.TRAN_NO=C.TRAN_NO AND A.DT=B.DT AND  B.DT=C.DT AND A.DT=C.DT AND A.ITEM_ID<>B.ITEM_ID AND A.ITEM_ID<>C.ITEM_ID AND B.ITEM_ID<>C.ITEM_IDGROUP BY A.PROD_DESC,  B.PROD_DESC,  C.PROD_DESCHAVING  COUNT(*)>1000ORDER BY COUNT(*) DESC;Example  4: Market basket analysis:
Declarativeness and Storage IndependencyDeclarativeness:	The developer has only to program what has to be done, and not how it should be done.Storage independency:	The language should hide how data is physically stored and how it is accessed.
Advantages of Two PropertiesProductivity increaseless code has to be writtenMaintainability: less code means having to maintain less codeFlexibility: changes to the storage layer can be made without the need to change the SQL code in the reports
Different Types of SQL FunctionsBuilt-in or User-definedSELECT   FLIGHT, TRUNCATE(DEPARTURE_TIME, MINUTES)FROM     DEPARTURES AS D1WHERE    BANK_HOLIDAY(DEPARTURE_TIME) = 1Scalar or TableSELECT   AVG(DURATION)FROM     LAST_FIVE_ROWS(DEPARTURES)Pure SQL, Procedural, or ExternalSimple or Complex
MapReduceMapReduce is a programming model introduced by GoogleAimed at processing requests on large data sets where the processing can be distributed over a high number of nodes using parallel capabilities Two steps Map and ReduceMap is like SelectReduce is like Group-by
Aster Data’s SQL-MapReduce (1)SQL-MR is a set of built-in and user-defined external table functionsExample:SELECT   *FROM     GET_NEXT_FLIGHT_1HR         (ON DEPARTURES PARTITION BY DESTINATION)WHERE    DESTINATION = 'London'ORDER BY DEPARTURE_TIMEAll the SQL-MR function processing is parallelizedIncluding complex group-by operations and time-series analytics
Aster Data’s SQL-MapReduce (2)An SQL-MR function can contain the most complex analytical logicProgrammers of SQL don’t need to learn a new language, Java, C++, Python, and many more can be usedThe SQL statements invoking SQL-MR functions are still declarative and storage-independentThe functions themselves are notUsable by any BI tools supporting SQL
Supported Built-in Functions
SQL-MR Technical AdvantagesTechnical DisadvantagesParallelization of complex operations
Simplification of queries
Efficiency of low-level programming language
Efficient data access
Predictable query performance
Linear scalability
Built-in functions
Ad

More Related Content

Viewers also liked (14)

Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.
Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.
Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.
Universiti Teknologi Malaysia
 
marketing segmentation(shoe)
marketing segmentation(shoe)marketing segmentation(shoe)
marketing segmentation(shoe)
munirah38
 
10 Best Practices Of Software Product Management
10 Best Practices Of Software Product Management10 Best Practices Of Software Product Management
10 Best Practices Of Software Product Management
SVPMA
 
Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...
Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...
Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...
SAP Ariba
 
Management information system software
Management information system softwareManagement information system software
Management information system software
Online
 
Dental luting cements/ colleges for dentistry
Dental luting cements/ colleges for dentistryDental luting cements/ colleges for dentistry
Dental luting cements/ colleges for dentistry
Indian dental academy
 
Market mapping
Market mappingMarket mapping
Market mapping
tutor2u
 
Managing Demand and Supply
Managing Demand and SupplyManaging Demand and Supply
Managing Demand and Supply
We Learn - A Continuous Learning Forum from Welingkar's Distance Learning Program.
 
Program management - Fundamentals
Program management   - FundamentalsProgram management   - Fundamentals
Program management - Fundamentals
Julen Mohanty
 
US and EU Submission – Comparative
US and EU Submission – ComparativeUS and EU Submission – Comparative
US and EU Submission – Comparative
Girish Swami
 
Managing demand and cpacity
Managing demand and cpacityManaging demand and cpacity
Managing demand and cpacity
Rbk Asr
 
Underground survey
Underground surveyUnderground survey
Underground survey
Sanjeet Kumar
 
Securing Single Page Applications with Token Based Authentication
Securing Single Page Applications with Token Based AuthenticationSecuring Single Page Applications with Token Based Authentication
Securing Single Page Applications with Token Based Authentication
Stefan Achtsnit
 
Toyota talent
Toyota talentToyota talent
Toyota talent
Renu Ramesh
 
Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.
Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.
Metalurgi kimpalan kimpalan rekabentuk dan proses pemilihan.
Universiti Teknologi Malaysia
 
marketing segmentation(shoe)
marketing segmentation(shoe)marketing segmentation(shoe)
marketing segmentation(shoe)
munirah38
 
10 Best Practices Of Software Product Management
10 Best Practices Of Software Product Management10 Best Practices Of Software Product Management
10 Best Practices Of Software Product Management
SVPMA
 
Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...
Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...
Best Practices in Vendor Management, Strategic Sourcing, Procure to Pay and D...
SAP Ariba
 
Management information system software
Management information system softwareManagement information system software
Management information system software
Online
 
Dental luting cements/ colleges for dentistry
Dental luting cements/ colleges for dentistryDental luting cements/ colleges for dentistry
Dental luting cements/ colleges for dentistry
Indian dental academy
 
Market mapping
Market mappingMarket mapping
Market mapping
tutor2u
 
Program management - Fundamentals
Program management   - FundamentalsProgram management   - Fundamentals
Program management - Fundamentals
Julen Mohanty
 
US and EU Submission – Comparative
US and EU Submission – ComparativeUS and EU Submission – Comparative
US and EU Submission – Comparative
Girish Swami
 
Managing demand and cpacity
Managing demand and cpacityManaging demand and cpacity
Managing demand and cpacity
Rbk Asr
 
Securing Single Page Applications with Token Based Authentication
Securing Single Page Applications with Token Based AuthenticationSecuring Single Page Applications with Token Based Authentication
Securing Single Page Applications with Token Based Authentication
Stefan Achtsnit
 

Similar to Using SQL-MapReduce for Advanced Analytics (20)

Benefits of Using MongoDB Over RDBMSs
Benefits of Using MongoDB Over RDBMSsBenefits of Using MongoDB Over RDBMSs
Benefits of Using MongoDB Over RDBMSs
MongoDB
 
Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...
Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...
Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...
Codemotion
 
What’s New in Imply 3.3 & Apache Druid 0.18
What’s New in Imply 3.3 & Apache Druid 0.18What’s New in Imply 3.3 & Apache Druid 0.18
What’s New in Imply 3.3 & Apache Druid 0.18
Imply
 
Big Data-Driven Applications with Cassandra and Spark
Big Data-Driven Applications  with Cassandra and SparkBig Data-Driven Applications  with Cassandra and Spark
Big Data-Driven Applications with Cassandra and Spark
Artem Chebotko
 
Distributed Queries in IDS: New features.
Distributed Queries in IDS: New features.Distributed Queries in IDS: New features.
Distributed Queries in IDS: New features.
Keshav Murthy
 
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP benchMultidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
Rim Moussa
 
Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...
Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...
Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...
MongoDB
 
Visual studio 2008
Visual studio 2008Visual studio 2008
Visual studio 2008
Luis Enrique
 
Spark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit EU talk by Francois Garillot and Mohamed KafsiSpark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit
 
Mobility insights at Swisscom - Understanding collective mobility in Switzerland
Mobility insights at Swisscom - Understanding collective mobility in SwitzerlandMobility insights at Swisscom - Understanding collective mobility in Switzerland
Mobility insights at Swisscom - Understanding collective mobility in Switzerland
François Garillot
 
Mondrian - Geo Mondrian
Mondrian - Geo MondrianMondrian - Geo Mondrian
Mondrian - Geo Mondrian
Simone Campora
 
No more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in productionNo more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in production
Chetan Khatri
 
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkBest Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Databricks
 
Advanced tips for making Oracle databases faster
Advanced tips for making Oracle databases fasterAdvanced tips for making Oracle databases faster
Advanced tips for making Oracle databases faster
SolarWinds
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
Prithwis Mukerjee
 
When Apache Spark Meets TiDB with Xiaoyu Ma
When Apache Spark Meets TiDB with Xiaoyu MaWhen Apache Spark Meets TiDB with Xiaoyu Ma
When Apache Spark Meets TiDB with Xiaoyu Ma
Databricks
 
Databaseconcepts
DatabaseconceptsDatabaseconcepts
Databaseconcepts
dilipkkr
 
Evolutionary db development
Evolutionary db development Evolutionary db development
Evolutionary db development
Open Party
 
PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)
PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)
PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)
Ontico
 
ScalaTo July 2019 - No more struggles with Apache Spark workloads in production
ScalaTo July 2019 - No more struggles with Apache Spark workloads in productionScalaTo July 2019 - No more struggles with Apache Spark workloads in production
ScalaTo July 2019 - No more struggles with Apache Spark workloads in production
Chetan Khatri
 
Benefits of Using MongoDB Over RDBMSs
Benefits of Using MongoDB Over RDBMSsBenefits of Using MongoDB Over RDBMSs
Benefits of Using MongoDB Over RDBMSs
MongoDB
 
Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...
Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...
Perchè potresti aver bisogno di un database NoSQL anche se non sei Google o F...
Codemotion
 
What’s New in Imply 3.3 & Apache Druid 0.18
What’s New in Imply 3.3 & Apache Druid 0.18What’s New in Imply 3.3 & Apache Druid 0.18
What’s New in Imply 3.3 & Apache Druid 0.18
Imply
 
Big Data-Driven Applications with Cassandra and Spark
Big Data-Driven Applications  with Cassandra and SparkBig Data-Driven Applications  with Cassandra and Spark
Big Data-Driven Applications with Cassandra and Spark
Artem Chebotko
 
Distributed Queries in IDS: New features.
Distributed Queries in IDS: New features.Distributed Queries in IDS: New features.
Distributed Queries in IDS: New features.
Keshav Murthy
 
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP benchMultidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
Rim Moussa
 
Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...
Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...
Benefits of Using MongoDB Over RDBMS (At An Evening with MongoDB Minneapolis ...
MongoDB
 
Visual studio 2008
Visual studio 2008Visual studio 2008
Visual studio 2008
Luis Enrique
 
Spark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit EU talk by Francois Garillot and Mohamed KafsiSpark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit
 
Mobility insights at Swisscom - Understanding collective mobility in Switzerland
Mobility insights at Swisscom - Understanding collective mobility in SwitzerlandMobility insights at Swisscom - Understanding collective mobility in Switzerland
Mobility insights at Swisscom - Understanding collective mobility in Switzerland
François Garillot
 
Mondrian - Geo Mondrian
Mondrian - Geo MondrianMondrian - Geo Mondrian
Mondrian - Geo Mondrian
Simone Campora
 
No more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in productionNo more struggles with Apache Spark workloads in production
No more struggles with Apache Spark workloads in production
Chetan Khatri
 
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkBest Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Databricks
 
Advanced tips for making Oracle databases faster
Advanced tips for making Oracle databases fasterAdvanced tips for making Oracle databases faster
Advanced tips for making Oracle databases faster
SolarWinds
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
Prithwis Mukerjee
 
When Apache Spark Meets TiDB with Xiaoyu Ma
When Apache Spark Meets TiDB with Xiaoyu MaWhen Apache Spark Meets TiDB with Xiaoyu Ma
When Apache Spark Meets TiDB with Xiaoyu Ma
Databricks
 
Databaseconcepts
DatabaseconceptsDatabaseconcepts
Databaseconcepts
dilipkkr
 
Evolutionary db development
Evolutionary db development Evolutionary db development
Evolutionary db development
Open Party
 
PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)
PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)
PostgreSQL - масштабирование в моде, Valentine Gogichashvili (Zalando SE)
Ontico
 
ScalaTo July 2019 - No more struggles with Apache Spark workloads in production
ScalaTo July 2019 - No more struggles with Apache Spark workloads in productionScalaTo July 2019 - No more struggles with Apache Spark workloads in production
ScalaTo July 2019 - No more struggles with Apache Spark workloads in production
Chetan Khatri
 
Ad

More from Teradata Aster (20)

Razorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Razorfish Multi-Channel Marketing: Better Customer Segmentation and TargetingRazorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Razorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Teradata Aster
 
Big Data Decision-Making
Big Data Decision-MakingBig Data Decision-Making
Big Data Decision-Making
Teradata Aster
 
Using Data to Manage in Today’s Chaotic Environment
Using Data to Manage in Today’s Chaotic EnvironmentUsing Data to Manage in Today’s Chaotic Environment
Using Data to Manage in Today’s Chaotic Environment
Teradata Aster
 
Big Analytics 2012 Event Survey Data
Big Analytics 2012 Event Survey DataBig Analytics 2012 Event Survey Data
Big Analytics 2012 Event Survey Data
Teradata Aster
 
What Makes A Great Data Scientist?
What Makes A Great Data Scientist?What Makes A Great Data Scientist?
What Makes A Great Data Scientist?
Teradata Aster
 
Practical Applications of Visual Analytics
Practical Applications of Visual AnalyticsPractical Applications of Visual Analytics
Practical Applications of Visual Analytics
Teradata Aster
 
Trust and Influence in the Complex Network of Social Media
Trust and Influence in the Complex Network of Social MediaTrust and Influence in the Complex Network of Social Media
Trust and Influence in the Complex Network of Social Media
Teradata Aster
 
Turning Big Data to Business Advantage
Turning Big Data to Business AdvantageTurning Big Data to Business Advantage
Turning Big Data to Business Advantage
Teradata Aster
 
Big Brands Meet Big Data – The Newest Innovator’s Dilemma
Big Brands Meet Big Data – The Newest Innovator’s DilemmaBig Brands Meet Big Data – The Newest Innovator’s Dilemma
Big Brands Meet Big Data – The Newest Innovator’s Dilemma
Teradata Aster
 
Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
Teradata Aster
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
Teradata Aster
 
Keynote: Cross Industry Lessons from Moneyball Analytics
Keynote: Cross Industry Lessons from Moneyball AnalyticsKeynote: Cross Industry Lessons from Moneyball Analytics
Keynote: Cross Industry Lessons from Moneyball Analytics
Teradata Aster
 
Technology Strategies for Big Data Analytics,
Technology Strategies for Big Data Analytics, Technology Strategies for Big Data Analytics,
Technology Strategies for Big Data Analytics,
Teradata Aster
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
Teradata Aster
 
From Data Science to Business Value - Analytics Applied
From Data Science to Business Value - Analytics AppliedFrom Data Science to Business Value - Analytics Applied
From Data Science to Business Value - Analytics Applied
Teradata Aster
 
Solving the Education Crisis with Big Data
Solving the Education Crisis with Big DataSolving the Education Crisis with Big Data
Solving the Education Crisis with Big Data
Teradata Aster
 
SAS aster data big data dc presentation public
SAS aster data big data dc presentation publicSAS aster data big data dc presentation public
SAS aster data big data dc presentation public
Teradata Aster
 
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Teradata Aster
 
comScore
comScorecomScore
comScore
Teradata Aster
 
20100506 aster data big data summit - microstrategy (shareable)
20100506   aster data big data summit - microstrategy (shareable)20100506   aster data big data summit - microstrategy (shareable)
20100506 aster data big data summit - microstrategy (shareable)
Teradata Aster
 
Razorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Razorfish Multi-Channel Marketing: Better Customer Segmentation and TargetingRazorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Razorfish Multi-Channel Marketing: Better Customer Segmentation and Targeting
Teradata Aster
 
Big Data Decision-Making
Big Data Decision-MakingBig Data Decision-Making
Big Data Decision-Making
Teradata Aster
 
Using Data to Manage in Today’s Chaotic Environment
Using Data to Manage in Today’s Chaotic EnvironmentUsing Data to Manage in Today’s Chaotic Environment
Using Data to Manage in Today’s Chaotic Environment
Teradata Aster
 
Big Analytics 2012 Event Survey Data
Big Analytics 2012 Event Survey DataBig Analytics 2012 Event Survey Data
Big Analytics 2012 Event Survey Data
Teradata Aster
 
What Makes A Great Data Scientist?
What Makes A Great Data Scientist?What Makes A Great Data Scientist?
What Makes A Great Data Scientist?
Teradata Aster
 
Practical Applications of Visual Analytics
Practical Applications of Visual AnalyticsPractical Applications of Visual Analytics
Practical Applications of Visual Analytics
Teradata Aster
 
Trust and Influence in the Complex Network of Social Media
Trust and Influence in the Complex Network of Social MediaTrust and Influence in the Complex Network of Social Media
Trust and Influence in the Complex Network of Social Media
Teradata Aster
 
Turning Big Data to Business Advantage
Turning Big Data to Business AdvantageTurning Big Data to Business Advantage
Turning Big Data to Business Advantage
Teradata Aster
 
Big Brands Meet Big Data – The Newest Innovator’s Dilemma
Big Brands Meet Big Data – The Newest Innovator’s DilemmaBig Brands Meet Big Data – The Newest Innovator’s Dilemma
Big Brands Meet Big Data – The Newest Innovator’s Dilemma
Teradata Aster
 
Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
Teradata Aster
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
Teradata Aster
 
Keynote: Cross Industry Lessons from Moneyball Analytics
Keynote: Cross Industry Lessons from Moneyball AnalyticsKeynote: Cross Industry Lessons from Moneyball Analytics
Keynote: Cross Industry Lessons from Moneyball Analytics
Teradata Aster
 
Technology Strategies for Big Data Analytics,
Technology Strategies for Big Data Analytics, Technology Strategies for Big Data Analytics,
Technology Strategies for Big Data Analytics,
Teradata Aster
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
Teradata Aster
 
From Data Science to Business Value - Analytics Applied
From Data Science to Business Value - Analytics AppliedFrom Data Science to Business Value - Analytics Applied
From Data Science to Business Value - Analytics Applied
Teradata Aster
 
Solving the Education Crisis with Big Data
Solving the Education Crisis with Big DataSolving the Education Crisis with Big Data
Solving the Education Crisis with Big Data
Teradata Aster
 
SAS aster data big data dc presentation public
SAS aster data big data dc presentation publicSAS aster data big data dc presentation public
SAS aster data big data dc presentation public
Teradata Aster
 
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Teradata Aster
 
20100506 aster data big data summit - microstrategy (shareable)
20100506   aster data big data summit - microstrategy (shareable)20100506   aster data big data summit - microstrategy (shareable)
20100506 aster data big data summit - microstrategy (shareable)
Teradata Aster
 
Ad

Recently uploaded (20)

Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
Social Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTechSocial Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTech
Steve Jonas
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
MINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PRMINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PR
MIND CTI
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
Build 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHSBuild 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHS
TECH EHS Solution
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Are Cloud PBX Providers in India Reliable for Small Businesses (1).pdf
Are Cloud PBX Providers in India Reliable for Small Businesses (1).pdfAre Cloud PBX Providers in India Reliable for Small Businesses (1).pdf
Are Cloud PBX Providers in India Reliable for Small Businesses (1).pdf
Telecoms Supermarket
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
Social Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTechSocial Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTech
Steve Jonas
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
MINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PRMINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PR
MIND CTI
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
Build 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHSBuild 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHS
TECH EHS Solution
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Are Cloud PBX Providers in India Reliable for Small Businesses (1).pdf
Are Cloud PBX Providers in India Reliable for Small Businesses (1).pdfAre Cloud PBX Providers in India Reliable for Small Businesses (1).pdf
Are Cloud PBX Providers in India Reliable for Small Businesses (1).pdf
Telecoms Supermarket
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 

Using SQL-MapReduce for Advanced Analytics

  • 1. Using SQL-MapReduce for Advanced Analytical QueriesbyRick F. van der LansR20/Consultancy BV
  • 2. What Did the Users Want?BI reportsProduction databases
  • 3. But What Did We Create?ODSdatawarehousedatamartproductiondatabasecube
  • 4. Problems with Current DW Platforms45%40%39%37%33%29%23%23%21%20%19%16%16%15%14%13%11%4%3%Poor query responseCan’t support advanced analyticsInadequate data load speedCan’t scale to large data volumesCost of scaling up is too expensivePoorly suited to real-time or on demand workloadsCurrent platform is a legacy we must phase outCan’t support data modeling we needWe need platform that supports mixed workloadsCan’t support large concurrent user countInadequate high availabilityInadequate support for in-memory processingInadequate support for web services and SOACurrent platform is 32-bit, and we need 64-bitCurrent platform is SMP, and we need MPPWe need platform better suited to cloud or virtualizationCan’t secure the data properlyOtherNo problemsSource: P. Russom, ‘NextGeneration Data WarehousePlatforms’, TDWI BestPractices Report, fourthquarter 2009.
  • 5. 49%8%20%12%8%1%1%3%current DW platform2009201020112012201320142015 or laterNeed for More Powerful Data Warehouse Platformsno plans to replaceSource: P. Russom, ‘NextGeneration Data WarehousePlatforms’, TDWI BestPractices Report, fourthquarter 2009.
  • 6. New Forms of AnalyticsAdvanced AnalyticsOperational AnalyticsDeep AnalyticsSelf-Service AnalyticsComplex AnalyticsAutomated Analytics
  • 7. Positioning of Advanced Analyticscomplexityof analyticalquerieshighcomplex queries onsmall to medium size databasesadvancedanalyticssimple queries onsmall to medium size databasessimple queries onlarge to ultra large databaseslowdatabase sizelowhigh
  • 8. Parallellization of SQLWorkerWorkerWorkerSELECT *FROM CUSTOMERSWHERE LOCATION = 'New York'DatabaseServerMaster
  • 9. How Easy Is Parallelizing SQL Queries? (1)Example 1:SELECT ID, SALES_DATE, PRICEFROM SALES_RECORDSWHERE PRICE > 100Example 2:SELECT REGION_ID, SUM(PRICE)FROM SALES_RECORDSWHERE PRICE > 100GROUP BY REGION_ID
  • 10. How Easy Is Parallelizing SQL Queries? (2)Example 3: Get all the flights to London for which another flight exists to London that leaves within an hour on the same day.SELECT *FROM DEPARTURES AS D1WHERE DESTINATION = 'London'AND DEPARTURE_TIME + 60 MINUTES >= (SELECT MIN(DEPARTURE_TIME) FROM DEPARTURES AS D2 WHERE DESTINATION = 'London' AND D2.DEPARTURE_TIME > D1.DEPARTURE_TIME AND D2.DEPARTURE_DAY = D1.DEPARTURE_DAY)ORDER BY DEPARTURE_TIME
  • 11. How Easy Is Parallelizing SQL Queries? (3)SELECTA.PROD_DESC AS ITEM1,B.PROD_DESC AS ITEM2,C.PROD_DESC AS ITEM3,COUNT (*) AS CNTFROM(SELECT SF.STORE_ID, SF.REG_ID, SF.TRAN_NO, SF.ITEM_ID, SF.DT, PD.PROD_DESC, PD.PRICE       FROM             SALES_FACT SF       INNER JOIN             PRODUCT_DIM PD       WHERE             SF.ITEM_ID=PD.ITEM_ID) AS TRANSACTIONS A, (SELECT SF.STORE_ID, SF.REG_ID, SF.TRAN_NO, SF.ITEM_ID, SF.DT, PD.PROD_DESC, PD.PRICE       FROM             SALES_FACT SF       INNER JOIN             PRODUCT_DIM PD       WHERE             SF.ITEM_ID=PD.ITEM_ID) AS TRANSACTIONS B,(SELECT SF.STORE_ID, SF.REG_ID, SF.TRAN_NO, SF.ITEM_ID, SF.DT, PD.PROD_DESC, PD.PRICE       FROM             SALES_FACT SF       ,,,            PRODUCT_DIM PD       WHERE             SF.ITEM_ID=PD.ITEM_ID) AS TRANSACTIONS C WHERE A.STORE_ID=B.STORE_ID AND  B.STORE_ID=C.STORE_ID AND A.STORE_ID=C.STORE_ID AND A.REG_ID=B.REG_ID AND  B.REG_ID=C.REG_ID AND A.REG_ID=C.REG_ID AND A.TRAN_NO=B.TRAN_NO AND  B.TRAN_NO=C.TRAN_NO AND A.TRAN_NO=C.TRAN_NO AND A.DT=B.DT AND  B.DT=C.DT AND A.DT=C.DT AND A.ITEM_ID<>B.ITEM_ID AND A.ITEM_ID<>C.ITEM_ID AND B.ITEM_ID<>C.ITEM_IDGROUP BY A.PROD_DESC,  B.PROD_DESC,  C.PROD_DESCHAVING  COUNT(*)>1000ORDER BY COUNT(*) DESC;Example 4: Market basket analysis:
  • 12. Declarativeness and Storage IndependencyDeclarativeness: The developer has only to program what has to be done, and not how it should be done.Storage independency: The language should hide how data is physically stored and how it is accessed.
  • 13. Advantages of Two PropertiesProductivity increaseless code has to be writtenMaintainability: less code means having to maintain less codeFlexibility: changes to the storage layer can be made without the need to change the SQL code in the reports
  • 14. Different Types of SQL FunctionsBuilt-in or User-definedSELECT FLIGHT, TRUNCATE(DEPARTURE_TIME, MINUTES)FROM DEPARTURES AS D1WHERE BANK_HOLIDAY(DEPARTURE_TIME) = 1Scalar or TableSELECT AVG(DURATION)FROM LAST_FIVE_ROWS(DEPARTURES)Pure SQL, Procedural, or ExternalSimple or Complex
  • 15. MapReduceMapReduce is a programming model introduced by GoogleAimed at processing requests on large data sets where the processing can be distributed over a high number of nodes using parallel capabilities Two steps Map and ReduceMap is like SelectReduce is like Group-by
  • 16. Aster Data’s SQL-MapReduce (1)SQL-MR is a set of built-in and user-defined external table functionsExample:SELECT *FROM GET_NEXT_FLIGHT_1HR (ON DEPARTURES PARTITION BY DESTINATION)WHERE DESTINATION = 'London'ORDER BY DEPARTURE_TIMEAll the SQL-MR function processing is parallelizedIncluding complex group-by operations and time-series analytics
  • 17. Aster Data’s SQL-MapReduce (2)An SQL-MR function can contain the most complex analytical logicProgrammers of SQL don’t need to learn a new language, Java, C++, Python, and many more can be usedThe SQL statements invoking SQL-MR functions are still declarative and storage-independentThe functions themselves are notUsable by any BI tools supporting SQL
  • 19. SQL-MR Technical AdvantagesTechnical DisadvantagesParallelization of complex operations
  • 21. Efficiency of low-level programming language
  • 27. Nesting of the functions
  • 28. Small group of developers have to learn a new language (possibly)
  • 29. Low-level language is not declarative
  • 30. Non-portable functions Market Basket Analysis using SQL-MRSELECT PROD_DESC1, PROD_DESC2, PROD_DESC3, COUNT(*) AS CNTFROM BASKET_GENERATOR(       ON  ((SELECT SF.STORE_ID, SF.REG_ID, SF.TRAN_NO, SF.ITEM_ID, SF.DT, PD.PROD_DESC, PD.PRICE             FROM                   SALES_FACT SF             INNER JOIN                   PRODUCT_DIM PD             WHERE                   SF.ITEM_ID=PD.ITEM_ID) AS TRANSACTIONS A        PARTITION BY STORE_ID, REG_ID, TRAN_NO, DT        BASKET_ITEM(‘PROD_DESC')        BASKET_SIZE('3')        )GROUP BY PROD_DESC1, PROD_DESC2, PROD_DESC3HAVING  COUNT(*)>1000ORDER BY COUNT(*) DESC;
  • 31. Business Advantages of SQL-MRSimplification of architectureDeep analyticsComplex analyticsOperational analyticsSelf-service analyticsNo forbidden queries
  • 33. ConclusionsThe analytical and reporting demands are increasingMost environments already have problems with performanceThe marriage of SQL and MapReduce offers an enormous potentialParallelizing the processing of analytical logic
  • 34. Business Advantages of SQL-MRSimplification of architectureDeep analyticsComplex analyticsOperational analyticsSelf-service analyticsNo forbidden queries
  • 35. Questions & AnswersRick van der LansR20 Consultancye-mail: [email protected] website: http://www.r20.nlStephanie McReynoldsDirector of Product Marketing, Aster Datae-mail: [email protected] More Information on Aster Data:http: //www.asterdata.com