SlideShare a Scribd company logo
Geekier analytics for
SaaS data
Sumit Sarkar
Chief Data Evangelist
Progress DataDirect
@SAsInSumit
www.linkedin.com/in/meetsumit
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.2
Agenda
§  Demand for geekier analytics
§  In depth: SaaS marketing data
lake
§  Our guidance
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.3
Demand for geekier analytics
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.4
As Lines of Business Control Technology Select,
SaaS Adoption Continues to Accelerate
http://www.forbes.com/sites/louiscolumbus/2015/01/24/roundup-of-cloud-computing-forecasts-and-market-estimates-2015/#192c1957740c
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.5
Are SaaS apps getting left behind the analytics revolution?
Current state Opportunity
Data silos in each SaaS App
behind process APIs
Open access to $200 billion analytics
market with standard interfaces for
analytics
Embedded BI provides limited
analytics options
Engage data skills and tools in IT for
richer insights
?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.6
What are geekier analytics?
Business Intelligence Data Integration
ODBC, JDBC
or OData
?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.7
What are geekier analytics?
JDBC
?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.8
Who is getting geeky today with analytics connectivity?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.9
Who is getting geeky today with analytics connectivity?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.10
Who is getting geeky today with analytics connectivity?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.11
In depth: SaaS marketing data lake
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.12
A data lake is a large-scale storage repository and
processing engine. A data lake provides "massive
storage for any kind of data, enormous processing
power and the ability to handle virtually limitless
concurrent tasks or jobs”
- SAS Institute
What is a Marketing Data Lake?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.13
Benefits of a Marketing Data Lake?
Some of the benefits of a data lake include:
§  Store data in all shapes and sizes
§  Flexible analytics with “schema on read”
§  Query data using SQL or big data
programming frameworks
§  Eliminate data silos
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.14
Why Marketing Data?
§  CMOs will outspend CIOs on technology by 2017
(Gartner)
§  Oracle spent $3B on a martech aquisition spree to
gain CMO mindshare.
§  Expect more collaboration between CMO and CIO
(CIO.com)
§  Marketing Data Warehouse/Lake Webinars ~750
registrations (Progress)
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.15
It’s easy to forget that it’s still about solving real business problems.
Relevant data
Transaction / behavior history
Manage
Data
Perform
Analytics
Drive
Decisions
Insights
continuous feedback loop
Appropriate
data sources
Answers to
business questions
Strategy (Thinking) Moves Right to Left
Implementation Moves Left to Right
Before you think data, think decisions!
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.16
Our marketing data is almost all in the cloud
And it’s almost all complex, stream data – which means APIs that only
give aggregations aren’t too useful
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.17
How to ingest data directly from SaaS applications into HDFS
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.18
How to ingest data directly from SaaS applications into Spark
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.19
JDBC access to SaaS data
Progress DataDirect
JDBC Connector
Schema Manager
Apache Sqoop
Salesforce.com
Schema
User Defined
Schema
Driver uses
§  SOAP API
§  Bulk API
§  Metadata API
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.20
Geek Speak
$ sqoop help import
usage: sqoop import [GENERIC-ARGS] [TOOL-ARGS]
Common arguments:
--connect <jdbc-uri> Specify JDBC connect string
--connect-manager <jdbc-uri> Specify connection manager class to use
--driver <class-name> Manually specify JDBC driver class to use
--hadoop-mapred-home <dir>+ Override $HADOOP_MAPRED_HOME
--help Print usage instructions
-P Read password from console
--password <password> Set authentication password
--username <username> Set authentication username
--verbose Print more information while working
--hadoop-home <dir>+ Deprecated. Override $HADOOP_HOME
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.21
Geek Speak
val dataframe_salesforce =
sqlContext.read.format("jdbc").option("url","jdbc:datadirect:sforce://
login.salesforce.com;").option("driver","com.ddtek.jdbc.sforce.SForceDriver").
option("dbtable","SFORCE.<table_name>").option("user","<Username>").option("p
assword","<password>").option("securitytoken","<security_token>").load()
dataframe_salesforce.registerTempTable("account")
dataframe_salesforce.sqlContext.sql("select * from
account").collect.foreach(println)
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.22
Our guidance
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.23
Data in SaaS Applications is Siloed, Protected by Proprietary APIs Designed
for Process Integration, not Data Integration
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.24
Partner Summit registration report from SFDC
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.25
Each SaaS API is vastly different
§  Get OData, JDBC, ODBC interfaces on top of any API
Data Source API
Marketo Web Services API (REST/SOAP)
Bulk and non-Bulk APIs
No query language
Oracle Service Cloud Web Services APIs (REST/SOAP)
ROQL
Google Analytics Hypercube (query limits of 10 metrics grouped by
max of 7 dimensions)
Salesforce SOAP, BULK, Metadata APIs
SOQL
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.26
#DontBeJeff
§  http://prgress.co/dontbejeff
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.27
Detail is important because this digital data is true big data
The
relationship
between
events is
critical
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.28
We’re almost never solving for one problem with a big data system
Reporting Analytics
Summarized
Data
Segmented
Data
Detail
Data
We can’t just aggregate / We can’t not aggregate
Dashboarding
Campaign
Optimization
Customer
Drill-down
Attribution, CLTV,
Experience,
Personalization
Targeting
Forecasting
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.29
Expose variety of detailed data using standard interfaces
SaaS vendors
•  Expose bulk, non-bulk and analytics data model
•  Leverage standard interfaces for SQL and REST
•  Direct secure database access option
SaaS data consumers
•  Support same standard interfaces
•  Turn to trusted data connectivity partners or dedicate
significant headcount
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.30
Other SQL to SaaS platforms
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.31
Progress DataDirect
Embed SaaS Connectors into the Data Access Layer
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.32
Q&A with R&D
§  How do you handle the varying quality of services across
SaaS APIs?
§  With analytics style connectivity, are SaaS vendors concerned
about scalability against large extracts from multiple tenants?
§  Which SaaS vendors are easier to work with?
© 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.33
Ingest data across 200+ data sources (beyond marketing data sources)
Big Data/NoSQL
§  Apache Hadoop Hive
§  Cloudera
§  Hortonworks
§  Pivotal HD
§  MapR
§  EMR
§  Pivotal HAWQ
§  Cloudera Impala
§  MongoDB
§  Spark SQL
§  Cassandra
§  SAP HANA
Data Warehouses
§  Amazon Redshift
§  SAP Sybase IQ
§  Teradata
§  Pivotal Greenplum
Relational
§  Oracle DB
§  Microsoft SQL Server
§  IBM DB2
§  MySQL
§  PostgreSQL
§  IBM Informix
§  SAP Sybase
§  Pervasive SQL
§  Progress OpenEdge
§  Progress Rollbase
SaaS/Cloud
§  Salesforce.com
§  Database.com
§  FinancialForce
§  Veeva CRM
§  ServiceMAX
§  Any Force.com App
§  Hubspot
§  Marketo
§  Microsoft Dynamics CRM
§  Microsoft SQL Azure
§  Oracle Eloqua
§  Oracle Service Cloud
§  Google Analytics
EDI/XML/Text
§  EDIFACT
§  EDIG@S
§  EANCOM
§  X12
§  IATA
§  Healthcare EDI: X12, HIPAA,
ICD-10, HL7
§  Custom EDI
§  Flat files: CSV, TSV, dBase,
Clipper, Foxpro, Paradox
§  Text Files
Any
§  SDK
§  SequeLink Socket Server
§  Customer Engineering
Geekier Analytics for SaaS data

More Related Content

What's hot (20)

PPTX
Data APIs Don't Discriminate [API World Stage Talk]
Sumit Sarkar
 
PPTX
Hybrid Data Pipeline for SQL and REST
Sumit Sarkar
 
PPTX
Welcome to the Era of Open Analytics
Sumit Sarkar
 
PPTX
Journey to SAS Analytics Grid with SAS, R, Python
Sumit Sarkar
 
PPTX
SQL vs SOQL for Salesforce Analytics
Sumit Sarkar
 
PPTX
Firewall friendly pipeline for secure data access
Sumit Sarkar
 
PPTX
Deliver Secure SQL Access for Enterprise APIs - August 29 2017
Nishanth Kadiyala
 
PPTX
OData and the future of business objects universes
Sumit Sarkar
 
PPTX
Salesforce Connect External Object Reports
Sumit Sarkar
 
PPTX
Salesforce External Objects for Big Data
Sumit Sarkar
 
PDF
Pre-Con Ed: Explore What’s New in CA Performance Management 3.0
CA Technologies
 
PDF
OOW16 - Planning Your Upgrade to Oracle E-Business Suite 12.2 [CON1423]
vasuballa
 
PDF
Kendo UI workshop introduction - PUG Baltic Annual Conference 2017
Alen Leit
 
PDF
Pre-Con Ed: Using SQL to Access Your CA IDMS Databases
CA Technologies
 
PDF
Progress OE Roadmap and Vision - PUG Baltic Annual Conference 2017
Alen Leit
 
PDF
Virtualized Oracle Real Application Clusters (RAC) - Containers and VMs for RAC
Markus Michalewicz
 
PDF
Progress application server for openedge best practices - PUG Baltic Annual C...
Alen Leit
 
PDF
Oracle strategies for a modern business
Moldova ICT Summit
 
PDF
Progress core product group strategy and roadmap - PUG Baltic Annual Conferen...
Alen Leit
 
PDF
Pre-Con Ed: Implement a Modern, Advanced Network Monitoring Solution by Integ...
CA Technologies
 
Data APIs Don't Discriminate [API World Stage Talk]
Sumit Sarkar
 
Hybrid Data Pipeline for SQL and REST
Sumit Sarkar
 
Welcome to the Era of Open Analytics
Sumit Sarkar
 
Journey to SAS Analytics Grid with SAS, R, Python
Sumit Sarkar
 
SQL vs SOQL for Salesforce Analytics
Sumit Sarkar
 
Firewall friendly pipeline for secure data access
Sumit Sarkar
 
Deliver Secure SQL Access for Enterprise APIs - August 29 2017
Nishanth Kadiyala
 
OData and the future of business objects universes
Sumit Sarkar
 
Salesforce Connect External Object Reports
Sumit Sarkar
 
Salesforce External Objects for Big Data
Sumit Sarkar
 
Pre-Con Ed: Explore What’s New in CA Performance Management 3.0
CA Technologies
 
OOW16 - Planning Your Upgrade to Oracle E-Business Suite 12.2 [CON1423]
vasuballa
 
Kendo UI workshop introduction - PUG Baltic Annual Conference 2017
Alen Leit
 
Pre-Con Ed: Using SQL to Access Your CA IDMS Databases
CA Technologies
 
Progress OE Roadmap and Vision - PUG Baltic Annual Conference 2017
Alen Leit
 
Virtualized Oracle Real Application Clusters (RAC) - Containers and VMs for RAC
Markus Michalewicz
 
Progress application server for openedge best practices - PUG Baltic Annual C...
Alen Leit
 
Oracle strategies for a modern business
Moldova ICT Summit
 
Progress core product group strategy and roadmap - PUG Baltic Annual Conferen...
Alen Leit
 
Pre-Con Ed: Implement a Modern, Advanced Network Monitoring Solution by Integ...
CA Technologies
 

Viewers also liked (20)

PPT
Rombus
andresta
 
PPT
Yahoo媒体化运作分析
Steven Lin
 
PPTX
3 Ways to Simplify your Mobile Apps
Progress
 
PDF
B2B marketing analytics-report
Progress
 
PPT
Lewis Diagram
kitcoffeen
 
PPTX
Webstock 2011
Simon Gianoutsos
 
PDF
Alumni Presentation - Homecoming 2007
aepitaudelta
 
PPT
Running a Gaming Program When You Can't Tell the Difference Between an Xbox a...
joshwEVPL
 
PPT
Homework 03
GRX
 
PPT
Health Related CMC Backgrounds
guestf01d8a
 
ODP
中文Project
makiyo
 
PPT
What Are Dreams
guest9aec79
 
PPT
A,E,J &J Presentation
guest1b1543
 
PPT
Newton’S Laws Practice
kitcoffeen
 
PPTX
Does PaaS Pay Off?
Progress
 
PPTX
Progress Pacific: Contemporary App Development
Progress
 
PPS
Sioux Hot-or-Not: The future of Linux (Alan Cox)
siouxhotornot
 
PPT
Mobile Phone To Computer
Ian Hay
 
PPTX
Creative visiting
Mike Sharples
 
PPT
Quantum Pres
kitcoffeen
 
Rombus
andresta
 
Yahoo媒体化运作分析
Steven Lin
 
3 Ways to Simplify your Mobile Apps
Progress
 
B2B marketing analytics-report
Progress
 
Lewis Diagram
kitcoffeen
 
Webstock 2011
Simon Gianoutsos
 
Alumni Presentation - Homecoming 2007
aepitaudelta
 
Running a Gaming Program When You Can't Tell the Difference Between an Xbox a...
joshwEVPL
 
Homework 03
GRX
 
Health Related CMC Backgrounds
guestf01d8a
 
中文Project
makiyo
 
What Are Dreams
guest9aec79
 
A,E,J &J Presentation
guest1b1543
 
Newton’S Laws Practice
kitcoffeen
 
Does PaaS Pay Off?
Progress
 
Progress Pacific: Contemporary App Development
Progress
 
Sioux Hot-or-Not: The future of Linux (Alan Cox)
siouxhotornot
 
Mobile Phone To Computer
Ian Hay
 
Creative visiting
Mike Sharples
 
Quantum Pres
kitcoffeen
 
Ad

Similar to Geekier Analytics for SaaS data (20)

PDF
Best Practices for Salesforce Data Access
Salesforce Developers
 
PPTX
Building a marketing data lake
Sumit Sarkar
 
PDF
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Matt Stubbs
 
PPTX
OData External Data Integration Strategies for SaaS
Sumit Sarkar
 
PDF
PROG_UntoldStory ISV eBook_0706c FINAL
SolarWinds MSP
 
PDF
Big Data LDN 2018: 2018 DATA TRENDS: RESULTS FROM FIFTH ANNUAL DATA AND ANALY...
Matt Stubbs
 
PPTX
Modern REST APIs for Enterprise Databases - OData
Nishanth Kadiyala
 
PDF
Data APIs as a Foundation for Systems of Engagement
Victor Olex
 
PPTX
Planning your move to the cloud: SaaS Enablement and User Experience (Oracle ...
Lucas Jellema
 
PPTX
How to Capitalize on Big Data with Oracle Analytics Cloud
Perficient, Inc.
 
PPTX
APIs in Enterprise
Victor Olex
 
PPTX
SaaS Enablement - Realizing the true Potential of Standard Applications (a ca...
Lucas Jellema
 
PDF
Real-time SQL Access for Your Salesforce.com Data
Salesforce Developers
 
PPTX
SaaS BI
Riola Fernandes
 
PPTX
Journey to the Cloud: What I Wish I Knew Before I Started
Datavail
 
PDF
Slides: Success Stories for Data-to-Cloud
DATAVERSITY
 
PDF
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
PDF
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Denodo
 
PPTX
Technology Trends in 2013-2014
KMS Technology
 
PDF
Book for SOA framework Solution for Data Services
Digendra Vir Singh (DV)
 
Best Practices for Salesforce Data Access
Salesforce Developers
 
Building a marketing data lake
Sumit Sarkar
 
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Matt Stubbs
 
OData External Data Integration Strategies for SaaS
Sumit Sarkar
 
PROG_UntoldStory ISV eBook_0706c FINAL
SolarWinds MSP
 
Big Data LDN 2018: 2018 DATA TRENDS: RESULTS FROM FIFTH ANNUAL DATA AND ANALY...
Matt Stubbs
 
Modern REST APIs for Enterprise Databases - OData
Nishanth Kadiyala
 
Data APIs as a Foundation for Systems of Engagement
Victor Olex
 
Planning your move to the cloud: SaaS Enablement and User Experience (Oracle ...
Lucas Jellema
 
How to Capitalize on Big Data with Oracle Analytics Cloud
Perficient, Inc.
 
APIs in Enterprise
Victor Olex
 
SaaS Enablement - Realizing the true Potential of Standard Applications (a ca...
Lucas Jellema
 
Real-time SQL Access for Your Salesforce.com Data
Salesforce Developers
 
Journey to the Cloud: What I Wish I Knew Before I Started
Datavail
 
Slides: Success Stories for Data-to-Cloud
DATAVERSITY
 
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Denodo
 
Technology Trends in 2013-2014
KMS Technology
 
Book for SOA framework Solution for Data Services
Digendra Vir Singh (DV)
 
Ad

More from Progress (19)

PDF
Ship Quickly, Ship Quality: The Developer’s Quest (Infographic)
Progress
 
PDF
Database Technology Trends 2016 – Survey Results
Progress
 
PDF
Top 10 innovative IoT connected devices
Progress
 
PPTX
Top SaaS App Challenges: Which One Is Yours?
Progress
 
PPTX
SQL Connectivity in a MongoDB World
Progress
 
PPTX
Ignite Your Big Data With a Spark!
Progress
 
PPTX
Bridge the App Gap: Crossing the Chasm Between IT and Business
Progress
 
PDF
3 Simple Ways to Simplify Your Mobile Apps
Progress
 
PDF
Why Should You Join The Mobile Revolution?
Progress
 
PPT
PaaS for App Dev and Deployment
Progress
 
PPTX
How OData Opens Your Data To Enterprise Mobile Applications
Progress
 
PDF
Progress Rollbase: Building Powerful Applications One Block at a Time
Progress
 
PPTX
Creating Stunning Enterprise Apps for Both Web and Mobile
Progress
 
PDF
With Progress Pacific, The RAD Race Has Already Been Won!
Progress
 
PPTX
Build Powerful Apps Fast with Progress Rollbase
Progress
 
PDF
Does PaaS Pay Off?
Progress
 
PPTX
Rollbase Mobile Tech Tips
Progress
 
PPTX
A Crash Course in Rapid Application Development
Progress
 
PPTX
2014 Developers' Choice Awards Reveal Database Trends
Progress
 
Ship Quickly, Ship Quality: The Developer’s Quest (Infographic)
Progress
 
Database Technology Trends 2016 – Survey Results
Progress
 
Top 10 innovative IoT connected devices
Progress
 
Top SaaS App Challenges: Which One Is Yours?
Progress
 
SQL Connectivity in a MongoDB World
Progress
 
Ignite Your Big Data With a Spark!
Progress
 
Bridge the App Gap: Crossing the Chasm Between IT and Business
Progress
 
3 Simple Ways to Simplify Your Mobile Apps
Progress
 
Why Should You Join The Mobile Revolution?
Progress
 
PaaS for App Dev and Deployment
Progress
 
How OData Opens Your Data To Enterprise Mobile Applications
Progress
 
Progress Rollbase: Building Powerful Applications One Block at a Time
Progress
 
Creating Stunning Enterprise Apps for Both Web and Mobile
Progress
 
With Progress Pacific, The RAD Race Has Already Been Won!
Progress
 
Build Powerful Apps Fast with Progress Rollbase
Progress
 
Does PaaS Pay Off?
Progress
 
Rollbase Mobile Tech Tips
Progress
 
A Crash Course in Rapid Application Development
Progress
 
2014 Developers' Choice Awards Reveal Database Trends
Progress
 

Recently uploaded (20)

PDF
Agentic AI: The Autonomous Upgrade Your AI Stack Didn’t Know It Needed
Amnic
 
PDF
ANÁLISIS DE COSTO- PAUCAR RIVERA NEISY.pdf
neisypaucarr
 
PDF
Gregory Felber - A Dedicated Researcher
Gregory Felber
 
DOCX
Apply for a Canada Permanent Resident Visa in Delhi with Expert Guidance.docx
WVP International
 
PDF
The New Zealand Business Sales Report 2025_compressed.pdf
charlie630518
 
PPTX
How to Build and Optimize a GA4 Conversion Funnel.pptx
Orage Technologies
 
PDF
Infrastructure and geopolitics.AM.ENG.docx.pdf
Andrea Mennillo
 
PDF
Alan Stalcup - Principal Of GVA Real Estate Investments
Alan Stalcup
 
PDF
Gregory Felber - An Accomplished Underwater Marine Biologist
Gregory Felber
 
PDF
NewBase 24 July 2025 Energy News issue - 1805 by Khaled Al Awadi._compressed...
Khaled Al Awadi
 
PPTX
Appreciations - July 25.pptxsdsdsddddddsssss
anushavnayak
 
PDF
Rise of the Build Division, Conquering the Curve.pdf
Brij Consulting, LLC
 
PDF
GenAI for Risk Management: Refresher for the Boards and Executives
Alexei Sidorenko, CRMP
 
PPTX
PUBLIC RELATIONS N6 slides (4).pptx poin
chernae08
 
PPTX
Marketing Presentations - How to do them
Jaime Ernesto Portillo De la O
 
PDF
Retinal Disorder Treatment Market 2030: The Impact of Advanced Diagnostics an...
Kumar Satyam
 
DOCX
India's Emerging Global Leadership in Sustainable Energy Production The Rise ...
Insolation Energy
 
PDF
New Royals Distribution Plan Presentation
ksherwin
 
PDF
Shop Beautiful Globes of the World for Home & Office.pdf
largeglobes2
 
PPTX
Piper 2025 Financial Year Shareholder Presentation
Piper Industries
 
Agentic AI: The Autonomous Upgrade Your AI Stack Didn’t Know It Needed
Amnic
 
ANÁLISIS DE COSTO- PAUCAR RIVERA NEISY.pdf
neisypaucarr
 
Gregory Felber - A Dedicated Researcher
Gregory Felber
 
Apply for a Canada Permanent Resident Visa in Delhi with Expert Guidance.docx
WVP International
 
The New Zealand Business Sales Report 2025_compressed.pdf
charlie630518
 
How to Build and Optimize a GA4 Conversion Funnel.pptx
Orage Technologies
 
Infrastructure and geopolitics.AM.ENG.docx.pdf
Andrea Mennillo
 
Alan Stalcup - Principal Of GVA Real Estate Investments
Alan Stalcup
 
Gregory Felber - An Accomplished Underwater Marine Biologist
Gregory Felber
 
NewBase 24 July 2025 Energy News issue - 1805 by Khaled Al Awadi._compressed...
Khaled Al Awadi
 
Appreciations - July 25.pptxsdsdsddddddsssss
anushavnayak
 
Rise of the Build Division, Conquering the Curve.pdf
Brij Consulting, LLC
 
GenAI for Risk Management: Refresher for the Boards and Executives
Alexei Sidorenko, CRMP
 
PUBLIC RELATIONS N6 slides (4).pptx poin
chernae08
 
Marketing Presentations - How to do them
Jaime Ernesto Portillo De la O
 
Retinal Disorder Treatment Market 2030: The Impact of Advanced Diagnostics an...
Kumar Satyam
 
India's Emerging Global Leadership in Sustainable Energy Production The Rise ...
Insolation Energy
 
New Royals Distribution Plan Presentation
ksherwin
 
Shop Beautiful Globes of the World for Home & Office.pdf
largeglobes2
 
Piper 2025 Financial Year Shareholder Presentation
Piper Industries
 

Geekier Analytics for SaaS data

  • 1. Geekier analytics for SaaS data Sumit Sarkar Chief Data Evangelist Progress DataDirect @SAsInSumit www.linkedin.com/in/meetsumit
  • 2. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.2 Agenda §  Demand for geekier analytics §  In depth: SaaS marketing data lake §  Our guidance
  • 3. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.3 Demand for geekier analytics
  • 4. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.4 As Lines of Business Control Technology Select, SaaS Adoption Continues to Accelerate http://www.forbes.com/sites/louiscolumbus/2015/01/24/roundup-of-cloud-computing-forecasts-and-market-estimates-2015/#192c1957740c
  • 5. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.5 Are SaaS apps getting left behind the analytics revolution? Current state Opportunity Data silos in each SaaS App behind process APIs Open access to $200 billion analytics market with standard interfaces for analytics Embedded BI provides limited analytics options Engage data skills and tools in IT for richer insights ?
  • 6. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.6 What are geekier analytics? Business Intelligence Data Integration ODBC, JDBC or OData ?
  • 7. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.7 What are geekier analytics? JDBC ?
  • 8. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.8 Who is getting geeky today with analytics connectivity?
  • 9. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.9 Who is getting geeky today with analytics connectivity?
  • 10. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.10 Who is getting geeky today with analytics connectivity?
  • 11. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.11 In depth: SaaS marketing data lake
  • 12. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.12 A data lake is a large-scale storage repository and processing engine. A data lake provides "massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs” - SAS Institute What is a Marketing Data Lake?
  • 13. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.13 Benefits of a Marketing Data Lake? Some of the benefits of a data lake include: §  Store data in all shapes and sizes §  Flexible analytics with “schema on read” §  Query data using SQL or big data programming frameworks §  Eliminate data silos
  • 14. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.14 Why Marketing Data? §  CMOs will outspend CIOs on technology by 2017 (Gartner) §  Oracle spent $3B on a martech aquisition spree to gain CMO mindshare. §  Expect more collaboration between CMO and CIO (CIO.com) §  Marketing Data Warehouse/Lake Webinars ~750 registrations (Progress)
  • 15. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.15 It’s easy to forget that it’s still about solving real business problems. Relevant data Transaction / behavior history Manage Data Perform Analytics Drive Decisions Insights continuous feedback loop Appropriate data sources Answers to business questions Strategy (Thinking) Moves Right to Left Implementation Moves Left to Right Before you think data, think decisions!
  • 16. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.16 Our marketing data is almost all in the cloud And it’s almost all complex, stream data – which means APIs that only give aggregations aren’t too useful
  • 17. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.17 How to ingest data directly from SaaS applications into HDFS
  • 18. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.18 How to ingest data directly from SaaS applications into Spark
  • 19. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.19 JDBC access to SaaS data Progress DataDirect JDBC Connector Schema Manager Apache Sqoop Salesforce.com Schema User Defined Schema Driver uses §  SOAP API §  Bulk API §  Metadata API
  • 20. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.20 Geek Speak $ sqoop help import usage: sqoop import [GENERIC-ARGS] [TOOL-ARGS] Common arguments: --connect <jdbc-uri> Specify JDBC connect string --connect-manager <jdbc-uri> Specify connection manager class to use --driver <class-name> Manually specify JDBC driver class to use --hadoop-mapred-home <dir>+ Override $HADOOP_MAPRED_HOME --help Print usage instructions -P Read password from console --password <password> Set authentication password --username <username> Set authentication username --verbose Print more information while working --hadoop-home <dir>+ Deprecated. Override $HADOOP_HOME
  • 21. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.21 Geek Speak val dataframe_salesforce = sqlContext.read.format("jdbc").option("url","jdbc:datadirect:sforce:// login.salesforce.com;").option("driver","com.ddtek.jdbc.sforce.SForceDriver"). option("dbtable","SFORCE.<table_name>").option("user","<Username>").option("p assword","<password>").option("securitytoken","<security_token>").load() dataframe_salesforce.registerTempTable("account") dataframe_salesforce.sqlContext.sql("select * from account").collect.foreach(println)
  • 22. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.22 Our guidance
  • 23. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.23 Data in SaaS Applications is Siloed, Protected by Proprietary APIs Designed for Process Integration, not Data Integration
  • 24. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.24 Partner Summit registration report from SFDC
  • 25. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.25 Each SaaS API is vastly different §  Get OData, JDBC, ODBC interfaces on top of any API Data Source API Marketo Web Services API (REST/SOAP) Bulk and non-Bulk APIs No query language Oracle Service Cloud Web Services APIs (REST/SOAP) ROQL Google Analytics Hypercube (query limits of 10 metrics grouped by max of 7 dimensions) Salesforce SOAP, BULK, Metadata APIs SOQL
  • 26. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.26 #DontBeJeff §  http://prgress.co/dontbejeff
  • 27. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.27 Detail is important because this digital data is true big data The relationship between events is critical
  • 28. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.28 We’re almost never solving for one problem with a big data system Reporting Analytics Summarized Data Segmented Data Detail Data We can’t just aggregate / We can’t not aggregate Dashboarding Campaign Optimization Customer Drill-down Attribution, CLTV, Experience, Personalization Targeting Forecasting
  • 29. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.29 Expose variety of detailed data using standard interfaces SaaS vendors •  Expose bulk, non-bulk and analytics data model •  Leverage standard interfaces for SQL and REST •  Direct secure database access option SaaS data consumers •  Support same standard interfaces •  Turn to trusted data connectivity partners or dedicate significant headcount
  • 30. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.30 Other SQL to SaaS platforms
  • 31. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.31 Progress DataDirect Embed SaaS Connectors into the Data Access Layer
  • 32. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.32 Q&A with R&D §  How do you handle the varying quality of services across SaaS APIs? §  With analytics style connectivity, are SaaS vendors concerned about scalability against large extracts from multiple tenants? §  Which SaaS vendors are easier to work with?
  • 33. © 2016 Progress Software Corporation and/or its subsidiaries or affiliates. All rights reserved.33 Ingest data across 200+ data sources (beyond marketing data sources) Big Data/NoSQL §  Apache Hadoop Hive §  Cloudera §  Hortonworks §  Pivotal HD §  MapR §  EMR §  Pivotal HAWQ §  Cloudera Impala §  MongoDB §  Spark SQL §  Cassandra §  SAP HANA Data Warehouses §  Amazon Redshift §  SAP Sybase IQ §  Teradata §  Pivotal Greenplum Relational §  Oracle DB §  Microsoft SQL Server §  IBM DB2 §  MySQL §  PostgreSQL §  IBM Informix §  SAP Sybase §  Pervasive SQL §  Progress OpenEdge §  Progress Rollbase SaaS/Cloud §  Salesforce.com §  Database.com §  FinancialForce §  Veeva CRM §  ServiceMAX §  Any Force.com App §  Hubspot §  Marketo §  Microsoft Dynamics CRM §  Microsoft SQL Azure §  Oracle Eloqua §  Oracle Service Cloud §  Google Analytics EDI/XML/Text §  EDIFACT §  EDIG@S §  EANCOM §  X12 §  IATA §  Healthcare EDI: X12, HIPAA, ICD-10, HL7 §  Custom EDI §  Flat files: CSV, TSV, dBase, Clipper, Foxpro, Paradox §  Text Files Any §  SDK §  SequeLink Socket Server §  Customer Engineering