SlideShare a Scribd company logo
Azure Stream Analytics
AZURE STREAM
ANALYTICS
DAVIDE MAURI
@mauridb
dmauri@solidq.com
• Microsoft SQL Server MVP
• Works with SQL Server from 6.5, on BI from 2003
• Specialized in Data Solution Architecture, Database Design,
Performance Tuning, High-Performance Data Warehousing, BI, Big
Data
• President of UGISS (Italian SQL Server UG)
• Regular Speaker @ SQL Server events
• Consulting & Training, Mentor @ SolidQ
• E-mail: dmauri@solidq.com
• Twitter: @mauridb
• Blog: http://sqlblog.com/blogs/davide_mauri
Davide Mauri
• COMPLEX EVENT PROCESSING
• LAMBDA ARCHITECTURE
• AZURE STREAM ANALYTICS
• DATA INGESTION
• AZURE STREAM ANALYTICS QUERY LANGUAGE
• ADVANCED FEATURES
• ADDITIONAL RESOURCES
COMPLEX EVENT PROCESSING
•Event processing is a method of tracking and analyzing
(processing) streams of information (data) about things that
happen (events)
•Complex event processing, or CEP, is event processing that
combines data from multiple sources to infer events or
patterns that suggest more complicated circumstances.
• Start to appear in 1990
• Goal: identify meaningful events (such as opportunities or
threats) and respond to them as quickly as possible
EVENT PROCESSING USE CASES
• Network monitoring
• Intelligence and surveillance
• Risk management
• E-commerce
• Fraud detection
• Smart order routing
• Transaction cost analysis
• Pricing and analytics
• Market data management
• Algorithmic trading
• Data warehouse augmentation
REAL TIME USE CASES
http://www.digital4.biz
LAMBDA ARCHITECTURE
Generic, scalable and fault-tolerant data processing architecture […]
in which low-latency reads and updates are required.
http://lambda-architecture.net/
HADOOP BUT NOT ONLY THAT!
•Apache Hadoop Ecosystem is the typical solution nowadays
• “Mature” Option
• Flume (optional collector and streaming data movement system)
• Kafka (distributed messaging system)
• Storm (distributed real-time computation system)
• “Innovative” Option
• Spark + Spark Streaming
•Very powerful, but very complex
WHY AZURE?
•Due to the high scalability and computing power that a
streaming solution may require, the cloud is a perfect
environment for it
•Very cheap and Very Simple to start a project
•Very well integrated with all other Azure offerings
• From Monitoring to Power BI
STREAM ANALYTICS
•Real-Time (somehow) complex event processing engine
•Enables real-time event processing in a very simple and
cheap way
• SQL-Like language
• Temporal Semantic Support
• (but different from SQL Server 2016)
•Azure Only at present time
STREAM ANALYTICS
•Platform-as-a-Service
• Can handle millions of events per second
• Based on the REEF project (now Apache incubated)
•Main objects: Job, Query, Functions, Input & Outputs
• Totally manageable from a REST interface
•“Streaming Units” is the base concept to manage
performance, scalability and costs
• Roughly 1 Streaming Units = 1 MB/Sec of throughput
DATA INGESTION
•Inputs for Stream Analytics
• Streaming Sources (“Data in motion”)
• JSON, CSV or AVRO
• Reference Data (“Data at rest”)
• JSON or CSV
• Blob Store (max 50MB)
•Streaming Sources
• Event Hubs
• IoT Hub
DEMO
STREAM ANALYTICS QUERY ENGINE
•Take date from one or more input
•Send resulting data to one or more output
•Support most common data types:
• bigint, float, unicode strings, datetime
• key-value pairs
• arrays
STREAM ANALYTICS QUERY LANGUAGE
•Stream Analytics Query Language Reference
• https://msdn.microsoft.com/library/azure/dn834998.aspx
•Subset of T-SQL
•With specific temporal extension
• Time values to be used can be set using TIMESTAMP BY directive
STREAM ANALYTICS QUERY LANGUAGE
DML Statements
• SELECT
• FROM
• WHERE
• GROUP BY
• HAVING
• CASE
• JOIN
• UNION
Windowing Extensions
• Tumbling Window
• Hopping Window
• Sliding Window
• Duration
Aggregate Functions
• SUM
• COUNT
• AVG
• MIN
• MAX
Scaling Functions
• WITH
• PARTITION BY
Date and Time Functions
• DATENAME
• DATEPART
• DAY
• MONTH
• YEAR
• DATETIMEFROMPARTS
• DATEDIFF
• DATADD
String Functions
• LEN
• CONCAT
• CHARINDEX
• SUBSTRING
• PATINDEX
Statistical Functions
• VAR
• VARP
• STDEV
• STDEVP
ADVANCED FEATURES
•Partitioning Support
• Specially useful for high scalability
•CTE-Like constructs that also helps scaling out
•Temporal aggregations
• Tumbling, Hopping and Sliding Windows
•Join between input streams
DEMO
STREAM ANALYTICS AND MACHINE LEARNING
•Apply AzureML model to streaming data
•Sample use-cases
• Fraud Detection
• Product Recommendation
• Customer Sentiment Analysis
•Right now in preview and available only through the “old”
portal
• https://manage.windowsazure.com/
DEMO
STREAM ANALYTICS ALTERNATIVE (ON AZURE)
•Apache Storm
•IaaS and not PaaS
•Much more complex to manage and develop…but much
more powerful
• https://azure.microsoft.com/en-
us/documentation/articles/stream-analytics-comparison-storm/
STREAM ANALYTICS ON-PREMISES?
•Apache Hadoop Ecosystem
• Flume / Kafka / Storm
•StreamInsight
• CEP solution part of the SQL Server Platform
•EventStore
• Javascript OpenSource CEP
•None of them has native temporal extension
ADDITIONAL RESOURCES
•Online Documentation
•Stream Analytics Reference Architecture
•Lambda Architecture
•GitHub Repository
QUESTIONS & ANSWERS
Azure Stream Analytics
TO DO LIST
Date il vostro feedback: http://aka.ms/deveval

Seguite www.azurecommunity.it
Riguardate i video su Channel 9

More Related Content

What's hot (20)

PPTX
R in Power BI
Eric Bragas
 
PPTX
Super charged prototyping
Michael Stephenson
 
PDF
Part 3 - Modern Data Warehouse with Azure Synapse
Nilesh Gule
 
PPTX
Configuration in azure done right
Rick van den Bosch
 
PPTX
Migrate a successful transactional database to azure
Ike Ellis
 
PPTX
Analyzing StackExchange data with Azure Data Lake
BizTalk360
 
PPTX
Blockchain for the DBA and Data Professional
Karen Lopez
 
PPTX
Democratizing Data Science in the Enterprise
Jesus Rodriguez
 
PPTX
BTUG - Dec 2014 - Hybrid Connectivity Options
Michael Stephenson
 
PPTX
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Eric Bragas
 
PPTX
Data modeling trends for Analytics
Ike Ellis
 
PPTX
Microsoft Azure News - Nov 2016
Daniel Toomey
 
PPTX
A lap around microsofts business intelligence platform
Ike Ellis
 
PPT
Business Intelligence with SQL Server
Peter Gfader
 
PPTX
Gateways to Power BI, Connect PowerBI.com to your On-Prem Data
Jean-Pierre Riehl
 
PDF
RightScale Webinar: Get Top Performance for Your Games
RightScale
 
PPTX
Tableau API
Dmitry Anoshin
 
PPTX
How does Microsoft solve Big Data?
James Serra
 
PPTX
Develop scalable analytical solutions with Azure Data Factory & Azure SQL Dat...
Microsoft Tech Community
 
PDF
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Lightbend
 
R in Power BI
Eric Bragas
 
Super charged prototyping
Michael Stephenson
 
Part 3 - Modern Data Warehouse with Azure Synapse
Nilesh Gule
 
Configuration in azure done right
Rick van den Bosch
 
Migrate a successful transactional database to azure
Ike Ellis
 
Analyzing StackExchange data with Azure Data Lake
BizTalk360
 
Blockchain for the DBA and Data Professional
Karen Lopez
 
Democratizing Data Science in the Enterprise
Jesus Rodriguez
 
BTUG - Dec 2014 - Hybrid Connectivity Options
Michael Stephenson
 
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Eric Bragas
 
Data modeling trends for Analytics
Ike Ellis
 
Microsoft Azure News - Nov 2016
Daniel Toomey
 
A lap around microsofts business intelligence platform
Ike Ellis
 
Business Intelligence with SQL Server
Peter Gfader
 
Gateways to Power BI, Connect PowerBI.com to your On-Prem Data
Jean-Pierre Riehl
 
RightScale Webinar: Get Top Performance for Your Games
RightScale
 
Tableau API
Dmitry Anoshin
 
How does Microsoft solve Big Data?
James Serra
 
Develop scalable analytical solutions with Azure Data Factory & Azure SQL Dat...
Microsoft Tech Community
 
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Lightbend
 

Viewers also liked (20)

PPTX
Real-Time Event & Stream Processing on MS Azure
Khalid Salama
 
PPTX
SQL Server 2016 Temporal Tables
Davide Mauri
 
PPTX
SQL Server 2016 What's New For Developers
Davide Mauri
 
PPTX
Azure Stream Analytics : Analyse Data in Motion
Ruhani Arora
 
PPTX
Azure ML: from basic to integration with custom applications
Davide Mauri
 
PPTX
SQL Server 2016 JSON
Davide Mauri
 
PPTX
Data juice
Davide Mauri
 
PPTX
Azure Machine Learning
Davide Mauri
 
PPTX
Dashboarding with Microsoft: Datazen & Power BI
Davide Mauri
 
PPTX
Azure api app métricas com application insights
Nicolas Takashi
 
PDF
Enterprise Data Workflows with Cascading and Windows Azure HDInsight
Paco Nathan
 
PPTX
Big data streaming with Apache Spark on Azure
Willem Meints
 
PPTX
Azure IOT
Maik van der Gaag
 
PPTX
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 1)
Sascha Dittmann
 
PDF
Fraud Detection using Hadoop
hadooparchbook
 
PPTX
Belgian Windows Server 2012 Launch windows azure insights for the enterprise ...
Mike Martin
 
PPTX
Microsoft NYC 14
SwitchPitch
 
PPTX
Go Serverless with Azure Functions
Jim O'Neil
 
PPTX
2016-08-25 TechExeter - going serverless with Azure
Steve Lee
 
PPTX
Software scope
Shubham Dubey
 
Real-Time Event & Stream Processing on MS Azure
Khalid Salama
 
SQL Server 2016 Temporal Tables
Davide Mauri
 
SQL Server 2016 What's New For Developers
Davide Mauri
 
Azure Stream Analytics : Analyse Data in Motion
Ruhani Arora
 
Azure ML: from basic to integration with custom applications
Davide Mauri
 
SQL Server 2016 JSON
Davide Mauri
 
Data juice
Davide Mauri
 
Azure Machine Learning
Davide Mauri
 
Dashboarding with Microsoft: Datazen & Power BI
Davide Mauri
 
Azure api app métricas com application insights
Nicolas Takashi
 
Enterprise Data Workflows with Cascading and Windows Azure HDInsight
Paco Nathan
 
Big data streaming with Apache Spark on Azure
Willem Meints
 
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 1)
Sascha Dittmann
 
Fraud Detection using Hadoop
hadooparchbook
 
Belgian Windows Server 2012 Launch windows azure insights for the enterprise ...
Mike Martin
 
Microsoft NYC 14
SwitchPitch
 
Go Serverless with Azure Functions
Jim O'Neil
 
2016-08-25 TechExeter - going serverless with Azure
Steve Lee
 
Software scope
Shubham Dubey
 
Ad

Similar to Azure Stream Analytics (20)

PPTX
Azure stream analytics by Nico Jacobs
ITProceed
 
PDF
Introduction to Streaming Analytics
Guido Schmutz
 
PPTX
Azure Stream Analytics
Marco Parenzan
 
PDF
Azure Stream Analytics Project : On-demand real-time analytics
Lamprini Koutsokera
 
PDF
Azure Stream Analytics Project: On-demand real-time analytics
Stratos Gounidellis
 
PDF
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
HostedbyConfluent
 
PDF
Introduction to Stream Processing
Guido Schmutz
 
PDF
[WSO2Con EU 2018] Streaming SQL in the Real World
WSO2
 
PPTX
Introduction to Azure Stream Analytics
Slava Kokaev
 
PPTX
Azure Stream Analytics
Marco Parenzan
 
PPTX
Shikha fdp 62_14july2017
Dr. Shikha Mehta
 
PDF
Introduction to Stream Processing
Guido Schmutz
 
PDF
Introduction to Stream Processing
Guido Schmutz
 
PDF
Streaming Analytics and Internet of Things - Geesara Prathap
WithTheBest
 
PDF
[WSO2Con EU 2018] The Rise of Streaming SQL
WSO2
 
PDF
1 Introduction to Microsoft data platform analytics for release
Jen Stirrup
 
PDF
Streaming analytics state of the art
Stavros Kontopoulos
 
PDF
The State of Stream Processing
confluent
 
PDF
Streaming Visualization
Guido Schmutz
 
PPTX
Stream Analytics in the Enterprise
Jesus Rodriguez
 
Azure stream analytics by Nico Jacobs
ITProceed
 
Introduction to Streaming Analytics
Guido Schmutz
 
Azure Stream Analytics
Marco Parenzan
 
Azure Stream Analytics Project : On-demand real-time analytics
Lamprini Koutsokera
 
Azure Stream Analytics Project: On-demand real-time analytics
Stratos Gounidellis
 
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
HostedbyConfluent
 
Introduction to Stream Processing
Guido Schmutz
 
[WSO2Con EU 2018] Streaming SQL in the Real World
WSO2
 
Introduction to Azure Stream Analytics
Slava Kokaev
 
Azure Stream Analytics
Marco Parenzan
 
Shikha fdp 62_14july2017
Dr. Shikha Mehta
 
Introduction to Stream Processing
Guido Schmutz
 
Introduction to Stream Processing
Guido Schmutz
 
Streaming Analytics and Internet of Things - Geesara Prathap
WithTheBest
 
[WSO2Con EU 2018] The Rise of Streaming SQL
WSO2
 
1 Introduction to Microsoft data platform analytics for release
Jen Stirrup
 
Streaming analytics state of the art
Stavros Kontopoulos
 
The State of Stream Processing
confluent
 
Streaming Visualization
Guido Schmutz
 
Stream Analytics in the Enterprise
Jesus Rodriguez
 
Ad

More from Davide Mauri (20)

PPTX
Azure serverless Full-Stack kickstart
Davide Mauri
 
PPTX
Agile Data Warehousing
Davide Mauri
 
PPTX
Dapper: the microORM that will change your life
Davide Mauri
 
PPTX
When indexes are not enough
Davide Mauri
 
PPTX
Building a Real-Time IoT monitoring application with Azure
Davide Mauri
 
PPTX
SSIS Monitoring Deep Dive
Davide Mauri
 
PPTX
Azure SQL & SQL Server 2016 JSON
Davide Mauri
 
PPTX
SQL Server & SQL Azure Temporal Tables - V2
Davide Mauri
 
PPTX
SSIS Monitoring Deep Dive
Davide Mauri
 
PPTX
AzureML - Creating and Using Machine Learning Solutions (Italian)
Davide Mauri
 
PPTX
Datarace: IoT e Big Data (Italian)
Davide Mauri
 
PPTX
Azure Machine Learning (Italian)
Davide Mauri
 
PPTX
Back to the roots - SQL Server Indexing
Davide Mauri
 
PPTX
Schema less table & dynamic schema
Davide Mauri
 
PPTX
Iris Multi-Class Classifier with Azure ML
Davide Mauri
 
PPTX
BIML: BI to the next level
Davide Mauri
 
PPTX
Agile Data Warehousing
Davide Mauri
 
PPTX
Data Science Overview
Davide Mauri
 
PPTX
Delayed durability
Davide Mauri
 
PPTX
Hekaton: In-memory tables
Davide Mauri
 
Azure serverless Full-Stack kickstart
Davide Mauri
 
Agile Data Warehousing
Davide Mauri
 
Dapper: the microORM that will change your life
Davide Mauri
 
When indexes are not enough
Davide Mauri
 
Building a Real-Time IoT monitoring application with Azure
Davide Mauri
 
SSIS Monitoring Deep Dive
Davide Mauri
 
Azure SQL & SQL Server 2016 JSON
Davide Mauri
 
SQL Server & SQL Azure Temporal Tables - V2
Davide Mauri
 
SSIS Monitoring Deep Dive
Davide Mauri
 
AzureML - Creating and Using Machine Learning Solutions (Italian)
Davide Mauri
 
Datarace: IoT e Big Data (Italian)
Davide Mauri
 
Azure Machine Learning (Italian)
Davide Mauri
 
Back to the roots - SQL Server Indexing
Davide Mauri
 
Schema less table & dynamic schema
Davide Mauri
 
Iris Multi-Class Classifier with Azure ML
Davide Mauri
 
BIML: BI to the next level
Davide Mauri
 
Agile Data Warehousing
Davide Mauri
 
Data Science Overview
Davide Mauri
 
Delayed durability
Davide Mauri
 
Hekaton: In-memory tables
Davide Mauri
 

Recently uploaded (20)

PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PDF
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PPTX
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
PDF
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PDF
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPT
Data base management system Transactions.ppt
gandhamcharan2006
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
Data base management system Transactions.ppt
gandhamcharan2006
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
Choosing the Right Database for Indexing.pdf
Tamanna
 

Azure Stream Analytics

  • 3. • Microsoft SQL Server MVP • Works with SQL Server from 6.5, on BI from 2003 • Specialized in Data Solution Architecture, Database Design, Performance Tuning, High-Performance Data Warehousing, BI, Big Data • President of UGISS (Italian SQL Server UG) • Regular Speaker @ SQL Server events • Consulting & Training, Mentor @ SolidQ • E-mail: [email protected] • Twitter: @mauridb • Blog: http://sqlblog.com/blogs/davide_mauri Davide Mauri
  • 4. • COMPLEX EVENT PROCESSING • LAMBDA ARCHITECTURE • AZURE STREAM ANALYTICS • DATA INGESTION • AZURE STREAM ANALYTICS QUERY LANGUAGE • ADVANCED FEATURES • ADDITIONAL RESOURCES
  • 5. COMPLEX EVENT PROCESSING •Event processing is a method of tracking and analyzing (processing) streams of information (data) about things that happen (events) •Complex event processing, or CEP, is event processing that combines data from multiple sources to infer events or patterns that suggest more complicated circumstances. • Start to appear in 1990 • Goal: identify meaningful events (such as opportunities or threats) and respond to them as quickly as possible
  • 6. EVENT PROCESSING USE CASES • Network monitoring • Intelligence and surveillance • Risk management • E-commerce • Fraud detection • Smart order routing • Transaction cost analysis • Pricing and analytics • Market data management • Algorithmic trading • Data warehouse augmentation
  • 7. REAL TIME USE CASES http://www.digital4.biz
  • 8. LAMBDA ARCHITECTURE Generic, scalable and fault-tolerant data processing architecture […] in which low-latency reads and updates are required. http://lambda-architecture.net/
  • 9. HADOOP BUT NOT ONLY THAT! •Apache Hadoop Ecosystem is the typical solution nowadays • “Mature” Option • Flume (optional collector and streaming data movement system) • Kafka (distributed messaging system) • Storm (distributed real-time computation system) • “Innovative” Option • Spark + Spark Streaming •Very powerful, but very complex
  • 10. WHY AZURE? •Due to the high scalability and computing power that a streaming solution may require, the cloud is a perfect environment for it •Very cheap and Very Simple to start a project •Very well integrated with all other Azure offerings • From Monitoring to Power BI
  • 11. STREAM ANALYTICS •Real-Time (somehow) complex event processing engine •Enables real-time event processing in a very simple and cheap way • SQL-Like language • Temporal Semantic Support • (but different from SQL Server 2016) •Azure Only at present time
  • 12. STREAM ANALYTICS •Platform-as-a-Service • Can handle millions of events per second • Based on the REEF project (now Apache incubated) •Main objects: Job, Query, Functions, Input & Outputs • Totally manageable from a REST interface •“Streaming Units” is the base concept to manage performance, scalability and costs • Roughly 1 Streaming Units = 1 MB/Sec of throughput
  • 13. DATA INGESTION •Inputs for Stream Analytics • Streaming Sources (“Data in motion”) • JSON, CSV or AVRO • Reference Data (“Data at rest”) • JSON or CSV • Blob Store (max 50MB) •Streaming Sources • Event Hubs • IoT Hub
  • 14. DEMO
  • 15. STREAM ANALYTICS QUERY ENGINE •Take date from one or more input •Send resulting data to one or more output •Support most common data types: • bigint, float, unicode strings, datetime • key-value pairs • arrays
  • 16. STREAM ANALYTICS QUERY LANGUAGE •Stream Analytics Query Language Reference • https://msdn.microsoft.com/library/azure/dn834998.aspx •Subset of T-SQL •With specific temporal extension • Time values to be used can be set using TIMESTAMP BY directive
  • 17. STREAM ANALYTICS QUERY LANGUAGE DML Statements • SELECT • FROM • WHERE • GROUP BY • HAVING • CASE • JOIN • UNION Windowing Extensions • Tumbling Window • Hopping Window • Sliding Window • Duration Aggregate Functions • SUM • COUNT • AVG • MIN • MAX Scaling Functions • WITH • PARTITION BY Date and Time Functions • DATENAME • DATEPART • DAY • MONTH • YEAR • DATETIMEFROMPARTS • DATEDIFF • DATADD String Functions • LEN • CONCAT • CHARINDEX • SUBSTRING • PATINDEX Statistical Functions • VAR • VARP • STDEV • STDEVP
  • 18. ADVANCED FEATURES •Partitioning Support • Specially useful for high scalability •CTE-Like constructs that also helps scaling out •Temporal aggregations • Tumbling, Hopping and Sliding Windows •Join between input streams
  • 19. DEMO
  • 20. STREAM ANALYTICS AND MACHINE LEARNING •Apply AzureML model to streaming data •Sample use-cases • Fraud Detection • Product Recommendation • Customer Sentiment Analysis •Right now in preview and available only through the “old” portal • https://manage.windowsazure.com/
  • 21. DEMO
  • 22. STREAM ANALYTICS ALTERNATIVE (ON AZURE) •Apache Storm •IaaS and not PaaS •Much more complex to manage and develop…but much more powerful • https://azure.microsoft.com/en- us/documentation/articles/stream-analytics-comparison-storm/
  • 23. STREAM ANALYTICS ON-PREMISES? •Apache Hadoop Ecosystem • Flume / Kafka / Storm •StreamInsight • CEP solution part of the SQL Server Platform •EventStore • Javascript OpenSource CEP •None of them has native temporal extension
  • 24. ADDITIONAL RESOURCES •Online Documentation •Stream Analytics Reference Architecture •Lambda Architecture •GitHub Repository
  • 27. TO DO LIST Date il vostro feedback: http://aka.ms/deveval  Seguite www.azurecommunity.it Riguardate i video su Channel 9

Editor's Notes

  • #7: Ref: http://www.infoq.com/articles/stream-processing-hadoop
  • #13: https://azure.microsoft.com/en-us/documentation/articles/stream-analytics-scale-jobs/
  • #15: Simple Setup of Event Hubs, Source and Destination
  • #20: Full Demo: Stream + Reference Data Windows Functions Tumbling Hopping Sliding
  • #21: Customer Sentiment Analysis: now that companies are offering support also via Twitter this becomes more and more important