Lift SSIS package to Azure Data Factory V2Manjeet Singh
Manjeet Singh gives a presentation on lifting SSIS packages to Azure using Data Factory v2. He discusses how the Integration Runtime in ADF v2 allows existing on-premises SSIS packages to be lifted to the cloud. He demonstrates deploying a SSIS package to an Azure SQL database, running it using SQL Server Management Studio and Azure Data Factory pipelines, and provides tips on using the SSIS Integration Runtime.
This document discusses migrating SSIS packages to the cloud using the Azure-SSIS Integration Runtime (IR). It describes what the Azure-SSIS IR is, when it makes sense to migrate packages to it, and how to set up the Azure-SSIS IR. Setting up the IR involves choosing an Azure SQL database or managed instance for the SSIS catalog, configuring connections, deploying SSIS projects, and scheduling packages. Custom setups are also possible by loading external DLLs. Typical data flows in Azure Data Factory are then discussed for lifting and shifting SSIS packages to the cloud.
Virtual Global Azure 2020 - Azure MonitorPedro Sousa
This presentation was given at Global Azure 2020 Lisbon, about Azure Monitor.
This session focused on:
- steps of the Monitoring Lifecycle;
- Conceptual Architecture of Azure Monitoring;
- Data Collection & Onboarding;
- Metrics & Logs;
- Demos.
Recordings for the event sessions will be available soon.
Azure saturday Pordenone 2019 - ML.NET model lifecycle with azure devopsMarco Zamana
This document discusses integrating machine learning model lifecycles into DevOps workflows. It describes how an application lifecycle can evolve to include ML model generation, training, testing, evaluation, and automatic deployment. It provides an example of a simple ML.NET application for binary classification and discusses expanding the pipeline to include model building, testing, and deployment. Finally, it discusses improvements like dataset versioning, using databases for training data, different DevOps scenarios, model versioning, and integrating with Azure ML and MLFlow.
- Azure Data Lake makes big data easy to manage, debug, and optimize through services like Azure Data Lake Store and Azure Data Lake Analytics.
- Azure Data Lake Store provides a hyper-scale data lake that allows storing any data in its native format at unlimited scale. Azure Data Lake Analytics allows running distributed queries and analytics jobs on data stored in Data Lake Store.
- Azure Data Lake is based on open source technologies like Apache Hadoop, YARN, and provides a managed service with auto-scaling and a pay-per-use model through the Azure portal and tools like Visual Studio.
Monitoring real-life Azure applications: When to use what and whyKarl Ots
Slides from my presentation at Intelligent Cloud Conf on 29.5.2018 in Copenhagen
Modern applications leverage a variety of services, and often span across on premises, IaaS, PaaS and SaaS. Monitoring these environments is different from traditional systems. We have more and more data available from the platform with the likes of ARM Activity Logs, Azure Monitor, Log Analytics and Application Insights.
With a massive amount of signal and noise being generated in all these systems, how do we get our arms around what is happening? Is my application impacted in an ongoing Azure outage? Are my integrations intact? Which services from Azure should I use to monitor my application end-to-end? Come and hear how to answer these questions. After the session, you’ll have deeper understanding of end-to-end monitoring techniques in Azure solutions and know which services to choose for which scenario.
.
This document discusses job scheduling, SQL Database, and pricing on the Azure PaaS. It describes how to create scheduled web jobs using the Azure scheduler portal by setting the job type, schedule, and action. It also discusses monitoring web jobs, DTUs and eDTUs in SQL Database, and how to determine the number needed. The document provides an overview of migration from Oracle and SQL Server databases to Azure SQL Database using tools like SSMA and SqlPackage.exe.
Building and deploying an analytic service on Cloud is a challenge. A bigger challenge is to maintain the service. In a world where users are gravitating towards a model where cluster instances are to provisioned on the fly, in order for these to be used for analytics or other purposes, and then to have these cluster instances shut down when the jobs get done, the relevance of containers and container orchestration is more important than ever. In short Customers are looking for Serverless Spark Clusters. The Intent of this presentation is to share what is Serverless Spark and what are the benefits of running Spark in serverless manner.
Azure Monitor provides centralized monitoring of Azure resources and applications. It collects metrics, logs, and application performance monitoring data from Azure resources, the Azure platform, and on-premises sources. It provides visibility into resource performance and usage, enables alerting and automation of responses to issues. Azure Monitor features include dashboards for visualizing data, log analytics for querying and analyzing logs, and integration with other Azure services for additional monitoring capabilities like Application Insights.
Let's meet and talk about Microsoft Azure PaaS offerings. The PaaS layer provides many scalable and globally deployed services completely manged by Microsoft that allow developer to focus on specific business requirements and to leave the infrastructure bits to the cloud provider. We will underline the differences between Virtual Machines, Cloud Services and Azure Web Apps on the compute layer. Later we will compare SQL Server and Azure SQL.
Then we will focus on Data Storage and Data Analytics services that gives incredible power to developers and data professionals.
Most of the examples we cover are platform agnostic so people from any programming background are welcome to join and share their unique experience. Microsoft Azure is getting more open and open source friendly with every new day!
Come and join us to learn more about Microsoft Azure and enjoy your journey with the public cloud!
Intelligent Cloud Conference 2018 - Next Generation of Data Integration with ...Tom Kerkhove
Azure Data Factory is a hybrid data integration service in Azure that allows you to create, manage & operate data pipelines in Azure. It is a serverless orchestrator that allows you to create data pipelines to either move, transform, load data; a fully managed Extract, Transform, Load (ETL) & Extract, Load, Transform (ELT) service if you will.
In this talk I'll cover the basics of Azure Data Factory and show you how you can create, manage & operate data pipelines.
This document summarizes a presentation about mastering Azure Monitor. It introduces Azure Monitor and its components, including metrics, logs, dashboards, alerts, and workbooks. It provides a brief history of how Azure Monitor was developed. It also explains the different data sources that can be monitored like the Azure platform, Application Insights, and Log Analytics. The presentation encourages attendees to navigate the "maze" of Azure Monitor and provides resources to help learn more, including an upcoming virtual event and blog post series on monitoring.
This document discusses Amazon Relational Database Service (RDS) and Aurora Serverless on AWS. It provides an overview of RDS features including managed database services, scalability, redundancy, backup and support for MySQL, PostgreSQL, Oracle, SQL Server and Aurora. Aurora provides additional performance and fault tolerance compared to RDS. The document also mentions DynamoDB for NoSQL databases and announcements from AWS Reinvent 2017 including DynamoDB Global Tables, RDS Aurora Multi-Master and Inter Region VPC Peering. It notes that while Aurora Serverless provides scalability, there are limits and full compatibility with PostgreSQL may be delayed.
Mining public datasets using opensource tools: Zeppelin, Spark and Jujuseoul_engineer
There are plenty of public datasets out there available and the number is growing. Few recent and most useful of BigData ecosystem tools are showcased: Apache Zeppelin (incubating), Apache Spark and Juju.
This document discusses Azure Pipelines and common misconceptions about it. It notes that Azure Pipelines can be used for both cloud and on-premises workloads, not just Microsoft technologies, and that maintaining agents is simplified. The document traces the history of Azure Pipelines and its predecessors. It promotes the benefits of defining pipelines in YAML, including storing them in source control, easy copying between repos, and support in Visual Studio Code. Future improvements may include multi-stage pipelines and releasing directly to environments using YAML.
Sergii Bielskyi "Using Kafka and Azure Event hub together for streaming Big d...Lviv Startup Club
Using Kafka and Azure Event hub together for streaming Big data
- Azure Event Hub is a managed streaming data ingestion service that can be used with Kafka. It provides integration with other Azure services and auto-scaling.
- Kafka can be deployed on-premises or on Azure. When deployed on Azure, it uses managed disks for storage. When integrated with Event Hubs, Kafka clients can publish/subscribe to Event Hubs namespaces.
- Event Hubs and Kafka both can be used for messaging, activity tracking, data aggregation, and transformation through stream processing of big data streams.
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...CodeOps Technologies LLP
Running day-1 Ops on your Kubernetes is somewhat easy, but it is quite daunting to manage day two challenges. Learn about AKS best practices for your cloud-native applications so that you can avoid blow up your workloads.
Cloud-Based Event Stream Processing Architectures and Patterns with Apache Ka...HostedbyConfluent
The Apache Kafka ecosystem is very rich with components and pieces that make for designing and implementing secure, efficient, fault-tolerant and scalable event stream processing (ESP) systems. Using real-world examples, this talk covers why Apache Kafka is an excellent choice for cloud-native and hybrid architectures, how to go about designing, implementing and maintaining ESP systems, best practices and patterns for migrating to the cloud or hybrid configurations, when to go with PaaS or IaaS, what options are available for running Kafka in cloud or hybrid environments and what you need to build and maintain successful ESP systems that are secure, performant, reliable, highly-available and scalable.
Event driven workloads on Kubernetes with KEDANilesh Gule
Slide deck of the presentation done at the Pune User Group on 27th February 2021. Demonstrate how Kubernetes based event driven autoscaling (KEDA) can be used with RabbitMQ as the event source.
This document discusses how to architect cloud applications using Azure Functions to save costs up to 90%. It introduces Azure Functions and describes how it allows running small pieces of code in the cloud without needing dedicated servers. It covers types of triggers for Functions and pricing plans. It also discusses using Durable Functions to manage state and common patterns like function chaining. It demonstrates how costs can be reduced compared to always-on web apps by paying per use and scaling on demand with Functions.
Cloud solutions could not be best solution if it is not chosen. One factor businesses deviates from cloud solutions is unawareness of getting best out of cloud solutions with increasing efficiency.
This presentation addresses gaps between discussion had at the global azure bootcamp New Jersey.
Azure Automation delivers cloud-based automation, operating system updates, and configuration service that supports consistent management across your Azure and non-Azure environments. It includes process automation, configuration management, update management, shared capabilities, and heterogeneous features.
Monitor Azure HDInsight with Azure Log AnalyticsAshish Thapliyal
This document lists various tools and services for monitoring an HDInsight Hadoop cluster deployed on Azure. It includes tools for monitoring application health and status, Yarn and Tez UI, Grafana for metrics, Spark history server, and HBase UI. It also describes using the Operations Management Suite (OMS) agent to collect logs and metrics from HDInsight nodes and services to analyze in Log Analytics.
Azure satpn19 time series analytics with azure adxRiccardo Zamana
The document discusses Azure Data Explorer (ADX), a fully managed data analytics service for real-time analysis on large volumes of data. It provides an overview of ADX, describing its key features such as fast query performance, optimized ingestion for streaming data, and its ability to enable data exploration. Examples of typical use cases for ADX including telemetry analytics and providing a backend for multi-tenant SaaS solutions are also presented. The document then dives into various ADX concepts like clusters, databases, ingestion techniques, supported data formats, and language examples to help users get started with the service.
Azure Databricks - An Introduction 2019 Roadshow.pptxpascalsegoul
Structure proposée du PowerPoint
1. Introduction au contexte
Objectif métier
Pourquoi Snowflake ?
Pourquoi Data Vault ?
2. Architecture cible
Schéma simplifié : zone RAW → Data Vault → Data Marts
Description des schémas : RAW, DV, DM
3. Données sources
Exemple : fichier CSV de commandes (client, produit, date, montant, etc.)
Structure des fichiers
4. Zone de staging (RAW)
CREATE STAGE
COPY INTO → vers table RAW
Screenshot du script SQL + résultat
5. Création des HUBs
HUB_CLIENT, HUB_PRODUIT…
Définition métier
Script SQL avec INSERT DISTINCT
6. Création des LINKS
LINK_COMMANDE (Client ↔ Produit ↔ Date)
Structure avec clés techniques
Script SQL + logique métier
7. Création des SATELLITES
SAT_CLIENT_DETAILS, SAT_PRODUIT_DETAILS…
Historisation avec LOAD_DATE, END_DATE, HASH_DIFF
Script SQL (MERGE ou INSERT conditionnel)
8. Orchestration
Exemple de flux via dbt ou Airflow (ou simplement séquence SQL)
Screenshot modèle YAML dbt ou DAG Airflow
9. Création des vues métiers (DM)
Vue agrégée des ventes mensuelles
SELECT complexe sur HUB + LINK + SAT
Screenshot ou exemple de résultat
10. Visualisation
Connexion à Power BI / Tableau
Screenshot d’un graphique simple basé sur une vue DM
11. Conclusion et bénéfices
Fiabilité, auditabilité, versioning, historique
Adapté aux environnements de production
Building and deploying an analytic service on Cloud is a challenge. A bigger challenge is to maintain the service. In a world where users are gravitating towards a model where cluster instances are to provisioned on the fly, in order for these to be used for analytics or other purposes, and then to have these cluster instances shut down when the jobs get done, the relevance of containers and container orchestration is more important than ever. In short Customers are looking for Serverless Spark Clusters. The Intent of this presentation is to share what is Serverless Spark and what are the benefits of running Spark in serverless manner.
Azure Monitor provides centralized monitoring of Azure resources and applications. It collects metrics, logs, and application performance monitoring data from Azure resources, the Azure platform, and on-premises sources. It provides visibility into resource performance and usage, enables alerting and automation of responses to issues. Azure Monitor features include dashboards for visualizing data, log analytics for querying and analyzing logs, and integration with other Azure services for additional monitoring capabilities like Application Insights.
Let's meet and talk about Microsoft Azure PaaS offerings. The PaaS layer provides many scalable and globally deployed services completely manged by Microsoft that allow developer to focus on specific business requirements and to leave the infrastructure bits to the cloud provider. We will underline the differences between Virtual Machines, Cloud Services and Azure Web Apps on the compute layer. Later we will compare SQL Server and Azure SQL.
Then we will focus on Data Storage and Data Analytics services that gives incredible power to developers and data professionals.
Most of the examples we cover are platform agnostic so people from any programming background are welcome to join and share their unique experience. Microsoft Azure is getting more open and open source friendly with every new day!
Come and join us to learn more about Microsoft Azure and enjoy your journey with the public cloud!
Intelligent Cloud Conference 2018 - Next Generation of Data Integration with ...Tom Kerkhove
Azure Data Factory is a hybrid data integration service in Azure that allows you to create, manage & operate data pipelines in Azure. It is a serverless orchestrator that allows you to create data pipelines to either move, transform, load data; a fully managed Extract, Transform, Load (ETL) & Extract, Load, Transform (ELT) service if you will.
In this talk I'll cover the basics of Azure Data Factory and show you how you can create, manage & operate data pipelines.
This document summarizes a presentation about mastering Azure Monitor. It introduces Azure Monitor and its components, including metrics, logs, dashboards, alerts, and workbooks. It provides a brief history of how Azure Monitor was developed. It also explains the different data sources that can be monitored like the Azure platform, Application Insights, and Log Analytics. The presentation encourages attendees to navigate the "maze" of Azure Monitor and provides resources to help learn more, including an upcoming virtual event and blog post series on monitoring.
This document discusses Amazon Relational Database Service (RDS) and Aurora Serverless on AWS. It provides an overview of RDS features including managed database services, scalability, redundancy, backup and support for MySQL, PostgreSQL, Oracle, SQL Server and Aurora. Aurora provides additional performance and fault tolerance compared to RDS. The document also mentions DynamoDB for NoSQL databases and announcements from AWS Reinvent 2017 including DynamoDB Global Tables, RDS Aurora Multi-Master and Inter Region VPC Peering. It notes that while Aurora Serverless provides scalability, there are limits and full compatibility with PostgreSQL may be delayed.
Mining public datasets using opensource tools: Zeppelin, Spark and Jujuseoul_engineer
There are plenty of public datasets out there available and the number is growing. Few recent and most useful of BigData ecosystem tools are showcased: Apache Zeppelin (incubating), Apache Spark and Juju.
This document discusses Azure Pipelines and common misconceptions about it. It notes that Azure Pipelines can be used for both cloud and on-premises workloads, not just Microsoft technologies, and that maintaining agents is simplified. The document traces the history of Azure Pipelines and its predecessors. It promotes the benefits of defining pipelines in YAML, including storing them in source control, easy copying between repos, and support in Visual Studio Code. Future improvements may include multi-stage pipelines and releasing directly to environments using YAML.
Sergii Bielskyi "Using Kafka and Azure Event hub together for streaming Big d...Lviv Startup Club
Using Kafka and Azure Event hub together for streaming Big data
- Azure Event Hub is a managed streaming data ingestion service that can be used with Kafka. It provides integration with other Azure services and auto-scaling.
- Kafka can be deployed on-premises or on Azure. When deployed on Azure, it uses managed disks for storage. When integrated with Event Hubs, Kafka clients can publish/subscribe to Event Hubs namespaces.
- Event Hubs and Kafka both can be used for messaging, activity tracking, data aggregation, and transformation through stream processing of big data streams.
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...CodeOps Technologies LLP
Running day-1 Ops on your Kubernetes is somewhat easy, but it is quite daunting to manage day two challenges. Learn about AKS best practices for your cloud-native applications so that you can avoid blow up your workloads.
Cloud-Based Event Stream Processing Architectures and Patterns with Apache Ka...HostedbyConfluent
The Apache Kafka ecosystem is very rich with components and pieces that make for designing and implementing secure, efficient, fault-tolerant and scalable event stream processing (ESP) systems. Using real-world examples, this talk covers why Apache Kafka is an excellent choice for cloud-native and hybrid architectures, how to go about designing, implementing and maintaining ESP systems, best practices and patterns for migrating to the cloud or hybrid configurations, when to go with PaaS or IaaS, what options are available for running Kafka in cloud or hybrid environments and what you need to build and maintain successful ESP systems that are secure, performant, reliable, highly-available and scalable.
Event driven workloads on Kubernetes with KEDANilesh Gule
Slide deck of the presentation done at the Pune User Group on 27th February 2021. Demonstrate how Kubernetes based event driven autoscaling (KEDA) can be used with RabbitMQ as the event source.
This document discusses how to architect cloud applications using Azure Functions to save costs up to 90%. It introduces Azure Functions and describes how it allows running small pieces of code in the cloud without needing dedicated servers. It covers types of triggers for Functions and pricing plans. It also discusses using Durable Functions to manage state and common patterns like function chaining. It demonstrates how costs can be reduced compared to always-on web apps by paying per use and scaling on demand with Functions.
Cloud solutions could not be best solution if it is not chosen. One factor businesses deviates from cloud solutions is unawareness of getting best out of cloud solutions with increasing efficiency.
This presentation addresses gaps between discussion had at the global azure bootcamp New Jersey.
Azure Automation delivers cloud-based automation, operating system updates, and configuration service that supports consistent management across your Azure and non-Azure environments. It includes process automation, configuration management, update management, shared capabilities, and heterogeneous features.
Monitor Azure HDInsight with Azure Log AnalyticsAshish Thapliyal
This document lists various tools and services for monitoring an HDInsight Hadoop cluster deployed on Azure. It includes tools for monitoring application health and status, Yarn and Tez UI, Grafana for metrics, Spark history server, and HBase UI. It also describes using the Operations Management Suite (OMS) agent to collect logs and metrics from HDInsight nodes and services to analyze in Log Analytics.
Azure satpn19 time series analytics with azure adxRiccardo Zamana
The document discusses Azure Data Explorer (ADX), a fully managed data analytics service for real-time analysis on large volumes of data. It provides an overview of ADX, describing its key features such as fast query performance, optimized ingestion for streaming data, and its ability to enable data exploration. Examples of typical use cases for ADX including telemetry analytics and providing a backend for multi-tenant SaaS solutions are also presented. The document then dives into various ADX concepts like clusters, databases, ingestion techniques, supported data formats, and language examples to help users get started with the service.
Azure Databricks - An Introduction 2019 Roadshow.pptxpascalsegoul
Structure proposée du PowerPoint
1. Introduction au contexte
Objectif métier
Pourquoi Snowflake ?
Pourquoi Data Vault ?
2. Architecture cible
Schéma simplifié : zone RAW → Data Vault → Data Marts
Description des schémas : RAW, DV, DM
3. Données sources
Exemple : fichier CSV de commandes (client, produit, date, montant, etc.)
Structure des fichiers
4. Zone de staging (RAW)
CREATE STAGE
COPY INTO → vers table RAW
Screenshot du script SQL + résultat
5. Création des HUBs
HUB_CLIENT, HUB_PRODUIT…
Définition métier
Script SQL avec INSERT DISTINCT
6. Création des LINKS
LINK_COMMANDE (Client ↔ Produit ↔ Date)
Structure avec clés techniques
Script SQL + logique métier
7. Création des SATELLITES
SAT_CLIENT_DETAILS, SAT_PRODUIT_DETAILS…
Historisation avec LOAD_DATE, END_DATE, HASH_DIFF
Script SQL (MERGE ou INSERT conditionnel)
8. Orchestration
Exemple de flux via dbt ou Airflow (ou simplement séquence SQL)
Screenshot modèle YAML dbt ou DAG Airflow
9. Création des vues métiers (DM)
Vue agrégée des ventes mensuelles
SELECT complexe sur HUB + LINK + SAT
Screenshot ou exemple de résultat
10. Visualisation
Connexion à Power BI / Tableau
Screenshot d’un graphique simple basé sur une vue DM
11. Conclusion et bénéfices
Fiabilité, auditabilité, versioning, historique
Adapté aux environnements de production
Data saturday Oslo Azure Purview Erwin de KreukErwin de Kreuk
Azure Purview provides unified data governance capabilities including automated data discovery, classification, and lineage visualization. It helps organizations overcome data governance silos, comply with regulations, and increase data agility. The key components of Azure Purview include the Data Map for automated metadata extraction and lineage, the Data Catalog for data discovery and governance, and Insights for monitoring data usage. It supports governance of data across cloud and on-premises environments in a serverless and fully managed platform.
Slides from my talk at Big Data Conference 2018 in Vilnius
Doing data science today is far more difficult than it will be in the next 5-10 years. Sharing, collaborating on data science workflows in painful, pushing models into production is challenging.
Let’s explore what Azure provides to ease Data Scientists’ pains. What tools and services can we choose based on a problem definition, skillset or infrastructure requirements?
In this talk, you will learn about Azure Machine Learning Studio, Azure Databricks, Data Science Virtual Machines and Cognitive Services, with all the perks and limitations.
Datasaturday Pordenone Azure Purview Erwin de KreukErwin de Kreuk
Azure Purview is Microsoft's solution for unified data governance. It includes three main components:
1. The Purview Data Map automates metadata scanning and lineage identification across hybrid data stores and applies over 100 classifiers and Microsoft sensitivity labels.
2. The Purview Data Catalog enables effortless discovery through semantic search and a business glossary, and shows data lineage with sources, owners, and transformations.
3. Purview Insights provides reports on assets, scans, the glossary, classification, and sensitive data labeling to give visibility into data usage across the estate.
This document discusses the future of data and the Azure data ecosystem. It highlights that by 2025 there will be 175 zettabytes of data in the world and the average person will have over 5,000 digital interactions per day. It promotes Azure services like Power BI, Azure Synapse Analytics, Azure Data Factory and Azure Machine Learning for extracting value from data through analytics, visualization and machine learning. The document provides overviews of key Azure data and analytics services and how they fit together in an end-to-end data platform for business intelligence, artificial intelligence and continuous intelligence applications.
This document discusses Azure Machine Learning services for data scientists. It provides an overview of Azure Machine Learning Studio for building and deploying machine learning models with over 100 modules. Numbers show hundreds of thousands of deployed models serving billions of requests. It also discusses Azure Batch AI for scalable machine learning training without managing infrastructure, and Azure Databricks for Apache Spark as a managed service on Azure. The document outlines the machine learning development lifecycle supported in Azure and tools for experimentation, model management, and operationalization of models.
The document discusses Azure Data Factory and its capabilities for cloud-first data integration and transformation. ADF allows orchestrating data movement and transforming data at scale across hybrid and multi-cloud environments using a visual, code-free interface. It provides serverless scalability without infrastructure to manage along with capabilities for lifting and running SQL Server Integration Services packages in Azure.
Data weekender4.2 azure purview erwin de kreukErwin de Kreuk
This document provides information about Azure Purview and its capabilities for unified data governance. It discusses:
- Azure Purview allows for automated discovery of data across on-premises, multicloud and SaaS sources through its data map. It enables classification, lineage tracking and compliance.
- The data catalog provides semantic search and browse capabilities along with a business glossary and data lineage visualizations.
- Insights features provide reporting on assets, scans, the business glossary, classifications and labeling to give visibility into data usage across the organization.
- The document demonstrates registering and scanning a Power BI tenant to discover data with Azure Purview.
The document discusses building an end-to-end analytic solution in the cloud using Microsoft Azure tools, including ingesting data from various sources into Azure Data Factory, storing it in Azure Data Lake, transforming the data using U-SQL scripts in Azure Data Lake Analytics, developing predictive models with Azure Machine Learning Studio, and visualizing insights with Power BI. It provides examples of how each tool in the analytic lifecycle can be leveraged as part of an overall cloud-based analytics solution handling large volumes of data.
This document provides an overview of a course on implementing a modern data platform architecture using Azure services. The course objectives are to understand cloud and big data concepts, the role of Azure data services in a modern data platform, and how to implement a reference architecture using Azure data services. The course will provide an ARM template for a data platform solution that can address most data challenges.
Dans cette session nous vous présenterons les différentes manières d'utiliser SQL Server dans une infrastructure Cloud (Microsoft Azure). Seront présentés des scénarios hybrides, de migration, de backup, et d'hébergement de bases de données SQL Server en mode IaaS ou PaaS.
Azure Databricks - An Introduction (by Kris Bock)Daniel Toomey
Azure Databricks is a fast, easy to use, and collaborative Apache Spark-based analytics platform optimized for Azure. It allows for interactive collaboration through a unified workspace, enables sharing of insights through integration with Power BI, and provides native integration with other Azure services. It also offers enterprise-grade security through integration with Azure Active Directory and compliance features.
DataMinds 2022 Azure Purview Erwin de KreukErwin de Kreuk
Azure Purview is Microsoft's solution for data governance and data lineage. It provides unified data governance across on-premises, multi-cloud and Software as a Service data sources. Azure Purview consists of three main components - the Data Map automates metadata extraction and data lineage, the Data Catalog enables effortless discovery, and Data Insights provides governance over data usage. It is a fully managed cloud service that eliminates the need for manual or homegrown data governance solutions.
Praveen Nair is a program director at Adfolks LLC and formerly held roles at Orion Business Innovation and PIT Solutions. He is a Microsoft MVP and certified in various Microsoft, PMP, and CSPO programs. Azure Monitor is a monitoring solution that collects, analyzes, and acts on telemetry data from Azure and on-premises environments. It helps maximize application performance and availability and proactively identify problems. Azure Monitor provides a unified view of applications, infrastructure, and networks using collected metrics and logs analyzed with Kusto query language.
Migrating on premises workload to azure sql databasePARIKSHIT SAVJANI
This document provides an overview of migrating databases from on-premises SQL Server to Azure SQL Database Managed Instance. It discusses why companies are moving to the cloud, challenges with migration, and the tools and services available to help with assessment and migration including Data Migration Service. Key steps in the migration workflow include assessing the database and application, addressing compatibility issues, and deploying the converted schema to Managed Instance which provides high compatibility with on-premises SQL Server in a fully managed platform as a service model.
Ho-Ho-Hold onto Your Hats! Real-Time Data Magic from Santa’s Sleigh with Azur...Callon Campbell
This holiday season, unwrap the gift of a jolly technical presentation on constructing a real-time medallion architecture tailored for telemetry data from Santa’s sleigh. This merry session will showcase how to leverage Azure Data Explorer and Microsoft Fabric Real-Time Intelligence to ingest, process, and visualize high-velocity data streams as Santa dashes through the night.
Azure Data Engineer Online Training | Microsoft Azure Data Engineereshwarvisualpath
Visualpath is one of the Best Azure Data Engineer Online Training. providing azure data engineer training with real-time Projects with highly skilled and certified trainers. Enroll for a Free Demo. Call us: - +91-9989971070.
Visit: https://www.visualpath.in/online-azure-data-engineer-course.html
Visit: https://visualpathblogs.com/
Join Us Whatsapp : https://www.whatsapp.com/catalog/919989971070/
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsInformatica
This presentation is geared toward enterprise architects and senior IT leaders looking to drive more value from their data by learning about cloud data lake management.
As businesses focus on leveraging big data to drive digital transformation, technology leaders are struggling to keep pace with the high volume of data coming in at high speed and rapidly evolving technologies. What's needed is an approach that helps you turn petabytes into profit.
Cloud data lakes and cloud data warehouses have emerged as a popular architectural pattern to support next-generation analytics. Informatica's comprehensive AI-driven cloud data lake management solution natively ingests, streams, integrates, cleanses, governs, protects and processes big data workloads in multi-cloud environments.
Please leave any questions or comments below.
Telangana State, India’s newest state that was carved from the erstwhile state of Andhra
Pradesh in 2014 has launched the Water Grid Scheme named as ‘Mission Bhagiratha (MB)’
to seek a permanent and sustainable solution to the drinking water problem in the state. MB is
designed to provide potable drinking water to every household in their premises through
piped water supply (PWS) by 2018. The vision of the project is to ensure safe and sustainable
piped drinking water supply from surface water sources
AI Competitor Analysis: How to Monitor and Outperform Your CompetitorsContify
AI competitor analysis helps businesses watch and understand what their competitors are doing. Using smart competitor intelligence tools, you can track their moves, learn from their strategies, and find ways to do better. Stay smart, act fast, and grow your business with the power of AI insights.
For more information please visit here https://www.contify.com/
By James Francis, CEO of Paradigm Asset Management
In the landscape of urban safety innovation, Mt. Vernon is emerging as a compelling case study for neighboring Westchester County cities. The municipality’s recently launched Public Safety Camera Program not only represents a significant advancement in community protection but also offers valuable insights for New Rochelle and White Plains as they consider their own safety infrastructure enhancements.
Thingyan is now a global treasure! See how people around the world are search...Pixellion
We explored how the world searches for 'Thingyan' and 'သင်္ကြန်' and this year, it’s extra special. Thingyan is now officially recognized as a World Intangible Cultural Heritage by UNESCO! Dive into the trends and celebrate with us!
2. #azuresatpn
Nice to meet you
Riccardo Perico | [email protected] | @R1k91
SolidQ
Data Platform & BI Specialist
10 years working, training and speaking in Microsoft «Data Realm»
MCP: MTA, MCSA
https://www.linkedin.com/in/riccardo-perico-8b942384/
5. #azuresatpn
What ADF really is?
Cloud based
Data
integration
service
Orchestrates &
Automates
Data
movement and
transformation
Allows
Monitoring
and Debugging
Programmable
7. #azuresatpn
Sample Workflow
On-premises
data mart
Customer
web logs
Product table
Azure DB
Product
recommendations
Visualize
Azure Blob storage
Customer web
Logs
Product table
Data set
(Collection of files,
DB table, etc.)
Pipeline: A sequence of
activities (logical group)
Activity: A processing step
(Hadoop job, custom code, ML model, etc.)
…
Data sources Ingest Transform and analyze Publish
Combined
input table
Mapping
Transform,
combine, etc. Analyze Move
8. #azuresatpn
Devices Device Connectivity Storage Analytics Presentation & Action
Event Hubs SQL Database
Machine
Learning
App Service
IoT Hubs
Table/Blob
Storage
Stream Analytics Power BI
Service Bus Cosmos DB HDInsight
Notification
Hubs
External Data
Sources
External Data
Sources
Data Factory Mobile Services
BizTalk Services
Data Lake
Analytics
11. #azuresatpn
Activities & Pipelines
An Activity is a single task in workflow:
• Copy from input to output
• Transform
• C#
• Stored Procedure
• Hadoop (Map/Reduce, Hive, Pig)
• ML, Data Lake Analytics
• Databricks
• Control
• IF, ForEach, Until, Wait, Execute Pipeline
• Web
Pipeline groups activities
SQL
Serve
r
SQL
DB
SQL
Server
VMs
12. #azuresatpn
Integration Runtime
• Bridge between Activity and Linked Service
• Compute environment where activity runs or it’s dispatched from
3 types of IR:
• IR Azure
• IR Self-hosted
• IR Azure-SSIS
14. #azuresatpn
ADF Location vs IR Location
• ADF location metadata store and triggering pipeline start
• IR location backend compute engine location (data movement,
activity dispatch and SSIS execution)
ADF Location and IR location could be different
IR can use “Auto Resolve”
15. #azuresatpn
Mapping Data Flows
• Based on Spark
• Use Databricks behind the scene
• A lot of transformations already available
• Few sources available for now
• This week GA announced!
17. #azuresatpn
Developer Tools
• Azure Portal: Create, Edit. Visual and Textual
• Visual Studio: Integrated in VS project
• Powershell: cmdlets https://docs.microsoft.com/en-
us/powershell/module/azurerm.datafactories/?view=azurermps-
6.13.0
• Azure RM Template
18. #azuresatpn
Pricing
Multiple factors affect pricing
• Number of Activities run
• Volume of data moved
• SQL Server Integration Services Compute Hours
• Whether you re-running an activity
https://azure.microsoft.com/en-us/pricing/details/data-factory/v2/
#7: Enterprises have data of various types that are located in disparate sources on-premises, in the cloud, structured, unstructured, and semi-structured, all arriving at different intervals and speeds.
The first step in building an information production system is to connect to all the required sources.
Without Data Factory, enterprises must build custom data movement components, they often lack the enterprise-grade monitoring, alerting, and the controls that a fully managed service can offer.
After data is present in a centralized data store in the cloud, process or transform the collected data by using compute services such as HDInsight Hadoop, Spark, Data Lake Analytics, and Machine Learning.
After the raw data has been refined into a business-ready consumable form, load the data into Azure Data Warehouse, Azure SQL Database, Azure CosmosDB, or whichever analytics engine your business users can point to from their business intelligence tools.
#11: Data sets identify the data from different data stores.
#14: Azure: public accessible endpoints, serverless, fully managed, pay for use only, scaled up automagically according to copy activity properties
Self-hosted: everything works in a private network behind corporate firewall, only HTTP outbound. A Windows server is needed and IR must be installed. Supports active-active load balancing.
Azure SSIS: Set of VMs natively executes SSIS. Supports BYO SSISDB on Azure SQL DB or Managed Instance. To On-prem use Azure Virtual Network with VPN site-to-site.
#16: Mapping Data Flows are visually designed data transformations in Azure Data Factory
#17: Copy activity from S3 to SQL
Rest to SQL
Datasets transform with SP vs MDF
Trigger & Monitor