Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a half-day virtual workshop, hosted by James McAuliffe.
The document discusses the challenges of maintaining separate data lake and data warehouse systems. It notes that businesses need to integrate these areas to overcome issues like managing diverse workloads, providing consistent security and user management across uses cases, and enabling data sharing between data science and business analytics teams. An integrated system is needed that can support both structured analytics and big data/semi-structured workloads from a single platform.
Azure Synapse Analytics is Azure SQL Data Warehouse evolved: a limitless analytics service, that brings together enterprise data warehousing and Big Data analytics into a single service. It gives you the freedom to query data on your terms, using either serverless on-demand or provisioned resources, at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate business intelligence and machine learning needs. This is a huge deck with lots of screenshots so you can see exactly how it works.
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a half-day virtual workshop, hosted by James McAuliffe.
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a half-day virtual workshop, hosted by James McAuliffe.
Embarking on building a modern data warehouse in the cloud can be an overwhelming experience due to the sheer number of products that can be used, especially when the use cases for many products overlap others. In this talk I will cover the use cases of many of the Microsoft products that you can use when building a modern data warehouse, broken down into four areas: ingest, store, prep, and model & serve. It’s a complicated story that I will try to simplify, giving blunt opinions of when to use what products and the pros/cons of each.
Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a two-day virtual workshop, hosted by James McAuliffe.
The breath and depth of Azure products that fall under the AI and ML umbrella can be difficult to follow. In this presentation I’ll first define exactly what AI, ML, and deep learning is, and then go over the various Microsoft AI and ML products and their use cases.
Data Quality in the Data Hub with RedPointGlobalCaserta
At a Big Data Warehousing Meetup, George Corugedo, CTO of RedPoint Global demonstrated how to use your big data platform for data integration, data quality and identity resolution to provide a true 360 degree view of your customer on Hadoop using the RedPoint product.
For more information or questions, please contact us at www.casertaconcepts.com.
Ai & Data Analytics 2018 - Azure Databricks for data scientistAlberto Diaz Martin
This document summarizes a presentation given by Alberto Diaz Martin on Azure Databricks for data scientists. The presentation covered how Databricks can be used for infrastructure management, data exploration and visualization at scale, reducing time to value through model iterations and integrating various ML tools. It also discussed challenges for data scientists and how Databricks addresses them through features like notebooks, frameworks, and optimized infrastructure for deep learning. Demo sections showed EDA, ML pipelines, model export, and deep learning modeling capabilities in Databricks.
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...Mark Rittman
Mark Rittman from Rittman Mead presented on Oracle Big Data Discovery. He discussed how many organizations are running big data initiatives involving loading large amounts of raw data into data lakes for analysis. Oracle Big Data Discovery provides a visual interface for exploring, analyzing, and transforming this raw data. It allows users to understand relationships in the data, perform enrichments, and prepare the data for use in tools like Oracle Business Intelligence.
Data weekender4.2 azure purview erwin de kreukErwin de Kreuk
This document provides information about Azure Purview and its capabilities for unified data governance. It discusses:
- Azure Purview allows for automated discovery of data across on-premises, multicloud and SaaS sources through its data map. It enables classification, lineage tracking and compliance.
- The data catalog provides semantic search and browse capabilities along with a business glossary and data lineage visualizations.
- Insights features provide reporting on assets, scans, the business glossary, classifications and labeling to give visibility into data usage across the organization.
- The document demonstrates registering and scanning a Power BI tenant to discover data with Azure Purview.
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a half-day virtual workshop, hosted by James McAuliffe.
Big Data Analytics in the Cloud with Microsoft AzureMark Kromer
Big Data Analytics in the Cloud using Microsoft Azure services was discussed. Key points included:
1) Azure provides tools for collecting, processing, analyzing and visualizing big data including Azure Data Lake, HDInsight, Data Factory, Machine Learning, and Power BI. These services can be used to build solutions for common big data use cases and architectures.
2) U-SQL is a language for preparing, transforming and analyzing data that allows users to focus on the what rather than the how of problems. It uses SQL and C# and can operate on structured and unstructured data.
3) Visual Studio provides an integrated environment for authoring, debugging, and monitoring U-SQL scripts and jobs. This allows
This document discusses techniques for optimizing Power BI performance. It recommends tracing queries using DAX Studio to identify slow queries and refresh times. Tracing tools like SQL Profiler and log files can provide insights into issues occurring in the data sources, Power BI layer, and across the network. Focusing on optimization by addressing wait times through a scientific process can help resolve long-term performance problems.
Organizations have been collecting, storing, and accessing data from the beginning of computerization. Insights gained from analyzing the data enable them to identify new opportunities, improve core processes, enable continuous learning and differentiation, remain competitive, and thrive in an increasingly challenging business environment.
The well-established data architecture, consisting of a data warehouse, fed from multiple operational data stores, and fronted by BI tools, has served most organizations well. However, over the last two decades, with the explosion of internet-scale data, and the advent of new approaches to data and computational processing, this tried-and-true data architecture has come under strain, and has created both challenges and opportunities for organizations.
In this green paper, we will discuss modern approaches to data architecture that have evolved to address these challenges and provide a framework for companies to build a data architecture and better adapt to increasing demands of the modern business environment. This discussion of data architecture will be tied to the Data Maturity Journey introduced in EQengineered’s June 2021 green paper on Data Modernization.
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...Lace Lofranco
Talk Description:
The Modern Data Warehouse architecture is a response to the emergence of Big Data, Machine Learning and Advanced Analytics. DevOps is a key aspect of successfully operationalising a multi-source Modern Data Warehouse.
While there are many examples of how to build CI/CD pipelines for traditional applications, applying these concepts to Big Data Analytical Pipelines is a relatively new and emerging area. In this demo heavy session, we will see how to apply DevOps principles to an end-to-end Data Pipeline built on the Microsoft Azure Data Platform with technologies such as Data Factory, Databricks, Data Lake Gen2, Azure Synapse, and AzureDevOps.
Resources: https://aka.ms/mdw-dataops
Data warehouse con azure synapse analyticsEduardo Castro
Azure Synapse is the evolution of Azure SQL Data Warehouse, combining big data, data storage and data integration into a single service for end-to-end cloud scale analytics. It provides unlimited analytics with unparalleled speed to gain insights. Azure Synapse brings together enterprise data warehousing and big data analytics to give a unified experience with the advantages of both worlds.
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Michael Rys
SQLBits 2020 presentation on how you can build solutions based on the modern data warehouse pattern with Azure Synapse Spark and SQL including demos of Azure Synapse.
This document discusses developing analytics applications using machine learning on Azure Databricks and Apache Spark. It begins with an introduction to Richard Garris and the agenda. It then covers the data science lifecycle including data ingestion, understanding, modeling, and integrating models into applications. Finally, it demonstrates end-to-end examples of predicting power output, scoring leads, and predicting ratings from reviews.
A modern data warehouse lets you bring together all your data at any scale easily, and to get insights through analytical dashboards, operational reports, or advanced analytics for all your users.
Accelerating Innovation with Unified Analytics with Ali GhodsiDatabricks
Today at the 10th Spark Summit, Databricks CEO & Co-founder revealed Databricks Serverless, a new initiative to offer serverless computing for complex data science and Apache Spark workloads. Databricks Serverless is the first product to offer a serverless API for Apache Spark, greatly simplifying and unifying data science and big data workloads for both end-users and DevOps.
This document provides an overview of loading data into Azure SQL DW (Synapse Analytics). It discusses extracting source data into text files, landing the data into Azure Data Lake Store Gen2, preparing the data for loading into staging tables using PolyBase or COPY commands, transforming the data, and inserting it into production tables. It also compares ETL vs ELT approaches and SSIS vs Azure Data Factory for data integration. The presenter then demonstrates loading data in Synapse SQL pool and invites any questions.
Is the traditional data warehouse dead?James Serra
With new technologies such as Hive LLAP or Spark SQL, do I still need a data warehouse or can I just put everything in a data lake and report off of that? No! In the presentation I’ll discuss why you still need a relational data warehouse and how to use a data lake and a RDBMS data warehouse to get the best of both worlds. I will go into detail on the characteristics of a data lake and its benefits and why you still need data governance tasks in a data lake. I’ll also discuss using Hadoop as the data lake, data virtualization, and the need for OLAP in a big data solution. And I’ll put it all together by showing common big data architectures.
Streaming Real-time Data to Azure Data Lake Storage Gen 2Carole Gunst
Check out this presentation to learn the basics of using Attunity Replicate to stream real-time data to Azure Data Lake Storage Gen2 for analytics projects.
Leveraging Azure Analysis Services Tabular Data Models with Power BI by Tim M...KTL Solutions
We will take a look at an introduction and overview of Azure Analysis Services: Microsoft‘s cloud-based analytical engine and Platform as a Service (PaaS) offerings and how to leverage SQL Server Data Tools to build and deploy a tabular data model to Azure Analysis Services.
We will then connect with Power BI Desktop and the Power BI portal to build visualizations. We will discuss Azure Analysis Services features and capabilities, use cases, provisioning and deployment, managing and monitoring, tools, and report creation. Azure Analysis Service became Globally Available in April 2017, and Power
BI has released several major updates as well.
This document provides an overview of big data and how Azure HDInsight can be used to work with big data. It discusses the evolution of data from gigabytes to exabytes and the big data utility gap where most data is stored but not analyzed. It then discusses how to store everything, analyze anything, and build the right thing using big data. Examples are provided of companies generating large amounts of data. An overview of the Hadoop ecosystem is given along with examples of using Hive and Pig on HDInsight to query and analyze large datasets. A case study of Klout is also summarized.
Harnessing Microsoft Fabric and Azure Service Fabric Analytics as a Service a...Microsoft Dynamics
Understand the key capabilities of Microsoft Fabric Services and how they offer solutions for today's data and analytics needs.
https://dynatechconsultancy.com/microsoft-fabric
Data Quality in the Data Hub with RedPointGlobalCaserta
At a Big Data Warehousing Meetup, George Corugedo, CTO of RedPoint Global demonstrated how to use your big data platform for data integration, data quality and identity resolution to provide a true 360 degree view of your customer on Hadoop using the RedPoint product.
For more information or questions, please contact us at www.casertaconcepts.com.
Ai & Data Analytics 2018 - Azure Databricks for data scientistAlberto Diaz Martin
This document summarizes a presentation given by Alberto Diaz Martin on Azure Databricks for data scientists. The presentation covered how Databricks can be used for infrastructure management, data exploration and visualization at scale, reducing time to value through model iterations and integrating various ML tools. It also discussed challenges for data scientists and how Databricks addresses them through features like notebooks, frameworks, and optimized infrastructure for deep learning. Demo sections showed EDA, ML pipelines, model export, and deep learning modeling capabilities in Databricks.
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...Mark Rittman
Mark Rittman from Rittman Mead presented on Oracle Big Data Discovery. He discussed how many organizations are running big data initiatives involving loading large amounts of raw data into data lakes for analysis. Oracle Big Data Discovery provides a visual interface for exploring, analyzing, and transforming this raw data. It allows users to understand relationships in the data, perform enrichments, and prepare the data for use in tools like Oracle Business Intelligence.
Data weekender4.2 azure purview erwin de kreukErwin de Kreuk
This document provides information about Azure Purview and its capabilities for unified data governance. It discusses:
- Azure Purview allows for automated discovery of data across on-premises, multicloud and SaaS sources through its data map. It enables classification, lineage tracking and compliance.
- The data catalog provides semantic search and browse capabilities along with a business glossary and data lineage visualizations.
- Insights features provide reporting on assets, scans, the business glossary, classifications and labeling to give visibility into data usage across the organization.
- The document demonstrates registering and scanning a Power BI tenant to discover data with Azure Purview.
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a half-day virtual workshop, hosted by James McAuliffe.
Big Data Analytics in the Cloud with Microsoft AzureMark Kromer
Big Data Analytics in the Cloud using Microsoft Azure services was discussed. Key points included:
1) Azure provides tools for collecting, processing, analyzing and visualizing big data including Azure Data Lake, HDInsight, Data Factory, Machine Learning, and Power BI. These services can be used to build solutions for common big data use cases and architectures.
2) U-SQL is a language for preparing, transforming and analyzing data that allows users to focus on the what rather than the how of problems. It uses SQL and C# and can operate on structured and unstructured data.
3) Visual Studio provides an integrated environment for authoring, debugging, and monitoring U-SQL scripts and jobs. This allows
This document discusses techniques for optimizing Power BI performance. It recommends tracing queries using DAX Studio to identify slow queries and refresh times. Tracing tools like SQL Profiler and log files can provide insights into issues occurring in the data sources, Power BI layer, and across the network. Focusing on optimization by addressing wait times through a scientific process can help resolve long-term performance problems.
Organizations have been collecting, storing, and accessing data from the beginning of computerization. Insights gained from analyzing the data enable them to identify new opportunities, improve core processes, enable continuous learning and differentiation, remain competitive, and thrive in an increasingly challenging business environment.
The well-established data architecture, consisting of a data warehouse, fed from multiple operational data stores, and fronted by BI tools, has served most organizations well. However, over the last two decades, with the explosion of internet-scale data, and the advent of new approaches to data and computational processing, this tried-and-true data architecture has come under strain, and has created both challenges and opportunities for organizations.
In this green paper, we will discuss modern approaches to data architecture that have evolved to address these challenges and provide a framework for companies to build a data architecture and better adapt to increasing demands of the modern business environment. This discussion of data architecture will be tied to the Data Maturity Journey introduced in EQengineered’s June 2021 green paper on Data Modernization.
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...Lace Lofranco
Talk Description:
The Modern Data Warehouse architecture is a response to the emergence of Big Data, Machine Learning and Advanced Analytics. DevOps is a key aspect of successfully operationalising a multi-source Modern Data Warehouse.
While there are many examples of how to build CI/CD pipelines for traditional applications, applying these concepts to Big Data Analytical Pipelines is a relatively new and emerging area. In this demo heavy session, we will see how to apply DevOps principles to an end-to-end Data Pipeline built on the Microsoft Azure Data Platform with technologies such as Data Factory, Databricks, Data Lake Gen2, Azure Synapse, and AzureDevOps.
Resources: https://aka.ms/mdw-dataops
Data warehouse con azure synapse analyticsEduardo Castro
Azure Synapse is the evolution of Azure SQL Data Warehouse, combining big data, data storage and data integration into a single service for end-to-end cloud scale analytics. It provides unlimited analytics with unparalleled speed to gain insights. Azure Synapse brings together enterprise data warehousing and big data analytics to give a unified experience with the advantages of both worlds.
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Michael Rys
SQLBits 2020 presentation on how you can build solutions based on the modern data warehouse pattern with Azure Synapse Spark and SQL including demos of Azure Synapse.
This document discusses developing analytics applications using machine learning on Azure Databricks and Apache Spark. It begins with an introduction to Richard Garris and the agenda. It then covers the data science lifecycle including data ingestion, understanding, modeling, and integrating models into applications. Finally, it demonstrates end-to-end examples of predicting power output, scoring leads, and predicting ratings from reviews.
A modern data warehouse lets you bring together all your data at any scale easily, and to get insights through analytical dashboards, operational reports, or advanced analytics for all your users.
Accelerating Innovation with Unified Analytics with Ali GhodsiDatabricks
Today at the 10th Spark Summit, Databricks CEO & Co-founder revealed Databricks Serverless, a new initiative to offer serverless computing for complex data science and Apache Spark workloads. Databricks Serverless is the first product to offer a serverless API for Apache Spark, greatly simplifying and unifying data science and big data workloads for both end-users and DevOps.
This document provides an overview of loading data into Azure SQL DW (Synapse Analytics). It discusses extracting source data into text files, landing the data into Azure Data Lake Store Gen2, preparing the data for loading into staging tables using PolyBase or COPY commands, transforming the data, and inserting it into production tables. It also compares ETL vs ELT approaches and SSIS vs Azure Data Factory for data integration. The presenter then demonstrates loading data in Synapse SQL pool and invites any questions.
Is the traditional data warehouse dead?James Serra
With new technologies such as Hive LLAP or Spark SQL, do I still need a data warehouse or can I just put everything in a data lake and report off of that? No! In the presentation I’ll discuss why you still need a relational data warehouse and how to use a data lake and a RDBMS data warehouse to get the best of both worlds. I will go into detail on the characteristics of a data lake and its benefits and why you still need data governance tasks in a data lake. I’ll also discuss using Hadoop as the data lake, data virtualization, and the need for OLAP in a big data solution. And I’ll put it all together by showing common big data architectures.
Streaming Real-time Data to Azure Data Lake Storage Gen 2Carole Gunst
Check out this presentation to learn the basics of using Attunity Replicate to stream real-time data to Azure Data Lake Storage Gen2 for analytics projects.
Leveraging Azure Analysis Services Tabular Data Models with Power BI by Tim M...KTL Solutions
We will take a look at an introduction and overview of Azure Analysis Services: Microsoft‘s cloud-based analytical engine and Platform as a Service (PaaS) offerings and how to leverage SQL Server Data Tools to build and deploy a tabular data model to Azure Analysis Services.
We will then connect with Power BI Desktop and the Power BI portal to build visualizations. We will discuss Azure Analysis Services features and capabilities, use cases, provisioning and deployment, managing and monitoring, tools, and report creation. Azure Analysis Service became Globally Available in April 2017, and Power
BI has released several major updates as well.
This document provides an overview of big data and how Azure HDInsight can be used to work with big data. It discusses the evolution of data from gigabytes to exabytes and the big data utility gap where most data is stored but not analyzed. It then discusses how to store everything, analyze anything, and build the right thing using big data. Examples are provided of companies generating large amounts of data. An overview of the Hadoop ecosystem is given along with examples of using Hive and Pig on HDInsight to query and analyze large datasets. A case study of Klout is also summarized.
Harnessing Microsoft Fabric and Azure Service Fabric Analytics as a Service a...Microsoft Dynamics
Understand the key capabilities of Microsoft Fabric Services and how they offer solutions for today's data and analytics needs.
https://dynatechconsultancy.com/microsoft-fabric
Microsoft Fabric is the next version of Azure Data Factory, Azure Data Explorer, Azure Synapse Analytics, and Power BI. It brings all of these capabilities together into a single unified analytics platform that goes from the data lake to the business user in a SaaS-like environment. Therefore, the vision of Fabric is to be a one-stop shop for all the analytical needs for every enterprise and one platform for everyone from a citizen developer to a data engineer. Fabric will cover the complete spectrum of services including data movement, data lake, data engineering, data integration and data science, observational analytics, and business intelligence. With Fabric, there is no need to stitch together different services from multiple vendors. Instead, the customer enjoys end-to-end, highly integrated, single offering that is easy to understand, onboard, create and operate.
This is a hugely important new product from Microsoft and I will simplify your understanding of it via a presentation and demo.
Agenda:
What is Microsoft Fabric?
Workspaces and capacities
OneLake
Lakehouse
Data Warehouse
ADF
Power BI / DirectLake
Resources
1 Introduction to Microsoft data platform analytics for releaseJen Stirrup
Part 1 of a conference workshop. This forms the morning session, which looks at moving from Business Intelligence to Analytics.
Topics Covered: Azure Data Explorer, Azure Data Factory, Azure Synapse Analytics, Event Hubs, HDInsight, Big Data
From Business Hindsight to Foresight with Azure Synapse AnalyticsKorcomptenz Inc
From Business Hindsight to Foresight with Azure Synapse Analytics
The document discusses how Azure Synapse Analytics can help organizations transition from descriptive analytics of past data to predictive analytics and prescriptive insights. It provides an overview of Azure Synapse's capabilities for data integration, warehousing, and big data analytics. Case studies demonstrate how customers have used Azure Synapse and Power BI to improve operations, customer experiences, and enable predictive maintenance.
Think of big data as all data, no matter what the volume, velocity, or variety. The simple truth is a traditional on-prem data warehouse will not handle big data. So what is Microsoft’s strategy for building a big data solution? And why is it best to have this solution in the cloud? That is what this presentation will cover. Be prepared to discover all the various Microsoft technologies and products from collecting data, transforming it, storing it, to visualizing it. My goal is to help you not only understand each product but understand how they all fit together, so you can be the hero who builds your companies big data solution.
Azure Data Explorer deep dive - review 04.2020Riccardo Zamana
Modern Data Science Lifecycle with ADX & Azure
This document discusses using Azure Data Explorer (ADX) for data science workflows. ADX is a fully managed analytics service for real-time analysis of streaming data. It allows for ad-hoc querying of data using Kusto Query Language (KQL) and integrates with various Azure data ingestion sources. The document provides an overview of the ADX architecture and compares it to other time series databases. It also covers best practices for ingesting data, visualizing results, and automating workflows using tools like Azure Data Factory.
Azure Synapse is Microsoft's new cloud analytics service offering that combines enterprise data warehouse and Big Data analytics capabilities. It offers a powerful and streamlined platform to facilitate the process of consolidating, storing, curating and analysing your data to generate reliable and actionable business insights.
Customer Migration to Azure SQL Database_2024.pdfGeorge Walters
Customer Migration to Azure SQL Database 2024 --
Hear how a tier 1 financial ISV application got migrated from on-premises to the Azure Cloud! This includes issues with existing application, building out an Azure Database practice, and migration. We finish up with how to do pieces of this application with the latest Azure additions.
This document discusses the future of data and the Azure data ecosystem. It highlights that by 2025 there will be 175 zettabytes of data in the world and the average person will have over 5,000 digital interactions per day. It promotes Azure services like Power BI, Azure Synapse Analytics, Azure Data Factory and Azure Machine Learning for extracting value from data through analytics, visualization and machine learning. The document provides overviews of key Azure data and analytics services and how they fit together in an end-to-end data platform for business intelligence, artificial intelligence and continuous intelligence applications.
Introduces the Microsoft’s Data Platform for on premise and cloud. Challenges businesses are facing with data and sources of data. Understand about Evolution of Database Systems in the modern world and what business are doing with their data and what their new needs are with respect to changing industry landscapes.
Dive into the Opportunities available for businesses and industry verticals: the ones which are identified already and the ones which are not explored yet.
Understand the Microsoft’s Cloud vision and what is Microsoft’s Azure platform is offering, for Infrastructure as a Service or Platform as a Service for you to build your own offerings.
Introduce and demo some of the Real World Scenarios/Case Studies where Businesses have used the Cloud/Azure for creating New and Innovative solutions to unlock these potentials.
Azure Data Engineer Course | Azure Data Engineer TraininAccentfuture
AccentFuture offers top Azure Data Engineer training. Enroll in our Azure Data Engineer course online and master skills with expert-led Azure Data Engineer online course and hands-on training.
This document provides an overview of a course on implementing a modern data platform architecture using Azure services. The course objectives are to understand cloud and big data concepts, the role of Azure data services in a modern data platform, and how to implement a reference architecture using Azure data services. The course will provide an ARM template for a data platform solution that can address most data challenges.
Azure provides cloud computing services including infrastructure as a service (IaaS), platform as a service (PaaS), and software as a service (SaaS) that allow users to rapidly setup environments, scale resources to meet demands, and increase efficiency. Azure offers a wide range of services such as compute, storage, databases, analytics, machine learning, IoT, and security to help users migrate existing applications or build new cloud-native applications. The document outlines key scenarios for using Azure such as development/testing, lift and shift of existing applications, big data analytics, and identity management to provide a starting point for leveraging the cloud platform
Apache Spark is a fast and general engine for large-scale data processing. It was created by UC Berkeley and is now the dominant framework in big data. Spark can run programs over 100x faster than Hadoop in memory, or more than 10x faster on disk. It supports Scala, Java, Python, and R. Databricks provides a Spark platform on Azure that is optimized for performance and integrates tightly with other Azure services. Key benefits of Databricks on Azure include security, ease of use, data access, high performance, and the ability to solve complex analytics problems.
IBM Cloud Pak for Data is a unified platform that simplifies data collection, organization, and analysis through an integrated cloud-native architecture. It allows enterprises to turn data into insights by unifying various data sources and providing a catalog of microservices for additional functionality. The platform addresses challenges organizations face in leveraging data due to legacy systems, regulatory constraints, and time spent preparing data. It provides a single interface for data teams to collaborate and access over 45 integrated services to more efficiently gain insights from data.
A dive into Microsoft Fabric/AI Solutions offering. For the event: AI, Data, and CRM: Shaping Business through Unique Experiences. By D. Koutsanastasis, Microsoft
Big Data, IoT, data lake, unstructured data, Hadoop, cloud, and massively parallel processing (MPP) are all just fancy words unless you can find uses cases for all this technology. Join me as I talk about the many use cases I have seen, from streaming data to advanced analytics, broken down by industry. I’ll show you how all this technology fits together by discussing various architectures and the most common approaches to solving data problems and hopefully set off light bulbs in your head on how big data can help your organization make better business decisions.
Introduction to Machine Learning with Azure & DatabricksCCG
Join CCG and Microsoft for a hands-on demonstration of Azure’s machine learning capabilities. During the workshop, we will:
- Hold a Machine Learning 101 session to explain what machine learning is and how it fits in the analytics landscape
- Demonstrate Azure Databricks’ capabilities for building custom machine learning models
- Take a tour of the Azure Machine Learning’s capabilities for MLOps, Automated Machine Learning, and code-free Machine Learning
By the end of the workshop, you’ll have the tools you need to begin your own journey to AI.
The document outlines several upcoming workshops hosted by CCG, an analytics consulting firm, including:
- An Analytics in a Day workshop focusing on Synapse on March 16th and April 20th.
- An Introduction to Machine Learning workshop on March 23rd.
- A Data Modernization workshop on March 30th.
- A Data Governance workshop with CCG and Profisee on May 4th focusing on leveraging MDM within data governance.
More details and registration information can be found on ccganalytics.com/events. The document encourages following CCG on LinkedIn for event updates.
How to Monetize Your Data Assets and Gain a Competitive AdvantageCCG
Join us for this session where Doug Laney will share insights from his best-selling book, Infonomics, about how organizations can actually treat information as an enterprise asset.
You had a strategy. You were executing it. You were then side-swiped by COVID, spending countless cycles blocking and tackling. It is now time to step back onto your path.
CCG is holding a workshop to help you update your roadmap and get your team back on track and review how Microsoft Azure Solutions can be leveraged to build a strong foundation for governed data insights.
Power BI Advanced Data Modeling Virtual WorkshopCCG
Join CCG and Microsoft for a virtual workshop, hosted by Solution Architect, Doug McClurg, to learn how to create professional, frustration-free data models that engage your customers.
Machine Learning with Azure and Databricks Virtual WorkshopCCG
Join CCG and Microsoft for a hands-on demonstration of Azure’s machine learning capabilities. During the workshop, we will:
- Hold a Machine Learning 101 session to explain what machine learning is and how it fits in the analytics landscape
- Demonstrate Azure Databricks’ capabilities for building custom machine learning models
- Take a tour of the Azure Machine Learning’s capabilities for MLOps, Automated Machine Learning, and code-free Machine Learning
By the end of the workshop, you’ll have the tools you need to begin your own journey to AI.
Join Brian Beesley, Director of Data Science, for an executive-level tour of AI capabilities. Get an inside peek at how others have used AI, and learn how you can harness the power of AI to transform your business.
Virtual Governance in a Time of Crisis WorkshopCCG
The CCGDG framework is focused on the following 5 key competencies. These 5 competencies were identified as areas within DG that have the biggest ROI for you, our customer. The pandemic has uncovered many challenges related to governance, therefore the backbone of this model is the emphasis on risk mitigation.
1. Program Management
2. Data Quality
3. Data Architecture
4. Metadata Management
5. Privacy
Advance Data Visualization and Storytelling Virtual WorkshopCCG
Join CCG and Microsoft for a virtual workshop, hosted by Senior BI Architect, Martin Rivera, taking you through a journey of advanced data visualization and storytelling.
In early 2019, Microsoft created the AZ-900 Microsoft Azure Fundamentals certification. This is a certification for all individuals, IT or non IT background, who want to further their careers and learn how to navigate the Azure cloud platform.
Learn about AZ-900 exam concepts and how to prepare and pass the exam
This document provides an overview and agenda for a Power BI Advanced training course. The course objectives are outlined, which include understanding data modeling concepts, calculated columns and measures, and evaluation contexts in DAX. The agenda lists the modules to be covered, including data modeling best practices, modeling scenarios, and DAX. Housekeeping items are provided, instructing participants to send questions to Sami and mute their lines. It is noted the session will be recorded.
This document provides an overview of Azure core services, including compute, storage, and networking options. It discusses Azure management tools like the portal, PowerShell, and CLI. For compute, it covers virtual machines, containers, App Service, and serverless options. For storage, it discusses SQL Database, Cosmos DB, blob, file, queue, and data lake storage. It also discusses networking concepts like load balancing and traffic management. The document ends with potential exam questions related to Azure services.
This document provides an agenda and objectives for an advanced Power BI training session. The agenda includes sections on Power BI M transformations, merge types, creating a BudgetFact table using multiple queries, and data profiling. The objectives are to understand M transformations, merging queries, using multiple queries for advanced transformations, and data profiling. Attendees will learn key M transformations like transpose, pivot columns, and unpivot columns. They will also learn about different merge types in Power BI.
This document provides an overview of Azure cloud concepts for exam preparation. It begins with an introduction to cloud computing benefits like scalability, reliability and cost effectiveness. It then covers Azure architecture including regions, availability zones and performance service level agreements. The document reviews cloud deployment models and compares infrastructure as a service, platform as a service and software as a service. It also discusses how to use the Azure pricing calculator and reduce infrastructure costs. Potential exam questions are provided at the end.
Business intelligence dashboards and data visualizations serve as a launching point for better business decision making. Learn how you can leverage Power BI to easily build reports and dashboards with interactive visualizations.
Data Governance and MDM | Profisse, Microsoft, and CCGCCG
CCG will introduce a methodology and framework for DG that allows organizations to assess DG faster, deriving actionable insights that can be quickly implemented with minimal disruption. CCG will also review how Microsoft Azure Solutions can be leveraged to build a strong foundation for governed data insights. In addition, Profisee will introduce a popular component of data governance, MDM.
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateCCG
Self-service BI empowers users to reach analytic outputs through data visualizations and reporting tools. Solution Architect and Cloud Solution Specialist, James McAuliffe, will be taking you through a journey of Azure's Modern Data Estate.
Data Governance with Profisee, Microsoft & CCG CCG
1. The workshop agenda covers data governance fundamentals, assessing an organization's data governance maturity using the CCGDG framework, and prioritizing a roadmap for improvement.
2. The Profisee presentation promotes their master data management solution for enabling digital transformation by providing a single view of critical data across systems.
3. Profisee's solution focuses on five key areas: stewardship, matching configuration, adjusting the configuration, operational matching, and workflow management to ensure data quality.
[Webinar] Top Power BI Updates You *Acutally* Need to Know CCG
1)Summary of the over 25 feature improvements made by Power BI in 2019
2) Top ways to leverage the changes in functionality
3) Ways to get buy-in and further utilize your Microsoft Power BI investment
Key takeaways:
-Identify with the key reasons for failing Data Governance initiatives
-Uncover the commonly used Data Governance terms and their meanings
-Learn the Framework for a successful Data Governance Program
Just-in-time: Repetitive production system in which processing and movement of materials and goods occur just as they are needed, usually in small batches
JIT is characteristic of lean production systems
JIT operates with very little “fat”
This comprehensive Data Science course is designed to equip learners with the essential skills and knowledge required to analyze, interpret, and visualize complex data. Covering both theoretical concepts and practical applications, the course introduces tools and techniques used in the data science field, such as Python programming, data wrangling, statistical analysis, machine learning, and data visualization.
By James Francis, CEO of Paradigm Asset Management
In the landscape of urban safety innovation, Mt. Vernon is emerging as a compelling case study for neighboring Westchester County cities. The municipality’s recently launched Public Safety Camera Program not only represents a significant advancement in community protection but also offers valuable insights for New Rochelle and White Plains as they consider their own safety infrastructure enhancements.
Mieke Jans is a Manager at Deloitte Analytics Belgium. She learned about process mining from her PhD supervisor while she was collaborating with a large SAP-using company for her dissertation.
Mieke extended her research topic to investigate the data availability of process mining data in SAP and the new analysis possibilities that emerge from it. It took her 8-9 months to find the right data and prepare it for her process mining analysis. She needed insights from both process owners and IT experts. For example, one person knew exactly how the procurement process took place at the front end of SAP, and another person helped her with the structure of the SAP-tables. She then combined the knowledge of these different persons.
Telangana State, India’s newest state that was carved from the erstwhile state of Andhra
Pradesh in 2014 has launched the Water Grid Scheme named as ‘Mission Bhagiratha (MB)’
to seek a permanent and sustainable solution to the drinking water problem in the state. MB is
designed to provide potable drinking water to every household in their premises through
piped water supply (PWS) by 2018. The vision of the project is to ensure safe and sustainable
piped drinking water supply from surface water sources
Thingyan is now a global treasure! See how people around the world are search...Pixellion
We explored how the world searches for 'Thingyan' and 'သင်္ကြန်' and this year, it’s extra special. Thingyan is now officially recognized as a World Intangible Cultural Heritage by UNESCO! Dive into the trends and celebrate with us!
1. Analytics in a day
Cloud analytics in the age of
self-service and data science
2. Agenda
Workshop Instruction & Discussion
9:00 – 9:30 Harness the power of analytics
9:30 – 10:15 Experience a new class of analytics with Azure Synapse
Break for 15 min and Set Up Lab
Hands-on lab
10:30 – 1:00 Hands-on lab using Azure Synapse and Power BI
Times are approximate and will be fluid with the class
3. Housekeeping
Please message Sami
with any questions,
concerns or if you
need assistance during
this workshop.
Please mute your line!
We will be applying mute.
This session will be
recorded.
If you do not want to be
recorded, please disconnect at
this time.
Links:
See chat window.
Worksheet:
See handouts.
To make presentation
larger, draw the
bottom half of screen
‘up’.
4. James McAuliffe,
Cloud Solution Architect
James McAuliffe is a Cloud Solution Architect with over 20 years of technology
industry experience. During this journey into data and analytics, he’s held all of the
traditional Business Intelligence Solution project roles, ranging from design and
development to complete life cycle BI implementations. He is a Microsoft Preferred
Partner Solutions expert and has worked with clients of all sizes, from local
businesses to Fortune 500 companies.
And I like old Italian cars.
linkedin.com/in/jamesmcauliffesql/
6. A premier Microsoft partner, CCG uses leading cloud
platforms to develop solutions and provide analytics that
help customers advance their digital strategies.
6
PARTNERSHIP SPOTLIGHT: MICROSOFT
Certifications
Gold Partner
Independent System Vendor (ISV)
and Co-Seller
AI Inner Circle Partner
Technologies
Azure Data Services
Azure Data Factory
Azure Data Lake Store
Azure Databricks
Azure Cognitive Services
Azure Machine Learning
Azure Stream Analytics
Azure Analysis Services
Azure Synapse Analytics
Power BI Platform
7. Offerings Overview
7
Data and
Analytics Strategy
Advanced Analytics,
Machine Learning, and AI
Data Management
and Data Governance
Enterprise Business
Intelligence
Cloud Strategy, Migration,
And Management
8. Harness the power of analytics
Section 1
All businesses are data businesses
Section 2
The cloud for modern analytics
Section 3
The paradox of analytics
Section 4
The analytics continuum
16. Structured, unstructured, and streaming data
integrated in a single, scalable, environment
A cloud analytics platform
is the hub for all data models
18. Data Landscape – Volume and Pressure
IDC Data Age 2025 - The Digitization of the World
19. While data grows 400%, less than 30% gets analyzed
2025
2020
44ZB 175ZB
20. * Harvard Business Review (2019), Understanding why analytics strategies fall short for some, but not for others
Analytics & AI is the #1 investment for business leaders,
however they struggle to maximize ROI
80% 55%
?
22. Each new technology creates
another siloed operation
Big data
Data integration
Machine learning
Business intelligence
Data governance
Security
paradox
32. The first unified, cloud native
platform for converged analytics
Azure Synapse is the only unified platform for
analytics, blending big data, data warehousing,
and data integration into a single cloud native
service for end-to-end analytics at cloud scale.
33. Powered by a new cloud native
distributed SQL engine
34. Flexible consumption models
Serverless pay-per-query ideal for ad-hoc data lake
exploration and transformation
Dedicated clusters optimized mission-critical data
warehouse workloads
Serverless Dedicated
35. ➢ Comprehensive security and compliance
➢ Streamlined data integration
➢ Flexible data warehousing
➢ Real-time operational analytics
➢ Integrated machine learning
➢ Power BI + Azure Synapse
37. Category Feature
Data Protection
Data in transit
Data encryption at rest
Data discovery and classification
Access Control
Object level security (tables/views)
Row level security
Column level security
Dynamic data masking
Column level encryption
Authentication
SQL login
Azure active directory
Multi-factor authentication
Network Security
Managed virtual network
Custom virtual network
Firewall
Azure ExpressRoute
Azure Private Link
Threat protection
Threat detection
Auditing
Vulnerability assessment
Isolation
Dedicated metadata store
Hosted in customer tenant
Best-in-class security
Customer & System Managed Keys
All data encrypted by default
Up to 3x levels of data encryption at rest
Democratize data at scale with fine-grained ACL
Proactive protection
Comprehensive Compliance
38. Eliminate network maintenance
One-click enables automated management of virtual
networks between cluster endpoints
Synapse resources only ever interop with private
endpoints
No management of subnets or IP Ranges
Prevents data exfiltration
Compliance boundary
39. More than just data security
Native integration with Azure Purview
Automatically discover and classify data assets
End-to-end data lineage
41. Fully-managed elastic platform
Elastic compute that can be easily optimized to
different classes of workload
All features available in a single tier
Infinite cost effective PAYG storage
42. SQL Editor
Automatic code completion (Intellisense)
Script collaboration within the Workspace
Built-in visualizations
Easily switch between clusters
43. TPC-H and TPC-DS Leader
Price/performance leadership relative to other cloud
data warehouses
“Polaris” is the only query engine to successfully
complete TPC-H at 1PB scale
https://aka.ms/synapse-dqp
11.5
7
62
9.5
28
2.5
30.5
48.5
99
22.5
9
5
11.5
6.5
2 5
27
99.5
18.5 21
95.5
8
Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 Q19 Q20 Q21 Q22
TPC-H 1 petabyte execution times
$47
$152
$564
$51
DW30000C 4X-Large BigQuery dc2.8xlarge
60N
Test-H 30TB
Price/performance @30TB
($ per query per hour) lower is better
$153
$286 $309
$570
Azure
Synapse
Redshift Snowflake
Enterprise
BigQuery Flat
Rate
Test-DS 30TB
Price/performance comparison (lower is better)
* “GigaOm Analytics Field Test-H Benchmark Report” January 2019; “GigaOm Analytics Field Test-DS Benchmark Report” April 2019
44. Only platform to complete TPC-H
benchmark at 1 Petabyte
Massive concurrency
Global workload graph
Workload aware query scheduling
https://aka.ms/synapse-dqp
45. Workload management
Azure Synapse supports a more diverse set of workload management tools through workload importance, intra-cluster isolation, and elastic clusters.
Scale in Scale out
Workload Importance Workload Isolation
Workload Group B 40%
Elastic Cluster (Scale Up)
2000 cDWU
46. Chooses the most secure cloud DW, Azure Synapse Analytics,
to transform two business critical Teradata systems
Challenge
Solution
52. Real-time operational analytics
One-click enablement in Azure Portal
No data integration pipelines required
Near-zero impact on operational systems
Latency <90s at 99th percentile
Azure Cosmos DB
Analytical Store
Column store optimized
for analytical queries
Transactional Store
Row store optimized for
transactional operations
Azure Synapse
Analytics
Cloud-Native HTAP
Azure Synapse Link
53. Event Hubs
IoT Hub
T-SQL language
Built-in streaming ingestion & analytics
Streaming Ingestion Dedicated SQL Pool
Synapse SQL
IoT ingestion without aggregation
Ingress up to 720 gigabytes/hour of raw events
Analyze data in-flight using SQL language Azure
Stream Analytics
Join streaming data with other data assets in the
data warehouse and data lake
54. “Azure data services have been
seamlessly integrated into existing
infrastructure, which was especially
helpful with respect to authentication
and user access management.”
New ways for optimizing Snow production
and operational costs with Azure Synapse
Challenge
Solution
56. Democratize predictive power
Synapse makes predictive analytics accessible to all
Notebooks provides a code authoring experience for
complex predictive models
Automatic ML graphical interface provides a no-
code experience for creating ML models
Native integration with Azure Cognitive Search
provides access to pre-built models
All Code Low/no-Code Pre-built models
57. Code-first ML model development
PySpark, Scala, and C# languages supported
Automatic code completion (Intellisense)
Author multiple languages in a single notebook
Analyze data from the data warehouse, data lake,
and real-time operational data from one place
58. Data + Languages
Languages such as SQL, PySpark, Scala and
C# in support of data science and data
warehouse workloads
The data lake supports and unlimited set of file
formats including Parquet, ORC and Json as well as
audio, image, and video formats
Language
Data
59. All you need is data
Fully automated feature exploration
60. Code-free in Synapse Studio
No-code creation on Machine Learning models
Democratize ML to everyone since no data science
domain knowledge required
Support for ensemble models
Supports classification, regression, and
time-series forecasting
61. Code-free in Synapse Studio
No-code references to machine learning models
Democratize ML to everyone since no data science
domain knowledge required
Easily embed in SQL stored procedures for
transformation of Views for reporting
62. SELECT d.*, p.Score FROM PREDICT(MODEL = @onnx_model, …
In-engine ML scoring
Machine learning models executed using SQL
“In-engine” for performance and scalability
No data leaves the platform for scoring
No additional cost for scoring
T-SQL Language
Synapse SQL
Model Data Predictions
63. Power BI + Azure Synapse
An unmatched combination
64. Where do you find yourself on the curve?
Hindsight Insight Foresight
Value
Difficulty
What happened?
Descriptive Analysis
Why did it happen?
Diagnostic Analysis
What will happen?
Predictive Analysis
How can we make it happen?
Prescriptive Analysis
65. Where do you find yourself on the curve?
Hindsight Insight Foresight
Value
Difficulty
What happened?
Descriptive Analysis
Why did it happen?
Diagnostic Analysis
What will happen?
Predictive Analysis
How can we make it happen?
Prescriptive Analysis
BI
66. BI + Analytics unlock the door to AI, machine learning, and
real-time insights
Hindsight Insight Foresight
Value
Difficulty
What happened?
Descriptive Analysis
Why did it happen?
Diagnostic Analysis
What will happen?
Predictive Analysis
How can we make it happen?
Prescriptive Analysis
Analytics
BI
67. Unified experience to enrich data
Automated ML for rapid development
Seamless collaboration
68. Build dashboard in Synapse Studio
Code-free experience for development rich
visualizations
One-click publishing to for secure consumption
across the enterprise
69. Visualize
and report
Power BI
Model
& serve
Azure Synapse
Analytics
CDM
Folders
Azure Data
Lake Storage
Respond instantly
Enable instant response times with
Power BI Aggregations on massive
datasets when querying at the
aggregated level
Get granular with your data
Queries at the granular level are
sent to Azure Synapse Analytics
with DirectQuery leveraging its
industry-leading performance
Save money with
industry-leading performance
Azure Synapse Analytics is up to
14x faster and 94% cheaper than
other cloud providers
View reports with
a single pane of glass
Skip the configuration when
connecting to Power BI with
integrated Power BI-authoring
directly in the Azure Synapse Studio
Accelerate business value with a powerful analytics platform
70. Customers using Azure Synapse & Power BI today
are transforming their business with purpose
27%
Faster time
to insights
271% Average ROI
26%
Lower total cost
of ownership
60%
Increased customer
satisfaction
* Forrester, October 2019, “The Total Economic Impact of Microsoft Azure Analytics with Power BI”
83. Cloud native ETL/ELT
95+ connectors available
Secure connectivity to on-premise data sources,
other clouds, and SaaS applications
Code-first and low/no code design interfaces
Schedule and Event based triggering
Code-free
106. Get started today
Create a free Azure account and get started with Azure Synapse Analytics:
https://azure.microsoft.com/en-us/free/synapse-analytics/
Submit info for free proof of value package for modernizing SAP workloads:
https://aka.ms/synapse-qlik
Submit info for free proof of value package for migrating on-prem data warehouse:
http://aka.ms/synapse-informatica
Learn more:
https://aka.ms/synapse
107. Use the limited time offer to save up to 76
percent when migrating to Azure Synapse
https://aka.ms/synapse-migration-offer
108. Analytics in a Day
Thank You!
James McAuliffe
[email protected]
https://www.linkedin.com/in/jamesmcauliffesql/
https://ccganalytics.com/
110. Supply Chain Intelligence
and Finance Analytics Migrate and Modernize
Data from
SAP ECC
Legacy DW
appliance
Each motion has a customized free proof of value package
111. Azure Synapse + Power BI + Qlik Data Integration
Free Proof of Value for SAP ERP
Free Deep Dive Session
Free ½ Day Solution
Architecture Workshop
Free Software
Subscriptions
Free Technical and
Subject Matter Expertise
Azure Synapse Expert Support
Azure Synapse Expert Support
Azure Synapse Analytics
(data warehousing) + Power BI
(for duration of POV)
Azure Synapse Analytics and
Power BI Expertise
(for duration of POV)
SAP Real-time Data Integration
Expert Support
1:1 Deep-Dive with Qlik Data
Integration Team
Qlik Data Integration for Data
Warehouse Automation
(for duration of POV)
Qlik Data Integration and
SAP Expertise
(for duration of POV)
112. Get started today with your free proof
of value from Microsoft and Qlik
https://aka.ms/synapse-qlik
113. Azure Synapse + Informatica
Free Proof of Value
Free Hands-On
Migration Workshop
TCO Assessment
Free Data Catalog
& ETL Migration
Free Code Conversion
Global Black Belt Support
On-premises Data Warehouse
Code Conversion
Azure Synapse Analytics (data
warehousing) Subscription
(for duration of PoV up to 30 days)
Cloud Data Warehouse Modernization
Workshop on Azure
Customized Migration
TCO Assessment
Enterprise Data Catalog (EDC)
Intelligent Cloud Services (IICS)
(for duration of PoV up to 30 days)
Free Azure Synapse
Subscription
114. Get started today with your free proof
of value from Microsoft and Informatica
http://aka.ms/synapse-informatica