SlideShare a Scribd company logo
SSAS 2008 Data Mining Lynn Langit/MSDN Developer Evangelist Microsoft http://blogs.msdn.com/SoCalDevGal
Session Prerequisites Working SQL Server 2008 Developer Understanding of OLAP concepts Working SQL Server Analysis Server 2005 Developer Interest in or basic knowledge of Data Mining concepts
Objectives and Agenda Understand what, why, when & how of SQL Server 2008 Data Mining Examine the core functionality of the Data Mining Extensions Hear about the new and/or advanced functionality of Data Mining
What and Why Data Mining? Predictive Analytics Presentation Exploration Discovery Passive Interactive Proactive Role of Software Business Insight Canned reporting Ad-hoc reporting OLAP Data mining
Cubes vs. Data Mining
DM - Scenarios to Tasks
Tasks to Techniques
BI for Everyone Individual – Excel  Project – Share Point
Microsoft’s Predictive Analytics Data Mining SQL extensions (DMX) Application Developer Data Mining Specialist Microsoft Dynamics CRM Analytics Foundation SQL Server 2008  Business Intelligence Development Studio Microsoft SQL Server 2008 Analysis Services Information  Worker Data Mining Add-ins for  the 2007 Microsoft Office system Microsoft SQL Server 2008 Data Mining BI Analyst Custom Algorithms SQL Services Azure
Data Mining Add-ins for Office 2007 Table Analysis Tools for Excel 2007 Data Mining Template for Visio 2007 Data Mining Client for Excel 2007 Information  Worker BI Analyst Data Mining Specialist
Microsoft Data Mining Lifecycle  CRISP-DM SSAS (Data Mining) Excel SSAS (DSV) Query Excel SSIS SSAS SSRS Excel Your Apps SSIS SSAS Excel Data www.crisp-dm.org Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment
Understand & Prepare specifics
Demo 1 – Explore / Clean / Partition Data 2 – Prepare Data
Modeling Specifics
Demo 3 – Select algorithm 4 – Create model
Evaluation Specifics
Demo 5 – Evaluate Model 6 – Deploy model 7- Update model 8 – Query model
Data Mining – Logical Model algorithm Mining Model Mining Model Training Data DB data Client data Application data Data Mining Engine To Predict Predicted Data Mining Model DB data Client data Application data “ Just one row ” Data Mining Engine
Data Mining - Physical Model Analysis Services Server Mining Model Data Mining Algorithm Data Source Your Application OLE DB/ ADOMD/ XMLA Deploy BI Dev Studio  (Visual Studio) App Data
Data Mining Interfaces – APIs XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
Configuration & Deployment Model Creation/Management Database Administrators Session Mining Models Model Application Permissions on models Permissions on data sources Browse Copy to Excel Drillthrough Query Default Advanced Excel Services Manage models and structures Export/Import Rename Connection Database Trace
Data Mining Extensions (DMX)  CREATE MINING MODEL  CreditRisk (CustID   LONG KEY, Gender  TEXT DISCRETE, Income    LONG CONTINUOUS, Profession  TEXT DISCRETE, Risk   TEXT DISCRETE PREDICT) USING  Microsoft_Decision_Trees INSERT INTO   CreditRisk  (CustId, Gender, Income, Profession, Risk) Select  CustomerID, Gender, Income, Profession,Risk From Customers Select  NewCustomers.CustomerID, CreditRisk.Risk,  PredictProbability(CreditRisk.Risk) FROM  CreditRisk  PREDICTION JOIN  NewCustomers ON   CreditRisk.Gender=NewCustomer.Gender   AND  CreditRisk.Income=NewCustomer.Income AND  CreditRisk.Profession=NewCustomer.Profession
DMX Column Expressions Predictable Columns Source Data Columns Functions -  Predict “ Workhorse” Discrete scalar values Continuous scalar values Associative nested tables Sequence nested tables Time Series Overloaded to PredictAssociation PredictSequence PredictTimeSeries PredictProbability PredictSupport PredictHistogram Cluster ClusterProbability GetNodeId IsInNode Arithmetic operators Stored Procedure Subselect Select from nested tables
Demo – Data Mining & Excel 20007 integration
Excel Functions* DMPREDICTTABLEROW  ( Connection, ModelName,   PredictionResult, TableRowRange [, string CommaSeparatedColumnNames] ) DMPREDICT  ( Connection, Model, PredictionResult, Value1, Name1,   [...,Value32, Name32] ) DMCONTENTQUERY  (Connection, Model, PredictionResult [, WhereClause])
DM in the Cloud Test Data Types Relational CSV SQL Services (Azure Services)
Try it in the cloud…
Analysis Results in the Cloud…
Calling the Cloud…(from Excel 2007)
New to SQL Server 2008 DM Microsoft Time Series algorithm improved  ARIMA plus ARTxp method, and a blending algorithm  = better results  New prediction mode allows adding new data to time series models Holdout Support added Easily partition data into training and test sets that are stored in mining structure & available to query after processing Ability to build mining models based on filtered subsets added Results in less structures, i.e. can just filter existing Drillthrough functionality extended  makes all mining structure columns available, not just columns included in the model allows you to build more compact models Cross-validation added allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
Summary Data Mining in SQL Server 2008 is mature, powerful and accessible Can use Excel 2007 Familiar client for BI – OLAP cubes AND Data Mining models Model Creators /  Users Excel Data or Server Data SSAS and Excel both support the full DM Cycle Data Understanding & Data Preparation Modeling, Validation & Deployment SQL Services Incubations available now Data Mining from the Cloud More
DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx   Microsoft Developer Network (MSDN) & TechNet  http://microsoft.com/msdn   http://microsoft.com/technet   Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx   Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx   SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx
BI Resources from Lynn Langit http :// blogs.msdn.com/SoCalDevGal “ How Do I…BI?” screencast series on MSDN “ Smart Business Intelligence Solutions with Microsoft SQL Server 2008”  MSPress Feb 2009 “ Foundations of SQL Server 2005 Business Intelligence ”  APress April 2007
 

More Related Content

What's hot (20)

Azure - Data Platform
Azure - Data PlatformAzure - Data Platform
Azure - Data Platform
giventocode
 
SQL Server 2008 for Developers
SQL Server 2008 for DevelopersSQL Server 2008 for Developers
SQL Server 2008 for Developers
ukdpe
 
Data Platform Overview
Data Platform OverviewData Platform Overview
Data Platform Overview
Hamid J. Fard
 
SQL to Azure Migrations
SQL to Azure MigrationsSQL to Azure Migrations
SQL to Azure Migrations
Datavail
 
Azure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in ActionAzure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in Action
Denys Chamberland
 
Migrating on premises workload to azure sql database
Migrating on premises workload to azure sql databaseMigrating on premises workload to azure sql database
Migrating on premises workload to azure sql database
PARIKSHIT SAVJANI
 
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
Robert Grossman
 
Azure Cloud Dev Camp - Introduction
Azure Cloud Dev Camp - IntroductionAzure Cloud Dev Camp - Introduction
Azure Cloud Dev Camp - Introduction
giventocode
 
Taming the shrew Power BI
Taming the shrew Power BITaming the shrew Power BI
Taming the shrew Power BI
Kellyn Pot'Vin-Gorman
 
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent OzarGeek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
IDERA Software
 
Microsoft Database Options
Microsoft Database OptionsMicrosoft Database Options
Microsoft Database Options
David Chou
 
Accessing Google Cloud APIs
Accessing Google Cloud APIsAccessing Google Cloud APIs
Accessing Google Cloud APIs
wesley chun
 
Data Migration and Data-Tier Applications with SQL Azure
Data Migration and Data-Tier Applications with SQL AzureData Migration and Data-Tier Applications with SQL Azure
Data Migration and Data-Tier Applications with SQL Azure
Mark Kromer
 
Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services
Mohamed Tawfik
 
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
George Walters
 
SQL Server End Of Support
SQL Server End Of SupportSQL Server End Of Support
SQL Server End Of Support
Mariano Kovo
 
Developing with SQL Server Analysis Services 201310
Developing with SQL Server Analysis Services 201310Developing with SQL Server Analysis Services 201310
Developing with SQL Server Analysis Services 201310
Mark Tabladillo
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
Antonios Chatzipavlis
 
Jean-René Roy : The Modern DBA
Jean-René Roy : The Modern DBAJean-René Roy : The Modern DBA
Jean-René Roy : The Modern DBA
MSDEVMTL
 
Developing ssas cube
Developing ssas cubeDeveloping ssas cube
Developing ssas cube
Slava Kokaev
 
Azure - Data Platform
Azure - Data PlatformAzure - Data Platform
Azure - Data Platform
giventocode
 
SQL Server 2008 for Developers
SQL Server 2008 for DevelopersSQL Server 2008 for Developers
SQL Server 2008 for Developers
ukdpe
 
Data Platform Overview
Data Platform OverviewData Platform Overview
Data Platform Overview
Hamid J. Fard
 
SQL to Azure Migrations
SQL to Azure MigrationsSQL to Azure Migrations
SQL to Azure Migrations
Datavail
 
Azure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in ActionAzure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in Action
Denys Chamberland
 
Migrating on premises workload to azure sql database
Migrating on premises workload to azure sql databaseMigrating on premises workload to azure sql database
Migrating on premises workload to azure sql database
PARIKSHIT SAVJANI
 
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
Robert Grossman
 
Azure Cloud Dev Camp - Introduction
Azure Cloud Dev Camp - IntroductionAzure Cloud Dev Camp - Introduction
Azure Cloud Dev Camp - Introduction
giventocode
 
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent OzarGeek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
IDERA Software
 
Microsoft Database Options
Microsoft Database OptionsMicrosoft Database Options
Microsoft Database Options
David Chou
 
Accessing Google Cloud APIs
Accessing Google Cloud APIsAccessing Google Cloud APIs
Accessing Google Cloud APIs
wesley chun
 
Data Migration and Data-Tier Applications with SQL Azure
Data Migration and Data-Tier Applications with SQL AzureData Migration and Data-Tier Applications with SQL Azure
Data Migration and Data-Tier Applications with SQL Azure
Mark Kromer
 
Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services
Mohamed Tawfik
 
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
George Walters
 
SQL Server End Of Support
SQL Server End Of SupportSQL Server End Of Support
SQL Server End Of Support
Mariano Kovo
 
Developing with SQL Server Analysis Services 201310
Developing with SQL Server Analysis Services 201310Developing with SQL Server Analysis Services 201310
Developing with SQL Server Analysis Services 201310
Mark Tabladillo
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
Antonios Chatzipavlis
 
Jean-René Roy : The Modern DBA
Jean-René Roy : The Modern DBAJean-René Roy : The Modern DBA
Jean-René Roy : The Modern DBA
MSDEVMTL
 
Developing ssas cube
Developing ssas cubeDeveloping ssas cube
Developing ssas cube
Slava Kokaev
 

Similar to SQL Server 2008 Data Mining (20)

CV Chandrajit Samanta
CV Chandrajit SamantaCV Chandrajit Samanta
CV Chandrajit Samanta
Chandrajit Samanta ([email protected])
 
BI in SQL Server 2008 for Architects
BI in SQL Server 2008 for ArchitectsBI in SQL Server 2008 for Architects
BI in SQL Server 2008 for Architects
llangit
 
Introduction To Sql Server Data Mining
Introduction To Sql Server Data MiningIntroduction To Sql Server Data Mining
Introduction To Sql Server Data Mining
Hugo Olivera Alonso
 
Bi2008 Plus Cloud Preview
Bi2008 Plus Cloud PreviewBi2008 Plus Cloud Preview
Bi2008 Plus Cloud Preview
llangit
 
Msbi online training
Msbi online trainingMsbi online training
Msbi online training
Glory IT Technologies Pvt. Ltd.
 
Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008
Eduardo Castro
 
Samuel Bayeta
Samuel BayetaSamuel Bayeta
Samuel Bayeta
Sam B
 
Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310
Mark Tabladillo
 
Introduction To Sql Services
Introduction To Sql ServicesIntroduction To Sql Services
Introduction To Sql Services
llangit
 
It ready dw_day3_rev00
It ready dw_day3_rev00It ready dw_day3_rev00
It ready dw_day3_rev00
Siwawong Wuttipongprasert
 
A Primer To Sybase Iq Development July 13
A Primer To Sybase Iq Development July 13A Primer To Sybase Iq Development July 13
A Primer To Sybase Iq Development July 13
sparkwan
 
MSBI Online Training in India
MSBI Online Training in IndiaMSBI Online Training in India
MSBI Online Training in India
united global soft
 
MSBI Online Training
MSBI Online Training MSBI Online Training
MSBI Online Training
united global soft
 
MSBI Online Training in Hyderabad
MSBI Online Training in HyderabadMSBI Online Training in Hyderabad
MSBI Online Training in Hyderabad
united global soft
 
Michael Liang Resume_Irvine_CA_ShortVersion
Michael Liang Resume_Irvine_CA_ShortVersionMichael Liang Resume_Irvine_CA_ShortVersion
Michael Liang Resume_Irvine_CA_ShortVersion
MICHAEL LIANG
 
Msbi online training
Msbi online trainingMsbi online training
Msbi online training
Divya Shree
 
Naveen CV
Naveen CVNaveen CV
Naveen CV
NaveenKumar GV
 
Cepta The Future of Data with Power BI
Cepta The Future of Data with Power BICepta The Future of Data with Power BI
Cepta The Future of Data with Power BI
Kellyn Pot'Vin-Gorman
 
Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608
Mark Tabladillo
 
BRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptx
BRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptxBRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptx
BRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptx
Farid Adam
 
BI in SQL Server 2008 for Architects
BI in SQL Server 2008 for ArchitectsBI in SQL Server 2008 for Architects
BI in SQL Server 2008 for Architects
llangit
 
Introduction To Sql Server Data Mining
Introduction To Sql Server Data MiningIntroduction To Sql Server Data Mining
Introduction To Sql Server Data Mining
Hugo Olivera Alonso
 
Bi2008 Plus Cloud Preview
Bi2008 Plus Cloud PreviewBi2008 Plus Cloud Preview
Bi2008 Plus Cloud Preview
llangit
 
Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008
Eduardo Castro
 
Samuel Bayeta
Samuel BayetaSamuel Bayeta
Samuel Bayeta
Sam B
 
Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310
Mark Tabladillo
 
Introduction To Sql Services
Introduction To Sql ServicesIntroduction To Sql Services
Introduction To Sql Services
llangit
 
A Primer To Sybase Iq Development July 13
A Primer To Sybase Iq Development July 13A Primer To Sybase Iq Development July 13
A Primer To Sybase Iq Development July 13
sparkwan
 
MSBI Online Training in Hyderabad
MSBI Online Training in HyderabadMSBI Online Training in Hyderabad
MSBI Online Training in Hyderabad
united global soft
 
Michael Liang Resume_Irvine_CA_ShortVersion
Michael Liang Resume_Irvine_CA_ShortVersionMichael Liang Resume_Irvine_CA_ShortVersion
Michael Liang Resume_Irvine_CA_ShortVersion
MICHAEL LIANG
 
Msbi online training
Msbi online trainingMsbi online training
Msbi online training
Divya Shree
 
Cepta The Future of Data with Power BI
Cepta The Future of Data with Power BICepta The Future of Data with Power BI
Cepta The Future of Data with Power BI
Kellyn Pot'Vin-Gorman
 
Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608
Mark Tabladillo
 
BRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptx
BRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptxBRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptx
BRK2045_Upgrade-SQL-Server-2017-On-Prem-IaaS-and-PaaS_final.pptx
Farid Adam
 

More from llangit (20)

2 Win7 For Devs Ux Touch Sensors
2 Win7 For Devs Ux Touch Sensors2 Win7 For Devs Ux Touch Sensors
2 Win7 For Devs Ux Touch Sensors
llangit
 
1 Win7 For Devs Fund Search
1 Win7 For Devs Fund Search1 Win7 For Devs Fund Search
1 Win7 For Devs Fund Search
llangit
 
3 Kodu
3 Kodu3 Kodu
3 Kodu
llangit
 
5 Digigirlz Xna
5 Digigirlz Xna5 Digigirlz Xna
5 Digigirlz Xna
llangit
 
4 Making Movies
4 Making Movies4 Making Movies
4 Making Movies
llangit
 
2 Digi Girlz Small Basic
2 Digi Girlz Small Basic2 Digi Girlz Small Basic
2 Digi Girlz Small Basic
llangit
 
1 Digi Girlz So Cal Databases Kims Final
1 Digi Girlz So Cal Databases Kims Final1 Digi Girlz So Cal Databases Kims Final
1 Digi Girlz So Cal Databases Kims Final
llangit
 
SQL Server 2008 for Developers
SQL Server 2008 for DevelopersSQL Server 2008 for Developers
SQL Server 2008 for Developers
llangit
 
SQL Server 2008 for .NET Developers
SQL Server 2008 for .NET DevelopersSQL Server 2008 for .NET Developers
SQL Server 2008 for .NET Developers
llangit
 
Tech Days09 Sqldev
Tech Days09 SqldevTech Days09 Sqldev
Tech Days09 Sqldev
llangit
 
Windows Azure for .NET Developers
Windows Azure for .NET DevelopersWindows Azure for .NET Developers
Windows Azure for .NET Developers
llangit
 
Making of GirlGamer
Making of GirlGamerMaking of GirlGamer
Making of GirlGamer
llangit
 
Kodu
KoduKodu
Kodu
llangit
 
DigiGirlz_SoCal_Databases
DigiGirlz_SoCal_DatabasesDigiGirlz_SoCal_Databases
DigiGirlz_SoCal_Databases
llangit
 
DigiGirlzSmallBasic
DigiGirlzSmallBasicDigiGirlzSmallBasic
DigiGirlzSmallBasic
llangit
 
The Role Of An Architect
The Role Of An ArchitectThe Role Of An Architect
The Role Of An Architect
llangit
 
SQL Server 2008 for Developers
SQL Server 2008 for DevelopersSQL Server 2008 for Developers
SQL Server 2008 for Developers
llangit
 
Parallel Programming and F#
Parallel Programming and F#Parallel Programming and F#
Parallel Programming and F#
llangit
 
BI2008newFeatures
BI2008newFeaturesBI2008newFeatures
BI2008newFeatures
llangit
 
Net35 Overview
Net35 OverviewNet35 Overview
Net35 Overview
llangit
 
2 Win7 For Devs Ux Touch Sensors
2 Win7 For Devs Ux Touch Sensors2 Win7 For Devs Ux Touch Sensors
2 Win7 For Devs Ux Touch Sensors
llangit
 
1 Win7 For Devs Fund Search
1 Win7 For Devs Fund Search1 Win7 For Devs Fund Search
1 Win7 For Devs Fund Search
llangit
 
5 Digigirlz Xna
5 Digigirlz Xna5 Digigirlz Xna
5 Digigirlz Xna
llangit
 
4 Making Movies
4 Making Movies4 Making Movies
4 Making Movies
llangit
 
2 Digi Girlz Small Basic
2 Digi Girlz Small Basic2 Digi Girlz Small Basic
2 Digi Girlz Small Basic
llangit
 
1 Digi Girlz So Cal Databases Kims Final
1 Digi Girlz So Cal Databases Kims Final1 Digi Girlz So Cal Databases Kims Final
1 Digi Girlz So Cal Databases Kims Final
llangit
 
SQL Server 2008 for Developers
SQL Server 2008 for DevelopersSQL Server 2008 for Developers
SQL Server 2008 for Developers
llangit
 
SQL Server 2008 for .NET Developers
SQL Server 2008 for .NET DevelopersSQL Server 2008 for .NET Developers
SQL Server 2008 for .NET Developers
llangit
 
Tech Days09 Sqldev
Tech Days09 SqldevTech Days09 Sqldev
Tech Days09 Sqldev
llangit
 
Windows Azure for .NET Developers
Windows Azure for .NET DevelopersWindows Azure for .NET Developers
Windows Azure for .NET Developers
llangit
 
Making of GirlGamer
Making of GirlGamerMaking of GirlGamer
Making of GirlGamer
llangit
 
DigiGirlz_SoCal_Databases
DigiGirlz_SoCal_DatabasesDigiGirlz_SoCal_Databases
DigiGirlz_SoCal_Databases
llangit
 
DigiGirlzSmallBasic
DigiGirlzSmallBasicDigiGirlzSmallBasic
DigiGirlzSmallBasic
llangit
 
The Role Of An Architect
The Role Of An ArchitectThe Role Of An Architect
The Role Of An Architect
llangit
 
SQL Server 2008 for Developers
SQL Server 2008 for DevelopersSQL Server 2008 for Developers
SQL Server 2008 for Developers
llangit
 
Parallel Programming and F#
Parallel Programming and F#Parallel Programming and F#
Parallel Programming and F#
llangit
 
BI2008newFeatures
BI2008newFeaturesBI2008newFeatures
BI2008newFeatures
llangit
 
Net35 Overview
Net35 OverviewNet35 Overview
Net35 Overview
llangit
 

Recently uploaded (20)

Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 

SQL Server 2008 Data Mining

  • 1. SSAS 2008 Data Mining Lynn Langit/MSDN Developer Evangelist Microsoft http://blogs.msdn.com/SoCalDevGal
  • 2. Session Prerequisites Working SQL Server 2008 Developer Understanding of OLAP concepts Working SQL Server Analysis Server 2005 Developer Interest in or basic knowledge of Data Mining concepts
  • 3. Objectives and Agenda Understand what, why, when & how of SQL Server 2008 Data Mining Examine the core functionality of the Data Mining Extensions Hear about the new and/or advanced functionality of Data Mining
  • 4. What and Why Data Mining? Predictive Analytics Presentation Exploration Discovery Passive Interactive Proactive Role of Software Business Insight Canned reporting Ad-hoc reporting OLAP Data mining
  • 5. Cubes vs. Data Mining
  • 6. DM - Scenarios to Tasks
  • 8. BI for Everyone Individual – Excel Project – Share Point
  • 9. Microsoft’s Predictive Analytics Data Mining SQL extensions (DMX) Application Developer Data Mining Specialist Microsoft Dynamics CRM Analytics Foundation SQL Server 2008 Business Intelligence Development Studio Microsoft SQL Server 2008 Analysis Services Information Worker Data Mining Add-ins for the 2007 Microsoft Office system Microsoft SQL Server 2008 Data Mining BI Analyst Custom Algorithms SQL Services Azure
  • 10. Data Mining Add-ins for Office 2007 Table Analysis Tools for Excel 2007 Data Mining Template for Visio 2007 Data Mining Client for Excel 2007 Information Worker BI Analyst Data Mining Specialist
  • 11. Microsoft Data Mining Lifecycle CRISP-DM SSAS (Data Mining) Excel SSAS (DSV) Query Excel SSIS SSAS SSRS Excel Your Apps SSIS SSAS Excel Data www.crisp-dm.org Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment
  • 12. Understand & Prepare specifics
  • 13. Demo 1 – Explore / Clean / Partition Data 2 – Prepare Data
  • 15. Demo 3 – Select algorithm 4 – Create model
  • 17. Demo 5 – Evaluate Model 6 – Deploy model 7- Update model 8 – Query model
  • 18. Data Mining – Logical Model algorithm Mining Model Mining Model Training Data DB data Client data Application data Data Mining Engine To Predict Predicted Data Mining Model DB data Client data Application data “ Just one row ” Data Mining Engine
  • 19. Data Mining - Physical Model Analysis Services Server Mining Model Data Mining Algorithm Data Source Your Application OLE DB/ ADOMD/ XMLA Deploy BI Dev Studio (Visual Studio) App Data
  • 20. Data Mining Interfaces – APIs XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
  • 21. Configuration & Deployment Model Creation/Management Database Administrators Session Mining Models Model Application Permissions on models Permissions on data sources Browse Copy to Excel Drillthrough Query Default Advanced Excel Services Manage models and structures Export/Import Rename Connection Database Trace
  • 22. Data Mining Extensions (DMX) CREATE MINING MODEL CreditRisk (CustID LONG KEY, Gender TEXT DISCRETE, Income LONG CONTINUOUS, Profession TEXT DISCRETE, Risk TEXT DISCRETE PREDICT) USING Microsoft_Decision_Trees INSERT INTO CreditRisk (CustId, Gender, Income, Profession, Risk) Select CustomerID, Gender, Income, Profession,Risk From Customers Select NewCustomers.CustomerID, CreditRisk.Risk, PredictProbability(CreditRisk.Risk) FROM CreditRisk PREDICTION JOIN NewCustomers ON CreditRisk.Gender=NewCustomer.Gender AND CreditRisk.Income=NewCustomer.Income AND CreditRisk.Profession=NewCustomer.Profession
  • 23. DMX Column Expressions Predictable Columns Source Data Columns Functions - Predict “ Workhorse” Discrete scalar values Continuous scalar values Associative nested tables Sequence nested tables Time Series Overloaded to PredictAssociation PredictSequence PredictTimeSeries PredictProbability PredictSupport PredictHistogram Cluster ClusterProbability GetNodeId IsInNode Arithmetic operators Stored Procedure Subselect Select from nested tables
  • 24. Demo – Data Mining & Excel 20007 integration
  • 25. Excel Functions* DMPREDICTTABLEROW ( Connection, ModelName, PredictionResult, TableRowRange [, string CommaSeparatedColumnNames] ) DMPREDICT ( Connection, Model, PredictionResult, Value1, Name1, [...,Value32, Name32] ) DMCONTENTQUERY (Connection, Model, PredictionResult [, WhereClause])
  • 26. DM in the Cloud Test Data Types Relational CSV SQL Services (Azure Services)
  • 27. Try it in the cloud…
  • 28. Analysis Results in the Cloud…
  • 30. New to SQL Server 2008 DM Microsoft Time Series algorithm improved ARIMA plus ARTxp method, and a blending algorithm = better results New prediction mode allows adding new data to time series models Holdout Support added Easily partition data into training and test sets that are stored in mining structure & available to query after processing Ability to build mining models based on filtered subsets added Results in less structures, i.e. can just filter existing Drillthrough functionality extended makes all mining structure columns available, not just columns included in the model allows you to build more compact models Cross-validation added allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
  • 31. Summary Data Mining in SQL Server 2008 is mature, powerful and accessible Can use Excel 2007 Familiar client for BI – OLAP cubes AND Data Mining models Model Creators / Users Excel Data or Server Data SSAS and Excel both support the full DM Cycle Data Understanding & Data Preparation Modeling, Validation & Deployment SQL Services Incubations available now Data Mining from the Cloud More
  • 32. DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
  • 33. DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx Microsoft Developer Network (MSDN) & TechNet http://microsoft.com/msdn http://microsoft.com/technet Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx
  • 34. BI Resources from Lynn Langit http :// blogs.msdn.com/SoCalDevGal “ How Do I…BI?” screencast series on MSDN “ Smart Business Intelligence Solutions with Microsoft SQL Server 2008” MSPress Feb 2009 “ Foundations of SQL Server 2005 Business Intelligence ” APress April 2007
  • 35.