SlideShare a Scribd company logo
14SQL SERVER: INTRODUCTION TO DATA MINING USING SQL SERVER
What is a Data Mining?Data mining is the process of analyzing a data set to find patternsData mining can also defined as deriving of knowledge from raw-data
AliasesData mining is also known  by the following terms:
Importance of Data miningThe Amount of data in the contemporary world is humungous. By studying this data and understanding the trend and patterns, one can understand the system better. Due to data mining, conclusions which are profitable for an organization  or decisions which may help a librarian manage books better: may be arrived at. Pervasiveness of data:CRM(Customer Relationship Management)ERP(Enterprise Resource Planning)Database serversData PoolWeb Server Logs
Data MiningThe traditional SQL queries that we learnt till now follow the method of ‘querying’ and based upon the response, ‘explore’ the system more. Query and Exploration MethodData Mining MethodThe Data mining methodology hence takes the opposite direction as that of query methodsHere, the important attribute on which the analysis is based is the ‘name’. Hence, it is called as the class
ApplicationsThe Application of data mining covers a wide domain. Any place where data is involved can be operated upon using data mining. Some of the real world applications of data mining are as follows:
Algorithms for Data miningThe Data mining systems utilize a wide variety of algorithms. The Four common algorithm types are:
Tasks involved in Data MiningThe Process of data mining is divided into various steps as follows:  Classification
  Clustering
  Association
  Regression
  ForecastingLet us have a look at them
ClassificationClassification is the process of grouping items into meaningful groups. The Groups are later treated as a single element and the relation between the groups are analyzed. Simply put, it is the task of assigning a group to each case.Example:Data Set
ClusteringClustering is the process of grouping data items based on some attributesExample:Data SetClustered based on nearness
Data mining algorithmsData Mining is a complex methodology which needs advanced algorithms operating on useful data.The Data mining algorithms are mainly divided into 2 types:Supervising algorithmUnsupervising algorithmIn a supervising algorithm, the system needs a target(may be a set of attributes) to learn againstWhereas the Unsupervising algorithm, iterates till the boundaries of the problem are reached

More Related Content

What's hot (17)

PPT
Data pre processing
pommurajopt
 
PDF
data mining
manasa polu
 
PPTX
Data mining nouman javed
nouman javed
 
PPT
Data Mining Technniques
Livares Technologies Pvt Ltd
 
PPTX
Data reduction
kalavathisugan
 
PDF
Manage your Datasets
Eng Teong Cheah
 
PPTX
Data mining
snegacmr
 
PPT
Data preprocessing
ankur bhalla
 
PPTX
XL-MINER: Associations
DataminingTools Inc
 
PPTX
Data Mining: Data processing
DataminingTools Inc
 
PPT
Elementary data organisation
Muzamil Hussain
 
PPTX
Knowledge Discovery & Representation
Darshan Patil
 
PPTX
Exploratory data analysis with Python
Davis David
 
DOCX
A random decision tree frameworkfor privacy preserving data mining
Venkat Projects
 
PPTX
Data Dictionary in System Analysis and Design
Arafat Hossan
 
PPTX
Data warehouse logical design
Er. Nawaraj Bhandari
 
PPTX
Data Mining: Classification and analysis
DataminingTools Inc
 
Data pre processing
pommurajopt
 
data mining
manasa polu
 
Data mining nouman javed
nouman javed
 
Data Mining Technniques
Livares Technologies Pvt Ltd
 
Data reduction
kalavathisugan
 
Manage your Datasets
Eng Teong Cheah
 
Data mining
snegacmr
 
Data preprocessing
ankur bhalla
 
XL-MINER: Associations
DataminingTools Inc
 
Data Mining: Data processing
DataminingTools Inc
 
Elementary data organisation
Muzamil Hussain
 
Knowledge Discovery & Representation
Darshan Patil
 
Exploratory data analysis with Python
Davis David
 
A random decision tree frameworkfor privacy preserving data mining
Venkat Projects
 
Data Dictionary in System Analysis and Design
Arafat Hossan
 
Data warehouse logical design
Er. Nawaraj Bhandari
 
Data Mining: Classification and analysis
DataminingTools Inc
 

Viewers also liked (7)

PPT
Ms access
dharmendra kumar
 
PPTX
Commands of DML in SQL
Ashish Gaurkhede
 
PPTX
SQL for interview
Aditya Kumar Tripathy
 
PDF
Database Systems - SQL - DDL Statements (Chapter 3/2)
Vidyasagar Mundroy
 
PPTX
MS Sql Server: Introduction To Database Concepts
DataminingTools Inc
 
PPTX
DML, DDL, DCL ,DRL/DQL and TCL Statements in SQL with Examples
LGS, GBHS&IC, University Of South-Asia, TARA-Technologies
 
PPT
Sql Server Basics
rainynovember12
 
Ms access
dharmendra kumar
 
Commands of DML in SQL
Ashish Gaurkhede
 
SQL for interview
Aditya Kumar Tripathy
 
Database Systems - SQL - DDL Statements (Chapter 3/2)
Vidyasagar Mundroy
 
MS Sql Server: Introduction To Database Concepts
DataminingTools Inc
 
DML, DDL, DCL ,DRL/DQL and TCL Statements in SQL with Examples
LGS, GBHS&IC, University Of South-Asia, TARA-Technologies
 
Sql Server Basics
rainynovember12
 
Ad

Similar to MS SQL SERVER: Introduction To Datamining Suing Sql Server (20)

PPTX
Data mining introduction
Basma Gamal
 
PPTX
Data mining
hardavishah56
 
PDF
Overview of Data Mining
ijtsrd
 
DOCX
Seminar Report Vaibhav
Vaibhav Dhattarwal
 
PPTX
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
GangeshSawarkar
 
PPTX
Introduction to-data-mining chapter 1
Mahmoud Alfarra
 
PPTX
Business Intelligence and Analytics Unit-2 part-A .pptx
RupaRani28
 
PPT
Lecture2 (1).ppt
Minakshee Patil
 
PDF
G045033841
IJERA Editor
 
PPT
Data Mining
Gary Stefan
 
PPTX
Data Mining Presentation.pptx
ChingChingErm
 
PPTX
Data mining concepts
Basit Rafiq
 
PDF
2 introductory slides
tafosepsdfasg
 
PPTX
Data mining
Ahmed Moussa
 
PPT
Data mining and privacy preserving in data mining
Needa Multani
 
PPT
Introduction to Data Mining
Sushil Kulkarni
 
PPTX
01 Introduction to Data Mining
Valerii Klymchuk
 
PPTX
Introduction to Data Mining and Data Warehousing
yokeshmca
 
PPT
Data mining
Alisha Korpal
 
Data mining introduction
Basma Gamal
 
Data mining
hardavishah56
 
Overview of Data Mining
ijtsrd
 
Seminar Report Vaibhav
Vaibhav Dhattarwal
 
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
GangeshSawarkar
 
Introduction to-data-mining chapter 1
Mahmoud Alfarra
 
Business Intelligence and Analytics Unit-2 part-A .pptx
RupaRani28
 
Lecture2 (1).ppt
Minakshee Patil
 
G045033841
IJERA Editor
 
Data Mining
Gary Stefan
 
Data Mining Presentation.pptx
ChingChingErm
 
Data mining concepts
Basit Rafiq
 
2 introductory slides
tafosepsdfasg
 
Data mining
Ahmed Moussa
 
Data mining and privacy preserving in data mining
Needa Multani
 
Introduction to Data Mining
Sushil Kulkarni
 
01 Introduction to Data Mining
Valerii Klymchuk
 
Introduction to Data Mining and Data Warehousing
yokeshmca
 
Data mining
Alisha Korpal
 
Ad

More from sqlserver content (20)

PPTX
MS SQL SERVER: Using the data mining tools
sqlserver content
 
PPTX
MS SQL SERVER: SSIS and data mining
sqlserver content
 
PPTX
MS SQL SERVER: Programming sql server data mining
sqlserver content
 
PPTX
MS SQL SERVER: Olap cubes and data mining
sqlserver content
 
PPTX
MS SQL SERVER: Microsoft time series algorithm
sqlserver content
 
PPTX
MS SQL SERVER: Microsoft sequence clustering and association rules
sqlserver content
 
PPTX
MS SQL SERVER: Neural network and logistic regression
sqlserver content
 
PPTX
MS SQL SERVER: Microsoft naive bayes algorithm
sqlserver content
 
PPTX
MS SQL SERVER: Decision trees algorithm
sqlserver content
 
PPTX
MS SQL Server: Data mining concepts and dmx
sqlserver content
 
PPTX
MS Sql Server: Reporting models
sqlserver content
 
PPTX
MS Sql Server: Reporting manipulating data
sqlserver content
 
PPTX
MS Sql Server: Reporting introduction
sqlserver content
 
PPTX
MS Sql Server: Reporting basics
sqlserver content
 
PPTX
MS Sql Server: Datamining Introduction
sqlserver content
 
PPTX
MS Sql Server: Business Intelligence
sqlserver content
 
PPTX
MS SQLSERVER:Feeding Data Into Database
sqlserver content
 
PPTX
MS SQLSERVER:Doing Calculations With Functions
sqlserver content
 
PPTX
MS SQLSERVER:Deleting A Database
sqlserver content
 
PPTX
MS SQLSERVER:Customizing Your D Base Design
sqlserver content
 
MS SQL SERVER: Using the data mining tools
sqlserver content
 
MS SQL SERVER: SSIS and data mining
sqlserver content
 
MS SQL SERVER: Programming sql server data mining
sqlserver content
 
MS SQL SERVER: Olap cubes and data mining
sqlserver content
 
MS SQL SERVER: Microsoft time series algorithm
sqlserver content
 
MS SQL SERVER: Microsoft sequence clustering and association rules
sqlserver content
 
MS SQL SERVER: Neural network and logistic regression
sqlserver content
 
MS SQL SERVER: Microsoft naive bayes algorithm
sqlserver content
 
MS SQL SERVER: Decision trees algorithm
sqlserver content
 
MS SQL Server: Data mining concepts and dmx
sqlserver content
 
MS Sql Server: Reporting models
sqlserver content
 
MS Sql Server: Reporting manipulating data
sqlserver content
 
MS Sql Server: Reporting introduction
sqlserver content
 
MS Sql Server: Reporting basics
sqlserver content
 
MS Sql Server: Datamining Introduction
sqlserver content
 
MS Sql Server: Business Intelligence
sqlserver content
 
MS SQLSERVER:Feeding Data Into Database
sqlserver content
 
MS SQLSERVER:Doing Calculations With Functions
sqlserver content
 
MS SQLSERVER:Deleting A Database
sqlserver content
 
MS SQLSERVER:Customizing Your D Base Design
sqlserver content
 

Recently uploaded (20)

PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PDF
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
Digital Circuits, important subject in CS
contactparinay1
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 

MS SQL SERVER: Introduction To Datamining Suing Sql Server

  • 1. 14SQL SERVER: INTRODUCTION TO DATA MINING USING SQL SERVER
  • 2. What is a Data Mining?Data mining is the process of analyzing a data set to find patternsData mining can also defined as deriving of knowledge from raw-data
  • 3. AliasesData mining is also known by the following terms:
  • 4. Importance of Data miningThe Amount of data in the contemporary world is humungous. By studying this data and understanding the trend and patterns, one can understand the system better. Due to data mining, conclusions which are profitable for an organization or decisions which may help a librarian manage books better: may be arrived at. Pervasiveness of data:CRM(Customer Relationship Management)ERP(Enterprise Resource Planning)Database serversData PoolWeb Server Logs
  • 5. Data MiningThe traditional SQL queries that we learnt till now follow the method of ‘querying’ and based upon the response, ‘explore’ the system more. Query and Exploration MethodData Mining MethodThe Data mining methodology hence takes the opposite direction as that of query methodsHere, the important attribute on which the analysis is based is the ‘name’. Hence, it is called as the class
  • 6. ApplicationsThe Application of data mining covers a wide domain. Any place where data is involved can be operated upon using data mining. Some of the real world applications of data mining are as follows:
  • 7. Algorithms for Data miningThe Data mining systems utilize a wide variety of algorithms. The Four common algorithm types are:
  • 8. Tasks involved in Data MiningThe Process of data mining is divided into various steps as follows: Classification
  • 12. ForecastingLet us have a look at them
  • 13. ClassificationClassification is the process of grouping items into meaningful groups. The Groups are later treated as a single element and the relation between the groups are analyzed. Simply put, it is the task of assigning a group to each case.Example:Data Set
  • 14. ClusteringClustering is the process of grouping data items based on some attributesExample:Data SetClustered based on nearness
  • 15. Data mining algorithmsData Mining is a complex methodology which needs advanced algorithms operating on useful data.The Data mining algorithms are mainly divided into 2 types:Supervising algorithmUnsupervising algorithmIn a supervising algorithm, the system needs a target(may be a set of attributes) to learn againstWhereas the Unsupervising algorithm, iterates till the boundaries of the problem are reached
  • 16. Regression and ForecastingREGRESSION:In some problems, the analysis, instead of looking for patterns that describe prime attributes (classes), we look for patterns in numerical valuesThere are 2 types of regression: 1.Linear regression 2. Logostic RegressionRegression is used to solve many business problems like predicting sea-wave patterns, temperature, air pressure, and humidity.FORECASTING:As the name suggests, it is the fore telling of data from that which currently exists.Eg: Election results forecast
  • 17. Steps to takeThe Process of data mining consists of various steps which are listed below:Data Collection: Collect dataData Cleaning: Eliminate unwanted, irrelevant and wrong dataData Transformation: Change data into a word that can be used for data mining. The Types of data transformations are:Numerical TransformationGroupingAggregation: Form groups of minute data items and handle them as aggregates. It makes the process much easier.Missing Value handling: Predict missing values or eliminate all such valuesRemoving Outliers: Remove invalid dataModel Building: Build the data mining model.Model Assessment Test with a large amount of data. If a model needs change, make it immediately.
  • 18. What to do next?The Microsoft Office 2007 supports a wide variety of data mining tools. Visit the site www.sqlserverdatamining.com and download the MS Access 2007 Add-on for data mining. Install the add-on.Working with the Access 07 Data mining tools will be handled in the next set of presentations.Summary Data mining
  • 24. Steps involvedVisit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net