SlideShare a Scribd company logo
DATA WAREHOUSING AND DATA MINING PRESENTED BY :- ANIL SHARMA  B-TECH(IT)MBA-A REG NO : 3470070100 PANKAJ JARIAL BTECH(IT)MBA-A REG NO : 3470070086
DATA WAREHOUSING Data warehousing is combining data from multiple sources into one comprehensive and easily manipulated database.  The primary aim for data warehousing is to provide businesses with analytics results from data mining, OLAP, Scorecarding and reporting.
NEED FOR DATA WAREHOUSING Information is now considered as a key for all the works. Those who gather, analyze, understand, and act upon information are winners. Information have no limits, it is very hard to collect information from various sources, so we need an data warehouse from where we can get all the information.
TODAYS BUISNESS INFORMATION
Retrieving data Analyzing data Extracting data Loading data Transforming data Managing data DATA WAREHOUSING INCLUDES:-
DATA WAREHOUSE ARCHITECTURE   Data warehousing is designed to provide an architecture that will make cooperate data accessible and useful to users. There is no right or wrong architecture.  The worthiness of the architecture can be judge by its use, and concept behind it . Data Warehouses can be architected in many different ways, depending on the specific needs of a business. 
Typical Data Warehousing Environment
An operational data store (ODS)  is basically a database that is used for being an temporary storage area for a datawarehouse. Its primary purpose is for handling data which are progressively in use. Operational data store contains data which are constantly updated through the course of the business operations.
ETL (Extract, Transform, Load)  is used to copy data from:- ODS to data warehouse staging area. Data warehouse staging area to data warehouse . Data warehouse to data mart . ETL extracts data, transforms values of inconsistent data, cleanses "bad" data, filters data and loads data into a target database.  
The Data Warehouse Staging Area is temporary location where data from source systems is copied.   It increases the speed of data warehouse architecture. It is very essential since data is increasing day by day.
The purpose of the Data Warehouse is to integrate corporate data. The amount of data in the Data Warehouse is massive.  Data is stored at a very deep level of detail. This allows data to be grouped in unimaginable ways. Data Warehouses does not contain all the data in the organization ,It's purpose is to provide base that are needed by the organization for strategic and tactical decision making.   
ETL extract data from the Data Warehouse and send to one or more Data Marts for use of users. Data marts are represented as shortcut to a data warehouse ,to save time. It is just an partition of data present in data warehouse. Each Data Mart can contain different combinations of tables, columns and rows from the Enterprise Data Warehouse. 
REASONS FOR CREATING AN DATA MART   Easy access to frequently needed data.  Creates collective view by a group of users.  Improves user response time. Ease of creation.  Lower cost than implementing a full Data warehouse
DATA MINING The non-trivial extraction of implicit, previously unknown, and potentially useful information from  large  databases. –  Extremely large datasets –  Useful knowledge that can improve processes –  Cannot be done manually
Where Has it Come From ?
Motivation Databases today are huge: –  More than 1,000,000 entities/records/rows –  From 10 to 10,000 fields/attributes/variables –  Giga-bytes and tera-bytes Databases a growing at an unprecendented rate The corporate world is a cut-throat world –  Decisions must be made rapidly –  Decisions must be made with maximum knowledge
How does data mining work?  Extract, transform, and load transaction data onto the data warehouse system.  Store and manage the data in a multidimensional database system.  Provide data access to business analysts and information technology professionals.  Analyze the data by application software.  Present the data in a useful format, such as a graph or table
DATA MINING MEASURES Accuracy  Clarity Dirty Data Scalability Speed Validation
Typical Applications of Data Mining
ADVANTAGES OF DATA MINING Engineering and Technology Medical Science  Business  Combating Terrorism  Games  Research and Development
Engineering and Technology In Electrical Power Engineering  -  used for condition monitoring of high  voltage electrical equipment  -  vibration monitoring and analysis of  transformer on-load tap-changers Education - to concentrate their knowledge
Medical Science Data mining has been widely used in area of  bioinformatics , genetics  DNA sequences and variability in disease susceptibility which is very important to help improve the diagnosis, prevention and treatment of the diseases
BUSINESS In Customer Relationship Management applications  It Translate data from customer to merchant Accurately Distribute Business Processes Powerful Tool For Marketing
Combating terrorism  Concept used by Interpol against terrorists for searching their records by  Multistate Anti-Terrorism Information Exchange  In the Secure Flight  program , Computer Assisted Passenger Pre screening System , Semantic Enhancement
Games for certain combinatorial games, also called table bases (e.g. for 3x3-chess)  It includes  extraction of human-usable strategies Berlekamp in dots-and-boxes  and Joh Nunn in chess endgames are notable examples
Research And Development Helps  to Develop the search algorithms It offers huge libraries of graphing and visualisation softwares  The users can easily create the models optimally
List of the top eight data-mining software vendors in 2008  Angoss Software  Infor CRM Epiphany  Portrait Software  SAS  G-Stat  SPSS  ThinkAnalytics  Unica  Viscovery
THANK YOU

More Related Content

What's hot (20)

PPT
Data mining and data warehousing
umesh patil
 
DOCX
data mining and data warehousing
Sunny Gandhi
 
PPTX
Data mining and data warehousing
Satya P. Joshi
 
PPT
DATA WAREHOUSING AND DATA MINING
Lovely Professional University
 
PPT
Project Presentation on Data WareHouse
Abhi Bhardwaj
 
PPTX
Business intelligence and data warehousing
OZ Assignment help
 
PPTX
Data Warehouse
MadhuriNigam1
 
PPT
Datawarehousing
work
 
PPTX
Introduction to Data mining
Hadi Fadlallah
 
PPT
Data warehousing
Shifali Goyal
 
PPTX
Introduction to Data Warehousing
Eyad Manna
 
PDF
Data Warehousing & Basic Architectural Framework
Dr. Sunil Kr. Pandey
 
PPTX
Data Warehousing - in the real world
ukc4
 
PPT
Seminar datawarehousing
Kavisha Uniyal
 
PPTX
Lecture 1 introduction to data warehouse
Shani729
 
PPTX
Data warehouse and data mining
Rohit Kumar
 
PPT
Data Mining Concepts
Dung Nguyen
 
PPTX
Data mining concepts and work
Amr Abd El Latief
 
PPT
Unit 3 part i Data mining
Dhilsath Fathima
 
PDF
Data Warehousing
Karthik Srini B R
 
Data mining and data warehousing
umesh patil
 
data mining and data warehousing
Sunny Gandhi
 
Data mining and data warehousing
Satya P. Joshi
 
DATA WAREHOUSING AND DATA MINING
Lovely Professional University
 
Project Presentation on Data WareHouse
Abhi Bhardwaj
 
Business intelligence and data warehousing
OZ Assignment help
 
Data Warehouse
MadhuriNigam1
 
Datawarehousing
work
 
Introduction to Data mining
Hadi Fadlallah
 
Data warehousing
Shifali Goyal
 
Introduction to Data Warehousing
Eyad Manna
 
Data Warehousing & Basic Architectural Framework
Dr. Sunil Kr. Pandey
 
Data Warehousing - in the real world
ukc4
 
Seminar datawarehousing
Kavisha Uniyal
 
Lecture 1 introduction to data warehouse
Shani729
 
Data warehouse and data mining
Rohit Kumar
 
Data Mining Concepts
Dung Nguyen
 
Data mining concepts and work
Amr Abd El Latief
 
Unit 3 part i Data mining
Dhilsath Fathima
 
Data Warehousing
Karthik Srini B R
 

Viewers also liked (18)

PPTX
General presentation
Lovely Professional University
 
PPT
Data Warehousing and Data Mining
idnats
 
PPS
Mind Reader
Jason S
 
PPTX
My presentation on data warehouse
Chanchal Tripathi
 
PDF
Power BI Desktop screen tour in Thai
PanaEk Warawit
 
PPTX
Organic Terrace Gardening by Jason
Jason S
 
DOC
Data warehouse concepts
obieefans
 
PPS
Data Warehouse 101
PanaEk Warawit
 
PPTX
Dataware house introduction by InformaticaTrainingClasses
InformaticaTrainingClasses
 
PPTX
DATA WAREHOUSING
Rishikese MR
 
PPTX
DATA WAREHOUSING
King Julian
 
PPTX
Network layer - design Issues
قصي نسور
 
PPT
Data mining slides
smj
 
PDF
Data warehouse architecture
pcherukumalla
 
PPS
Introduction to Data Warehousing
Jason S
 
PPTX
Data mining
Akannsha Totewar
 
PDF
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
PDF
TEDx Manchester: AI & The Future of Work
Volker Hirsch
 
General presentation
Lovely Professional University
 
Data Warehousing and Data Mining
idnats
 
Mind Reader
Jason S
 
My presentation on data warehouse
Chanchal Tripathi
 
Power BI Desktop screen tour in Thai
PanaEk Warawit
 
Organic Terrace Gardening by Jason
Jason S
 
Data warehouse concepts
obieefans
 
Data Warehouse 101
PanaEk Warawit
 
Dataware house introduction by InformaticaTrainingClasses
InformaticaTrainingClasses
 
DATA WAREHOUSING
Rishikese MR
 
DATA WAREHOUSING
King Julian
 
Network layer - design Issues
قصي نسور
 
Data mining slides
smj
 
Data warehouse architecture
pcherukumalla
 
Introduction to Data Warehousing
Jason S
 
Data mining
Akannsha Totewar
 
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
TEDx Manchester: AI & The Future of Work
Volker Hirsch
 
Ad

Similar to DATA WAREHOUSING AND DATA MINING (20)

PPT
Data Warehousing And Data Mining Presentation Transcript
SUBODH009
 
PDF
Data Mining @ BSU Malolos 2019
Edwin S. Garcia
 
PPTX
DWIntro.pptx
KRISHNARAJ207
 
PPTX
IM SEMINAR.pptx
KRISHNARAJ207
 
PDF
Data mining and data warehousing notes
tinamaheswariktm2004
 
PPTX
Datawarehouse
Muhammad Ahmad
 
PPTX
Business Intelligence Module 3_Datawarehousing.pptx
AmbikaVenkatesh4
 
PPTX
Data Mining & Data Warehousing
AAKANKSHA JAIN
 
PPTX
DATA WAREHOUSING.2.pptx
GraceJoyMoleroCarwan
 
PPT
IT Ready - DW: 1st Day
Siwawong Wuttipongprasert
 
PPTX
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
ShivaniTiwari24572
 
PPTX
Introduction to data mining and data warehousing
Er. Nawaraj Bhandari
 
DOCX
Abstract
raghavansrini7
 
PPT
Data mining & data warehousing
Shubha Brota Raha
 
PPT
Data Warehousing Datamining Concepts
raulmisir
 
PPT
Gulabs Ppt On Data Warehousing And Mining
gulab sharma
 
PPT
Data Mining and Data Warehousing
Amdocs
 
PPTX
ETL processes , Datawarehouse and Datamarts.pptx
ParnalSatle
 
PDF
Data mining & data warehousing (ppt)
Harish Chand
 
PPT
DMML1_overview.ppt
butest
 
Data Warehousing And Data Mining Presentation Transcript
SUBODH009
 
Data Mining @ BSU Malolos 2019
Edwin S. Garcia
 
DWIntro.pptx
KRISHNARAJ207
 
IM SEMINAR.pptx
KRISHNARAJ207
 
Data mining and data warehousing notes
tinamaheswariktm2004
 
Datawarehouse
Muhammad Ahmad
 
Business Intelligence Module 3_Datawarehousing.pptx
AmbikaVenkatesh4
 
Data Mining & Data Warehousing
AAKANKSHA JAIN
 
DATA WAREHOUSING.2.pptx
GraceJoyMoleroCarwan
 
IT Ready - DW: 1st Day
Siwawong Wuttipongprasert
 
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
ShivaniTiwari24572
 
Introduction to data mining and data warehousing
Er. Nawaraj Bhandari
 
Abstract
raghavansrini7
 
Data mining & data warehousing
Shubha Brota Raha
 
Data Warehousing Datamining Concepts
raulmisir
 
Gulabs Ppt On Data Warehousing And Mining
gulab sharma
 
Data Mining and Data Warehousing
Amdocs
 
ETL processes , Datawarehouse and Datamarts.pptx
ParnalSatle
 
Data mining & data warehousing (ppt)
Harish Chand
 
DMML1_overview.ppt
butest
 
Ad

Recently uploaded (20)

PDF
Dr. Enrique Segura Ense Group - A Philanthropist And Entrepreneur
Dr. Enrique Segura Ense Group
 
PPTX
IP Leaks Can Derail Years Of Innovation In Seconds
Home
 
PDF
Redefining Punjab’s Growth Story_ Mohit Bansal and the Human-Centric Vision o...
Mohit Bansal GMI
 
PDF
Connecting Startups to Strategic Global VC Opportunities.pdf
Google
 
PPTX
Why-Your-BPO-Startup-Must-Track-Attrition-from-Day-One.pptx.pptx
Orage technologies
 
DOCX
RECLAIM STOLEN CRYPTO REVIEW WITH RECUVA HACKER SOLUTIONS
camilamichaelj7
 
PDF
Van Aroma IFEAT - Clove Oils - Socio Economic Report .pdf
VanAroma
 
PDF
Concept Topology in Architectural Build Addendum.pdf
Brij Consulting, LLC
 
PDF
Explore Unique Wash Basin Designs: Black, Standing & Colored Options
Mozio
 
PDF
Easypromo AI Review: Revolutionizing Digital Promotions with Artificial Intel...
Larry888358
 
PDF
From Legacy to Velocity: how we rebuilt everything in 8 months.
Product-Tech Team
 
PDF
Thane Stenner - An Industry Expert
Thane Stenner
 
PDF
Factors Influencing Demand For Plumbers In Toronto GTA:
Homestars
 
PDF
Royalzig Unveils India’s First World-Class Luxury Furniture Experience Center...
Royalzig Luxury Furniture
 
PDF
Buy Boys Long Sleeve T-shirts at Port 213
Port 213
 
PDF
kcb-group-plc-2024-integrated-report-and-financial-statements (3).pdf
DanielNdegwa10
 
PDF
Blind Spots in Business: Unearthing Hidden Challenges in Today's Organizations
Crimson Business Consulting
 
PDF
NewBase 14 July 2025 Energy News issue - 1802 by Khaled Al Awadi_compressed ...
Khaled Al Awadi
 
DOCX
How to Choose the Best Dildo for Men A Complete Buying Guide.docx
Glas Toy
 
PDF
Top Farewell Gifts for Seniors Under.pdf
ThreadVibe Living
 
Dr. Enrique Segura Ense Group - A Philanthropist And Entrepreneur
Dr. Enrique Segura Ense Group
 
IP Leaks Can Derail Years Of Innovation In Seconds
Home
 
Redefining Punjab’s Growth Story_ Mohit Bansal and the Human-Centric Vision o...
Mohit Bansal GMI
 
Connecting Startups to Strategic Global VC Opportunities.pdf
Google
 
Why-Your-BPO-Startup-Must-Track-Attrition-from-Day-One.pptx.pptx
Orage technologies
 
RECLAIM STOLEN CRYPTO REVIEW WITH RECUVA HACKER SOLUTIONS
camilamichaelj7
 
Van Aroma IFEAT - Clove Oils - Socio Economic Report .pdf
VanAroma
 
Concept Topology in Architectural Build Addendum.pdf
Brij Consulting, LLC
 
Explore Unique Wash Basin Designs: Black, Standing & Colored Options
Mozio
 
Easypromo AI Review: Revolutionizing Digital Promotions with Artificial Intel...
Larry888358
 
From Legacy to Velocity: how we rebuilt everything in 8 months.
Product-Tech Team
 
Thane Stenner - An Industry Expert
Thane Stenner
 
Factors Influencing Demand For Plumbers In Toronto GTA:
Homestars
 
Royalzig Unveils India’s First World-Class Luxury Furniture Experience Center...
Royalzig Luxury Furniture
 
Buy Boys Long Sleeve T-shirts at Port 213
Port 213
 
kcb-group-plc-2024-integrated-report-and-financial-statements (3).pdf
DanielNdegwa10
 
Blind Spots in Business: Unearthing Hidden Challenges in Today's Organizations
Crimson Business Consulting
 
NewBase 14 July 2025 Energy News issue - 1802 by Khaled Al Awadi_compressed ...
Khaled Al Awadi
 
How to Choose the Best Dildo for Men A Complete Buying Guide.docx
Glas Toy
 
Top Farewell Gifts for Seniors Under.pdf
ThreadVibe Living
 

DATA WAREHOUSING AND DATA MINING

  • 1. DATA WAREHOUSING AND DATA MINING PRESENTED BY :- ANIL SHARMA B-TECH(IT)MBA-A REG NO : 3470070100 PANKAJ JARIAL BTECH(IT)MBA-A REG NO : 3470070086
  • 2. DATA WAREHOUSING Data warehousing is combining data from multiple sources into one comprehensive and easily manipulated database. The primary aim for data warehousing is to provide businesses with analytics results from data mining, OLAP, Scorecarding and reporting.
  • 3. NEED FOR DATA WAREHOUSING Information is now considered as a key for all the works. Those who gather, analyze, understand, and act upon information are winners. Information have no limits, it is very hard to collect information from various sources, so we need an data warehouse from where we can get all the information.
  • 5. Retrieving data Analyzing data Extracting data Loading data Transforming data Managing data DATA WAREHOUSING INCLUDES:-
  • 6. DATA WAREHOUSE ARCHITECTURE Data warehousing is designed to provide an architecture that will make cooperate data accessible and useful to users. There is no right or wrong architecture. The worthiness of the architecture can be judge by its use, and concept behind it . Data Warehouses can be architected in many different ways, depending on the specific needs of a business. 
  • 8. An operational data store (ODS) is basically a database that is used for being an temporary storage area for a datawarehouse. Its primary purpose is for handling data which are progressively in use. Operational data store contains data which are constantly updated through the course of the business operations.
  • 9. ETL (Extract, Transform, Load) is used to copy data from:- ODS to data warehouse staging area. Data warehouse staging area to data warehouse . Data warehouse to data mart . ETL extracts data, transforms values of inconsistent data, cleanses "bad" data, filters data and loads data into a target database.  
  • 10. The Data Warehouse Staging Area is temporary location where data from source systems is copied.  It increases the speed of data warehouse architecture. It is very essential since data is increasing day by day.
  • 11. The purpose of the Data Warehouse is to integrate corporate data. The amount of data in the Data Warehouse is massive.  Data is stored at a very deep level of detail. This allows data to be grouped in unimaginable ways. Data Warehouses does not contain all the data in the organization ,It's purpose is to provide base that are needed by the organization for strategic and tactical decision making.  
  • 12. ETL extract data from the Data Warehouse and send to one or more Data Marts for use of users. Data marts are represented as shortcut to a data warehouse ,to save time. It is just an partition of data present in data warehouse. Each Data Mart can contain different combinations of tables, columns and rows from the Enterprise Data Warehouse. 
  • 13. REASONS FOR CREATING AN DATA MART Easy access to frequently needed data. Creates collective view by a group of users. Improves user response time. Ease of creation. Lower cost than implementing a full Data warehouse
  • 14. DATA MINING The non-trivial extraction of implicit, previously unknown, and potentially useful information from large databases. – Extremely large datasets – Useful knowledge that can improve processes – Cannot be done manually
  • 15. Where Has it Come From ?
  • 16. Motivation Databases today are huge: – More than 1,000,000 entities/records/rows – From 10 to 10,000 fields/attributes/variables – Giga-bytes and tera-bytes Databases a growing at an unprecendented rate The corporate world is a cut-throat world – Decisions must be made rapidly – Decisions must be made with maximum knowledge
  • 17. How does data mining work? Extract, transform, and load transaction data onto the data warehouse system. Store and manage the data in a multidimensional database system. Provide data access to business analysts and information technology professionals. Analyze the data by application software. Present the data in a useful format, such as a graph or table
  • 18. DATA MINING MEASURES Accuracy Clarity Dirty Data Scalability Speed Validation
  • 19. Typical Applications of Data Mining
  • 20. ADVANTAGES OF DATA MINING Engineering and Technology Medical Science Business Combating Terrorism Games Research and Development
  • 21. Engineering and Technology In Electrical Power Engineering - used for condition monitoring of high voltage electrical equipment - vibration monitoring and analysis of transformer on-load tap-changers Education - to concentrate their knowledge
  • 22. Medical Science Data mining has been widely used in area of bioinformatics , genetics DNA sequences and variability in disease susceptibility which is very important to help improve the diagnosis, prevention and treatment of the diseases
  • 23. BUSINESS In Customer Relationship Management applications It Translate data from customer to merchant Accurately Distribute Business Processes Powerful Tool For Marketing
  • 24. Combating terrorism Concept used by Interpol against terrorists for searching their records by Multistate Anti-Terrorism Information Exchange In the Secure Flight program , Computer Assisted Passenger Pre screening System , Semantic Enhancement
  • 25. Games for certain combinatorial games, also called table bases (e.g. for 3x3-chess) It includes extraction of human-usable strategies Berlekamp in dots-and-boxes and Joh Nunn in chess endgames are notable examples
  • 26. Research And Development Helps to Develop the search algorithms It offers huge libraries of graphing and visualisation softwares The users can easily create the models optimally
  • 27. List of the top eight data-mining software vendors in 2008 Angoss Software Infor CRM Epiphany Portrait Software SAS G-Stat SPSS ThinkAnalytics Unica Viscovery