SlideShare a Scribd company logo
4
Most read
7
Most read
8
Most read
INTRODUCTION TO
DATA WAREHOUSING
By: Eng. Eyad R. Manaa
INTRODUCTION
• Data: Meaningful facts, text, graphics, images,
sound, video segments.
• Database: An organized collection of logically
related data.
• Information: Data processed to be useful in
decision making.
• Metadata: Data that describes data.
ADVANTAGES OF THE DATABASE APPROACH
• Data Independence/Reduced Maintenance
• Improved Data Sharing
• Increased Application Development Productivity
• Enforcement of Standards
• Improved Data Quality (Constraints)
• Better Data Accessibility/ Responsiveness
• Security, Backup/Recovery, Concurrency
PROBLEM:
HETEROGENEOUS INFORMATION SOURCES
“Heterogeneities are
everywhere” Personal
Databases
Digital Libraries
Scientific Databases
World
Wide
Web
 Different interfaces
 Different data representations
 Duplicate and inconsistent information
PROBLEM: DATA MANAGEMENT IN LARGE
ENTERPRISES
 fragmentation of informational systems
 Result of application (user)-driven
development of operational systems
Sales Administration Finance Manufacturing ...
Sales Planning
Stock Mngmt
...
Suppliers
...
Debt Mngmt
Num. Control
...
Inventory
SOLUTION: UNIFIED ACCESS TO DATA
Integration System
Collects and combines information
Provides integrated view, uniform user interface
Supports sharing
World
Wide
Web
Digital Libraries Scientific Databases
Personal
Databases
WHAT IS A DATA WAREHOUSE?
“A data warehouse is simply a single,
complete, and consistent store of data
obtained from a variety of sources and
made available to end users in a way they
can understand and use it in a business
context.”
WHAT IS A DATA WAREHOUSE?
“A DW is a
subject-oriented,
integrated,
time-varying,
non-volatile
collection of data that is used primarily in
organizational decision making.”
A DATA WAREHOUSE IS
Stored collection of diverse data
A solution to data integration problem
Single repository of information
Subject-oriented
Organized by subject, not by application
Used for analysis, data mining, etc.
Optimized differently from transaction-
oriented db
A DATA WAREHOUSE IS
Large volume of data (Gb, Tb)
Non-volatile
Historical
Time attributes are important
Updates infrequent
OLTP VS. OLAP
OLTP: On Line Transaction Processing
Describes processing at operational sites
OLAP: On Line Analytical Processing
Describes processing at warehouse
WAREHOUSE IS A SPECIALIZED DB
Standard DB (OLTP)
 Mostly updates
 Many small transactions
 Mb - Gb of data
 Current snapshot
 Raw data
 Thousands of users (e.g.,
clerical users)
Warehouse (OLAP)
Mostly reads
Queries are long and complex
Gb - Tb of data
History
Summarized data
Hundreds of users
(e.g., decision-
makers, analysts)
GENERIC WAREHOUSE ARCHITECTURE
Extractor/
Monitor
Extractor/
Monitor
Extractor/
Monitor
Integrator
Warehouse
Client Client
Design Phase
Maintenance
Loading
...
Metadata
Optimization
Query & Analysis
 ETL Concept
WAREHOUSING PROCESS
ETL CONCEPT
ETL CONCEPT
ISSUES IN DATA WAREHOUSING
Warehouse Design
Extraction
Wrappers, monitors (change detectors)
Integration
Cleansing & merging
Warehousing specification &
Maintenance
Optimizations

More Related Content

PPTX
DATA WAREHOUSING
Rishikese MR
 
PDF
Data warehouse
Ramkrishna bhagat
 
PPTX
Data warehouse
Yogendra Uikey
 
PPTX
Data warehouse
Sonali Chawla
 
PPT
Data warehouse
shachibattar
 
PPTX
Data Warehouse
MadhuriNigam1
 
PPTX
Data mart
Prachi Agarwal
 
PDF
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Edureka!
 
DATA WAREHOUSING
Rishikese MR
 
Data warehouse
Ramkrishna bhagat
 
Data warehouse
Yogendra Uikey
 
Data warehouse
Sonali Chawla
 
Data warehouse
shachibattar
 
Data Warehouse
MadhuriNigam1
 
Data mart
Prachi Agarwal
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Edureka!
 

What's hot (20)

PPTX
Data warehousing
Anshika Nigam
 
PPTX
OLAP & DATA WAREHOUSE
Zalpa Rathod
 
PPTX
Ppt
bullsrockr666
 
PPT
Introduction to Data Warehouse
Shanthi Mukkavilli
 
PPTX
Data warehousing
Shruti Dalela
 
PDF
Data warehousing
Juhi Mahajan
 
PPT
1.4 data warehouse
Krish_ver2
 
PDF
Data Warehousing
Karthik Srini B R
 
PPTX
Business intelligence ppt
sujithkylm007
 
PPT
Data warehouse
krishna kumar singh
 
PPT
Date warehousing concepts
pcherukumalla
 
PPTX
Business intelligence
Randy L. Archambault
 
PPSX
OLAP OnLine Analytical Processing
Walid Elbadawy
 
PPT
Datawarehousing and Business Intelligence
Prithwis Mukerjee
 
PDF
Data warehouse architecture
pcherukumalla
 
ODP
Introduction To Analytics
Alex Meadows
 
PPTX
Data Analytics
Srinimf-Slides
 
PPT
Data warehouse
Medma Infomatix (P) Ltd.
 
PPTX
Data modeling star schema
Sayed Ahmed
 
PDF
Introduction to Data Warehouse
SOMASUNDARAM T
 
Data warehousing
Anshika Nigam
 
OLAP & DATA WAREHOUSE
Zalpa Rathod
 
Introduction to Data Warehouse
Shanthi Mukkavilli
 
Data warehousing
Shruti Dalela
 
Data warehousing
Juhi Mahajan
 
1.4 data warehouse
Krish_ver2
 
Data Warehousing
Karthik Srini B R
 
Business intelligence ppt
sujithkylm007
 
Data warehouse
krishna kumar singh
 
Date warehousing concepts
pcherukumalla
 
Business intelligence
Randy L. Archambault
 
OLAP OnLine Analytical Processing
Walid Elbadawy
 
Datawarehousing and Business Intelligence
Prithwis Mukerjee
 
Data warehouse architecture
pcherukumalla
 
Introduction To Analytics
Alex Meadows
 
Data Analytics
Srinimf-Slides
 
Data warehouse
Medma Infomatix (P) Ltd.
 
Data modeling star schema
Sayed Ahmed
 
Introduction to Data Warehouse
SOMASUNDARAM T
 
Ad

Viewers also liked (20)

PPS
Introduction to Data Warehousing
Jason S
 
PPTX
DATA WAREHOUSING
King Julian
 
PPT
Data Warehousing and Data Mining
idnats
 
PPT
introduction to datawarehouse
kiran14360
 
PPTX
Data warehousing
Varun Jain
 
PPTX
introduction to data warehousing and mining
Rajesh Chandra
 
PPT
An introduction to data warehousing
Shahed Khalili
 
PPT
data warehousing
Jagnesh Chawla
 
PPT
Lecture 13
Shani729
 
PPT
Lecture 1
Shani729
 
PPTX
Introduction to Data Warehousing
Gurpreet Singh Sachdeva
 
PDF
Data Warehousing & Basic Architectural Framework
Dr. Sunil Kr. Pandey
 
DOC
Data Warehouse (ETL) testing process
Rakesh Hansalia
 
PPTX
Introduction to ETL process
Omid Vahdaty
 
PPTX
Database and types of databases
baabtra.com - No. 1 supplier of quality freshers
 
PPT
Data-ware Housing
Prof.Nilesh Magar
 
PPT
Data Mining and Data Warehousing
Aswathy S Nair
 
PDF
Data mining & data warehousing (ppt)
Harish Chand
 
PPTX
Types of database
faizan1712818
 
PPTX
Etl process in data warehouse
Komal Choudhary
 
Introduction to Data Warehousing
Jason S
 
DATA WAREHOUSING
King Julian
 
Data Warehousing and Data Mining
idnats
 
introduction to datawarehouse
kiran14360
 
Data warehousing
Varun Jain
 
introduction to data warehousing and mining
Rajesh Chandra
 
An introduction to data warehousing
Shahed Khalili
 
data warehousing
Jagnesh Chawla
 
Lecture 13
Shani729
 
Lecture 1
Shani729
 
Introduction to Data Warehousing
Gurpreet Singh Sachdeva
 
Data Warehousing & Basic Architectural Framework
Dr. Sunil Kr. Pandey
 
Data Warehouse (ETL) testing process
Rakesh Hansalia
 
Introduction to ETL process
Omid Vahdaty
 
Database and types of databases
baabtra.com - No. 1 supplier of quality freshers
 
Data-ware Housing
Prof.Nilesh Magar
 
Data Mining and Data Warehousing
Aswathy S Nair
 
Data mining & data warehousing (ppt)
Harish Chand
 
Types of database
faizan1712818
 
Etl process in data warehouse
Komal Choudhary
 
Ad

Similar to Introduction to Data Warehousing (20)

PPTX
presentationofism-complete-1-100227093028-phpapp01.pptx
vipush1
 
PPTX
Datawarehouse
Ashish Kargwal
 
PPTX
Data warehouse-complete-1-100227093028-phpapp01.pptx
ArunPatrick2
 
PDF
Dwbasics
Sailendra Behera
 
PPT
Final presentation
Dave Nawazish Ali
 
PDF
Data Mining is the process ofData Mining is the process ofData Mining is the ...
naveedabbas61
 
PPTX
Data warehouse
MR Z
 
PPT
Data Warehouse By Piyush
astronish
 
PPTX
Module 1_Data Warehousing Fundamentals.pptx
nikshaikh786
 
PPT
11667 Bitt I 2008 Lect4
ambujm
 
PPTX
DATA WAREHOUSING.2.pptx
GraceJoyMoleroCarwan
 
PPT
DATA WAREHOUSING
Sejal Gaikwad
 
PPTX
Data warehousing ppt
Ashish Kumar Thakur
 
PPT
11666 Bitt I 2008 Lect3
ambujm
 
PPT
Data ware housing - Introduction to data ware housing process.
Vibrant Technologies & Computers
 
PDF
Data Warehouse: A Primer
IJRTEMJOURNAL
 
PPTX
Data Warehousing – Core Concepts and Components
logeswarisaravanan
 
PDF
Cognos datawarehouse
ssuser7fc7eb
 
PPTX
WEEK 1 - Data mining and Warehouse.pptx
noblerexford
 
PPT
DW (1).ppt
RahulSingh986955
 
presentationofism-complete-1-100227093028-phpapp01.pptx
vipush1
 
Datawarehouse
Ashish Kargwal
 
Data warehouse-complete-1-100227093028-phpapp01.pptx
ArunPatrick2
 
Final presentation
Dave Nawazish Ali
 
Data Mining is the process ofData Mining is the process ofData Mining is the ...
naveedabbas61
 
Data warehouse
MR Z
 
Data Warehouse By Piyush
astronish
 
Module 1_Data Warehousing Fundamentals.pptx
nikshaikh786
 
11667 Bitt I 2008 Lect4
ambujm
 
DATA WAREHOUSING.2.pptx
GraceJoyMoleroCarwan
 
DATA WAREHOUSING
Sejal Gaikwad
 
Data warehousing ppt
Ashish Kumar Thakur
 
11666 Bitt I 2008 Lect3
ambujm
 
Data ware housing - Introduction to data ware housing process.
Vibrant Technologies & Computers
 
Data Warehouse: A Primer
IJRTEMJOURNAL
 
Data Warehousing – Core Concepts and Components
logeswarisaravanan
 
Cognos datawarehouse
ssuser7fc7eb
 
WEEK 1 - Data mining and Warehouse.pptx
noblerexford
 
DW (1).ppt
RahulSingh986955
 

Recently uploaded (20)

PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 

Introduction to Data Warehousing

  • 2. INTRODUCTION • Data: Meaningful facts, text, graphics, images, sound, video segments. • Database: An organized collection of logically related data. • Information: Data processed to be useful in decision making. • Metadata: Data that describes data.
  • 3. ADVANTAGES OF THE DATABASE APPROACH • Data Independence/Reduced Maintenance • Improved Data Sharing • Increased Application Development Productivity • Enforcement of Standards • Improved Data Quality (Constraints) • Better Data Accessibility/ Responsiveness • Security, Backup/Recovery, Concurrency
  • 4. PROBLEM: HETEROGENEOUS INFORMATION SOURCES “Heterogeneities are everywhere” Personal Databases Digital Libraries Scientific Databases World Wide Web  Different interfaces  Different data representations  Duplicate and inconsistent information
  • 5. PROBLEM: DATA MANAGEMENT IN LARGE ENTERPRISES  fragmentation of informational systems  Result of application (user)-driven development of operational systems Sales Administration Finance Manufacturing ... Sales Planning Stock Mngmt ... Suppliers ... Debt Mngmt Num. Control ... Inventory
  • 6. SOLUTION: UNIFIED ACCESS TO DATA Integration System Collects and combines information Provides integrated view, uniform user interface Supports sharing World Wide Web Digital Libraries Scientific Databases Personal Databases
  • 7. WHAT IS A DATA WAREHOUSE? “A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context.”
  • 8. WHAT IS A DATA WAREHOUSE? “A DW is a subject-oriented, integrated, time-varying, non-volatile collection of data that is used primarily in organizational decision making.”
  • 9. A DATA WAREHOUSE IS Stored collection of diverse data A solution to data integration problem Single repository of information Subject-oriented Organized by subject, not by application Used for analysis, data mining, etc. Optimized differently from transaction- oriented db
  • 10. A DATA WAREHOUSE IS Large volume of data (Gb, Tb) Non-volatile Historical Time attributes are important Updates infrequent
  • 11. OLTP VS. OLAP OLTP: On Line Transaction Processing Describes processing at operational sites OLAP: On Line Analytical Processing Describes processing at warehouse
  • 12. WAREHOUSE IS A SPECIALIZED DB Standard DB (OLTP)  Mostly updates  Many small transactions  Mb - Gb of data  Current snapshot  Raw data  Thousands of users (e.g., clerical users) Warehouse (OLAP) Mostly reads Queries are long and complex Gb - Tb of data History Summarized data Hundreds of users (e.g., decision- makers, analysts)
  • 13. GENERIC WAREHOUSE ARCHITECTURE Extractor/ Monitor Extractor/ Monitor Extractor/ Monitor Integrator Warehouse Client Client Design Phase Maintenance Loading ... Metadata Optimization Query & Analysis
  • 17. ISSUES IN DATA WAREHOUSING Warehouse Design Extraction Wrappers, monitors (change detectors) Integration Cleansing & merging Warehousing specification & Maintenance Optimizations