0% found this document useful (0 votes)
211 views

Data Warehousing - AIS

A data warehouse is a large database used for analysis rather than transactions. It contains extracted, cleansed, and transformed data from operational databases and other sources. The data is organized for quick searching, retrieval, and querying. The five stages of populating a data warehouse are: 1) modeling the data, 2) extracting from operational databases, 3) cleansing the extracted data, 4) transforming the data, and 5) loading it into the warehouse database. Data mining tools like neural networks and association analysis are then used to discover patterns in the data warehouse.

Uploaded by

Arlene Garcia
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
211 views

Data Warehousing - AIS

A data warehouse is a large database used for analysis rather than transactions. It contains extracted, cleansed, and transformed data from operational databases and other sources. The data is organized for quick searching, retrieval, and querying. The five stages of populating a data warehouse are: 1) modeling the data, 2) extracting from operational databases, 3) cleansing the extracted data, 4) transforming the data, and 5) loading it into the warehouse database. Data mining tools like neural networks and association analysis are then used to discover patterns in the data warehouse.

Uploaded by

Arlene Garcia
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12

DATA

WAREHOUSING
DATA WAREHOUSE
 A multi-dimensional database often using hundreds of
gigabytes or even terabytes of memory
 Data are extracted periodically from operational
databases or from public information services.

 A database constructed for quick searching, retrieval, ad-


hoc queries, and ease of use
DATA WAREHOUSE
 ERP systems can exist without data warehouses.
 However, most large ERP implementations include separate
operational and data warehouse databases.

 Otherwise, management data analysis may result in pulling


system resources away from operational use.

 Also, there are many sophisticated data-mining tools.


DATA MINING TOOLS
 Neural Computing- an approach by which historical data
can be examined for patterns.
 Intelligent Agents- an approach to retrieve information
from the internet or from intranet-based databases.
 Association Analysis- uses a specialized set of algorithms
that sort through large data sets and expresses statistical
rules among items.
DATA WAREHOUSE PROCESS
The five stages of the data warehousing process:
1. modeling data for the data warehouse
2. extracting data from operational databases
3. cleansing extracted data
4. transforming data into the warehouse model
5. loading data into the data warehouse database
DATA WAREHOUSE PROCESS
(STAGE 1)
 Modeling data for the data warehouse
 Focuses on data normalization.
 Relational theory does not apply to a data warehousing system.
 Normalized tables pertaining to selected events may be
consolidated into de-normalized tables.
DATA WAREHOUSE PROCESS
(STAGE 2)
 Extracting data from operational databases
 The process of collecting data from operational
databases, flat-files, archives, and external data sources.
 Snapshots vs. stabilized data
 A key feature of a data warehouse is that the data contained in it
are in a non-volatile (stable) state.
DATA WAREHOUSE PROCESS
(STAGE 3)
 Cleansing extracted data
 Involves filtering out or repairing invalid data prior to
being stored in the warehouse
 Operational data are “dirty” for many reasons: clerical, data
entry, computer program errors, misspelled names and blank
fields.
 Also involves transforming data into standard business
terms with standard data values
DATA WAREHOUSE PROCESS
(STAGE 4)
 Transforming data into the warehouse model
 To improve efficiency, data are transformed into
summary views before being loaded.
 Unlike operational views, which are virtual in nature
with underlying base tables, data warehouse views are
physical tables.
 OLAP permits users to construct virtual views.
DATA WAREHOUSE PROCESS
(STAGE 5)
 Loading data into the data warehouse database
 Data warehouses must be created & maintained
separately from the operational databases.
 internal efficiency
 integration of legacy systems
 consolidation of global data
APPLICATIONS OF DATA
MINING

You might also like