Jeffrey A. Hoffer, Mary B. Prescott, Fred R. Mcfadden: Modern Database Management 10 Edition
Jeffrey A. Hoffer, Mary B. Prescott, Fred R. Mcfadden: Modern Database Management 10 Edition
10th Edition
Jeffrey A. Hoffer, Mary B. Prescott,
Fred R. McFadden
1
Requirement for Integrated, company-
wide view of high-quality information
(from different databases)
Separation of operational and
informational systems and data (for
improved performance)
◦ Operational systems is used to run the
business in real time, based on current data.
Also called system of record
◦ Information systems are designed to support
decision making based on historical data for
complex queries or data mining applications.
2
Why organizations need to bring data from various
system of records?
Answer: Speed and flexibility in decision making…
3
Data Mart: A data warehouse that is limited in
scope
4
Generic Two-Level Architecture
Independent Data Mart
Dependent Data Mart and Operational Data
Store
Logical Data Mart and @ctive Warehouse
Three-Layer architecture
5
Figure 09-2: Generic two-level architecture (4 steps)
L
One,
company-
wide
T warehouse
6
Figure 09-3: Independent data mart
Data marts:
Mini-warehouses, limited in scope
T
E
7
Why created?
◦ Short-term business objectives
◦ Lower-cost to implement
◦ Organizationally and politically easier to implement
◦ Due to size limitation of the adopted technology
Benefits
◦ We may work in series of small projects to prove
the concept of data warehousing
◦ Start getting benefits of data warehousing earlier
before waiting for all data to be centralized
8
1. Costly redundant data and processing due to
separate ETL process
2. Data marts may not be consistent with each other
3. Limited analysis (difficulty in making joins across
different platforms of separate data marts)
4. High cost of scaling for new data marts due to
expected changes in the technology over the time
5. High cost of making separate data marts consistent
with each other
9
Figure 09-4: ODS provides option for
Dependent data mart with operational data store obtaining current data
T
E Simpler data access
Single ETL for
enterprise data warehouse Dependent data marts
(EDW) loaded from EDW
10
1st and 2nd limitation is addresses by loading the
dependent data marts from an enterprise data ware
(EDW) house which is single control point / version of
truth.
Redundancy across DDM is planned and consistent
Often called Hub and Spoke approach.
The 3rd limitation is addressed by providing updatable
and current-valued operational data store (ODS) for
decision support processing to serve op users.
Contrary to EDW, the ODS doesn't contain history data.
ODS may be fed from ERP database and serves as
staging area from loading data to EDW.
11
This is practical for only
◦ Moderate size data warehouse or
◦ When using high-performance DW technology,
such as Teradata (NCR)
12
ODS and data warehouse
are one and the same
T
E
Near real-time ETL for Data marts are NOT separate databases,
@active Data Warehouse but logical views of the data warehouse
Easier to create new data marts
13