Week 5 Chapter 6
Week 5 Chapter 6
• Warehouse Storage
• Data marts
• Metadata
• Information Access/Delivery
• Report writers
• Query processors
• OLAP
• Alert systems
• DSS applications
• Data mining
Data Storage Specifications - 1
• If your company is adopting the top-down approach of developing
the data warehouse, then you have to define the storage
specifications for
• The data staging area
• The overall corporate data warehouse
• Each of the dependent data marts, beginning with the first
• Any multidimensional databases for OLAP
• Alternatively, if your company opts for the bottom-up approach, you
need specifications for
• The data staging area
• Each of the conformed data marts, beginning with the first
• Any multidimensional databases for OLAP
• Typically, the overall corporate data warehouse will be based on the
relational model supported by a relational database management
system (RDBMS).
Data Storage Specifications - 2
• Whatever your choice of the database management system may be,
that system will have to interact with back-end and front-end tools.
• The back-end tools are the products for data transformation, data
cleansing, and data loading. The front-end tools relate to information
delivery to the users.
• While defining requirements, bear in mind their influence on data
storage specifications and collect all the necessary details about the
back-end and the frontend architectural components
DBMS Selection:
• The following elements of business requirements affect the choice of
the DBMS:
1. Level of user experience. If the users are totally inexperienced with
database systems, the DBMS must have features to monitor and control
runaway queries. DBMS must support an easy SQL-type language interface.
Data Storage Specifications - 3
2. Types of queries. The DBMS must have a powerful optimizer if most of
the queries are complex and produce large result sets.
3. Need for openness. The degree of openness depends on the back-end
and front-end architectural components and those, in turn, depend on the
business requirements.
4. Data loads. The data volumes and load frequencies determine the
strengths in the areas of data loading, recovery, and restart.
5. Metadata management. Let your requirements definition reflect the type
and extent of the metadata framework.
6. Data repository locations. Is your data warehouse going to reside in one
central location, or is it going to be distributed? The answer to this question
will establish whether the selected DBMS must support distributed
databases.
7. Data warehouse growth. Your business requirements definition must
contain information on the estimated growth in the number of users, and in
the number and complexity of queries. The growth estimates will have a
direct relation to how the selected DBMS supports scalability.
Data Storage Specifications - 4
Storage Sizing
• You need to estimate the storage sizes for the following in the
requirements definition phase:
• Data staging area. Calculate storage estimates for data staging area
from sizes of source system data structures for each business subject.
• Overall corporate data warehouse. For each business subject, list
the various attributes, estimate their field lengths, and arrive at the
calculation for the storage needed for that subject.
• Data Marts, dependent or conformed. Use the details of the
business dimensions and business measures found in the information
diagrams to estimate the storage size for the data marts.
• Multidimensional databases. Work out the details of. OLAP planned
for your users and then use those details to estimate storage for these
multidimensional databases.
Information Delivery Strategy - 1
• The impact of business requirements on the information delivery
mechanism in a data warehouse is straightforward.
• During the requirements definition phase, users tell you what
information they want to retrieve from the data warehouse.
• You record these requirements in the requirements definition
document.
• You then provide all the desired features and content in the
information delivery component.
• Although the impact appears to be straightforward and simple, there
are several issues to be considered.
• Many different aspects of the requirements impact various elements of
the information delivery component in different ways.
Information Delivery Strategy - 2
• The broad areas of the information delivery component directly
impacted by business requirements are:
• Queries and reports
• Types of analysis
• Information distribution
• Decision support applications
• Growth and expansion