4 DWH - Process - Managers
4 DWH - Process - Managers
Process managers are responsible for maintaining the flow of data both into and out of the data
warehouse. There are three different types of process managers:
Load manager
Warehouse manager
Query manager
Perform simple transformations into structure similar to the one in the data warehouse.
Fast Load
In order to minimize the total load window, the data needs to be loaded into the warehouse
in the fastest possible time.
Gateway technology is not suitable, since they are inefficient when large data volumes are
involved.
Simple Transformations
While loading, it may be required to perform simple transformations. After completing simple
transformations, we can do complex checks. Suppose we are loading the EPOS sales transaction,
we need to perform the following checks:
Strip out all the columns that are not required within the warehouse.
Convert all the values to required data types.
Warehouse Manager
The warehouse manager is responsible for the warehouse management process. It consists of a
third-party system software, C programs, and shell scripts. The size and complexity of a warehouse
manager varies between specific solutions.
Creates indexes, business views, partition views against the base data.
Transforms and merges the source data of the temporary store into the published data
warehouse.
Archives the data that has reached the end of its captured life.
Note: A warehouse Manager analyzes query profiles to determine whether the index and
aggregations are appropriate.
Query Manager
The query manager is responsible for directing the queries to suitable tables. By directing the
queries to appropriate tables, it speeds up the query request and response process. In addition,
the query manager is responsible for scheduling the execution of the queries posted by the user.
It stores query profiles to allow the warehouse manager to determine which indexes and
aggregations are appropriate.
Loading [MathJax]/jax/output/HTML-CSS/jax.js