Copy of 2m unit2
Copy of 2m unit2
What is ETL – ETL Vs ELT – Types of Data warehouses - Data warehouse Design and Modeling
-Delivery Process - Online Analytical Processing (OLAP) - Characteristics of OLAP – Online
Transaction Processing (OLTP) Vs OLAP - OLAP operations- Types of OLAP- ROLAP Vs
MOLAP Vs HOLAP.
1.What is ETL?
The mechanism of extracting information from source systems and bringing it into the
data warehouse is commonly called ETL, which stands for Extraction, Transformation and
Loading.
2.What is Extraction?
o Extraction is the operation of extracting information from a source system for further use
in a data warehouse environment. This is the first stage of the ETL process.
o Extraction process is often one of the most time-consuming tasks in the ETL.
3.Define cleansing?
4.Define Transformation?
Transformation is the core of the reconciliation phase. It converts records from its
operational source format into a particular data warehouse format. If we implement a three-layer
architecture, this phase outputs our reconciled data layer.
5.Define Loading?
The Load is the process of writing the data into the target database. During the load step,
it is necessary to ensure that the load is performed correctly and with as little resources as
possible. Loading can be carried in two ways
1. Refresh
2. Update
Process Data is transferred to the ETL server and moved back to DB. Data remains in the DB except for
High network bandwidth required. cross Database loads (e.g. source to
object).
Transformation Transformations are performed in ETL Server. Transformations are performed (in
the source or) in the target.
Time- It needs highs maintenance as you need to select data to load Low maintenance as data is always
Maintenance and transform. available.
Calculations Overwrites existing column or Need to append the dataset and Easily add the calculated column to
push to the target platform. the existing table.
Breaks the vast problem into smaller subproblems. Solves the essential low-level problem and integrates
them into a higher one.
Inherently architected- not a union of several data marts. Inherently incremental; can schedule essential data
marts first.
Single, central storage of information about the content. Departmental information stored.
11.Benefits of OLAP?
1. OLAP helps managers in decision-making through the multidimensional record views
that it is efficient in providing, thus increasing their productivity.
2. OLAP functions are self-sufficient owing to the inherent flexibility support to the
organized databases.
1 Basic It is used for data analysis It is used to manage very large number of
online short transactions
3 Data It manages all insert, update and It is mainly used for data reading
Modification delete transaction
5 Normalization Tables in OLAP database are not Tables in OLTP database are normalized.
normalized.
13.Operations of OLAP?
Roll up
Drill down
Slice
Dice
Pivot
14.Types of OLAP
o ROLAP
o MOLAP
o HOLAP
15.Define ROLAP
These are intermediate servers which stand in between a relational back-end server and
user frontend tools.
They use a relational or extended-relational DBMS to save and handle warehouse data, and OLAP
middleware to provide missing pieces.
o Database server.
o ROLAP server.
o Front-end tool.
ROLAP MOLAP
ROLAP stands for Relational Online Analytical MOLAP stands for Multidimensional
Processing. Online Analytical Processing.
It usually used when data warehouse contains It used when data warehouse contains
relational data. relational as well as non-relational
data.
17.Define MOLAP?
19.Define HOLAP?
HOLAP incorporates the best features of MOLAP and ROLAP into a single
architecture. HOLAP systems save more substantial quantities of detailed data in the relational
tables while the aggregations are stored in the pre-calculated cubes.