0% found this document useful (0 votes)
17 views

Copy of 2m unit2

2 marks with Answer

Uploaded by

M Kalaiselvi
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views

Copy of 2m unit2

2 marks with Answer

Uploaded by

M Kalaiselvi
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

CCS431 DATA WAREHOUSING

2 MARK QUESTION AND ANSWERS-UNIT2

UNIT II ETL AND OLAP TECHNOLOGY 6

What is ETL – ETL Vs ELT – Types of Data warehouses - Data warehouse Design and Modeling
-Delivery Process - Online Analytical Processing (OLAP) - Characteristics of OLAP – Online
Transaction Processing (OLTP) Vs OLAP - OLAP operations- Types of OLAP- ROLAP Vs
MOLAP Vs HOLAP.

1.What is ETL?

The mechanism of extracting information from source systems and bringing it into the
data warehouse is commonly called ETL, which stands for Extraction, Transformation and
Loading.

2.What is Extraction?
o Extraction is the operation of extracting information from a source system for further use
in a data warehouse environment. This is the first stage of the ETL process.
o Extraction process is often one of the most time-consuming tasks in the ETL.

3.Define cleansing?

The cleansing stage is crucial in a data warehouse technique because it is supposed to


improve data quality. The primary data cleansing features found in ETL tools are rectification and
homogenization. They use specific dictionaries to rectify typing mistakes and to recognize
synonyms, as well as rule-based cleansing to enforce domain-specific rules and defines
appropriate associations between values.

4.Define Transformation?

Transformation is the core of the reconciliation phase. It converts records from its
operational source format into a particular data warehouse format. If we implement a three-layer
architecture, this phase outputs our reconciled data layer.

5.Define Loading?

The Load is the process of writing the data into the target database. During the load step,
it is necessary to ensure that the load is performed correctly and with as little resources as
possible. Loading can be carried in two ways

1. Refresh
2. Update

6. Difference between ETL and ELT

Basics ETL ELT

Process Data is transferred to the ETL server and moved back to DB. Data remains in the DB except for
High network bandwidth required. cross Database loads (e.g. source to
object).

Transformation Transformations are performed in ETL Server. Transformations are performed (in
the source or) in the target.

Time- It needs highs maintenance as you need to select data to load Low maintenance as data is always
Maintenance and transform. available.

Calculations Overwrites existing column or Need to append the dataset and Easily add the calculated column to
push to the target platform. the existing table.

7. Types of Data Warehouses?

 Host-Based (MVS) Data Warehouses


 Host-Based (UNIX) Data Warehouses
 LAN-Based Workgroup Data Warehouses
 Host-Based Single Stage (LAN) Data Warehouses
 Multi-Stage Data Warehouses
 Stationary Data Warehouses
 Distributed Data Warehouses
 Virtual Data Warehouses

8.Types of Data Warehouse model?


Enterprise Warehouse
Data Mart
Virtual Warehouses

9.Differentiate Top-down and Bottom-up Approach?

Top-Down Design Approach Bottom-Up Design Approach

Breaks the vast problem into smaller subproblems. Solves the essential low-level problem and integrates
them into a higher one.

Inherently architected- not a union of several data marts. Inherently incremental; can schedule essential data
marts first.

Single, central storage of information about the content. Departmental information stored.

Centralized rules and control. Departmental rules and control.

10. What is OLAP (Online Analytical Processing)?

OLAP stands for On-Line Analytical Processing. OLAP is a classification of software


technology which authorizes analysts, managers, and executives to gain insight into information
through fast, consistent, interactive access in a wide variety of possible views of data that has been
transformed from raw information to reflect the real dimensionality of the enterprise as understood
by the clients.

11.Benefits of OLAP?
1. OLAP helps managers in decision-making through the multidimensional record views
that it is efficient in providing, thus increasing their productivity.
2. OLAP functions are self-sufficient owing to the inherent flexibility support to the
organized databases.

12.Difference between OLTP and OLAP?

Sr. Key OLAP OLTP


No.

1 Basic It is used for data analysis It is used to manage very large number of
online short transactions

2 Database It uses data warehouse It uses traditional DBMS


Type

3 Data It manages all insert, update and It is mainly used for data reading
Modification delete transaction

4 Response Processing is little slow In Milliseconds


time

5 Normalization Tables in OLAP database are not Tables in OLTP database are normalized.
normalized.

13.Operations of OLAP?

 Roll up
 Drill down
 Slice
 Dice
 Pivot
14.Types of OLAP

o ROLAP
o MOLAP
o HOLAP

15.Define ROLAP

These are intermediate servers which stand in between a relational back-end server and
user frontend tools.

They use a relational or extended-relational DBMS to save and handle warehouse data, and OLAP
middleware to provide missing pieces.

16.ROLAP Architecture Components?


ROLAP Architecture includes the following components

o Database server.
o ROLAP server.
o Front-end tool.

ROLAP MOLAP

ROLAP stands for Relational Online Analytical MOLAP stands for Multidimensional
Processing. Online Analytical Processing.

It usually used when data warehouse contains It used when data warehouse contains
relational data. relational as well as non-relational
data.

It contains Analytical server. It contains the MDDB server.

17.Define MOLAP?

A MOLAP system is based on a native logical model that directly supports


multidimensional data and operations. Data are stored physically into multidimensional arrays,
and positional techniques are used to access them.

18. Differentiate ROLAP and MOLAP

19.Define HOLAP?
HOLAP incorporates the best features of MOLAP and ROLAP into a single
architecture. HOLAP systems save more substantial quantities of detailed data in the relational
tables while the aggregations are stored in the pre-calculated cubes.

20. MOLAP Architecture Components?


MOLAP Architecture includes the following components
o Database server.
o MOLAP server.
o Front-end tool.

You might also like