0% found this document useful (0 votes)
18 views

Data Sources For Data Warehouse

Data Warehouse
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views

Data Sources For Data Warehouse

Data Warehouse
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 2

Data Sources for Data Warehouse

The internet is the "mother of all data warehouses" but few corporations have
begun to harvest the hundreds of sources of data that will allow them to look
at what is going on outside the enterprise.

Data warehouses are intended to provide information to decision makers. To


do so, data warehouses must gather and consolidate data from many sources
in the organization into a consistent set of data that accurately reflects the
organization's business operation and history.

Organizations often have multiple online transaction processing (OLTP)


systems to capture daily business operations. These OLTP systems are
seldom designed at the same time as data warehouses. They may even be
designed by different organizations, which is often the case when
organizations grow through acquisitions and mergers. Database schemas and
data element identification keys often vary from database to database. For
example, the customer table in the OLTP of an acquired company may contain
many of the same customers and products as the acquiring company but use
a different identification system. Data extracted from these OLTP systems
must be transformed into a common representation.

Legacy systems that have been in use for many years often contain
denormalized data as well as unusual data identification designs and limited
query flexibility.

Data critical for business analysis may even reside on individual desktop
computers in personal databases and spreadsheets, especially in
organizations that developed and grew without a central information
technology group. Such data must also be captured into the data warehouse.

Sources of data to be used in the data warehouse must be identified and


techniques developed for extracting the data from them. Data Transformation
Services (DTS) provides powerful tools for extracting and transforming data
from diverse data sources.
Creating and Using Data Warehouses Overview
Organizations collect data in the normal course of business operations. The
purpose of a data warehouse is to consolidate and organize this data so it can
be analyzed and used to support business decisions. In many cases a data
warehouse contains the living history of the organization.

Data warehouses usually contain historical data, often collected from a variety
of disparate sources such as web, online transaction processing (OLTP)
systems, legacy systems, text files, or spreadsheets. A data warehouse
combines this data, cleanses it for accuracy and consistency, and organizes it
for ease and efficiency of querying.

Some definitions of a data warehouse include several elements such as a data


preparation area, the cleansing process, the database that holds the data
warehouse data, and the tools that organize and present the data to client
applications. Other definitions restrict the data warehouse to the database
that contains the data warehouse data. In large data warehousing
applications, data is often segmented into specialized components, called data
marts, that address individual components of the organization. Some
definitions consider data marts to be part of the data warehouse; other
definitions consider them to be separate entities. The intended meaning of the
term data warehouse is usually clear from the context in which it is used. The
topics in this section generally use the broadest definition and address
individual elements as components within the context of the data warehouse.

You might also like