0% found this document useful (0 votes)
122 views

Applications of Data Warehousing

Data warehouses are used in public applications like business, finance, banking, and healthcare to integrate data from various sources into a central repository for analysis and strategic decision-making. They allow organizations to access large amounts of historical data in a consistent format to make timely, well-informed decisions. While traditional databases struggle with increasing demands for online access, retrieval, and maintenance of large data volumes, data warehouses provide efficient solutions through techniques like extract-transform-load processing and online analytical processing. Various case studies demonstrate how data warehouses improve processes and help achieve strategic objectives in government and business domains.

Uploaded by

Anan Sirwal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
122 views

Applications of Data Warehousing

Data warehouses are used in public applications like business, finance, banking, and healthcare to integrate data from various sources into a central repository for analysis and strategic decision-making. They allow organizations to access large amounts of historical data in a consistent format to make timely, well-informed decisions. While traditional databases struggle with increasing demands for online access, retrieval, and maintenance of large data volumes, data warehouses provide efficient solutions through techniques like extract-transform-load processing and online analytical processing. Various case studies demonstrate how data warehouses improve processes and help achieve strategic objectives in government and business domains.

Uploaded by

Anan Sirwal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Applications of Data Warehouse in Public

Data warehouse is the important means of preparing the update efficiently which greatly impacts every industry [5].
peoples and government to face the challenges of the new So companies start seeking the solution for all their
millennium. Today data warehouse is used worldwide in problems and adopt DWH technology.
various applications. Today due to increase in data With sharp and harder competition, enterprises are targeting
complexity and manageability issues, data warehousing has in availing fast and pinpoint information to have best
attracted a great deal of interest in public life applications decisions. Furthermore, with the thirst for huge chunks of
like business, finance, Inventory, Banking, healthcare and information, enterprises’ traditional DB (database) is off no
industries. As the importance of retrieving the information use of smartly managing the increasing needs of online
from knowledge-base cannot be denied, data warehousing information update, access, maintenance, and retrieval. This
is all about making the information available for decision lagging impressively effects the efficiently and effectively
making. Data warehouse is taken as the heart of the latest usage of internal data by the management to hold decision-
decision support systems (DSS). Due to the eagerness of making in time. As a result, to search for various ways and
data warehouse in public life, the need for the design and means to store, access, handle, and utilize the huge chunks
implementation of data warehouse in different applications of data in an effective manner, is the main concern of every
is becoming crucial. Information from various operational business [5].
data sources are integrated by data warehousing into a Organizations requires a database system for their daily
central repository to start the process of analysis and mining decision making, with better adaptability, top flexibility,
of integrated information and primarily used in strategic and best support. Considering the past decade, the
decision making by means of online analytical processing educational (academia) side and the industry side, both
techniques (OLAP). have progressively plated different layouts to solve the
problems and to present solution to craft an aforementioned
system [5]. Adopting the data warehouse technology is one
I. INTRODUCTION of the solutions to that. DWH was defined by Inmon [3, 4]
Operational and transactional systems are the new as, ‘‘pooling data from multiple separate sources to
generation systems which are different from 1970’s construct a main DWH”. Proper data-analyzing tools can be
decision support systems (DSS) [1]. In order to complete used by different users to analyze and store required data.
the life cycle, DSS needs a Data Warehouse (DWH). A Data Warehouse’s purpose is to take large data from
DWH pools the available data which is spread all over the heterogeneous sources and furnish them in known formats
organization, and makes a unify pool (like data structure) that helps in understanding and for making smart decisions
having the presence of similar and linked formats [2]. [6]. The Benefits linked to the DWH applications include
Data warehousing takes off in the 1980s as an answer to the the region of time saving, with the availability of clean and
very little or no availability of information propagated by handful of information, tough and exact decisions making
online application systems, online applications were praised in accordance with the improvement of processes related to
by a very limited domains of users, and integration was not business and to help achieving strategic business objectives
there even [3]. Historical data kept by online applications [2, 4, 5, 6].
are very little as they deposit their historical data for high Realizing the need after researching literature and for
performance in faster way. Thus organizations hold very further exploring on this research article, taking in account
little information as compared to data [3]. the importance of the applications of DWH in public life
With the thirst and huge need for large blocks of and the shortfall of the factual research, we have all the
information, DWH gain much importance and became an concreate reason to explore the most applications of DWH
essential strategy component for medium and large in public life. In this paper we discussed different
organizations. Timely and accurately decision making at applications of DWH in public life along with available
management level becomes difficult due to the incapability case studies. Its sections as follows; Section 2 presents
of traditional databases to handle increasing demands of DWH technology. Section 3 presents the applications of
online information access, retrieval, maintenance and data warehousing in different domains. Section 4 provides a
descriptive view of different case studies under the
umbrella of government and business categories. Section 5
provides a brief usage analysis of Data Warehouse
applications. Finally, conclusion is presented in Section 6.
II. DATA WAREHOUSE TECHNOLOGY function. As shown below in figure 1 there are four separate
A data warehouse is a repository of information collected and distinct components of data warehouse:
from multiple sources, stored under a unified schema, and
that usually resides at a single site and it is subject-oriented, 1 Operational Source Systems These are the operational
integrated, nonvolatile, and time-variant. Data warehouses systems of record that collect data from multiple sources
are constructed via a process of data cleaning, data such as all applications, databases, computer system within
integration, data transformation, data loading, and periodic the enterprise etc. The operational source systems are
data refreshing. To facilitate decision making, the data in a thought of as outside the data warehouse because we have
data warehouse are organized around major subjects, such no control over the content and format of the data in these
as customer, item, supplier, and activity. The data are stored operational systems. Because operational data is
to provide information from a historical perspective (such fragmented, inconsistent and not organized around subjects
as from the past 5-10 years) and are typically summarized (such as customer, vendor and product), so it must be
[8]. The primary goals of a data warehouse are the cleaned up to make it consistent and useful for decision
following: making [9].
- Make organization’s information easily accessible.
- Present the organization’s information consistently. 2 Data Staging Area The data staging area is everything
- Provide capacity to separate and combine data. between the operational source systems and the data
- Inclusion of tools set to query, analyze and present presentation area (see fig 1). The data staging area of data
information. warehouse is both a storage area and a set of process
- Publish used data. commonly referred to as extract-transform-load (ETL).
- Serve as the foundation for business decision Extraction is the first step in the process of getting data into
making. the data warehouse environment. Extracting means reading
Feature of Data Warehouse and understanding the source data and coping the data
1. Subject-oriented: A data warehouse is organized needed for the data warehouse into the staging area for
around major subjects such as customer, supplier, further manipulation.Once the data is extracted to staging
product and sales. Rather than concentrating on the area there are numerous transformations such as cleansing
day-to-day operation and transaction processing of an the data, combining from multiple sources, deduplicating
organization data- warehouse focuses on the modeling data and assigning warehouse keys etc.
and analysis of data for decision makers. Hence by applying these transformations the raw
2. Integrated: A data warehouse is usually constructed by operational data is transformed into a warehouse deliverable
integrating multiple heterogeneous sources, such as fit for user queries. In data staging area simple activities of
relational database, flat files and on-line transaction sorting and sequential processing are also done. The final
records. Data cleaning and data integration techniques step of the ETL process is the loading of data. Loading in
are applied to ensure consistency in naming, the data warehouse usually takes the form of flat files,
conventions, encoding structures, attributes measures relational tables, quality assured dimensional tables.
and so on.
3. Time-variant: Data are stored to provide information
from a historical perspective (e.g., the past 5-10 years).
Every key structure in the data warehouse contains,
either implicitly or explicitly, an element of time[9].
4. Nonvolatile: Data in a data warehouse is never updated
but used only for queries. Thus such data can only be
loaded from other data bases such as the operational
databases. End users who want to update the data must
be the operational database, as only it can be changed,
updated or deleted. This means that a data warehouse
will always be filled with historical data .In sum, a data
warehouse is a semantically consistent data store that
serves as a physical implementation of a decision
support data model and stores the information on which
an enterprise needs to make strategic decision. A data
warehouse is also often viewed as architecture,
constructed by integrating data from multiple
heterogeneous sources to support structured and/or ad
hoc queries, analytical reporting and decision making.
3 Data Presentation Area The data presentation area is
Components of data warehouse where data is organized, stored, and made available for
The components makes a complete environment of data direct querying by user, report writers, and other analytical
warehouse. Each of these components performs specific applications (i.e. data access tools). The presentation area is
a series of integrated data marts containing many fact tables
and decision tables. The dimensional modeling is the most 2 Applications Tools These application tools provides
viable technique for delivering data to data warehouse users more facilities to end users than the built-in query and
for making databases simple and understandable. reporting tools. These tools provide complex set of queries
The dimensional modeling is quite different from third and data models. So the end user must be SQL and data
normal form (3NF) modeling is a design technique that modelling expert. Example of such application tools is
removes data redundancies. Hence dimensional model Power Builder from Power Soft, Forte from Forte Software
contains the same information as a normalized model but etc [8].
packages the data in a format whose design goals are user
understandability, query performance and resilience to 3 OLAP Tools Online analytical processing (OLAP) tools
change. are based on the concept of multidimensional databases.
In presentation area data mart may contain detailed, atomic OLAP tools are capable of analyzing online a large number
data. All the data marts must be build using common of past transactions and large number of records and
dimensions and facts. Without shared, conformed summarize them. This type of data is usually
dimensions and facts, a data mart is a stand alone. Each multidimensional in nature i.e. the data is organized in a
data mart may contain several fact tables, each with 5 to 15 multidimensional model and supported by a
dimension tables. If the design has been done correctly multidimensional database. The OLAP tools can be of three
many of these dimension tables will be shared from fact types: Multidimensional OLAP tools (MOLAP), Relational
table to fact table. OLAP tools (ROLAP), Hybrid OLAP tools (HOLAP).
The data be presented, stored and accessed in dimensional Examples of some OLAP tools are Meta Cube from
schema. If the presentation area is based on relational Informix, Oracle Express, Cognos Power Play etc.
database, then these dimensionally modeled tables are
referred to as star schema. If the presentation area is based III APPLICATIONS OF DATA WAREHOUSE IN PUBLIC LIFE
on multidimensional database or Online Analytic Importance of DWH cannot be denied due to its benefits
Processing (OLAP) technology then the data is stored in because decisions at management level will no longer need
cubes. to be taken on the limited and inaccurate data and it also
Dimensional modeling is applicable to both relational and helps the companies to avoid different challenges. So it
multidimensional databases. Both have common logical becomes the need of every individual company to
design with recognizable dimensions but their physical implement data warehouse. It is estimated that by 2020
implementation differs [7]. around 200% more devices will join the Internet and share
data. DWH strongly depends upon devices and inter linked
4 Data Access Models The principal purpose of data data. The more interlinked devices are, the more powerful
warehouse is to provide information to business users for and useful DWH. According to the forecast by many
decision making. These users interact with data warehouse organization [25, 26] by 2016 around 6.4 billion connected
using these data access tools. Many of these tools require an peers will join the room globally, an increase of 30% from
information specialist or a domain expert, who can analyze 2015. Cisco and other research agencies [25, 26] think that
the information and can interact with the data warehouse approximately 20 - 50 billion devices will be connected by
environment in order to reach meaningful conclusions. 2020[25,26].
There are following four category of data access tools: Other side of the picture is that cost will increase too. If we
Query and Reporting Tools, Application Tools, OLAP talk about spending on hardware, the applications related to
Tools, Data Mining Tools consumer will hit to $546 billion by the end of 2016; apart
from that the usage of connected items in the organization
1 Query and Reporting Tools Query and reporting tools will be somewhere around $868 billion by the end of 2016
are simple and built-in data access tools. This category of [25, 26].
tools can be further subdivided into two groups: reporting As we can see that different levels are defined. These levels
tools and managed query tools. are associated with the hierarchy such that first level is the
core component. The first level is always be a central DWH
Reporting Tools These tools can be further subdivided into (core system(s), hardware system(s)). Furthermore, 2nd
two groups: Production Reporting Tools and Report level is associated with one of the world’s top domains
Writers. Production reporting tools let companies to (Root level, business and Government). The reason behind
generate regular operational reports. Report writers are the selecting Business and Government as top
tools designed for end users. Figure 2 shows the cycle of public life applications of data
warehouse in different fields and how they are interrelated
Managed Query Tools These tools makes the query task according to user preference. As we can see that different
easier for end user by eliminating complexity of SQL and levels are defined. These levels are associated with the
database structures and inserting the meta layer between hierarchy such that first level is the core component. The
end user and database. Meta layer is a software that first level is always be a central DWH (core system(s),
provides subject oriented view of data base. Hence these hardware system(s)). Furthermore, 2nd level is associated
tools are designed for ease to use and can automatically with one of the world’s top domains (Root level, business
generate SQL statements and accept SQL statements. and Government).
Fig Applications of Data Warehouse in Public Life

The reason behind selecting Business and Government as 2) Construction (material based industries)
top of hierarchy is a handful of literature, and all other Data warehouse approach in construction industry seems to
domains are encapsulated under them. With the presence of be efficient in decision making as it provides construction
2nd level all other sublevel gets populated. The 2nd level managers the complete internal and external knowledge
serves as the only pillar that supports all other domains. 2nd about available data so that they can measure and monitor
level is said to be a specific level. 3rd level domains are the the construction performance.
more general than specific. The Nth level is the most Application of DWH in construction industry clearly shows
general level that holds all minor to major domains that construction bosses can smartly judge the stock
A. Buisness remaining, inventory related trend linked to the materials,
Improvement related to decision making and increasing the amount and quantity of each material and also the price
organizational performances are the basic reasons to adopt of all materials [30, 56]. It would also be helpful in
DWH in business [27]. Business holds a key location in reasonable resource allocation to fulfill the required
applications of data warehouse. All other private and semi- services, maintenance and operation of the systems,
private organizations come under its umbrella. allocation of financial budgets, effective managing of
In DWH, for easiness a single repository is used to store investment related long term plans and identification of
data, which is extracted from different databases. This data potential risks [31].
repository provides forecasting which helps the business
personals and business managers. This complete cycle is 3) Manufacturing Industry
used to help in identifying the requirements for business DWH plays a vital role in daily house to industrial hold
and to draft a plan for business [28] as shown in fig 2. things. Manufacturing industry includes product and
process design, scheduling, planning, production,
1) Social media websites maintenance and huge investments in equipment,
Social media is a great example of data warehousing. Social manpower and heavy machinery. In this scenario, decisions
media industry is emerging and so is the need to implement taken will have wide-ranging effects in terms of
DWH in it. A number of features from Facebook, Twitter profitability and long-term strategic issues. Many industries
and other social media sites are also based on analyzing are trying to convert themselves and many should adopt
large data sets [29]. It gathers all data like groups, likes, DWH technology rather than traditional decision making so
friends, location mapping etc. and stores it in a single that a warehouse gathers, standardizes and stores data from
central repository. Although all this information is stored in various applications for improvement in processes and
separated databases but the most relevant and significant increasing its efficiency as analyzing the data in separate
information is stored in a central aggregated database [28]. applications is time-consuming. At this stage, some
transaction processing systems, which are updated timely, DWH provides an integrated and total view of an institute
are often hired to propagate the routine business of [33]. Most of the related departments use data warehouse as
manufacturing and construction companies [56, 57]. a source of information about faculty and students. DWH
helps the students in getting their results and notes from a
4) Marketing web enabled database quickly through a student portal and
Every business is not successful without proper marketing last but not the least it helps in decision making by
and marketing is not successful without knowing the latest providing current and historical information of the institute.
trends and demands. Relationship marketing is a new On a large scale, a DWH can integrate the information of
terminology linked with how different businesses handle different institutes into a single central repository for
their customers and the relationships in between that are analysis and strategic decision making.
assets for them and how they can be improved for long-
term profitability. DWH in marketing is used to examine 7) Finance
the patterns of customer’s behavior and use this customer With the advancement in technology, especially IT industry
information for implementing relationship marketing. They has opened the doors to the new ways of handling business
play a vital role in identifying and targeting the profitable considering financial systems. Government and Business
customers [32]. domain holds equal part in finance. Financial systems may
a) Trend Analysis include banks, post offices, insurance companies, income
It is a technique that is used to predict future outcomes tax and all other tax departments etc. Implementation of
from historical results or information. Different data warehouse in financial industry has several benefits
medium to large scale enterprises are converting to e.g. it can maintain transparency in account opening and
this. In trend analysis, DWH can be used to examine transactions. Similarly, government can take decisions
the behaviors of the customer by using historical against any financial crises. These systems are intelligent
records over consecutive months. enough to spot the defaulters and may act according to the
b) Web Marketing situation. As data warehousing is maintained in this
Web is a hub of billions of devices and around 20 – 50 scenario so efficient decision making process can easily be
billion devices till 2020. It refers to a category of performed. These data warehouses in finance applications
advertising that includes any marketing activity can also be used for the analyzation and to have forecasting
conducted online. Facebook, google, and many major of different aspects of business, stock and bond
to minor such like sites uses web marketing and are performance analysis [34, 58, 60].
relying on latest updated data warehouse.
c) Market Segmentation 8) Inventory
Behavior identification is the top most priority of any It can be defined as storage of items and availability of
organization. Market segmentation is the identification items. In any departmental store whose monthly turn over is
of the customer’s behavior and common characteristics more than 1000000/- it deals with hundreds of items. It has
related to the purchases made against that product of good turn over and reputation in city. Firm employed 10 or
related company. Many organizations are focusing on more employees. One person maintains the account of firm.
integrating data warehouse to get best behavior Seven staff members are for counter sale and three for
analysis. outstation work time. The account gets information from
counter salesman such as daily sales, cash slips, information
5) Banking purchase all goods from wholesaler firm adopt a simple
The banking industry is categorized as one of the highest policy for ordering new items when stock goes down to P-
information demanding industry in the business world. items (called reorder point) and will order Q more items
With the advancement in information technology sector, the (called reorder quantity) from wholesaler. The firm has to
role of business intelligence (BI) increases with great balance the reorder print and reorder quantity for properly
number in the process of banking operations [54]. The running the store.
increased business speed and growing competition has Thus the firm build a project for data warehouse in which
shown the need of banking intelligence dramatically. Bank huge amount of information (such as daily sale, purchase,
intelligence is the ability to gather, manage, and analyze a cash, credit, back sales, bills, reorder cost, reorder point,
large amount of data on bank customers, products, reorder quantity of hundred of items) is stored on the daily
operations, services, suppliers, partners and all the basis. After a long run of firm it may be needed for owner
transactions. As data increases, it becomes difficult to of firm to analyze the total investment and total profit from
collect, handle and transform it into useful knowledge and firm in monthly, yearly basis. This analysis can be done by
DWH solves this problem. Many data warehouse flavors applying data mining tools on data stored in data warehouse.
are designed for the support of banking industry.
9) Telecommunication
6) Education A large number of wireless telecommunication companies
DWH in education field is becoming popular day by day. is interested in data warehouse technology which already
Use of DWH in educational field presents several potential has collected raw material from multiple sources concerned
benefits in making appropriate decisions and for evaluating with calls incoming or outgoing, technical data, customer
data in time which is the basic target of DWH process. billing data. The data warehouse team will format raw data
for consistency in data warehouse and check the data for find patterns in geography. So far, data mining and
validity and quality Assurance. Then this data will be fitted Geographic Information Systems (GIS) have existed as two
into data model of data warehouse. The data mining tools separate technologies, each with its own methods, traditions
and techniques are applied on this data stored in data and approaches to visualization and data analysis.
warehouse for telecommunication companies and extracting Particularly, most contemporary GIS have only very basic
the useful and hidden information to take future decisions. spatial analysis functionality. The immense explosion in
geographically referenced data occasioned by developments
10) CRM (Customer Relationship Management) in IT, digital mapping, remote sensing, and the global
CRM is the process that manages that interaction between diffusion of GIS emphasizes the importance of developing
company and its customer. The interacting with customers data driven inductive approaches to geographical analysis
is not simple as it has been in past. The way in which and modeling.
companies interact with their customers has changed. A Data mining, which is the partially automated search for
customer is continue business is no longer granted. As a hidden patterns in large databases, offers great potential
result companies have found that they need to understand benefits for applied GIS-based decision-making.
their customers better. Company need to know their
customers preference not only for products but also for B. Government
style, service etc. They need to manage the relationship Amongst the two major sub-divisions of DWH industry,
with each and every customer and make each as profitable government holds equal division. Government can use data
customer. Those companies that are successful will find warehousing technique in different fields e.g. for searching
increased revenue at lower cost of sales and work time. The terrorist profile and threat assessments, in agriculture, in
methodology that makes this possible is called customer educational industry, in financing department, medical
relationship management (CRM). departments and for fraud detection. The
Many organizations have collected and stored a huge telecommunication industry and Banking industry holds
amount of data about their customers, suppliers and many issues related to user frauds. Figure 2 shows
business partners. However inability to discover valuable application of data warehouse in government departments.
information hidden in the data prevents them to take
benefits from this large database. The business desire is to 1) NIC
extract valid, previously unknown information from this The general information service terminal of national
large data base and use it for profit. To fulfill these goals informatics center (GISTNIC) data warehouse for Tamil
the organization needs data ware- house to manage this Nadu was implemented in 1998. In this data warehouse the
large amount of data and then apply the data mining information was collected on village level such sector as
techniques to extract the useful information for decision education, health, rainfall, data on individuals below
making [7]. poverty line (BPL Survey) etc. This GISTNIC data
warehouse is a web enabled data warehouse which provides
11) Science and Engineering information on national issues ranging across diverse
Advances in technology are making massive data sets subject like food, agriculture and latest updates in science
common in many scientific disciplines. To find useful and technology. This information was collected to fullfill
information in these data sets, scientists and engineers are the needs of politicians, economists and to all the citizens.
turning to data mining techniques. The web enabled SAS software is used as data mining tool
In recent years, data mining has been widely used in area of which provide online information to these decisions makers
science and engineering, such as bioinformatics, genetics, in the government sector [9].
medicine, education and electrical power engineering
,medical imaging, astronomy, chemistry, remote sensing, 2) World Bank : The world bank collects and maintains
and physics. In the area of study on human genetics, the huge amount of data of economic and development of all
important goal is to understand the mapping relationship the countries of world. For the purpose of monitoring of
between the inter-individual variation in human DNA various world bank’s project in all the countries of world
sequences and variability in disease susceptibility. In many the world bank started collecting data and analysing it. For
terms, it is to find out how the changes in an individual's more than hundred developing countries the world banks
DNA sequence affect the risk of developing common also captured data on their economics and financial credit
diseases such as cancer. This is very important to help [9]. The world bank build data warehouse and analyse it
improve the diagnosis, prevention and treatment of the using OLAP tools which is not possible manually.
diseases. The data mining technique that is used to perform
this task is known as multifactor dimensionality reduction. 3) Medical
Medical sector is emerging as the highest DWH
12) Space Applications implementer industry. In health-care, data quality and
Space Applications is the application of data warehouse demand for quality medical services has become
techniques to spatial data. It is called Spatial data mining. increasingly important [55, 59]. Due to the intricacy and
Spatial data mining is the application of data mining variety of medical cum clinical data, the adoption of data
techniques to spatial data. Spatial data mining follows along warehouses by health care was slow as compared to other
the same functions in data mining, with the end objective to fields. Over the past few years it was reported that the usage
of DWH increased by the administrative and clinical areas. Figure 3, which is related to the contribution made by
Data warehouses can help in improving the care of specific business and government domains to DWH. It is clearly
patients. These health-care institutions are adopting data observed by the survey that 80% of Business and 20%
warehousing for strategic decision making as a decision Government related organizations are contributing in the
supporting tool. It provides the tools for acquiring medical progress of data warehouse.
data, for extracting the relevant information from that data A. Business
and finally making this knowledge available to all the DWH in business is now emerging like a hurricane. Around
concerned persons. Administrative data in data warehouse 80% of data warehouse implementation is captured by
can help in providing the information about skilled staff business. Following are few case studies related to business
needed for a particular treatment and this information implementation of data warehouse.
further used for the treatment scheduling and to help 1) Finance
supporting medical personals in human resources area [36]. Financial services company (FSC) is considered to be the
leading marketer of investment besides banking for
4) Data warehouse for Ministry of Rural Development: products. They implemented DWH named as VISION. The
The Ministry of Rural Development (MRD) has been user of VISION consists of financial and marketing analysts,
regularly reviewing the data on Rurals. The analysis of this managers. It was developed with substantial business and
large amount of data become difficult. But the data technical goals that can gave a factual and precise picture of
warehouse technology can make it easy to analyse which best customers of banks and also about most important
includes all analysis variables into consideration. Rural products [27].
development data warehouse can be further developed by 2) Medical
collecting more detail information about the infrastructure This case study is based on generation of evidence-based
of the districts. Thus rural development data warehouse guidelines performed by University Health Network
aims at making the rural infrastructure data’s beneficial for (Toronto) which clearly showed that it is authentic,
the society. It also provides ready to use data to the people influential and user-friendly to have a DWH related to
at Ministry of Rural Development. clinic for best strategic decision making. Without this IT
1) To reduce the amount of resources – time and support, it would not be imaginable to look for evidence-
manpower- spent on managing the volumes and variety of based medicine as it is difficult for clinicians to gather data
databases handled by Ministry of Rural Development. for a specific disease [36].
2) To provide powerful decision making tools in the hands 3) Banking
of end-users (at Ministry of Rural Development) in order to Their research problem is based on the factors that banking
facilitate better decision making for rural development industry should consider before and during the adoption of
programs. DWH technology. Their results revealed the number of
banks in Taiwan that adopted this technology and also the
5) Fraud and Threat detection architectures that these banks implemented [5].
Governments are playing their part to detect any threat and 4) Manufacturing
fraud caused by ill-minded people. Unfortunately, almost Large Manufacturing Company (LMC) is making its way to
no specific data warehouse implementation that is known is top for production of home related appliances. LMC
available. Data warehouse access to governments are there, implemented data warehouse technology as there is a great
but they need a data warehouse system that is linked with need to improve the technical infrastructure of the
every corner so that threats and terrorists will be monitored company. Before this, data was scattered in different
. CASE STUDIES formats throughout the company and this makes normal and
In this section few case studies are discussed. As discussed basic functioning difficult for business units. This
earlier data warehouse world is a blend of two parts i.e. warehouse provides support to marketing, manufacturing
business side and government side. Both sides have their and logistic applications by providing data to dependent
own further divisions and any other increment will be data marts [27].
added under them. A graphical view is presented in the

www.ijcsit.com 2550
B. Government Their DWH is updated each night that includes: Large
Data warehouse in government plays a vital and critical Metadata Repository, Security and auditing infrastructure
role. Around 20-35% of data warehouse industry is and Master Reference Data. By using latest information
captured by government. Many developing countries are from data warehouse patients with high risk are identified
now transferring to the use of data warehouse. Few case and their reports were sent at every hospital or clinic [35].
studies related to government and usages of data warehouse
are as follows. 3) Finance
Internal Revenue Service is the agency of U.S. that is
1) NIC responsible for tax collection and tax laws imposition. They
The general information service terminal of national implemented data warehouse CRIS as there is no way to
informatics center (GISTNIC) data warehouse for Tamil recoup entity with convinced attribute and perform some
Nadu was implemented in 1998. In this data warehouse the analysis on these marked entities. This implemented DWH
information was collected on village level such sector as consisted of five domains: business entity, tax returns
education, health, rainfall, data on individuals below entity, related to taxpayer transactions entity, peoples’
poverty line (BPL Survey) etc. This GISTNIC data income sources entity and tax payments details entity [27].
warehouse is a web enabled data warehouse which provides
information on national issues ranging across diverse ANALYSIS AND RESULTS
subject like food, agriculture and latest updates in science In this section we will see the areas, cross domains and
and technology. This information was collected to fullfill usage of data warehouse around the world and the graphical
the needs of politicians, economists and to all the citizens. view of inter related data effecting data warehouse.
The web enabled SAS software is used as data mining tool
which provide online information to these decisions makers A. Comparison of different cross domain areas affecting
in the government sector [9]. data warehouse
Table 1 shows the comparison of different cross domain
2) Medical areas and their interlinked data.
In Utah and Idaho, Intermountain Healthcare implemented
EDW. This healthcare system operates 22 hospitals, 179 B. Graphical representation of Survey
clinics, physician offices. This case study is about venous Following graph shows percentage captured by different
thrombosis patients. Datasets consists of: records of areas in DWH around the world. As we can see from the
Inpatients, columns of outpatient, financial data linked to or Figure 4, medical holds top position in using DWH
from patient’s accounts, data from laboratories related to technology.
clinics for the process of imaging and surgery [35] etc.

Fig 4 Percentage Distribution of DWH in Public Life

www.ijcsit.com 2551
TABLE I. COMPARISON OF DIFFERENT CROSS DOMAIN AREAS AFFECTING DATA WAREHOUSE
Percentage
Domain Area of Usage Cross domain Reference No.
used
Hospitals, Clinics,
Medical 23.30% Government/Business [37,38,40,39,35,41,42,36,28,12]
Physician offices
Finance Tax departments 6.20% Government/Business [34,45,28,27]
Baking industry all
Banking 6.20% Business [28,5,46]
around the world
Manufacturing Home appliances 1.90% Business [27]
Schools, colleges,
Education 3.80% Government/Business [33]
universities
Customer relationship
management, trend
Marketing 16% Business [28, 47, 48, 49, 50,32, 52]
analysis and
information system
Facebook, Twitter,
Social Media 6.20% Business [29]
others.
Infrastructure
Construction 8.70% Business [43,44,31]
management
Agricultural production
Agriculture 3.80% Government/Business [6]
department
Airports, Crime
Fraud Detection 1.90% Government [37,38,40,39,35,41,42,36,28,12]
Agencies
Airports, Crime
Threat Analysis 1.90% Government [34,45,28,27]
Agencies
Others Others 20% All [28,5,46]

DISCUSSION AND CONCLUSION last but not the least problems holding with recent data
This research survey describes the applications of data management systems’ infrastructure etc.
warehouse in various domains including government and
non-government organizations. Our analysis is based on the REFERENCES
literature review and case studies provided in this survey. [1] T. Ariyachandra, H. J. Watson, “Key organizational factors in data
warehouse architecture selection”, Decision Support Systems 49
The analysis of this study shows that the non-governmental
(2010) 200–212.
organizations use data warehouse technology much more [2] T. R. Sahama, P. R. Croll, “A Data Warehouse Architecture for
than the government organizations. The governments Clinical Data Warehousing”, in Roddick, J. F. and Warren, J. R.,
mostly use data warehouse for controlling the crime and Eds. Proceedings Australasian Workshop on Health Knowledge
fraud. Non-governmental organizations mostly use DWH Management and Discovery (HKMD 2007) CRPIT, 68, pages pp.
227-232, Ballarat, Victoria.
for data analysis, prediction and making decisions. Case [3] W.H. Inmon., “DW 2.0 Architecture for the Next Generation of Data
studies describe the importance of data warehouse in four Warehousing”, DM Review, Apr 2006, Vol. 16 Issue 4, p.8-25.
domains; Healthcare, Banking, Finance and Manufacturing. [4] W.H. Inmon, “Building the Data Warehouse”, Third Edition, York:
John Wiley & Sons, 2002.
The details of these case studies and their use of data
[5] Hwang, Hsin-Ginn, et al. "Critical factors influencing the adoption of
warehouse have been discussed in the Section 4. The data warehouse technology: a study of the banking industry in
analysis of the Table 1 shows that data warehouse is being Taiwan." Decision Support Systems 37.1 (2004): 1-21.
used in many application domains. The Figure 4 clearly [6] Nilakanta, Sree, Kevin Scheibe, and Anil Rai. "Dimensional issues in
agricultural data warehouse designs." Computers and electronics in
depicts the areas that are using data warehouse. It shows
agriculture 60.2 (2008): 263-278.
that medical and marketing areas are using data warehouse [7] Alex Berson, Stephen Smith, Kurt Thearling “Building Data Mining
much more than the other domains, whereas manufacturing, Applications for CRM”,2001.
agriculture, education, and government sector are rarely [8] C.S.R.Prabhu, ”Datawarehouse Concepts, Techniques, Products and
using data warehouse. Applications”,2nd Edition 2004.
[9] Margy Ross, Ralph Kimball, “The Data warehouse Toolkit”, 2nd
The analysis shows that data warehouse technology have Edition 2002.
been adopted in business as well as in government [10] J.V.D. Hoven, Data warehousing: bringing it all together,
organizations for managing their huge data and for decision Information Systems Management (1998 Spring) 92 – 96.
making. Still many organizations have not gone for the [11] R. Kimball, The Data Warehouse Toolkit, Wiley, New York, 1996.
[12] R.M.T. Lu, K.A. Mazouz, A conceptual model of data warehousing
adoption of DWH technology. Either they do not realize its for medical device manufacturers, Proc. of the 22nd Annual EMBS
importance or there may be difficulties in its adoption. The International Conference 2000 (July).
reasons for ignoring the importance of implementing DWH [13] D. Powell, To outsource or not to outsource? Netwo rking
Management (1993) 56 – 59.
technology have been discussed in literature that include
[14] Y. Yao, H. He, Data warehousing and the Internet’s impact on ERP,
quite large investment in terms of capital, more time IT Professional (2000 March) 37–41.
utilization, looking for intangible benefits are difficult, the

www.ijcsit.com 2552
[15] Rob, P., Coronel, C., 2006. Database Systems: Design, [40] Yoo, Sooyoung, et al. "Electronically implemented clinical indicators
Implementation, and Management. Course Technology. based on a data warehouse in a tertiary hospital: its clinical benefit
[16] Sen, A., Sinha, A.P., 2005. A comparison of data warehousing and effectiveness." International journal of medical informatics 83.7
methodologies. Commun. ACM 48 (3), 79–84 (2014): 507-516.
[17] Kimball, R., 2002. The Data Warehouse Toolkit: The Complete [41] Prokosch, Hans-Ulrich, and T. Ganslandt. "Perspectives for medical
Guide to Dimensional Modeling. John Wiley & Sons, Inc. informatics." Methods Inf Med 48.1 (2009): 38-44.
[18] Alsquor, M., Matouk, K., Owoc, M. L., A survey of data warehouse [42] Adlassnig, Klaus-Peter, et al. "Fuzziness in healthcare-associated
architectures:preliminary results. Proceedings of the Federated infection monitoring and surveillance." Norbert Wiener in the 21st
Conference on Computer Scienceand Information Century (21CW), 2014 IEEE Conference on. IEEE, 2014.
Systems,Wroclaw,2012,Sivut 1121-1126. [43] Chong, Heap Yih, Rosli Mohamad Zin, and Siong Choy Chong.
[19] Hackney, D., 2002. Architectures and Approaches for Successful "Employing data warehousing for contract administration: e-dispute
Data Warehouses, Oracle White Paper. resolution prototype." Journal of Construction Engineering and
[20] CHAKIR, Aziza, Hicham MEDROMI, and Adil SAYOUTI. Management 139.6 (2012): 611-619.
"Actions for data warehouse success." Editorial Preface 4.8 (2013). [44] Chau, Kwok-Wing, et al. "Application of data warehouse and
[21] Chaudhuri, S., Dayal, U., 1997. An Overview of Data Warehousing decision support system in construction management." Automation in
and OLAP Technology. ACM SIGMOD Record 26 (1), 64–74. construction 12.2 (2003): 213-224.
[22] Thakur, Garima, and Anjana Gosain. "A Comprehensive Analysis of [45] Chen, Wenzhe. "The Application of Data Warehouse Technology in
Materialized Views in a Data Warehouse Environment." IJACSA) Modern Finance." 2015 International Conference on Advances in
International Journal of Advanced Computer Science and Mechanical Engineering and Industrial Informatics. Atlantis Press,
Applications 2.5 (2011). 2015.
[23] Watson, H.J., Haley, B.J., 1997. Data warehousing: a framework and [46] Lin, Zhonglin, et al. "Banking intelligence: application of data
survey of current practices. J. Data Warehousing 2 (1), 10–17. warehouse in bank operations." Service Operations and Logistics,
[24] Watson, H.J., Gerard, J.G., Gonzalez, L.E., Haywood, M.E., Fenton, and Informatics, 2008. IEEE/SOLI 2008. IEEE International
D., 1999. Data warehousing failures: case studies and findings. J. Conference on. Vol. 1. IEEE, 2008.
Data Warehousing 4 (1), 44–55 [47] Shaw, Michael J., et al. "Knowledge management and data mining
[25] www.informationweek.com/mobile/mobile-devices/gartner-21- for marketing." Decision support systems 31.1 (2001): 127-137.
billion-iot-devices-to-invade-by-2020/d/d-id/1323081 [48] Ngai, Eric WT, Li Xiu, and Dorothy CK Chau. "Application of data
[26] www.gartner.com/newsroom/id/3165317 mining techniques in customer relationship management: A literature
[27] Watson, Hugh J., Dale L. Goodhue, and Barbara H. Wixom. "The review and classification." Expert systems with applications 36.2
benefits of data warehousing: why some organizations realize (2009): 2592-2602.
exceptional payoffs." Information & Management 39.6 (2002): 491- [49] Nedeva, Veselina Ivanova. "ANALYSIS OF MARKETING
502. INFORMATION SYSTEMS AND CONCEPTION OF AN
[28] Joseph, Madhuri V. "Significance of Data Warehousing and Data INTEGRATED MARKETING INFORMATION SYSTEM."
Mining in Business Applications." International Journal of Soft International Journal of Computing 3.2 (2014): 127-133.
Computing and Engineering (IJSCE) ISSN (2013): 2231-2307. [50] Payton, Fay, and Debra Zahay. "Why doesn't marketing use the
[29] Thusoo, Ashish, et al. "Data warehousing and analytics infrastructure corporate data warehouse? The role of trust and quality in adoption
at facebook." Proceedings of the 2010 ACM SIGMOD International of data-warehousing technology for CRM applications." Journal of
Conference on Management of data. ACM, 2010. Business & Industrial Marketing 20.4/5 (2005): 237-244.
[30] Chowdhury, Rajdeep, et al. "Implementation of Central Dogma [51] Thomas, Davenport, et al. "Data to Knowledge to Results, Building
Based Cryptographic Algorithm in Data Warehouse Architecture for an Analytic Capability." California Management Review 43.2
Performance Enhancement." International Journal of Advanced (2001).
Computer Science & Applications 1.6: 29-34. [52] Cunningham, Colleen, Il-Yeol Song, and Peter P. Chen. "Data
[31] Park, Taeil, and Hyoungkwan Kim. "A data warehouse-based warehouse design to support customer relationship management
decision support system for sewer infrastructure management." analyses." Proceedings of the 7th ACM international workshop on
Automation in Construction 30 (2013): 37-49. Data warehousing and OLAP. ACM, 2004.
[32] Ryals, Lynette, and Adrian Payne. "Customer relationship [53] Watson, Hugh J., Celia Fuller, and Thilini Ariyachandra. "Data
management in financial services: towards information-enabled warehouse governance: best practices at Blue Cross and Blue Shield
relationship marketing." Journal of strategic marketing 9.1 (2001): 3- of North Carolina." Decision Support Systems 38.3 (2004): 435-450.
27. [54] Sarkar, Anirban. "Data Warehouse Requirements Analysis
[33] Goyal, Monika, and Rajan Vohra. "Applications of data mining in Framework: Business-Object Based Approach." International Journal
higher education." International journal of computer science 9.2 3 (2012).
(2012): 113. [55] Diana, Nova Eka, and Aan Kardiana. "Comprehensive Centralized-
[34] Bhedi, Vaibhav R., Shrinivas P. Deshpande, and Ujwal A. Lanjewar. Data Warehouse for Managing Malaria Cases." International Journal
"Data Warehouse Architecture for Financial Institutes to Become of Advanced Computer Science & Applications 1.6: 40-46.
Robust Integrated Core Financial System using BUID." International [56] N.L. Sarda, Temporal issues in data warehouse systems, Database
Journal of Advanced Research in Computer and Communication Applications in Non-Traditional Environments ’99, TheProceedings
Engineering 3.3 (2014): 2278 of the 1999 International Symposium on Database Application in
[35] Evans, R. Scott, James F. Lloyd, and Lee A. Pierce. "Clinical use of Non-traditional Environments (DANTE ’99), IEEE Computer
an enterprise data warehouse." AMIA Annual Symposium Society, Los Alamitos, 1999, pp. 27–34
Proceedings. Vol. 2012. American Medical Informatics Association, [57] J.-B. Yang, N.-J. Yau, Application of case-based reasoning in
2012. construction engineering and management, Proceedings of the Third
[36] Stolba, Nevena, and A. Min Tjoa. "The relevance of data Congress held in conjunction with A/E/C Systems 1996, Computing
warehousing and data mining in the field of evidence-based medicine in Civil Engineering, American Society of Civil Engineers, New
to support healthcare decision making." International Journal of York, 1996, pp. 663–669.
Computer Systems Science and Engineering 3.3 (2006): 143-148. [58] K.W. Chau, Y. Cao, M. Anson, J.P. Zhang, Application of data
[37] Schubart, Jane R., and Jonathan S. Einbinder. "Evaluation of a data warehouse and decision support system in construction management,
warehouse in an academic health sciences center." International Automation in Construction 12 (2) (2002) 213–224.
journal of medical informatics 60.3 (2000): 319-333. [59] J. Vanegas, P. Chinowsky, Computing in Civil Engineering,
[38] Liu, Baoyan, et al. "Data processing and analysis in real world American Society of Civil Engineers, New York, 1996.
traditional Chinese medicine clinical data: challenges and [60] J. Dyche, e-Data Turning Data into Information with Data
approaches." Statistics in medicine 31.7 (2012): 653-660. Warehousing, Addison-Wesley, Reading, 2000
[39] Leitheiser, Robert L. "Data quality in health care data warehouse
environments." System Sciences, 2001. Proceedings of the 34th
Annual Hawaii International Conference on. IEEE, 2001.

www.ijcsit.com 2553

You might also like