Data warehouse fourth unit notes
Data warehouse fourth unit notes
The process architecture defines an architecture in which the data from the data
warehouse is processed for a particular computation.
In this architecture, the data is collected into single centralized storage and
processed upon completion by a single machine with a huge structure in terms of
memory, processor, and storage.
It is very successful when the collection and consumption of data occur at the same
location.
Distributed Process Architecture
In this architecture, information and its processing are allocated across data
centers, and its processing is distributed across data centers, and processing of data
is localized with the group of the results into centralized storage. Distributed
architectures are used to overcome the limitations of the centralized process
architectures where all the information needs to be collected to one central
location, and results are available in one central location.
Client-Server
In this architecture, the user does all the information collecting and presentation,
while the server does the processing and management of data.
Three-tier Architecture
With client-server architecture, the client machines need to be connected to a
server machine, thus mandating finite states and introducing latencies and
overhead in terms of record to be carried between clients and servers.
N-tier Architecture
Cluster Architecture
Peer-to-Peer Architecture
This is a type of architecture where there are no dedicated servers and clients.
Instead, all the processing responsibilities are allocated among all machines, called
peers. Each machine can perform the function of a client or server or just process
data.
Intraquery Parallelism
Interquery parallelism does not help in this function since each query is run
sequentially.
This application of parallelism decomposes the serial SQL, query into lower-level
operations such as scan, join, sort, and aggregation.
Interquery Parallelism