Topic 5
Topic 5
Features of a DBMS
i. Databases is the collection are logically interrelated with each other. Often they represent
a single logical database.
ii. Data is physically stored across multiple sites. Data in each site can be managed by a
DBMS independent of the other sites.
iii. The processors in the sites are connected via a network. They do not have any
multiprocessor configuration.
iv. A distributed database is not a loosely connected file system.
v. A distributed database incorporates transaction processing, but it is not synonymous with
a transaction processing system.
Features of a DDBMS
i. It is used to create, retrieve, update and delete distributed databases.
ii. It synchronizes the database periodically and provides access mechanisms by the virtue of
which the distribution becomes transparent to the users.
iii. It ensures that the data modified at any site is universally updated.
iv. It is used in application areas where large volumes of data are processed and accessed by
numerous users simultaneously.
v. It is designed for heterogeneous database platforms.
vi. It maintains confidentiality and data integrity of the databases.
Fully Replicated
In this design alternative, at each site, one copy of all the database tables is stored. Since, each site
has its own copy of the entire database, queries are very fast requiring negligible communication
cost. On the contrary, the massive redundancy in data requires huge cost during update operations.
Hence, this is suitable for systems where a large number of queries is required to be handled
whereas the number of database updates is low.
Partially Replicated
Copies of tables or portions of tables are stored at different sites. The distribution of the tables is
done in accordance to the frequency of access. This takes into consideration the fact that the
frequency of accessing the tables vary considerably from site to site. The number of copies of the
tables (or portions) depends on how frequently the access queries execute and the site which
generate the access queries.
Fragmented
In this design, a table is divided into two or more pieces referred to as fragments or partitions, and
each fragment can be stored at different sites. This considers the fact that it seldom happens that
all data stored in a table is required at a given site. Moreover, fragmentation increases parallelism
and provides better disaster recovery. Here, there is only one copy of each fragment in the system,
i.e. no redundant data.
Mixed Distribution
This is a combination of fragmentation and partial replications. Here, the tables are initially
fragmented in any form (horizontal or vertical), and then these fragments are partially replicated
across the different sites according to the frequency of accessing the fragments.
5.4.2 Fragmentation
Fragmentation is the task of dividing a table into a set of smaller tables. The subsets of the table
are called fragments. Fragmentation can be of three types: horizontal, vertical, and hybrid
(combination of horizontal and vertical). Horizontal fragmentation can further be classified into
two techniques: primary horizontal fragmentation and derived horizontal fragmentation.
Fragmentation should be done in a way so that the original table can be reconstructed from the
fragments. This is needed so that the original table can be reconstructed from the fragments
whenever required. This requirement is called “reconstructiveness.”
Advantages of Fragmentation
i. Since data is stored close to the site of usage, efficiency of the database system is increased.
ii. Local query optimization techniques are sufficient for most queries since data is locally
available.
iii. Since irrelevant data is not available at the sites, security and privacy of the database system
can be maintained.
Disadvantages of Fragmentation
i. When data from different fragments are required, the access speeds may be very high.
ii. In case of recursive fragmentations, the job of reconstruction will need expensive
techniques.
iii. Lack of back-up copies of data in different sites may render the database ineffective in case
of failure of a site.
For example, let us consider that a University database keeps records of all registered students in
a Student table having the following schema.
STUDENT
Horizontal Fragmentation
Horizontal fragmentation groups the tuples of a table in accordance to values of one or more fields.
Horizontal fragmentation should also confirm to the rule of reconstructiveness. Each horizontal
fragment must have all columns of the original base table.
For example, in the student schema, if the details of all students of Computer Science Course needs
to be maintained at the School of Computer Science, then the designer will horizontally fragment
the database as follows −
CREATE COMP_STD AS
SELECT * FROM STUDENT
WHERE COURSE = "Computer Science"
Hybrid Fragmentation
In hybrid fragmentation, a combination of horizontal and vertical fragmentation techniques are
used. This is the most flexible fragmentation technique since it generates fragments with minimal
extraneous information. However, reconstruction of the original table is often an expensive task.
Improve performance:
The performance of the system can be improved by connecting multiple CPU and disks in
parallel. Many small processors can also be connected in parallel.
Improve availability of data:
Data can be copied to multiple locations to improve the availability of data.
For example: if a module contains a relation (table in database) which is unavailable then it is
important to make it available from another module.
Improve reliability:
Reliability of system is improved with completeness, accuracy and availability of data.
Provide distributed access of data:
Companies having many branches in multiple cities can access data with the help of parallel
database system.