Database Management NOTES
Database Management NOTES
UNIT I
Lesson 4 Managing Data Resources
Learning Objectives
To analyse the reasons for organisations having trouble in finding their required information.
To understand the benefits of database management system in .improving the organisation of
information
To learn the various types of database models affecting the way a business can access and use
information
To know about the requirements of a database environment
4.1 Introduction
Effective use of information depends on how data are stored, organised and accessed in an
organisation. Proper delivery of information not only depends upon the capabilities of computer hardware
and software but also on the organisations ability to manage data as an important resource. After seeing the
various methods and approaches of System Development, now, let us have some understanding of how
important are the data resources in an organisation and how the information system we developed is going to
handle them.
It has been very difficult for organizations to manage their data effectively. In trying to do so, we
have to meet two very big challenges which are standing out. Implementing a database requires a widespread
organisational change in the role of information and information managers, the allocation of power al senior
levels, the ownership and sharing of information, and patterns of organisational agreement. A database
management system (DBMS) challenges the existing power arrangements in an organisation and for that
reason often generate political resistance. In a traditional file environment, each department constructed files
and programs to fulfill its specific needs. Now, with a database, files and programs must be built that take
into account the full organizations interest in data. Although the organisation has spent the money on
hardware and software for a database environment, it may not reap the benefits it should if it is unwilling to
make the requisite organisational changes.
Moving to database environment can be a costly long-term process. In addition to the cost of DBMS
Software, related hardware, and data modeling, organizations should anticipate heavy expenditure for
integrating, merging and standardizing their data that will populate their database to eliminate
inconsistencies, redundancies and errors that typically arise when overlapping data are stored and maintained
by different systems and different functional areas.
You should understand the managerial and organisational requirements as well as the technologies
for managing data as a resource. Organizations need to manage their data assets very carefully to make sure
that the data can be easily accessed and managed by the managers and employees across the organisation.
First, we describe the typical challenges facing business trying to access information using
traditional file management technologies. Then we describe the technology of database management
systems, which can overcome many of the drawbacks of traditional file management systems and provide the
firm wide integration of information required for digital firm applications.
11.602
65
11.602
66
11.602
67
11.602
68
11.602
69
Self Explanatory
Self Explanatory
Primary Key----->
Secondary Key
Order Table
Order Number
Order Item
Number of Items Ordered
Customer ID
Primary Key
Self Explanatory
Self Explanatory
Secondary Key
11.602
70
The biggest problem with these databases is the misconception that every data element should be stored
in the same table. In fact, each data element should be analyzed in relation to other data elements with the
goal of making the tables as small in size as possible. The ideal relational database will have many small
tables, not one big one. On the surface that may seem like extra work and effort, but by keeping the tables
small, they can serve a wider audience because they are more flexible. This setup is especially helpful in
reducing redundancy and increasing the usefulness of data.
Advantages and Disadvantages
Hierarchical and network databases can be very efficient as long as you plan ahead. But as you
know, needs change, and neither one of these databases offers a lot of flexibility to change with business
needs. It's sort of like parents and children; once you establish the tie, it's pretty hard to amend.
Relational database management systems are more flexible, especially if you keep the tables small. It
is much easier for non-techies to create the query language in a relational system. It's also easier to add new
data elements, although if you do, you'll have to go back and fill in the missing information for the old
records or just forget them altogether.
4.3.3 Comparing of Database Alternatives
Type of
Decisions
Hierarchical
Network
Relational
Processing
Efficiency
High
Medium-High
Low but
Improving
Low
Low-Medium
End-user
friendliness
Low
Low-Medium
Programming
Flexibility
High
High
High
High
Low
Flexibility
The above table compares these alternatives on several dimensions to show you the advantages and
disadvantages of each.
What you should remember is that none of these databases is very good if you don't keep the end
user in mind. If you're not careful, you'll wind up with lots of information that no one can use.
4.3.4 Creating a database
First, you should think long and hard about how you use the available information in your current
situation. Think of the good and the bad of how it is organized, stored, and used. Now imagine how this
information could be organized better and used more easily throughout the organization. What part of the
current system would you be willing to get rid of and what would you add? Involve as many users in this
planning stage as possible. They are the ones who will prosper or suffer because of the decisions you make at
this point.
Determine the relationships between each data element that you currently have (entity-relationship
diagram). The data don't necessarily have to be in a computer for you to consider the impact. Determine
which data elements work best together and how you will organize them in tables. Break your groups of data
into as small a unit as possible (normalization). Even when you say it's as small as it can get, go back again.
Avoid redundancy between tables. Decide what the key identifier will be for each record. See, you've done
all this and you haven't even touched the computer yet!
11.602
71
Distributed databases are usually found in very large corporations that require multiple sites to
have immediate, fast access to data. As the book points out, there are lots of disadvantages, so you
should be careful in determining if this is the right way for you to run your business.
Object-Oriented and Hypermedia Databases: Many companies are steering away from strictly
text-based database systems. Data as objects can be pictures, groups of text, voice, audio, etc.
Object-oriented databases bring the various objects from many different sources and get them all
working together.
As we move away from strictly text-based information systems and incorporate video and sound,
graphics and text, the hypermedia database will become more common. The below given figure
helps explain the concept of a hypermedia database by showing how the various elements are
networked. The attraction to this type of database is that it allows the user to decide which path to
follow from one node to another.
Hypermedia database
11.602
72
Data Warehouses: As organizations want and need more information about the company, the
products, and the customers, the concept of data warehousing has become very popular. Remember
those islands of information we keep talking about? Unfortunately, too many of them have
proliferated over the years, and now companies are trying to rein them in using data warehousing.
No data warehouses are not great big buildings with shelves and shelves of bits and bytes stored on
them. They are huge computer files that store old and new data about anything and everything a
company wants to maintain information on.
Since the data warehouse can be cumbersome, a company can break the information into smaller
groups called data marts. It's easier and cheaper to sort through smaller groups of data. It's still
useful to have a huge data warehouse, though, so that information is available to everyone who
wants or needs it. You can let the user determine how the data will be manipulated and used. Using a
data warehouse correctly can give management a tremendous amount of information that can be used
to trim costs, reduce inventory, put products in the right stores, etc.
11.602
73
No one part of the organization should feel it owns information to the exclusion of other departments
or people in the organization. A certain department may have the primary responsibility for updating and
maintaining the information, but that department still has to share it across the whole company. Well-written
information policies can outline the rules for using this important resource, including how it will be shared,
maintained, distributed, and updated.
4.6.2 Data planning
At the beginning we said that as many users as possible should be brought together to plan the
database. We believed it so much then that we'll say it again here. By excluding groups of users in the
planning stages, no matter how insignificant that group may seem a company courts trouble.
11.602
74
As with any other resource, managers must administer data, plan their uses, and discover new
opportunities for the data to serve the organization through changing technologies.
11.602
75
To emphasize the redundancies of the application approach, let us review another example related to
banks.
Bank Example: Consider a new customer, Smith, opening a savings account. The personal data and the
account information are entered into the Savings file. At a later time, Joe opens a money market account with
the Money Market Department. Same information is entered. Finally, same information is entered again by
the Loan Department where Joe obtains a car loan. These files are shown below:
Savings File in the Savings Department
Account
Number
Depositor
Name
S-100
Smith, Joe
S-101
Doe, Jones
SSN
Address
111-111111
Phone
Deposit
$amount
(215) 2041234
$1.000
Phone
Deposit
$amount
222-111234
Depositor
Name
Smith, Joe
SSN
111-111111
Address
100 Main St., Phila, PA
19100
(215)2041234
$500.
11.602
76
Depositor
Name
SSN
Address
Phone
Loan
$amount
LD-123
Smith, Joe
111-111111
(215)2041234
$2500.
By seeing the above example we may conclude the disadvantages of the file system as the
following:
Lengthy Development Time and maintenance (Because there are many duplicated programs to
process same data)
Now, try to answer to the following question.
How would you intuitively 'redesign' the PVFC and the bank file system?
Just observe the following database design where the above listed disadvantages of file system are
completely eliminated.
Compare this database design with the previous 'traditional' file processing bank example, and note
how the redundancies were removed and that the customer record has to be 'related' to the accounts of the
same customer.
4.7.1 How are these relational databases developed?
We begin by analyzing the user's requirements .In the process of the analysis we develop the data
model (conceptual schema) representing the data requirements. The conceptual schema is independent of the
11.602
77
Program - Transaction
User View 1
(Report)
User View 2
(Screen Display)
User View 3
(Transactions)
Conceptual Schema
Physical Schema
Part of the conceptual schema for the PVFC example would actually look like this:
11.602
78
We would take this output and start to build a conceptual schema from it. Then we would build the physical
schema. The conceptual schema represents the overall logical structure of the database. It is contains all
entities and the list of attributes for these entities. The conceptual schema is used as the bases for
representing primary and foreign keys, and drawing links among the entities. The conceptual schema is also
later used in normalization -- in seeing the need for normalization and in carrying out the normalization
tasks.
Building the conceptual schema requires a careful analysis of ALL output requirements for a
system. This analysis CANNOT be carried out if you do not clearly understand the business parameters or
rules that govern the use of the system. You have to understand the problem domain, not just what is
needed. You need to know
11.602
What is needed?
What is possible?
What is not possible (or at least not likely)?
79